Runloop Benchmark API
Run and manage Benchmarks and Benchmark Runs — the evaluation framework for AI coding agents. Supports SWE-Bench, SWE-smith, and custom benchmark definitions, scenario aggregation, run lifecycle (start/cancel/complete), scoring, and log retrieval. Contact Support: Name: Runloop AI Support Email: [email protected]
Overview
Runloop Benchmark API is a Postman Collection published by Runloop on the APIs.io network.
Run and manage Benchmarks and Benchmark Runs — the evaluation framework for AI coding agents. Supports SWE-Bench, SWE-smith, and custom benchmark definitions, scenario aggregation, run lifecycle (start/cancel/complete), scoring, and log retrieval. Contact Support: Name: Runloop AI Support Email: [email protected]
The collection contains 28 requests organised into 28 folders.
Tagged areas include AI, AI Agents, Coding Agents, Sandboxes, and Devboxes.