Runloop · Postman Collection

Runloop Benchmark API

Run and manage Benchmarks and Benchmark Runs — the evaluation framework for AI coding agents. Supports SWE-Bench, SWE-smith, and custom benchmark definitions, scenario aggregation, run lifecycle (start/cancel/complete), scoring, and log retrieval. Contact Support: Name: Runloop AI Support Email: [email protected]

28
Requests
28
Folders
View on GitHub Raw JSON AIAI AgentsCoding AgentsSandboxesDevboxesCode ExecutionEvaluationBenchmarksSWE-BenchMCPSnapshotsmicroVMEnterpriseSOC 2Postman Collection

Overview

Runloop Benchmark API is a Postman Collection published by Runloop on the APIs.io network.

Run and manage Benchmarks and Benchmark Runs — the evaluation framework for AI coding agents. Supports SWE-Bench, SWE-smith, and custom benchmark definitions, scenario aggregation, run lifecycle (start/cancel/complete), scoring, and log retrieval. Contact Support: Name: Runloop AI Support Email: [email protected]

The collection contains 28 requests organised into 28 folders.

Tagged areas include AI, AI Agents, Coding Agents, Sandboxes, and Devboxes.

Requests & Folders

v1

Related API Specs

Runloop Devbox API (OpenAPI) Runloop Blueprint API (OpenAPI) Runloop Benchmark API (OpenAPI) Runloop Scenario API (OpenAPI) Runloop Agents API (OpenAPI) Runloop Axons API (OpenAPI) Runloop Storage Objects API (OpenAPI) Runloop Secrets API (OpenAPI) Runloop Network Policies API (OpenAPI) Runloop Gateway Configs API (OpenAPI) Runloop MCP Configs API (OpenAPI) Runloop API Keys API (OpenAPI) Runloop Executions Streaming API (OpenAPI)
Back to Runloop · All Collections · GitHub