Rate Limits#
Requests are rate-limited per IP address. Limits vary by endpoint type:
| Endpoint | Rate Limit |
|---|---|
| Scan endpoints (/evaluate, /info, /export) | 30 requests/minute |
| LLM endpoints (/optimize, /triggers, /judge) | 5 requests/minute |
| Other endpoints | 60 requests/minute |
The API also fetches repository data through the GitHub REST API v3. GitHub's own rate limits apply on the server side:
| Server Config | GitHub Rate Limit |
|---|---|
| No GitHub token | 60 requests/hour |
| With token | 5,000 requests/hour |
Caching#
Evaluation results are cached on the server, keyed by owner/repo/commit_sha (and path for single-skill requests).
- If the repository's HEAD has not changed, the cached result is returned instantly
- Cache TTL is 1 hour — after expiry, the next request triggers a fresh evaluation
- Full-repo and single-skill results are cached independently