Skip to content

Rate Limits

Rate limits are enforced per API key at the Nginx gateway. Limits apply independently per region — a request to US and JP each count against their respective region limits.

TierRequests/minRequests/secConcurrent Streams
Basic10021
Pro120,00020010
QuantUnlimitedUnlimitedUnlimited
  • Global Nginx limit: 5,000 req/s per API key
  • Tier-specific limits apply within the global cap
  • Batch requests count as one request but are limited to 100 calls per batch
  • Streaming RPCs count as one request at connection time
  • Individual messages within a stream are not rate-limited
  • Connection limits per tier apply
  • SSE connections count against concurrent stream limits
  • Event throughput is not rate-limited once connected
  • Reconnect attempts count as new requests
  • Quant only — no rate limits
  • One connection per API key

Rate-limited responses include:

HeaderDescription
X-RateLimit-LimitMaximum requests per window
X-RateLimit-RemainingRemaining requests in window
X-RateLimit-ResetUnix timestamp when window resets
Retry-AfterSeconds until next request allowed (on 429)
{
"error": "rate_limit_exceeded",
"message": "Rate limit exceeded. Retry after 1 second.",
"retry_after": 1
}
  1. Use streaming protocols (gRPC, SSE, WebSocket) instead of polling
  2. Implement exponential backoff on 429 responses
  3. Cache responses client-side where appropriate
  4. Use the region closest to your infrastructure to avoid wasted round-trips