Are rate limits per API key or per account?
Limits are typically enforced per account or API key. Check your dashboard for exact limits.
Reference
Rate limits cap request and token throughput to keep the service reliable.
Handle 429 responses with backoff and spread retries across workers.
A rate limit is a provider-enforced cap on requests or tokens within a time window.
HTTP/1.1 429 Too Many Requests Retry-After: 2
FAQ
Limits are typically enforced per account or API key. Check your dashboard for exact limits.
No. Wait for Retry-After or use exponential backoff with jitter before retrying.
Batch low-priority work, reduce parallelism, and cache repeated responses.