Documentation Index
Fetch the complete documentation index at: https://docs.polinate.ai/llms.txt
Use this file to discover all available pages before exploring further.
Rate Limits
We use a token bucket model to ensure fair usage and stability.Model overview
- A bucket holds a limited number of tokens.
- Tokens are consumed per request.
- Tokens are refilled over time at a fixed rate.
- When the bucket is empty, requests are rate-limited with HTTP 429.
Handling 429 responses
- Back off and retry after a short delay.
- 429 responses always include a
Retry-Afterheader in seconds. Honor this value before retrying. - Use batching and idempotency where possible.
Example 429 response
Headers
Retry-After is expressed in seconds.
