Overview
Our API implements rate limiting to ensure fair usage and maintain service stability for all users. Rate limits are measured in queries (not HTTP requests — see How Queries Are Counted below). When you exceed the rate limit, you’ll receive a429 Too Many Requests response with helpful headers to guide your retry strategy.
Rate Limits
- Default Rate Limit: 200 queries per minute per API key
- Rate Limit Window: 60-second sliding window
How Queries Are Counted
Rate limits are counted in queries, not in HTTP requests. The number of queries a single API call consumes depends on the endpoint type:- Single-entity endpoints: one request equals one query.
- Bulk / batch endpoints (such as the enrichment and match endpoints): each row (entity) in the request payload counts as a separate query. A single HTTP request that submits 50 entities consumes 50 queries against your limit — not 1.
Examples
| API call | Endpoint type | Entities in payload | Queries counted |
|---|---|---|---|
| Enrich a batch of 50 businesses | Bulk | 50 | 50 |
| Match a batch of 10 prospects | Bulk | 10 | 10 |
| Enrich a single business | Single | 1 | 1 |
X-RateLimit-Remaining header decrements by the number of entities in your request — not by 1 per call. Size your batches accordingly to stay within the 200-queries-per-minute limit.
Rate Limit Headers
When making requests to our API, the following headers are included in every response to help you track your usage:| Header | Description | Example |
|---|---|---|
X-RateLimit-Limit | Maximum number of queries allowed in the current window | 200 |
X-RateLimit-Remaining | Number of queries remaining in the current window | 150 |
X-RateLimit-Reset | Unix timestamp when the rate limit window resets | 1234567890 |
Retry-After | Number of seconds to wait before retrying (only included in 429 responses) | 60 |
Rate Limit Response
When you exceed the rate limit, you’ll receive the following response:HTTP
Implementing Retry Logic
Best Practices
- Always check for 429 responses and implement exponential backoff
- Respect the Retry-After header - this tells you the minimum time to wait
- Monitor your usage using the
X-RateLimit-Remainingheader - Account for batch size - remember each entity in a bulk request counts as a query
- Implement preemptive throttling when
X-RateLimit-Remainingis low
Example Implementation
Monitoring Rate Limits
Preemptive Rate Limiting
To avoid hitting rate limits, monitor theX-RateLimit-Remaining header and implement throttling. Remember that X-RateLimit-Remaining reflects queries remaining, so a single bulk request can consume many at once:
JavaScript
Frequently Asked Questions
What happens if I consistently hit rate limits?
What happens if I consistently hit rate limits?
Repeated rate limit violations may result in temporary suspension of API access. Implement proper retry logic and consider upgrading your plan if you need higher limits.
How is the rate limit window calculated?
How is the rate limit window calculated?
We use a sliding window approach that tracks queries over the past 60 seconds. Each query timestamp is recorded, and queries older than 60 seconds are removed from the count.
How are queries counted for bulk or batch endpoints?
How are queries counted for bulk or batch endpoints?
The rate limit counts queries, not HTTP requests. For bulk and batch endpoints (such as enrichment and match), every row (entity) in your request payload counts as one query. For example, a single request that submits 50 entities to a batch endpoint consumes 50 queries from your per-minute limit — not 1. Single-entity endpoints count as one query per request.
Are rate limits applied per API key or per IP address?
Are rate limits applied per API key or per IP address?
Rate limits are applied per API key. Each API key has its own independent rate limit counter.
Do different endpoints have different rate limits?
Do different endpoints have different rate limits?
Currently, all endpoints share the same rate limit of 200 queries per minute. Keep in mind that bulk and batch endpoints count each entity in the payload as a separate query, so a single call to those endpoints can consume multiple queries. Some endpoints may have additional specific limits which will be documented separately.