Rate Limits, Retries, and Reliability¶

The wrapper keeps reliability behavior intentionally small. It parses PRC rate-limit metadata, raises typed errors, optionally performs one safe retry on 429, and enables dynamic pre-request limiting by default with rate_limited=True. It does not include cache backends, circuit breakers, tracing, metrics sinks, request replay, or request coalescing.

Request Headers¶

Every server request sends:

Server-Key: the per-server API key.
Authorization: only when global_key= is configured.
User-Agent: wrapper version and Python runtime by default.

429 Behavior¶

On a 429, the transport raises RateLimitError.

from erlc_api import RateLimitError

try:
    await api.players()
except RateLimitError as exc:
    print(exc.retry_after_s, exc.reset_epoch_s, exc.bucket)

Useful fields:

Field	Meaning
`retry_after_s`	Seconds to wait, parsed from headers or body when available.
`reset_epoch_s`	Epoch reset time parsed from rate-limit headers.
`bucket`	PRC bucket name when provided.
`error_code`	PRC error code when present.
`body_excerpt`	Short safe body excerpt for diagnostics.

Retry Policy¶

Default constructor behavior:

api = AsyncERLC("server-key", retry_429=True)

When timing information exists, the wrapper sleeps once and retries once. It does not perform exponential backoff or infinite retries.

Disable the built-in retry when your app has its own scheduler:

api = AsyncERLC("server-key", retry_429=False)

Dynamic Rate Limiter¶

Dynamic limiting is enabled by default, including for apps that poll or have bursty command handlers:

api = AsyncERLC("server-key")

Behavior:

learns from X-RateLimit-* and Retry-After headers on every response;
waits before requests when an observed bucket has no remaining capacity until reset;
updates state from actual 429 responses;
tracks global-key requests separately from server-key-only requests;
stores state in memory only.

Opt out only when your application already coordinates rate limits:

api = AsyncERLC("server-key", rate_limited=False)

Inspect current state:

snapshot = api.rate_limits
if snapshot is not None:
    print(snapshot.to_dict())

Use erlc_api.ratelimit directly only for custom transports:

from erlc_api.ratelimit import AsyncRateLimiter

limiter = AsyncRateLimiter()

Polling Guidance¶

Use erlc_api.limits for conservative planning:

from erlc_api.limits import poll_plan

plan = poll_plan(server_count=2, endpoint_count=3, timeout_s=120)

This module does not claim official PRC rate limits. It only helps avoid overly aggressive polling in your own app.

Read Caching¶

For dashboards and bots that repeat the same reads, use explicit memory TTL caching instead of polling harder:

from erlc_api.cache import AsyncCachedClient

cached = AsyncCachedClient(api, ttl_s=5)
players = await cached.players()

CachedClient and AsyncCachedClient cache read endpoints only. Commands are never cached.

Multi-Server Fanout¶

For multiple private servers, use bounded concurrency so one dashboard refresh does not send an unbounded burst:

from erlc_api.multiserver import AsyncMultiServer, ServerRef

servers = [ServerRef("main", "key-1"), ServerRef("training", "key-2")]
manager = AsyncMultiServer(api, servers, concurrency=3)
statuses = await manager.status()

Per-server errors are collected by default, so one failed server does not hide the state of every other server.

For larger deployments, see Scaling Your App.

Reliability Boundaries¶

The wrapper handles:

decoding successful responses;
mapping known error codes to typed exceptions;
parsing rate-limit metadata;
default dynamic pre-request waiting when rate_limited=True;
explicit read caching through erlc_api.cache;
bounded read fanout through erlc_api.multiserver;
closing sync and async HTTP clients.

Your application should handle:

persistence and retries across process restarts;
user-visible degraded status;
queueing high-volume bot commands;
idempotency for webhook delivery.

Common Mistakes¶

Running many watchers at one-second intervals across multiple servers.
Treating retry_429=True as a full retry policy.
Expecting the process-local limiter to coordinate multiple Python processes.
Swallowing RateLimitError without slowing future calls.
Assuming advisory safe_interval() values are official PRC limits.
Caching command results. Cache helpers intentionally skip command execution.
Using unbounded multi-server fanout in a bot command.

Previous Page: Security and Secrets | Next Page: Scaling Your App