Aevon API Docs.
Aevon is an OpenAI-compatible AI gateway giving you access to 30+ frontier models — Claude, GPT-4o, DeepSeek, Gemini, and more — through a single API key and base URL. Drop it into any existing client in under 60 seconds.
Quickstart
Get up and running in 4 steps. No SDK needed — works with any OpenAI-compatible client.
1. Get your API key
Sign up at aevon.sh/register with your GitHub account. Your API key is generated instantly — you'll receive $100 in free credits.
2. Set your base URL
Replace your existing base URL with Aevon's endpoint. That's it — your API key works across all models.
export OPENAI_BASE_URL="https://aevon.sh/v1"
export OPENAI_API_KEY="sx_your_key_here"
export ANTHROPIC_BASE_URL="https://aevon.sh"
export ANTHROPIC_API_KEY="sx_your_key_here"
3. Make your first request
import openai
client = openai.OpenAI(
base_url="https://aevon.sh/v1",
api_key="sx_your_key_here"
)
response = client.chat.completions.create(
model="claude-sonnet-4-5",
messages=[
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)
import OpenAI from 'openai';
const client = new OpenAI({
baseURL: 'https://aevon.sh/v1',
apiKey: 'sx_your_key_here',
});
const response = await client.chat.completions.create({
model: 'gpt-4o',
messages: [{ role: 'user', content: 'Hello!' }],
});
console.log(response.choices[0].message.content);
curl https://aevon.sh/v1/chat/completions \
-H "Authorization: Bearer sx_your_key_here" \
-H "Content-Type: application/json" \
-d '{
"model": "deepseek-r1",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'
Authentication
All requests require a Bearer token in the Authorization header. Your API key starts with sx_.
Authorization: Bearer sx_your_key_here
⚠
Keep your API key secret.
Never expose it in frontend code, public repos, or client-side JavaScript. Regenerate immediately if compromised.
Chat Completions
The primary endpoint. Fully compatible with the OpenAI Chat Completions API.
POSThttps://aevon.sh/v1/chat/completions
Request Parameters
| Parameter |
Type |
Required |
Description |
| model |
string |
Required |
Model ID. See full list below. |
| messages |
array |
Required |
Array of message objects with role and content. |
| max_tokens |
integer |
Optional |
Maximum tokens to generate. Default: model max. |
| temperature |
float |
Optional |
Sampling temperature 0–2. Default: 1. |
| stream |
boolean |
Optional |
Enable streaming via SSE. Default: false. |
| system |
string |
Optional |
System prompt (for supported models). |
Available Models
Pass any of these model IDs in the model parameter. Switch models by changing a single string — no other code changes needed.
Anthropic
claude-sonnet-4-5
Anthropic
200K
Latest
claude-opus-4-5
Anthropic
200K
Pro
OpenAI
gpt-4o
OpenAI
128K
Pro
gpt-4o-mini
OpenAI
128K
Free
DeepSeek
deepseek-r1
DeepSeek
64K
Hot
deepseek-v2-lite
DeepSeek
32K
Free
Others
gemini-2.0-pro
Google
1M
Pro
qwen3-235b
Alibaba
32K
New
glm-4-5-air
Zhipu
128K
Free
llama-3.3-70b
Meta
128K
Free
Cursor Integration
Use Aevon as a drop-in replacement in Cursor. Go to Settings → Models → OpenAI API Key and set:
API Key: sx_your_key_here
Base URL: https://aevon.sh/v1
✓
All Cursor models work.
GPT-4o, Claude, and DeepSeek are all accessible once configured.
Claude Code Integration
Run Claude Code through Aevon by overriding the Anthropic base URL:
export ANTHROPIC_BASE_URL="https://aevon.sh"
export ANTHROPIC_API_KEY="sx_your_key_here"
claude
Python SDK
pip install openai
import openai
client = openai.OpenAI(
base_url="https://aevon.sh/v1",
api_key="sx_your_key_here"
)
stream = client.chat.completions.create(
model="claude-sonnet-4-5",
messages=[{"role": "user", "content": "Write a haiku"}],
stream=True
)
for chunk in stream:
print(chunk.choices[0].delta.content, end="")
Node.js SDK
npm install openai
import OpenAI from 'openai';
const client = new OpenAI({
baseURL: 'https://aevon.sh/v1',
apiKey: 'sx_your_key_here',
});
const models = ['claude-sonnet-4-5', 'gpt-4o', 'deepseek-r1'];
for (const model of models) {
const res = await client.chat.completions.create({
model,
messages: [{ role: 'user', content: 'Hello' }],
});
console.log(model, res.choices[0].message.content);
}
Error Codes
| Code | Meaning | Fix |
| 401 |
Unauthorized |
Invalid or missing API key. |
| 402 |
Insufficient Credits |
Top up your balance at aevon.sh/billing. |
| 429 |
Rate Limited |
Slow down requests or upgrade plan. |
| 500 |
Gateway Error |
Provider issue. Retry or switch models. |
Credits & Billing
Aevon is usage-based — you're charged per token consumed, not per month. Free tier models consume zero credits.
ℹ
$100 free on signup.
No credit card needed. Covers approximately 1–2M tokens depending on the model used.
Rate Limits
| Tier | Requests/min | Tokens/min |
| Free |
20 RPM |
100K TPM |
| Credits |
60 RPM |
500K TPM |
| Pro |
300 RPM |
2M TPM |