Documentation · V1.0

Aevon API Docs.

Aevon is an OpenAI-compatible AI gateway giving you access to 30+ frontier models — Claude, GPT-4o, DeepSeek, Gemini, and more — through a single API key and base URL. Drop it into any existing client in under 60 seconds.

Quickstart

Get up and running in 4 steps. No SDK needed — works with any OpenAI-compatible client.

1. Get your API key

Sign up at aevon.sh/register with your GitHub account. Your API key is generated instantly — you'll receive $100 in free credits.

2. Set your base URL

Replace your existing base URL with Aevon's endpoint. That's it — your API key works across all models.

Shell
# Set environment variables export OPENAI_BASE_URL="https://aevon.sh/v1" export OPENAI_API_KEY="sx_your_key_here" # For Claude Code specifically export ANTHROPIC_BASE_URL="https://aevon.sh" export ANTHROPIC_API_KEY="sx_your_key_here"

3. Make your first request

Python
Node.js
cURL
import openai client = openai.OpenAI( base_url="https://aevon.sh/v1", api_key="sx_your_key_here" ) response = client.chat.completions.create( model="claude-sonnet-4-5", # or any model messages=[ {"role": "user", "content": "Hello!"} ] ) print(response.choices[0].message.content)

Authentication

All requests require a Bearer token in the Authorization header. Your API key starts with sx_.

HTTP Header
Authorization: Bearer sx_your_key_here
Keep your API key secret. Never expose it in frontend code, public repos, or client-side JavaScript. Regenerate immediately if compromised.

Chat Completions

The primary endpoint. Fully compatible with the OpenAI Chat Completions API.

Endpoint
POSThttps://aevon.sh/v1/chat/completions

Request Parameters

Parameter Type Required Description
model string Required Model ID. See full list below.
messages array Required Array of message objects with role and content.
max_tokens integer Optional Maximum tokens to generate. Default: model max.
temperature float Optional Sampling temperature 0–2. Default: 1.
stream boolean Optional Enable streaming via SSE. Default: false.
system string Optional System prompt (for supported models).

Available Models

Pass any of these model IDs in the model parameter. Switch models by changing a single string — no other code changes needed.

Anthropic

claude-sonnet-4-5 Anthropic 200K Latest
claude-opus-4-5 Anthropic 200K Pro

OpenAI

gpt-4o OpenAI 128K Pro
gpt-4o-mini OpenAI 128K Free

DeepSeek

deepseek-r1 DeepSeek 64K Hot
deepseek-v2-lite DeepSeek 32K Free

Others

gemini-2.0-pro Google 1M Pro
qwen3-235b Alibaba 32K New
glm-4-5-air Zhipu 128K Free
llama-3.3-70b Meta 128K Free

Cursor Integration

Use Aevon as a drop-in replacement in Cursor. Go to Settings → Models → OpenAI API Key and set:

Cursor Settings
API Key: sx_your_key_here Base URL: https://aevon.sh/v1
All Cursor models work. GPT-4o, Claude, and DeepSeek are all accessible once configured.

Claude Code Integration

Run Claude Code through Aevon by overriding the Anthropic base URL:

Shell
export ANTHROPIC_BASE_URL="https://aevon.sh" export ANTHROPIC_API_KEY="sx_your_key_here" claude # run claude code normally

Python SDK

Python
pip install openai import openai client = openai.OpenAI( base_url="https://aevon.sh/v1", api_key="sx_your_key_here" ) # Streaming example stream = client.chat.completions.create( model="claude-sonnet-4-5", messages=[{"role": "user", "content": "Write a haiku"}], stream=True ) for chunk in stream: print(chunk.choices[0].delta.content, end="")

Node.js SDK

JavaScript
npm install openai import OpenAI from 'openai'; const client = new OpenAI({ baseURL: 'https://aevon.sh/v1', apiKey: 'sx_your_key_here', }); // Switch any model instantly const models = ['claude-sonnet-4-5', 'gpt-4o', 'deepseek-r1']; for (const model of models) { const res = await client.chat.completions.create({ model, messages: [{ role: 'user', content: 'Hello' }], }); console.log(model, res.choices[0].message.content); }

Error Codes

CodeMeaningFix
401 Unauthorized Invalid or missing API key.
402 Insufficient Credits Top up your balance at aevon.sh/billing.
429 Rate Limited Slow down requests or upgrade plan.
500 Gateway Error Provider issue. Retry or switch models.

Credits & Billing

Aevon is usage-based — you're charged per token consumed, not per month. Free tier models consume zero credits.

$100 free on signup. No credit card needed. Covers approximately 1–2M tokens depending on the model used.

Rate Limits

TierRequests/minTokens/min
Free 20 RPM 100K TPM
Credits 60 RPM 500K TPM
Pro 300 RPM 2M TPM