40+
model providers
99.98%
uptime, 90-day
278ms
median TTFT, Beijing → GPT-4o
12B+
tokens routed / month
One gateway. Forty providers. Wired up in minutes.
We don't rewrite your prompts and we don't cache your responses. We route them to the original vendor API — just faster, and without the payment headache.
Change one line. Ship.
Keep the SDK you already use. LLMoxy passes through to the upstream vendor — your code barely changes.
- 1Sign up with Alipay or WeChat0 minutes. No card required.
- 2Copy your API keyScoped, rate-limitable, revocable.
- 3Point your SDK at llmoxy.comEvery request shows up in the console, priced to the cent.
from openai import OpenAI
client = OpenAI(
base_url="https://llmoxy.com/v1",
api_key="sk-llmoxy-...",
)
resp = client.chat.completions.create(
model="claude-3.5-sonnet",
messages=[{"role": "user", "content": "Hello"}],
)
print(resp.choices[0].message.content)Why not just call the vendor directly?
If you're building from APAC, serving APAC users, or juggling more than one vendor, direct is the painful path.
LLMoxy
OpenRouter
Vendor direct
Self-hosted proxy
Alipay / WeChat Pay
APAC-direct routing.
Partial
Anthropic native /v1/messages
Per-token upstream pass-through
Self-hostable console
No monthly minimum
40+ models, one key
One vendor
LLMoxyBest for APAC
Alipay / WeChat Pay
APAC-direct routing.
Anthropic native /v1/messages
Per-token upstream pass-through
Self-hostable console
No monthly minimum
40+ models, one key
OpenRouter
Alipay / WeChat Pay
APAC-direct routing.Partial
Anthropic native /v1/messages
Per-token upstream pass-through
Self-hostable console
No monthly minimum
40+ models, one key
Vendor direct
Alipay / WeChat Pay
APAC-direct routing.
Anthropic native /v1/messages
Per-token upstream pass-through
Self-hostable console
No monthly minimum
40+ models, one keyOne vendor
Self-hosted proxy
Alipay / WeChat Pay
APAC-direct routing.
Anthropic native /v1/messages
Per-token upstream pass-through
Self-hostable console
No monthly minimum
40+ models, one key
Pay-as-you-go. No monthly minimum.
LLMoxy charges per token, tracking the upstream provider rate. Start free, top up as you grow.
Frequently asked.
Pay-as-you-go per token, tracking upstream provider rates. No monthly minimum, no commitment. Every request surfaces input/output token counts and the exact upstream cost in the console.
