API Docs

OpenAI-compatible. Set base_url to https://api.eastx.ai/v1 and your existing SDK code works.

Quickstart

One chat completions call — pick your favorite client.

curl https://api.eastx.ai/v1/chat/completions \
  -H "Authorization: Bearer $EASTX_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-6",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'

Authentication

All endpoints require an API key in the Authorization header:

Authorization: Bearer sk-eastx-<your-key>

• Keys are issued at /keys and shown to you ONCE — store them securely.
• Optional per-key spend cap and RPM limit can be set when creating a key.
• Revoke compromised keys immediately from the same page; revocation is instant.

GET/v1/models

List models

Returns the catalog of models your account can call. Filters out coming-soon and disabled models.

Example

curl https://api.eastx.ai/v1/models \
  -H "Authorization: Bearer $EASTX_API_KEY"

• Each entry has OpenAI-shape fields (id, object, created, owned_by) plus extras: display_name, input_price_per_1m, output_price_per_1m. Most OpenAI SDKs ignore unknown fields.

GET/v1/models/{id}

Retrieve a model

Lookup a single model by id. Used by some SDKs during client initialization to validate the model name.

Example

curl https://api.eastx.ai/v1/models/claude-sonnet-4-6 \
  -H "Authorization: Bearer $EASTX_API_KEY"

• 404 model_not_found if the id is unknown or disabled.

POST/v1/chat/completions

Chat completions

OpenAI-compatible chat-completions endpoint. Supports streaming (SSE) and non-streaming. Token usage is tracked at the upstream and your balance is debited at the end of each request.

Request body

{
  "model": "claude-sonnet-4-6",
  "messages": [
    {"role": "system", "content": "You are concise."},
    {"role": "user", "content": "Explain JWT in one sentence."}
  ],
  "max_tokens": 256,
  "temperature": 0.2,
  "stream": false
}

Example

curl https://api.eastx.ai/v1/chat/completions \
  -H "Authorization: Bearer $EASTX_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-6",
    "messages": [{"role":"user","content":"hi"}],
    "max_tokens": 32
  }'

• For streaming, set "stream": true and read SSE chunks until "data: [DONE]".
• Vision and tool/function calling are passed through to upstream models that support them (e.g. Claude, qwen-vl-*, GPT-class).
• Balance is pre-checked against worst-case cost (input + max_tokens × output_price). Insufficient balance returns 402.

POST/v1/embeddings

Embeddings

Generate vector embeddings for one or more strings. Only models with modality=embedding are accepted here (chat models return 400 wrong_endpoint).

Request body

{
  "model": "text-embedding-v3",
  "input": "Hello, world."
}

Example

curl https://api.eastx.ai/v1/embeddings \
  -H "Authorization: Bearer $EASTX_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "text-embedding-v3",
    "input": ["hello", "world"]
  }'

• input accepts a single string or an array of strings.
• Billing is by input tokens only (output_tokens = 0).

POST/v1/images/generations

Image generation

Generate images from a text prompt. We wrap Aliyun's async task API so the client sees a synchronous response — typical latency is 3-10s. Charged per image regardless of resolution.

Request body

{
  "model": "qwen-image-plus",
  "prompt": "a red apple on a wooden table",
  "n": 1,
  "size": "1024x1024"
}

Example

curl https://api.eastx.ai/v1/images/generations \
  -H "Authorization: Bearer $EASTX_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen-image-plus",
    "prompt": "a red apple on a wooden table",
    "n": 1,
    "size": "1024x1024"
  }'

• Response shape: {"created":<ts>,"data":[{"url":"<signed-oss-url>","revised_prompt":"..."}]}.
• n is capped at 4. Sizes supported depend on the model; 1024x1024 works everywhere.
• URLs are temporary signed OSS links — download and re-host within ~1 hour.
• Internal polling deadline is 90s; long jobs return 504 upstream_timeout.

Error codes

All errors return JSON of shape {"error":{"code":"...","message":"...","type":"..."}}. HTTP status maps to error code as follows.

HTTP	error.code	When you see it
400	bad_request	Required field missing or malformed body. Inspect error.message for specifics.
400	wrong_endpoint	Model modality does not match endpoint (e.g. chat model on /v1/embeddings).
401	missing_key	No Authorization header. Send "Authorization: Bearer sk-eastx-...".
401	invalid_key	Key not found or revoked. Generate a new one at /keys.
402	insufficient_balance	Worst-case cost exceeds account balance. Top up via /recharge.
402	cap_reached	Per-key spend cap (balanceCapUsd) reached. Raise it or rotate the key.
403	disabled	API key was disabled by an admin.
403	expired	API key expiresAt is in the past.
404	model_not_found	Unknown model id. Call /v1/models to see what is available.
429	rate_limit_exceeded	Per-key requests-per-minute (RPM) limit hit. Slow down or split across keys.
502	upstream_unreachable	Couldn't connect to the upstream provider. Usually transient — retry with backoff.
502	upstream_error	Upstream returned an error. The response message has upstream details + upstream_status.
503	upstream_unavailable	All upstream tokens for the channel are in cooldown or disabled. We are aware.
504	upstream_timeout	Image generation took longer than 90s. Submit again; same prompt may succeed.

Rate limits & quotas

• Per-key RPM: each API key has a requests-per-minute limit (default 60). Exceeding it returns 429 rate_limit_exceeded. Raise the limit on a key in the /keys page.
• Account balance: every successful request is debited from your USD balance. Insufficient balance returns 402; top up via /recharge.
• Per-key spend cap: optional balanceCapUsd cuts off a key after a cumulative spend. Useful for embedding short-lived keys in client apps.
• Upstream cooldowns: if all upstream tokens for a channel are throttled or erroring, the gateway returns 503 upstream_unavailable. Retry on backoff.