Gateway API

OpenAI-compatible Gateway endpoints for chat, embeddings, and images.

Chat Completions

Base: POST /v1/chat/completions · Auth: API Key

json
{
  "model": "gpt-4o",
  "messages": [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Explain quantum computing briefly."}
  ],
  "max_tokens": 500,
  "temperature": 0.7
}

Streaming

Set stream: true for Server-Sent Events response:

bash
curl -N -X POST "https://<BASE_URL>/v1/chat/completions" \
  -H "Authorization: Bearer <API_KEY>" \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4o", "messages": [{"role":"user","content":"Hi"}], "stream": true}'

Models

Available models depend on your tenant configuration. Common models include gpt-4o, claude-3-5-sonnet, gemini-pro, deepseek-chat.

Rate Limits

Rate limits are enforced per API key and plan tier. 429 responses include a Retry-After header.