Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.savegate.ai/llms.txt

Use this file to discover all available pages before exploring further.

Overview

SaveGate provides access to 208+ AI models from leading providers. All models are accessible through a unified API using the same authentication and request format. Pricing shown is per 1M tokens (input/output).

OpenAI Models

Chat Models

Model IDMax InputMax OutputInput/1MOutput/1M
gpt-5.41.1M128K$2.50$15
gpt-5.4-2026-03-051.1M128K$2.50$15
gpt-5.4-mini272K128K$0.25$2
gpt-5.4-nano272K128K$0.05$0.40
gpt-5.3-chat-latest128K16.4K$1.75$14
gpt-5.2272K128K$1.75$14
gpt-5.2-chat-latest128K16.4K$1.75$14
gpt-5.2-2025-12-11272K128K$1.75$14
gpt-5.1272K128K$1.25$10
gpt-5.1-chat-latest128K16.4K$1.25$10
gpt-5.1-2025-11-13272K128K$1.25$10
gpt-5272K128K$1.25$10
gpt-5-chat128K16.4K$1.25$10
gpt-5-chat-latest128K16.4K$1.25$10
gpt-5-search-api272K128K$1.25$10
gpt-5-2025-08-07272K128K$1.25$10
gpt-5-search-api-2025-10-14272K128K$1.25$10
gpt-5-mini272K128K$0.25$2
gpt-5-mini-2025-08-07272K128K$0.25$2
gpt-5-nano272K128K$0.05$0.40
gpt-5-nano-2025-08-07272K128K$0.05$0.40
o4-mini200K100K$1.10$4.40
o4-mini-2025-04-16200K100K$1.10$4.40
chatgpt-4o-latest128K4.1K$5$15
gpt-4o128K16.4K$2.50$10
gpt-4o-search-preview128K16.4K$2.50$10
gpt-4o-2024-05-13128K4.1K$5$15
gpt-4o-2024-08-06128K16.4K$2.50$10
gpt-4o-2024-11-20128K16.4K$2.50$10
gpt-4o-search-preview-2025-03-11128K16.4K$2.50$10
gpt-4o-mini128K16.4K$0.15$0.60
gpt-4o-mini-search-preview128K16.4K$0.15$0.60
gpt-4o-mini-2024-07-18128K16.4K$0.15$0.60
gpt-4o-mini-search-preview-2025-03-11128K16.4K$0.15$0.60
gpt-4-turbo128K4.1K$10$30
gpt-4.11.0M32.8K$2$8
gpt-4-turbo-2024-04-09128K4.1K$10$30
gpt-4.1-2025-04-141.0M32.8K$2$8
gpt-4.1-mini1.0M32.8K$0.40$1.60
gpt-4.1-mini-2025-04-141.0M32.8K$0.40$1.60
gpt-4.1-nano1.0M32.8K$0.10$0.40
gpt-4.1-nano-2025-04-141.0M32.8K$0.10$0.40
gpt-48.2K4.1K$30$60
gpt-4-06138.2K4.1K$30$60
o3200K100K$2$8
o3-2025-04-16200K100K$2$8
o3-mini200K100K$1.10$4.40
o3-mini-2025-01-31200K100K$1.10$4.40
gpt-3.5-turbo16.4K4.1K$0.50$1.50
o1200K100K$15$60
o1-2024-12-17200K100K$15$60
gpt-realtime32K4.1K$4$16
gpt-realtime-1.532K4.1K$4$16
gpt-realtime-2025-08-2832K4.1K$4$16
gpt-realtime-mini128K4.1K$0.60$2.40
gpt-realtime-mini-2025-10-06128K4.1K$0.60$2.40
gpt-realtime-mini-2025-12-15128K4.1K$0.60$2.40

Responses Models

Model IDMax InputMax OutputInput/1MOutput/1M
gpt-5.4-pro1.1M128K$30$180
gpt-5.4-pro-2026-03-051.1M128K$30$180
gpt-5.3-codex272K128K$1.75$14
gpt-5.2-pro272K128K$21$168
gpt-5.2-pro-2025-12-11272K128K$21$168
gpt-5.2-codex272K128K$1.75$14
gpt-5.1-codex272K128K$1.25$10
gpt-5.1-codex-max272K128K$1.25$10
gpt-5.1-codex-mini272K128K$0.25$2
gpt-5-pro128K272K$15$120
gpt-5-pro-2025-10-06128K272K$15$120
gpt-5-codex272K128K$1.25$10
o4-mini-deep-research200K100K$2$8
o4-mini-deep-research-2025-06-26200K100K$2$8
o3-pro200K100K$20$80
o3-pro-2025-06-10200K100K$20$80
o3-deep-research200K100K$10$40
o3-deep-research-2025-06-26200K100K$10$40
o1-pro200K100K$150$600
o1-pro-2025-03-19200K100K$150$600
codex-mini-latest200K100K$1.50$6

Embedding Models

Model IDMax InputMax OutputInput/1MOutput/1M
text-embedding-3-large8.2K-$0.13-
text-embedding-3-small8.2K-$0.02-
text-embedding-ada-0028.2K-$0.10-
text-embedding-ada-002-v28.2K-$0.10-

Image Generation Models

Model IDMax InputMax OutputInput/1MOutput/1M
chatgpt-image-latest--$5-
standard/1024-x-1024/dall-e-3----
standard/1024-x-1792/dall-e-3----
standard/1792-x-1024/dall-e-3----
gpt-image-1--$5-
gpt-image-1.5--$5$10
gpt-image-1.5-2025-12-16--$5$10
gpt-image-1-mini--$2-
Image generation models also support resolution and quality variants (e.g., hd/1024-x-1024/dall-e-3). Use the /v1/models API endpoint to see all available variants.

Audio Speech Models

Model IDMax InputMax OutputInput/1MOutput/1M
gpt-4o-mini-tts--$2.50$10
gpt-4o-mini-tts-2025-03-20--$2.50$10
gpt-4o-mini-tts-2025-12-15--$2.50$10
tts-1----
tts-1-1106----
tts-1-hd----
tts-1-hd-1106----

Audio Transcription Models

Model IDMax InputMax OutputInput/1MOutput/1M
gpt-4o-transcribe16K2K$2.50$10
gpt-4o-transcribe-diarize16K2K$2.50$10
gpt-4o-mini-transcribe16K2K$1.25$5
gpt-4o-mini-transcribe-2025-03-2016K2K$1.25$5
gpt-4o-mini-transcribe-2025-12-1516K2K$1.25$5
whisper-1----

Video Generation Models

Model IDMax InputMax OutputInput/1MOutput/1M
sora-2-pro----
sora-2-pro-high-res----
sora-2----

Completion Models

Model IDMax InputMax OutputInput/1MOutput/1M
babbage-00216.4K4.1K$0.40$0.40

Moderation Models

Model IDMax InputMax OutputInput/1MOutput/1M
omni-moderation-latest32.8K0--
text-moderation-00732.8K0--
text-moderation-latest32.8K0--
text-moderation-stable32.8K0--
omni-moderation-2024-09-2632.8K0--

Anthropic Claude Models

Chat Models

Model IDMax InputMax OutputInput/1MOutput/1M
claude-opus-4-71M128K$5$25
claude-opus-4-61M128K$5$25
claude-sonnet-4-61M64K$3$15
claude-opus-4-5200K64K$5$25
claude-opus-4-5-20251101200K64K$5$25
claude-sonnet-4-5200K64K$3$15
claude-sonnet-4-5-20250929200K64K$3$15
claude-haiku-4-5200K64K$1$5
claude-haiku-4-5-20251001200K64K$1$5
claude-opus-4-20250514200K32K$15$75
claude-sonnet-4-202505141M64K$3$15
claude-opus-4-1200K32K$15$75
claude-opus-4-1-20250805200K32K$15$75
claude-4-opus-20250514200K32K$15$75
claude-4-sonnet-202505141M64K$3$15
claude-3-haiku-20240307200K4.1K$0.25$1.25

Google Gemini Models

Chat Models (OpenAI-Compatible)

Chat models are accessed via the OpenAI-compatible endpoint. Use the model IDs below directly — no prefix needed.
Model IDMax InputMax OutputInput/1MOutput/1M
gemini-2.5-pro1.0M64K$1.88$15
gemini-2.5-pro-thinking1.0M64K$1.88$15
gemini-2.5-pro-nothinking1.0M64K$1.88$15
gemini-2.5-flash1.0M64K$0.45$3.75
gemini-2.5-flash-nothinking1.0M64K$0.45$3.75

Image Generation Models (Native Gemini API)

Image generation models use the native Google Gemini API — not the OpenAI-compatible endpoint. Use the Google Gemini SDK with your SaveGate API key passed via the x-goog-api-key header.
Model IDCost per Image
gemini-3-pro-image-preview-4k$0.30
Example: Generate an image
curl -s 'https://api.savegate.ai/v1beta/models/gemini-3-pro-image-preview-4k:generateContent' \
  -H 'Content-Type: application/json' \
  -H 'x-goog-api-key: sg-xxx' \
  -d '{
    "contents": [
      {
        "parts": [
          {
            "text": "A futuristic city at sunset"
          }
        ]
      }
    ]
  }' \
| jq -r '.candidates[0].content.parts[0].inlineData.data' \
| base64 --decode > output.png

ElevenLabs Models

Text-to-Speech Models

Model IDModePricing
elevenlabs/eleven_v3Audio Speech$0.18 / 1K characters

Speech-to-Text Models

Model IDModePricing
elevenlabs/scribe_v1_experimentalAudio Transcription$0.22 / hour

Using Models

Simply specify the model ID in your API request:
response = client.chat.completions.create(
    model="gpt-5.4",  # Specify model here
    messages=[{"role": "user", "content": "Hello!"}]
)

Model Updates

SaveGate automatically updates model versions to the latest stable releases. For version pinning, use specific model IDs (with date suffix) when available.
Use the /v1/models API endpoint to get the full list of available models with their capabilities and pricing information in real-time.