Skip to main content

Overview

SaveGate provides access to 50+ state-of-the-art AI models from leading providers. All models are accessible through a unified API using the same authentication and request format.

OpenAI Models

GPT-5.1 Family (Latest)

GPT-5.1

Model ID: gpt-5.1
  • Most advanced GPT model
  • Superior reasoning capabilities
  • Enhanced multimodal understanding
  • Best for complex tasks

GPT-5.1 Mini

Model ID: gpt-5.1-mini
  • Faster and more affordable
  • Excellent performance/cost ratio
  • Good for most tasks

GPT-5.1 Reasoning

Model ID: gpt-5.1-reasoning
  • Specialized for complex reasoning
  • Extended thinking time
  • Best for analysis and problem-solving

GPT-4.2 Family

GPT-4.2

Model ID: gpt-4.2
  • Advanced capabilities
  • Improved performance
  • Great balance of speed and quality

GPT-4.2 Mini

Model ID: gpt-4.2-mini
  • Fast and efficient
  • Cost-effective
  • Good for high-volume tasks

GPT-4.2 Reasoning

Model ID: gpt-4.2-reasoning
  • Enhanced reasoning capabilities
  • Complex problem solving
  • Mathematical and logical tasks

GPT-4.1 Family

GPT-4.1

Model ID: gpt-4.1
  • Reliable performance
  • Balanced capabilities
  • Production-ready

GPT-4.1 Mini

Model ID: gpt-4.1-mini
  • Lightweight and fast
  • Good value
  • High throughput

GPT-4.1 Nano

Model ID: gpt-4.1-nano
  • Ultra-lightweight
  • Fastest response times
  • Most cost-effective

GPT-4o Family

GPT-4o

Model ID: gpt-4o
  • Multimodal (text + vision)
  • Fast and efficient
  • Context: 128K tokens

GPT-4o Mini

Model ID: gpt-4o-mini
  • Lightweight multimodal
  • Cost-effective
  • Good for simple tasks

GPT-4o Realtime

Model ID: gpt-4o-realtime
  • Real-time audio/video
  • Low latency
  • Interactive applications

GPT-4o Transcribe

Model ID: gpt-4o-transcribe
  • Audio transcription
  • High accuracy
  • Multiple languages

GPT-4o Mini Transcribe

Model ID: gpt-4o-mini-transcribe
  • Lightweight transcription
  • Fast processing
  • Cost-effective

O-Series (Reasoning Models)

O3

Model ID: o3
  • Advanced reasoning
  • Complex problem solving
  • Extended thinking time
  • Best for difficult tasks

O3 Mini

Model ID: o3-mini
  • Faster reasoning model
  • Good balance
  • Cost-effective reasoning

Classic GPT-4 Family

GPT-4

Model ID: gpt-4
  • Original GPT-4
  • Proven reliability
  • Context: 8K tokens

GPT-4 Turbo

Model ID: gpt-4-turbo
  • Faster than GPT-4
  • Context: 128K tokens
  • JSON mode support

GPT-3.5 Family

Model IDContextBest For
gpt-3.5-turbo16KGeneral tasks, cost-effective

Whisper Models

Whisper Large v3

Model ID: whisper-large-v3
  • State-of-the-art speech recognition
  • 99 languages supported
  • High accuracy transcription
  • Timestamp support

Anthropic Claude Models

Claude 4.5 Family (Latest)

Claude Opus 4.5

Model ID: claude-opus-4.5
  • Most capable Claude model
  • Superior reasoning and analysis
  • Context: 200K tokens
  • Best for complex tasks

Claude Sonnet 4.5

Model ID: claude-sonnet-4.5
  • Latest Sonnet version
  • Excellent balance
  • Fast and capable
  • Great for coding

Claude 4.1 Family

Claude Opus 4.1

Model ID: claude-opus-4.1
  • High capability model
  • Advanced reasoning
  • Context: 200K tokens
  • Complex problem solving

Claude 4 Family

Claude Opus 4

Model ID: claude-opus-4
  • Powerful capabilities
  • Long context
  • Reliable performance

Claude Sonnet 4

Model ID: claude-sonnet-4
  • Balanced model
  • Good speed
  • Versatile

Claude 3.7 Family

Claude 3.7 Sonnet

Model ID: claude-3.7-sonnet
  • Enhanced Sonnet
  • Improved capabilities
  • Context: 200K tokens
  • Great for coding and analysis

Claude 3.5 Family

Claude 3.5 Sonnet

Model ID: claude-3.5-sonnet
  • Best balance of speed/capability
  • Context: 200K tokens
  • Excellent for coding

Claude 3.5 Haiku

Model ID: claude-3.5-haiku
  • Fastest Claude 3.5
  • Cost-effective
  • Good for simple tasks

Claude 3 Family

Claude 3 Opus

Model ID: claude-3-opus
  • Most capable Claude 3
  • Context: 200K tokens
  • Complex tasks

Claude 3 Sonnet

Model ID: claude-3-sonnet
  • Balanced performance
  • Good for most tasks
  • Context: 200K tokens

Claude 3 Haiku

Model ID: claude-3-haiku
  • Fastest Claude 3
  • Most cost-effective
  • Context: 200K tokens

Claude 2 Family

Model IDContextBest For
claude-2.1200KLong documents, analysis
claude-2100KGeneral tasks
claude-instant-1.2100KFast responses, high volume

Model Selection Guide

By Use Case

Best Models:
  1. claude-sonnet-4.5 - Best overall for coding
  2. gpt-5.1 - Advanced code generation
  3. claude-3.7-sonnet - Excellent code understanding
  4. gpt-4.2 - Strong alternative
Why: Advanced reasoning, code understanding, and generation

By Budget

  • gpt-5.1 - Most advanced GPT
  • gpt-5.1-reasoning - Advanced reasoning
  • claude-opus-4.5 - Most capable Claude
  • o3 - Complex problem solving
Use when: Quality is critical, complex tasks
  • gpt-5.1-mini - Great performance/cost
  • claude-sonnet-4.5 - Best overall value
  • gpt-4.2 - Fast and capable
  • claude-3.7-sonnet - Excellent coding
Use when: Need quality at reasonable cost
  • gpt-4.1-nano - Ultra-lightweight
  • gpt-3.5-turbo - Fast and cheap
  • claude-3.5-haiku - Fastest Claude
  • gpt-4.2-mini - Good value
Use when: Simple tasks, high volume

Model Capabilities

Function Calling

Models that support function/tool calling:
  • ✅ All GPT-5.1, GPT-4.2, GPT-4.1 models
  • ✅ GPT-4o, GPT-4o Mini
  • ✅ GPT-4, GPT-4 Turbo
  • ✅ GPT-3.5 Turbo
  • ✅ All Claude 4.5, 4.1, 4, 3.7, 3.5, 3 models
  • ✅ Claude 2.1

Vision (Multimodal)

Models that support image understanding:
  • ✅ GPT-5.1 family
  • ✅ GPT-4.2, GPT-4.1
  • ✅ GPT-4o, GPT-4o Mini, GPT-4o Realtime
  • ✅ All Claude 4.5, 4.1, 4 models
  • ✅ Claude 3.7, 3.5, 3 family

Audio & Transcription

Models with audio capabilities:
  • whisper-large-v3 - Speech recognition
  • gpt-4o-transcribe - Audio transcription
  • gpt-4o-mini-transcribe - Fast transcription
  • gpt-4o-realtime - Real-time audio/video

Reasoning

Models specialized for complex reasoning:
  • gpt-5.1-reasoning - Advanced reasoning
  • gpt-4.2-reasoning - Enhanced reasoning
  • o3 - Complex problem solving
  • o3-mini - Efficient reasoning
  • ✅ Claude Opus models - Strong reasoning

Using Models

Simply specify the model ID in your API request:
response = client.chat.completions.create(
    model="gpt-5.1",  # Specify model here
    messages=[{"role": "user", "content": "Hello!"}]
)

Model Updates

SaveGate automatically updates model versions to the latest stable releases. For version pinning, use specific model IDs when available.
Check the pricing page for detailed cost information for each model.