Overview
SaveGate provides access to 50+ state-of-the-art AI models from leading providers. All models are accessible through a unified API using the same authentication and request format.OpenAI Models
GPT-4 Family
GPT-4
Model ID:
gpt-4- Most capable GPT-4 model
- Best for complex reasoning
- Context: 8K tokens
GPT-4 Turbo
Model ID:
gpt-4-turbo, gpt-4-turbo-preview- Faster and cheaper than GPT-4
- Context: 128K tokens
- JSON mode support
GPT-4o
Model ID:
gpt-4o, gpt-4o-mini- Multimodal (text + vision)
- Fast and efficient
- Context: 128K tokens
O1 Series
Model IDs:
o1-preview, o1-mini- Advanced reasoning
- Best for complex problems
- Extended thinking time
GPT-3.5 Family
| Model ID | Context | Best For |
|---|---|---|
gpt-3.5-turbo | 16K | General tasks, cost-effective |
gpt-3.5-turbo-16k | 16K | Longer conversations |
Anthropic Models
Claude 3 Family
Claude 3.5 Sonnet
Model ID:
claude-3-5-sonnet-20241022- Latest and most capable
- Best balance of speed/capability
- Context: 200K tokens
- Excellent for coding
Claude 3 Opus
Model ID:
claude-3-opus-20240229- Most capable Claude model
- Best for complex tasks
- Context: 200K tokens
Claude 3 Sonnet
Model ID:
claude-3-sonnet-20240229- Balanced performance
- Good for most tasks
- Context: 200K tokens
Claude 3 Haiku
Model ID:
claude-3-haiku-20240307- Fastest Claude model
- Most cost-effective
- Context: 200K tokens
Google Models
Gemini Family
Gemini 1.5 Pro
Model ID:
gemini-1.5-pro- Multimodal capabilities
- Context: 1M tokens
- Best for long context
Gemini 1.5 Flash
Model ID:
gemini-1.5-flash- Fast and efficient
- Context: 1M tokens
- Cost-effective
Gemini Pro
Model ID:
gemini-pro- General purpose
- Good performance
- Reliable choice
Gemini Pro Vision
Model ID:
gemini-pro-vision- Multimodal (text + images)
- Vision understanding
- Creative tasks
Meta Models
Llama 3 Family
| Model ID | Size | Context | Best For |
|---|---|---|---|
meta-llama/llama-3-70b-instruct | 70B | 8K | General tasks, reasoning |
meta-llama/llama-3-8b-instruct | 8B | 8K | Fast, cost-effective |
meta-llama/llama-2-70b-chat | 70B | 4K | Conversations |
meta-llama/llama-2-13b-chat | 13B | 4K | Lightweight chat |
Mistral Models
Mistral Large
Model ID:
mistral-large-latest- Most capable Mistral model
- Multilingual support
- Function calling
Mistral Medium
Model ID:
mistral-medium-latest- Balanced performance
- Good for most tasks
- Cost-effective
Mistral Small
Model ID:
mistral-small-latest- Fast and lightweight
- Simple tasks
- Most affordable
Mixtral 8x7B
Model ID:
mixtral-8x7b-instruct- Mixture of Experts
- Strong performance
- Open weights
Model Selection Guide
By Use Case
- Coding
- Writing
- Analysis
- Chat
- Long Context
Best Models:
claude-3-5-sonnet-20241022- Best overallgpt-4-turbo- Strong alternativeclaude-3-opus-20240229- Complex problems
By Budget
Premium (Best Performance)
Premium (Best Performance)
Balanced (Good Value)
Balanced (Good Value)
claude-3-5-sonnet-20241022- Best overall valuegpt-4-turbo- Fast and capablegemini-1.5-pro- Long context
Budget (Cost-Effective)
Budget (Cost-Effective)
gpt-3.5-turbo- Fast and cheapclaude-3-haiku-20240307- Fastest Claudegemini-1.5-flash- Fast Geminimistral-small-latest- Affordable
Model Capabilities
Function Calling
Models that support function/tool calling:- ✅ All GPT-4 models
- ✅ GPT-3.5 Turbo
- ✅ Claude 3 family
- ✅ Mistral Large
- ✅ Gemini Pro
Vision (Multimodal)
Models that support image understanding:- ✅ GPT-4o, GPT-4o Mini
- ✅ Claude 3 family (all models)
- ✅ Gemini Pro Vision
- ✅ Gemini 1.5 Pro
JSON Mode
Models with guaranteed JSON output:- ✅ GPT-4 Turbo
- ✅ GPT-3.5 Turbo
- ✅ Gemini 1.5 Pro
Using Models
Simply specify the model ID in your API request:Model Updates
SaveGate automatically updates model versions to the latest stable releases:gpt-4→ Latest GPT-4 versionclaude-3-5-sonnet-latest→ Latest Claude 3.5 Sonnetgemini-pro→ Latest Gemini Pro
claude-3-5-sonnet-20241022).
Check the pricing page for detailed cost information for each model.