Providers
The AI Gateway supports 16 AI providers with 213 certified models. All providers use a unified API interface.
Provider Summary
| Provider | Models | Capabilities |
|---|---|---|
| OpenAI | 45 | Chat, Embeddings, TTS, STT, Images |
| Anthropic | 9 | Chat with Vision |
| Google (Gemini) | 41 | Chat, Embeddings, TTS, Images, Video |
| Mistral | 54 | Chat, Embeddings, STT, Moderation |
| Cohere | 16 | Chat, Embeddings, Reranking |
| Grok (xAI) | 10 | Chat, Images |
| DeepSeek | 2 | Chat, Reasoning |
| Groq | - | Fast inference (uses certified models) |
| ElevenLabs | 12 | TTS, Music, Sound Effects |
| Mubert | - | Music Generation |
| Stability AI | 1 | Image Generation |
| Black Forest Labs | 11 | Image Generation (FLUX) |
| Runway | 10 | Video Generation |
| Luma AI | 2 | Video Generation |
| Custom/vLLM | - | Self-hosted models via vLLM |
| Kubernetes | - | Self-hosted models on Kubernetes |
OpenAI
Industry-leading models for chat, embeddings, audio, and image generation.
Capabilities: Chat, Embeddings, Text-to-Speech, Speech-to-Text, Image Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
openai/gpt-4o | Chat | General-purpose, vision, function calling |
openai/gpt-4o-mini | Chat | Fast, cost-effective tasks |
openai/o1 | Chat | Complex reasoning tasks |
openai/text-embedding-3-large | Embedding | High-quality embeddings |
openai/tts-1-hd | TTS | High-quality speech synthesis |
openai/whisper-1 | STT | Audio transcription |
openai/dall-e-3 | Image | High-quality image generation |
Anthropic
Claude models with excellent reasoning, coding, and long-context capabilities.
Capabilities: Chat with Vision, Function Calling
Featured Models
| Model | Type | Best For |
|---|---|---|
anthropic/claude-sonnet-4-20250514 | Chat | Balanced performance and speed |
anthropic/claude-opus-4-20250514 | Chat | Most capable, complex tasks |
anthropic/claude-3-5-sonnet-latest | Chat | Previous generation, still excellent |
anthropic/claude-3-5-haiku-latest | Chat | Fast, simple tasks |
Google (Gemini)
Multimodal models with massive context windows and diverse capabilities.
Capabilities: Chat, Embeddings, TTS, Image Generation, Video Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
gemini/gemini-2.0-flash | Chat | Fast multimodal processing |
gemini/gemini-1.5-pro | Chat | 2M context window |
gemini/gemini-1.5-flash | Chat | Fast, cost-effective |
gemini/text-embedding-004 | Embedding | Text embeddings |
gemini/imagen-3.0-generate-002 | Image | High-quality images |
gemini/veo-2 | Video | Video generation |
Note: In the backend provider enum, Google models use the gemini provider identifier. When configuring models via the API, use gemini as the provider value.
Mistral
European AI with strong multilingual and code capabilities.
Capabilities: Chat, Embeddings, Speech-to-Text, Moderation
Featured Models
| Model | Type | Best For |
|---|---|---|
mistral/mistral-large-latest | Chat | Most capable Mistral model |
mistral/mistral-small-latest | Chat | Fast, cost-effective |
mistral/pixtral-large-latest | Chat | Vision capabilities |
mistral/codestral-latest | Chat | Code generation |
mistral/mistral-embed | Embedding | Text embeddings |
Cohere
Enterprise-focused models with RAG and reranking specialization.
Capabilities: Chat, Embeddings, Reranking
Featured Models
| Model | Type | Best For |
|---|---|---|
cohere/command-r-plus | Chat | Complex tasks with RAG |
cohere/command-r | Chat | General chat with RAG |
cohere/command-a-03-2025 | Chat | Reasoning capabilities |
cohere/embed-english-v3.0 | Embedding | English text embeddings |
cohere/embed-multilingual-v3.0 | Embedding | Multilingual embeddings |
cohere/rerank-v3.5 | Rerank | Search result reranking |
Grok (xAI)
xAI's models with real-time knowledge and image generation.
Capabilities: Chat, Vision, Image Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
grok/grok-3 | Chat | Most capable Grok model |
grok/grok-3-fast | Chat | Fast inference |
grok/grok-3-mini | Chat | Efficient, smaller model |
grok/grok-2-latest | Chat | Vision capabilities |
grok/aurora | Image | Image generation |
DeepSeek
Chinese AI with strong reasoning and coding capabilities.
Capabilities: Chat, Reasoning
Featured Models
| Model | Type | Best For |
|---|---|---|
deepseek/deepseek-chat | Chat | General chat, coding |
deepseek/deepseek-reasoner | Chat | Complex reasoning tasks |
ElevenLabs
Industry-leading voice synthesis and audio generation.
Capabilities: Text-to-Speech, Music Generation, Sound Effects
Featured Models
| Model | Type | Best For |
|---|---|---|
elevenlabs/eleven_multilingual_v2 | TTS | 29-language voice synthesis |
elevenlabs/eleven_turbo_v2_5 | TTS | Fast, low-latency TTS |
elevenlabs/eleven_flash_v2_5 | TTS | Ultra-fast TTS |
elevenlabs/eleven_music_v1 | Music | Music generation |
elevenlabs/eleven_sfx_v1 | Audio | Sound effects |
Mubert
AI-powered music generation platform.
Capabilities: Music Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
mubert/mubert-render | Music | AI-generated music and soundtracks |
Stability AI
Open-source image generation models.
Capabilities: Image Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
stability/stable-diffusion-3.5-large | Image | High-quality image generation |
Black Forest Labs
FLUX models for state-of-the-art image generation.
Capabilities: Image Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
bfl/flux-2 | Image | Latest FLUX model |
bfl/flux-1.1-pro | Image | Professional quality |
bfl/flux-1.1-pro-ultra | Image | Ultra-high quality |
bfl/flux-1-dev | Image | Development/testing |
bfl/flux-1-schnell | Image | Fast generation |
Runway
Video generation for creative applications.
Capabilities: Video Generation, Image Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
runway/gen4-turbo | Video | Latest Runway model |
runway/gen3a-turbo | Video | Fast video generation |
runway/gen3a-turbo-img2vid | Video | Image-to-video |
Luma AI
AI video generation with Ray models.
Capabilities: Video Generation
Featured Models
| Model | Type | Best For |
|---|---|---|
luma/ray-2 | Video | High-quality video |
luma/ray-flash-2 | Video | Fast video generation |
Custom / vLLM / Kubernetes
Self-hosted models deployed on your own infrastructure.
Provider Identifiers:
custom- Generic custom provider with configurable endpointvllm- vLLM-based inference serverskubernetes- Models deployed on Kubernetes via the deployment system
Capabilities: Varies based on the model deployed
These providers are used for self-hosted model deployments. See Deploying Models for details on deploying your own models.
Using Models
When calling the AI Gateway, use the Strongly-generated model ID - the unique identifier assigned when a model is added to your account.
// Get your model ID from your configured models list
const modelId = '507f1f77bcf86cd799439011';
const response = await fetch('https://ai-gateway.strongly.ai/v1/chat/completions', {
method: 'POST',
headers: {
'Content-Type': 'application/json',
'X-User-Id': userId,
'X-App-Id': appId // or X-Workflow-Id or X-Workspace-Id
},
body: JSON.stringify({
model: modelId, // Strongly-generated model ID
messages: [{ role: 'user', content: 'Hello!' }]
})
});
Switching Providers
To switch between providers, use the Strongly ID for the model you want to use:
// Each model has its own unique Strongly ID
const openaiModel = '507f1f77bcf86cd799439011'; // Your GPT-4o instance
const anthropicModel = '507f1f77bcf86cd799439012'; // Your Claude instance
const googleModel = '507f1f77bcf86cd799439013'; // Your Gemini instance
// Use any model by passing its Strongly ID
const model = openaiModel;
Note: The same vendor model (e.g., GPT-4o) can be configured multiple times with different API keys. Each configuration gets a unique Strongly ID.