Skip to main content

Available models

We currently offer the following models, which may change or expand over time. Each is described along with model-specific parameters.

Model NameTypeModalitiesContext (Tokens)License
gpt-oss-120bChat + reasoningText, tool-calling131,072Apache 2.0
Ministral-3-14B-Instruct-2512Chat + visionText, image, tool-calling262,144Apache 2.0
Devstral-Small-2-24B-Instruct-2512ChatText, image, tool-calling262,144Apache 2.0
Qwen3-Embedding-8BEmbeddingText to vector32,768Apache 2.0
whisper-large-v3-turboSpeech-to-TextAudio to textN/A (audio-based)MIT

Picking modelsโ€‹

  • Start with Ministral-3-14B-Instruct-2512 for broad, scalable, cost-conscious chat and basic multimodal (text + image) workflows.
  • Upgrade to Devstral-Small-2-24B-Instruct-2512 for demanding, nuanced, business-critical tasks where top answer quality, agentic workflows, and image understanding are vital.
  • Use gpt-oss-120b for complex text-centric workloads and advanced automations when precision and vast knowledge are required.
  • Choose Qwen3-Embedding-8B for all use cases involving search, recommendation, clustering, or knowledge graph building.
  • Use whisper-large-v3-turbo for any audio transcription or voice-command needs.

Please have a look at the following pages to gather more information about a specific model: