AI Models

Explore our comprehensive collection of AI models from leading providers. Find the perfect model for your needs.

anthropic
Claude Haiku 4.5
Fast, efficient, and surprisingly capable?Haiku delivers near-flagship performance at budget-friendly pricing. This model excels at coding, agent workflows, and computer use tasks with Claude's characteristic helpfulness and safety. With 200K context and support for files and images, it's perfect for production deployments, sub-agent systems, and any scenario where you need reliable intelligence without the premium cost. Lightning-fast response times make it ideal for real-time applications.
anthropic
Claude Opus 4.1
The previous Opus generation delivering superior coding and agentic performance with 74.5% on SWE-bench Verified. Excellent for complex multi-step problems requiring rigor and precision.
anthropic
Claude Opus 4.5
The most intelligent Claude model for the hardest problems. Opus 4.5 excels at extremely complex reasoning, advanced coding challenges, sophisticated research, and intricate multi-step planning. With enhanced general intelligence and vision capabilities, this model tackles problems that push the boundaries of AI capability. Perfect for cutting-edge research, complex system design, advanced agentic workflows, and situations where you absolutely need the highest quality thinking available. The ultimate problem-solver.
anthropic
Claude Sonnet 4.5
Our go-to model for sophisticated work that demands both intelligence and nuance. Sonnet 4.5 excels at complex coding projects, nuanced writing, detailed analysis, and thoughtful problem-solving. With thinking capabilities, multimodal support, and Claude's renowned ability to follow instructions precisely, this model strikes the perfect balance between capability and cost for professional-grade work. Ideal for when quality truly matters.
deepseek
DeepSeek V3.2
An innovative model pioneering breakthrough efficiency in long-context processing. Using revolutionary Sparse Attention technology, this model handles massive contexts (128K tokens) with exceptional speed and minimal resource use while maintaining quality. With thinking capabilities and an impressive 65K output window, it excels at tasks requiring extensive context understanding. Perfect for processing large documents, codebases, or datasets where traditional models slow down. Exceptional value for long-context work.
deepseek
DeepSeek V3.2 Exp
An innovative experimental model pioneering breakthrough efficiency in long-context processing. Using revolutionary Sparse Attention technology, this model handles massive contexts (164K tokens) with exceptional speed and minimal resource use while maintaining quality. With thinking capabilities and an impressive 65K output window, it excels at tasks requiring extensive context understanding. Perfect for processing large documents, codebases, or datasets where traditional models slow down. Exceptional value for long-context work.
deepseek
DeepSeek V3.2 Exp Thinking
The thinking-optimized variant of our experimental long-context model, designed for deep reasoning over extensive information. This model allocates more computational resources to explicit reasoning while processing massive contexts efficiently. With 164K context support, it excels at analytical tasks requiring both breadth of information and depth of thought. Choose this when you need thorough, reasoned analysis of large amounts of information. Extremely cost-effective for research and analysis.
deepseek
DeepSeek V3.2 Thinking
The thinking-optimized variant of our long-context model, designed for deep reasoning over extensive information. This model allocates more computational resources to explicit reasoning while processing massive contexts efficiently. With 128K context support, it excels at analytical tasks requiring both breadth of information and depth of thought. Choose this when you need thorough, reasoned analysis of large amounts of information. Extremely cost-effective for research and analysis.
google
Gemini 2.5 Flash
The perfect all-rounder combining intelligence, speed, and value. This thinking model delivers excellent performance across diverse tasks with a massive 1M context window and full multimodal support. Whether you need code generation, document analysis, visual understanding, or complex reasoning, Flash handles it with grace. The sweet spot between capability and cost makes it our recommended choice for most professional work. With strong thinking capabilities, it provides both quality and transparency.
google
Gemini 2.5 Flash Image (Nano Banana)
The first hybrid reasoning image generator combining speed, intelligence, and creative control. Nano Banana creates images from text, edits them conversationally across multiple turns, and generates interleaved text-and-image responses. With configurable thinking budgets, you control the balance between quality, cost, and speed. Locale-aware generation ensures culturally appropriate visuals for global audiences. Perfect for rapid creative iteration, conversational image editing, and projects requiring both visual and textual content together. Fast, flexible, and surprisingly capable.
google
Gemini 2.5 Flash Lite
Blazingly fast and incredibly affordable, without sacrificing capability. This lightweight model offers an extraordinary 1M token context window with multimodal support at breakthrough pricing. With configurable thinking and tool connectivity, it handles diverse tasks from quick queries to complex document analysis. The massive context window means you can process entire books, codebases, or datasets in a single request. Perfect for high-volume applications and cost-conscious projects that still need quality results.
google
Gemini 2.5 Pro
The previous-generation advanced Gemini reasoning model capable of solving complex problems with 1M context and comprehensive multimodal support including audio and video.
google
Gemini 3 Flash
Google's most intelligent model balanced for speed and cost, combining frontier intelligence with superior search and grounding. Gemini 3 Flash delivers exceptional reasoning capabilities across a massive 1M context window while maintaining fast response times. With full multimodal support including vision and tool use, it excels at complex analytical tasks, research, and code generation. The perfect choice when you need top-tier intelligence and speed or cost is a consideration.
google
Gemini 3 Pro Image (Nano Banana Pro)
Professional-grade image generation delivering studio-quality, production-ready visuals with unparalleled precision and control. Building on Nano Banana's foundation, the Pro version adds enhanced reasoning, deep world knowledge, sophisticated text rendering and translation within images, and studio-level fine controls. Create high-fidelity visuals with accurate text, cultural nuance, and functional design precision. Perfect for professional projects, marketing materials, product designs, and any work requiring publication-ready quality. The ultimate image generation model for serious creative work.
google
Gemini 3 Pro Preview
The most advanced Gemini model, pushing the boundaries of multimodal reasoning and complex problem-solving. This preview model excels at sophisticated analytical tasks with support for text, images, audio, video, and documents. With a 1M context window and enhanced reasoning capabilities, it tackles problems that require deep understanding across multiple modalities. Choose this for cutting-edge multimodal work, advanced research, or when you need the absolute best in visual and analytical reasoning. The future of multimodal AI.
zai
GLM 4.5 Air
A streamlined, efficient agent-focused model using Mixture-of-Experts architecture. With 106B total parameters but only 12B active per task, this model delivers impressive intelligence while remaining fast and cost-effective. Purpose-built for agentic applications, it excels at tool use and autonomous workflows. The thinking capabilities provide transparency in decision-making. With 128K context and 96K output, it handles substantial tasks comfortably. Perfect for production agent systems where you need reliability and efficiency without breaking the budget.
zai
GLM 4.6
The latest and most capable GLM model with comprehensive improvements across all domains. This versatile model excels at real-world coding, handles long contexts up to 200K tokens, and delivers strong performance in reasoning, research, writing, and agentic workflows. With thinking capabilities and an impressive 96K output window, it tackles diverse professional tasks with confidence. The well-rounded upgrade brings enhanced capabilities across the board while maintaining excellent value. Choose this for sophisticated work requiring versatility and depth.
zai
GLM 4.6V
A capable multimodal model achieving state-of-the-art visual understanding among models of similar scale. GLM 4.6V combines strong image analysis with the reasoning and tool use capabilities of the GLM family. With 128K context support and vision capabilities, it handles image understanding, document analysis, and visual reasoning tasks effectively. An excellent choice for multimodal workflows where you need reliable visual comprehension without premium pricing.
zai
GLM 4.7
The latest and most capable GLM model with comprehensive improvements across all domains. This versatile model excels at real-world coding, handles long contexts up to 205K tokens, and delivers strong performance in reasoning, research, writing, and agentic workflows. With thinking capabilities and an impressive 131K output window, it tackles diverse professional tasks with confidence. The well-rounded upgrade brings enhanced capabilities across the board while maintaining excellent value. Choose this for sophisticated work requiring versatility and depth.
openai
GPT-5
OpenAI's original GPT-5 flagship excelling at complex reasoning, broad knowledge, advanced coding, and multi-step agentic tasks.
openai
GPT-5 Mini
The sweet spot for everyday AI work?intelligent, fast, and affordable. This model excels at reasoning, conversation, and general tasks with an optimal balance of capability and cost. With 400K context, multimodal support, and thinking capabilities, it handles most professional work confidently. The GPT-5 architecture delivers reliable quality across coding, writing, analysis, and problem-solving. Our most popular choice for teams who need consistent, high-quality performance without premium pricing. An excellent general-purpose workhorse.
openai
GPT-5 Nano
Lightning-fast and incredibly cost-effective for high-throughput workloads. This model specializes in straightforward instructions and classification tasks where speed is essential. With a massive 400K context window and multimodal support, it processes large volumes of simple tasks efficiently. The thinking capability is tuned for quick, decisive responses rather than deep contemplation. Perfect for production systems handling thousands of simple requests, real-time classification, or any scenario requiring fast, economical processing with GPT-5 architecture.
openai
GPT-5 Pro
The ultimate thinking machine for problems that demand maximum intelligence and computational effort. GPT-5 Pro allocates massive compute resources to think deeply and thoroughly about the hardest challenges. With an extraordinary 272K output window and extensive thinking capabilities, this model tackles problems other AIs simply cannot solve. Requests may take minutes to complete as it works through complex reasoning chains. For cutting-edge research, groundbreaking problem-solving, and situations where correctness is paramount and time is secondary.
openai
GPT-5.1
OpenAI's flagship model with adaptive thinking that allocates computational effort based on question complexity. This model excels at sophisticated reasoning, deep real-world knowledge, advanced coding challenges, and complex multi-step workflows. It intelligently spends more time on hard problems while responding quickly to simpler ones. Perfect for professional work requiring OpenAI's best capabilities across reasoning, knowledge, and technical execution.
openai
GPT-5.1 Instant
A conversational variant of GPT-5 with warmer tone, improved instruction following, and adaptive reasoning. Designed for purely conversational applications rather than research.
openai
GPT-5.1 Thinking
OpenAI's premier thinking model with precisely tuned adaptive reasoning. This upgraded version excels at complex analytical tasks, sophisticated coding, and multi-step problem-solving with transparent thought processes.
openai
GPT-5.2
OpenAI's best general-purpose model, part of the GPT-5 flagship model family. GPT-5.2 is their most intelligent model yet for both general and agentic tasks. With a 400K context window, multimodal capabilities including image generation, and advanced reasoning, this model excels at sophisticated coding, complex analysis, and multi-step workflows. The ideal choice for professional work requiring OpenAI's cutting-edge capabilities.
openai
GPT-5.2 Pro
The ultimate version of GPT-5.2 that produces smarter and more precise responses. This model allocates massive compute resources to think deeply and thoroughly about the hardest challenges. With a 400K context window, multimodal capabilities including image generation, and maximum reasoning power, it tackles problems that require the highest quality thinking available. For cutting-edge research, groundbreaking problem-solving, and situations where precision and correctness are paramount.
xai
Grok 4
xAI's premier flagship model combining exceptional natural language understanding, mathematical prowess, and sophisticated reasoning. This well-rounded model excels across diverse domains?from creative writing to complex calculations to logical problem-solving. With 256K context for both input and output, thinking capabilities, and vision support, it handles virtually any task with intelligence and nuance. The true jack-of-all-trades that masters most of them. Perfect when you need a single model that performs excellently across the board.
xai
Grok 4 Fast
xAI's previous-generation fast multimodal model with 2M context and cost-efficient performance. Designed for rapid agentic workflows without extended reasoning.
xai
Grok 4 Fast Reasoning
xAI's previous-generation thinking model with 2M context and cost-efficient agentic performance. Combined rapid execution with reasoning capabilities.
xai
Grok 4.1 Fast
xAI's speed demon for agentic workflows requiring rapid, accurate tool execution. With an extraordinary 2M context window, this model processes massive amounts of information while maintaining blazing-fast response times. Optimized specifically for tool calling and task completion, it excels at real-world applications like customer support, financial analysis, and automated workflows where speed is critical. The non-reasoning variant prioritizes quick responses over extended thought processes. Choose this when you need rapid, reliable agent performance at incredible value.
xai
Grok 4.1 Fast Reasoning
xAI's intelligent agent combining massive context, thinking capabilities, and tool mastery. With a 2M context window and reasoning mode, this model thoughtfully navigates complex agentic workflows while maintaining speed. The perfect balance between rapid execution and intelligent decision-making for sophisticated real-world applications. Excels at scenarios requiring both tool orchestration and reasoning?like nuanced customer support, complex financial analysis, and adaptive workflows. Choose this when your agents need to think and act intelligently.
moonshotai
Kimi K2 Thinking Turbo
The ultimate autonomous thinking agent capable of executing hundreds of sequential tool calls with coherent reasoning throughout. This model can chain 200-300 tool operations without human intervention, maintaining logical consistency across complex multi-step problems. Built specifically as a thinking agent, it reasons step-by-step while acting, achieving state-of-the-art results on the hardest benchmarks. With a massive 262K equal input/output window, it handles truly extensive workflows. The Turbo variant delivers this capability at exceptional speed. For complex autonomous projects requiring persistent reasoning and action.
moonshotai
Kimi K2 Turbo
A massive 1T parameter MoE model (32B active) optimized for agentic capabilities, tool use, reasoning, and code synthesis. Achieved strong benchmark results in coding and reasoning tasks.
meta
Llama 3.3 70B
A balanced model combining performance with efficiency for conversational AI. Designed for content creation, enterprise applications, and research with strong language understanding. Handles summarization, classification, sentiment analysis, and code generation.
meta
Llama 4 Maverick 17B
A multimodal model from the Llama 4 collection with MoE architecture for text and image tasks. Designed for multimodal experiences with vision capabilities.
meta
Llama 4 Scout 17B
A compact multimodal model using mixture-of-experts architecture for text and image understanding. Designed for efficient multimodal experiences with vision support.
mistral
Magistral Medium
A sophisticated thinking model built for transparent, verifiable reasoning across languages. This model excels at complex analytical tasks where you need to see and understand the reasoning process. With exceptional multilingual capabilities, it maintains reasoning quality even when switching languages mid-conversation. The thinking mode provides clear, step-by-step explanations you can follow and verify. Perfect for international teams, academic work, and any task requiring transparent, logical reasoning across language barriers.
minimax
MiniMax M2
An efficiency masterpiece combining massive scale with minimal active compute. This Mixture-of-Experts model packs 230B parameters but activates only 10B per request, delivering elite performance at remarkable speed and cost. Purpose-built for coding and agentic work while maintaining strong general intelligence. With an extraordinary 205K equal input/output window and thinking capabilities, it handles massive contexts and generates extensive responses. Perfect for developers and teams who need powerful AI without the premium price tag. Exceptional value for sophisticated work.
minimax
MiniMax M2.1
MiniMax's latest model, optimized specifically for robustness in coding, tool use, instruction following, and long-horizon planning. With a 205K context window and 131K output capacity, it handles substantial tasks with confidence. The thinking capabilities enable transparent reasoning for complex problems. Perfect for developers and teams who need powerful AI for sophisticated coding and agentic workflows.
mistral
Mistral Large
Our flagship Mistral model designed for complex, specialized tasks requiring substantial reasoning power. This model excels at sophisticated code generation, synthetic text creation, RAG applications, and advanced agent workflows. With strong reasoning capabilities and vision support, it handles specialized professional tasks that demand both intelligence and precision. Choose this when you need Mistral's most capable model for challenging, domain-specific work. Ideal for production systems requiring reliable, high-quality outputs.
mistral
Mistral Medium 3.1
Exceptional performance at a fraction of the cost. This model delivers near-frontier capabilities (performing at 90%+ of top models on benchmarks) while being dramatically more affordable. With vision support and strong tool integration, it handles a wide range of professional tasks with impressive quality. The perfect choice when you need high-quality results without premium pricing. An outstanding value proposition for teams looking to maximize capability per dollar.
mistral
Mistral Small
Fast, efficient, and affordable for high-volume tasks. This model is purpose-built for bulk operations like classification, customer support, and text generation where you need to process many requests quickly and economically. With vision support and tool integration, it handles straightforward tasks reliably. Perfect for production systems processing thousands of requests, batch operations, and any scenario where speed and cost matter more than extreme intelligence. Excellent performance at the best price point.
mistral
Pixtral 12B
A compact 12B multimodal model with image understanding alongside text capabilities.
alibaba
Qwen3 235B
A powerful general-purpose model perfect for complex reasoning tasks, instruction following, and multilingual projects. With 235 billion parameters, this model excels at understanding nuanced instructions and maintaining context across long conversations. Ideal for research, creative writing, and sophisticated problem-solving where you need reliable, well-reasoned responses.
alibaba
Qwen3 Coder 30B
Your budget-friendly coding companion designed for everyday development work. This model specializes in code generation, debugging, and refactoring with strong tool integration. With a massive 262K context window, it can handle entire codebases and long documentation. Perfect for daily coding tasks where you need fast, reliable assistance without breaking the bank.
alibaba
Qwen3 Max
Our most advanced Qwen model, engineered specifically for sophisticated agent workflows and complex tool orchestration. This model achieves state-of-the-art performance in autonomous task completion, excelling at multi-step planning and execution. With a 262K context window and 32K output tokens, it handles extended agentic scenarios where other models struggle. Choose this when you need reliable, intelligent automation.
alibaba
Qwen3 Next 80B Thinking
A next-generation thinking model that balances power with efficiency through innovative architecture. This model shines when you need deep reasoning without the cost of larger models. With 164K context and explicit thinking capabilities, it works through complex problems step-by-step, making it ideal for analytical tasks, research, and situations where you want to see the reasoning process. Extremely cost-effective for thoughtful, deliberate problem-solving.
alibaba
Qwen3 VL 235B Thinking
Our premier vision-language model combining 235B parameters with exceptional visual understanding and reasoning. This model excels at analyzing images, charts, diagrams, and documents with a special focus on STEM and mathematical content. With dramatically improved OCR capabilities and visual perception, it handles everything from handwritten equations to complex technical diagrams. Perfect for research, education, and any task requiring sophisticated visual analysis.