← AI Hub

Models

Technical model profiles and strategy explainers — capabilities, deployment tradeoffs, and practical fit guidance.

AI model pages are point-in-time snapshots based on each page's last verified date. Current and preview entries are refreshed on the active maintenance cadence, while legacy and deprecated entries remain browseable as historical context.

Filters available

Filter by type, provider, status, and open-source availability. Deprecated entries stay hidden unless enabled.

Model Strategy Explainers

Constraint-led guidance for open-weight and proprietary choices across local, private, and managed deployment paths.

Model Families

Stable overviews of major model product lines. Use these as durable reference points.

Language Models

DeepSeek-Reasoner

DeepSeek · DeepSeek-R1

DeepSeek's reasoning-focused API endpoint with strong analytical performance and very aggressive token pricing.

language128K ctx
Mar 6, 2026

GLM-5

Zhipu AI · GLM

Zhipu's latest GLM flagship with long context, strong coding ability, and open-weight plus API access.

language200K ctxOpen
Mar 1, 2026

GPT-5

OpenAI · GPT-5

Original GPT-5 release entry, now superseded by newer GPT-5.3 and GPT-5.4 generation variants.

language400K ctxLegacy
Mar 6, 2026

GPT-5 mini

OpenAI · GPT-5

Cost-efficient GPT-5 variant for high-volume production workflows needing strong reasoning at lower cost.

language400K ctx
Feb 26, 2026

GPT-5 nano

OpenAI · GPT-5

Ultra-low-cost GPT-5 tier for high-throughput automation and lightweight reasoning tasks.

language400K ctx
Feb 26, 2026

GPT-5-Codex

OpenAI · GPT-5

Earlier GPT-5 Codex release entry, superseded by newer GPT-5.3-Codex in the current lineup.

language400K ctxLegacy
Mar 6, 2026

GPT-5.2

OpenAI · GPT-5

Earlier GPT-5.2 generation entry retained as historical context after GPT-5.3 and GPT-5.4.

language400K ctxLegacy
Mar 6, 2026

GPT-5.2-Codex

OpenAI · GPT-5

Earlier GPT-5 Codex variant retained as historical context after GPT-5.3-Codex.

language400K ctxLegacy
Mar 6, 2026

GPT-5.2-Pro

OpenAI · GPT-5

Earlier premium GPT-5.2 tier retained as historical context after GPT-5.4-Pro.

language400K ctxLegacy
Mar 6, 2026

GPT-5.3

OpenAI · GPT-5

Fast everyday GPT-5 tier tuned for general work, coding, and lower-cost production usage.

language400K ctx
Mar 6, 2026

GPT-5.3-Codex

OpenAI · GPT-5

Latest Codex-tuned GPT-5 model for repository-scale implementation, review, and agent workflows.

language400K ctx
Mar 6, 2026

GPT-5.4

OpenAI · GPT-5

Current high-capability GPT-5 tier for difficult professional tasks, coding, and longer agent workflows.

language400K ctx
Mar 6, 2026

GPT-5.4-Pro

OpenAI · GPT-5

Highest-capability GPT-5 tier for decision-ready analysis and the most demanding professional workflows.

language400K ctx
Mar 6, 2026

Grok 4

xAI · Grok

xAI's flagship Grok model focused on frontier reasoning, long context, and tool-oriented assistant workflows.

language256K ctx
Mar 6, 2026

Grok 4.20

xAI · Grok

xAI's latest model with improved reasoning and coding capabilities, building on Grok 4 with enhanced tool use and real-time data integration.

language256K ctx
Feb 22, 2026

Kimi K2.5

Moonshot AI · Kimi

Moonshot's Kimi K2.5 is an open-weight long-context model focused on agentic reasoning and tool use.

language256K ctxOpen
Mar 1, 2026

MiniMax-M2.5

MiniMax · MiniMax M

MiniMax's latest open-weight reasoning model with 200K+ context and strong coding, tool-use, and productivity benchmarks.

language205K ctxOpen
Mar 6, 2026

Mistral Small 3.2

Mistral AI · Mistral Small

Mistral's open 24B model balancing strong instruction quality with low API cost for production assistants.

language128K ctxOpen
Mar 1, 2026

o3

OpenAI · o-series

High-reasoning model optimized for difficult analysis, planning, and reliability-critical decisions.

language200K ctx
Mar 6, 2026

o4-mini

OpenAI · o-series

Cost-effective reasoning model retained as a legacy reference after retirement from ChatGPT defaults.

language200K ctxLegacy
Mar 6, 2026

Qwen3-Max

Alibaba · Qwen3

Alibaba's top Qwen API model for high-end multilingual reasoning, coding, and enterprise assistant workloads.

language262K ctx
Feb 26, 2026

Qwen3.5

Alibaba · Qwen

Alibaba's Qwen3.5 generation extends the Qwen line with stronger open-weight reasoning and coding performance.

language262K ctxOpen
Mar 1, 2026

Multimodal Models

Claude Haiku 4.5

Anthropic · Claude 4

Fast and efficient Claude tier for latency-sensitive assistant and automation workloads.

multimodal200K ctx
Mar 6, 2026

Claude Opus 4.6

Anthropic · Claude 4

Anthropic's flagship Claude model for high-difficulty reasoning, coding, and long-running agent workflows.

multimodal200K ctx
Mar 6, 2026

Claude Sonnet 4.5

Anthropic · Claude 4

Balanced Claude tier for production reasoning, coding, and long-context assistant workflows.

multimodal200K ctx
Mar 6, 2026

Claude Sonnet 4.6

Anthropic · Claude 4

Anthropic's balanced Claude model — strong reasoning and coding at moderate pricing, the default recommendation for most tasks.

multimodal200K ctx
Feb 22, 2026

Gemini 2.5 Flash

Google · Gemini 2.5

Fast Gemini tier balancing multimodal capability, latency, and cost for production assistants.

multimodal1M ctx
Mar 6, 2026

Gemini 2.5 Flash-Lite

Google · Gemini 2.5

Budget-oriented Gemini tier for large-scale assistant and automation workloads.

multimodal1M ctx
Mar 6, 2026

Gemini 2.5 Pro

Google · Gemini 2.5

High-capability Gemini tier for long-context multimodal reasoning and advanced enterprise workflows.

multimodal1M ctx
Mar 6, 2026

Gemini 3.1 Pro Preview

Google · Gemini 3

Google's current high-end Gemini preview model for multimodal reasoning, agentic workflows, and long-context analysis.

multimodal1M ctxPreview
Mar 6, 2026

GPT-4.1

OpenAI · GPT-4.1

Long-context multimodal model retained as a legacy reference after retirement from ChatGPT defaults.

multimodal1M ctxLegacy
Mar 6, 2026

GPT-4o

OpenAI · GPT-4o

Widely deployed multimodal model kept as a legacy reference after retirement from ChatGPT defaults.

multimodal128K ctxLegacy
Mar 6, 2026

GPT-4o mini

OpenAI · GPT-4o

Lower-cost GPT-4o tier for high-volume multimodal assistant and automation workloads.

multimodal128K ctx
Mar 6, 2026

Llama 4 Maverick

Meta · Llama 4

Open-weights Llama 4 tier for teams needing customization, control, and self-hosting flexibility.

multimodal262K ctxOpen
Mar 6, 2026

Llama 4 Scout

Meta · Llama 4

Efficiency-focused Llama 4 tier for customizable deployments with tighter compute budgets.

multimodal10M ctxOpen
Mar 6, 2026

Mistral Large 3

Mistral AI · Mistral Large

Mistral's flagship European multimodal model with long context and competitive enterprise API economics.

multimodal256K ctx
Mar 6, 2026

Image Models

Video Models

Audio Models