AI Models Catalog — 782 LLMs tracked

Qwen

125 variants

GPT

OpenAI · 65 variants

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sou...

MaziyarPanahi

47 variants

Unsloth

31 variants

Mistral

Mistral · 27 variants

This is Mistral AI's flagship model, Mistral Large 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excel...

Llama

Meta · 24 variants

Llama Guard 3 is a Llama-3.1-8B pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be u...

DeepSeek

DeepSeek · 21 variants

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous vers...

LM Studio Community

21 variants

Nemotron

NVIDIA · 17 variants

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/mod...

TRL Testing

16 variants

Claude

Anthropic · 13 variants

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. Se...

Farbod Tavakkoli

13 variants

Gemini

Google · 12 variants

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while...

Gemma

Google · 12 variants

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). ...

Phi

Microsoft · 11 variants

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations w...

Grok

xAI · 10 variants

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text...

GLM

9 variants

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) arch...

SmolLM

9 variants

Bartowski

9 variants

OLMo

7 variants

OLMo-2 32B Instruct is a supervised instruction-finetuned variant of the OLMo-2 32B March 2025 base model. It excels in complex reasoning...

LFM

6 variants

Hermes

Nous · 6 variants

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better rolepl...

Sonar

Perplexity · 5 variants

Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the ability to customize sources. It is designed ...

Nova

Amazon · 5 variants

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. ...

ERNIE

5 variants

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptio...

MiniMax

5 variants

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion acti...

CyanKiwi

5 variants

RedHat AI

5 variants

Granite

4 variants

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by ...

Kimi

4 variants

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters...

GGML

4 variants

TheBloke

4 variants

Nano Banana

Google · 3 variants

Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual...

Command

Cohere · 3 variants

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, mult...

MLX Community

3 variants

Step

2 variants

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it select...