Support

AI-powered help

Welcome!

Please introduce yourself before we start.

    LLM Gateway
    Log InGet Started

    AI Models Directory

    Browse and compare 180+ AI models from OpenAI, Anthropic, Google, and 30+ providers — filter by capabilities, pricing, and context size.

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    107/259
    Models
    27/34
    Providers
    66
    Vision Models (filtered)
    104
    Tool-enabled (filtered)
    2
    Free Models (filtered)

    GLM-4.6V Flash

    glm
    glm-4.6v-flash
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 128k
    Input
    $0.00
    /M tokens
    Cached
    $0.00
    /M tokens
    Output
    $0.00
    /M tokens
    Get Started

    GLM-4.6V FlashX

    glm
    glm-4.6v-flashx
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 128k
    Input
    $0.04
    /M tokens
    Cached
    $0.00
    /M tokens
    Output
    $0.40
    /M tokens
    Get Started

    GLM-4.6V

    glm
    glm-4.6v
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 128k
    Input
    $0.30
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $0.90
    /M tokens
    Get Started

    GLM-4.6

    glm
    glm-4.6
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    NovitaAI
    Context: 204.8k
    Input
    $0.55
    /M tokens
    Cached
    $0.11
    /M tokens
    Output
    $2.20
    /M tokens
    Get Started

    GLM-4.7 Flash (Free)

    glm
    glm-4.7-flash-free
    Streaming
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 200k
    Input
    $0.00
    /M tokens
    Cached
    $0.00
    /M tokens
    Output
    $0.00
    /M tokens
    Get Started

    GLM-4.7 FlashX

    glm
    glm-4.7-flashx
    Streaming
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 200k
    Input
    $0.07
    /M tokens
    Cached
    $0.01
    /M tokens
    Output
    $0.40
    /M tokens
    Get Started

    GLM-4.7

    glm
    glm-4.7
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Z AI
    Context: 200k
    Input
    $0.60
    /M tokens
    Cached
    $0.11
    /M tokens
    Output
    $2.20
    /M tokens
    + $0.010 per search
    Get Started

    GLM-4.5 X

    glm
    glm-4.5-x
    Streaming
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 128k
    Input
    $2.20
    /M tokens
    Cached
    $0.45
    /M tokens
    Output
    $8.90
    /M tokens
    Get Started

    GLM-4.5V

    glm
    glm-4.5v
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Z AI
    Context: 128k
    Input
    $0.60
    /M tokens
    Cached
    $0.11
    /M tokens
    Output
    $1.80
    /M tokens
    Get Started

    GLM-4.5

    glm
    glm-4.5
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Z AI
    Context: 128k
    Input
    $0.60
    /M tokens
    Cached
    $0.11
    /M tokens
    Output
    $2.20
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5

    glm
    glm-5
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Structured JSON Output
    Z AI
    Context: 202.8k
    Input
    $1.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $3.20
    /M tokens
    + $0.010 per search
    Get Started

    GLM-5.1

    glm
    glm-5.1
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Structured JSON Output
    Z AI
    Context: 200k
    Input
    $1.40
    /M tokens
    Cached
    $0.26
    /M tokens
    Output
    $4.40
    /M tokens
    + $0.010 per search
    Get Started

    Seed 1.8 (251228)

    bytedance
    seed-1-8-251228
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.25
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    Seed 1.6 Flash (250715)

    bytedance
    seed-1-6-flash-250715
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.07
    /M tokens
    Cached
    $0.01
    /M tokens
    Output
    $0.30
    /M tokens
    Get Started

    Seed 1.6 (250915)

    bytedance
    seed-1-6-250915
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.25
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    Seed 1.6 (250615)

    bytedance
    seed-1-6-250615
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    ByteDance
    Context: 256k
    Input
    $0.25
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    Qwen3.6 35B A3B

    alibaba
    qwen3.6-35b-a3b
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.25
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.48
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3.6 Plus

    alibaba
    qwen3.6-plus
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.50
    /M tokens
    Cached
    $0.05
    /M tokens
    Output
    $3.00
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3.6 Max Preview

    alibaba
    qwen3.6-max-preview
    Streaming
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k
    Input
    $1.30
    /M tokens
    Cached
    $0.13
    /M tokens
    Output
    $7.80
    /M tokens
    Get Started

    Qwen3 Max 2026-01-23

    alibaba
    qwen3-max-2026-01-23
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k
    Input
    $1.20
    /M tokens
    Cached
    $0.24
    /M tokens
    Output
    $6.00
    /M tokens
    Get Started

    Qwen3 VL 235B A22B Thinking

    alibaba
    qwen3-vl-235b-a22b-thinking
    Streaming
    Vision
    Reasoning
    Alibaba Cloud
    Context: 131.1k
    Input
    $0.50
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.00
    /M tokens
    Get Started

    QwQ Plus

    alibaba
    qwq-plus
    Streaming
    Reasoning
    Alibaba Cloud
    Context: 131.1k
    Input
    $0.80
    /M tokens
    Cached
    —
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    Qwen3.5 397B A17B

    alibaba
    qwen35-397b-a17b
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.60
    /M tokens
    Cached
    —
    /M tokens
    Output
    $3.60
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3 VL 30B A3B Thinking

    alibaba
    qwen3-vl-30b-a3b-thinking
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    NovitaAI
    Context: 131.1k
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.00
    /M tokens
    Get Started

    Qwen3.7 Plus

    alibaba
    qwen3.7-plus
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 1M
    Input
    $0.40
    /M tokens
    Cached
    $0.08
    /M tokens
    Output
    $1.60
    /M tokens
    Tiered Pricing
    IN
    CACHED
    OUT
    ≤256K tokens
    $0.40
    $0.08
    $1.60
    >256K tokens
    $1.20
    $0.24
    $4.80
    Get Started

    Qwen3.7 Max

    alibaba
    qwen3.7-max
    Streaming
    Tools
    Reasoning
    JSON Output
    Native Web Search
    Alibaba Cloud
    Context: 1M
    Input
    $2.50
    /M tokens
    Cached
    $0.50
    /M tokens
    Output
    $7.50
    /M tokens
    + $0.010 per search
    Get Started

    Qwen3 Max

    alibaba
    qwen3-max
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 256k
    Input
    $3.00
    /M tokens
    Cached
    $0.60
    /M tokens
    Output
    $15.00
    /M tokens
    Get Started

    Qwen3 Next 80B A3B Thinking

    alibaba
    qwen3-next-80b-a3b-thinking
    Streaming
    Tools
    Reasoning
    Alibaba Cloud
    Context: 131.1k
    Input
    $0.50
    /M tokens
    Cached
    —
    /M tokens
    Output
    $6.00
    /M tokens
    Get Started

    Qwen3 30B A3B Thinking 2507

    alibabaModel Deactivated
    qwen3-30b-a3b-thinking-2507
    Streaming
    Tools
    Reasoning
    JSON Output
    Nebius AI
    Context: 262k
    Deactivated since Apr 25, 2026
    Input
    $0.10
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.30
    /M tokens
    Get Started

    Qwen3 235B A22B Thinking 2507

    alibaba
    qwen3-235b-a22b-thinking-2507
    Streaming
    Tools
    Reasoning
    JSON Output
    NovitaAI
    Context: 131.1k
    Input
    $0.30
    /M tokens
    Cached
    —
    /M tokens
    Output
    $3.00
    /M tokens
    Get Started

    Kimi K2.6

    moonshot
    kimi-k2.6
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Moonshot AI
    Context: 262.1k
    Input
    $0.95
    /M tokens
    Cached
    $0.16
    /M tokens
    Output
    $4.00
    /M tokens
    Get Started

    Kimi K2.5

    moonshot
    kimi-k2.5
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Alibaba Cloud
    Context: 262.1k
    Input
    $0.57
    /M tokens
    Cached
    —
    /M tokens
    Output
    $3.01
    /M tokens
    Get Started

    Kimi K2 Thinking Turbo

    moonshot
    kimi-k2-thinking-turbo
    Streaming
    Tools
    Reasoning
    JSON Output
    Moonshot AI
    Context: 262.1k
    Input
    $1.15
    /M tokens
    Cached
    $0.15
    /M tokens
    Output
    $8.00
    /M tokens
    Get Started

    Kimi K2 Thinking

    moonshot
    kimi-k2-thinking
    Streaming
    Tools
    Reasoning
    JSON Output
    Moonshot AI
    Context: 262.1k
    Input
    $0.60
    /M tokens
    Cached
    $0.15
    /M tokens
    Output
    $2.50
    /M tokens
    Get Started

    MiniMax Text 01

    minimax
    minimax-text-01
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 1M
    Input
    $0.20
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.10
    /M tokens
    Get Started

    MiniMax M2.1 Lightning

    minimax
    minimax-m2.1-lightning
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 196.6k
    Input
    $0.12
    /M tokens
    Cached
    —
    /M tokens
    Output
    $0.48
    /M tokens
    Get Started

    MiniMax M2.1

    minimax
    minimax-m2.1
    Streaming
    Tools
    Reasoning
    JSON Output
    MiniMax
    Context: 196.6k
    Input
    $0.27
    /M tokens
    Cached
    —
    /M tokens
    Output
    $1.10
    /M tokens
    Get Started

    MiniMax M2

    minimax
    minimax-m2
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 196.6k
    Input
    $0.20
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $1.00
    /M tokens
    Get Started

    MiniMax M2.5 Highspeed

    minimax
    minimax-m2.5-highspeed
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.5

    minimax
    minimax-m2.5
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    NovitaAI
    Context: 204.8k
    Input
    $0.30
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $1.20
    /M tokens
    Get Started

    MiniMax M2.7 Highspeed

    minimax
    minimax-m2.7-highspeed
    Streaming
    Tools
    Reasoning
    MiniMax
    Context: 204.8k
    Input
    $0.60
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    MiniMax M2.7

    minimax
    minimax-m2.7
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    MiniMax
    Context: 204.8k
    Input
    $0.30
    /M tokens
    Cached
    $0.06
    /M tokens
    Output
    $1.20
    /M tokens
    Get Started

    MiniMax M3

    minimax
    minimax-m3
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    MiniMax
    Context: 1.0M
    Input
    $0.60
    /M tokens
    Cached
    $0.12
    /M tokens
    Output
    $2.40
    /M tokens
    Get Started

    DeepSeek V4 Flash

    deepseek
    deepseek-v4-flash
    Streaming
    Tools
    Reasoning
    JSON Output
    DeepInfra
    Context: 1M
    Input
    $0.14
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $0.28
    /M tokens
    Get Started

    DeepSeek V4 Pro

    deepseek
    deepseek-v4-pro
    Streaming
    Tools
    Reasoning
    JSON Output
    Structured JSON Output
    DeepSeek
    Context: 1.1M
    Input
    $0.43
    /M tokens
    Cached
    $0.00
    /M tokens
    Output
    $0.87
    /M tokens
    Get Started

    DeepSeek V3.2

    deepseek
    deepseek-v3.2
    Streaming
    Tools
    JSON Output
    Reasoning
    DeepSeek
    Context: 163.8k
    Deactivated since May 1, 2026
    Input
    $0.28
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $0.42
    /M tokens
    Get Started

    DeepSeek V3.1

    deepseek
    deepseek-v3.1
    Streaming
    Tools
    Reasoning
    ByteDance
    Context: 128k
    Input
    $0.56
    /M tokens
    Cached
    $0.11
    /M tokens
    Output
    $1.68
    /M tokens
    Get Started

    MiMo V2 Flash

    xiaomi
    mimo-v2-flash
    Streaming
    Tools
    Reasoning
    JSON Output
    Xiaomi
    Context: 256k
    Input
    $0.10
    /M tokens
    Cached
    $0.02
    /M tokens
    Output
    $0.30
    /M tokens
    Get Started

    MiMo V2.5

    xiaomi
    mimo-v2.5
    Streaming
    Vision
    Tools
    Reasoning
    JSON Output
    Xiaomi
    Context: 1M
    Input
    $0.14
    /M tokens
    Cached
    $0.03
    /M tokens
    Output
    $0.28
    /M tokens
    Get Started

    MiMo V2 Pro

    xiaomi
    mimo-v2-pro
    Streaming
    Tools
    Reasoning
    JSON Output
    Xiaomi
    Context: 1M
    Input
    $1.00
    /M tokens
    Cached
    $0.20
    /M tokens
    Output
    $3.00
    /M tokens
    Get Started
    Page 1 of 3