LLM (Large Language Models)

iSiri offers access to the latest closed-source large language models (LLMs) available in the market.

Users can select preferred model by navigating to General Settings > Reply > Model, which ensures that the chosen model is prioritized during interactions. Or click the incon , and select Model > Settings on the home page in the input box.

LLM Information

Input and output prices are consistent with the official API pricing from Google, Anthropic and OpenAI

Model Name

GPT-4o mini

Company

OpenAI

Input Price($/1k token)

0.00015

Output Price($/1k token)

0.0006

Description

OpenAI's affordable and intelligent small model for fast, lightweight tasks

Context window (K tokens)

128

Max output

16,384 tokens

Model Name

GPT-4o

Company

OpenAI

Input Price($/1k token)

0.005

Output Price($/1k token)

0.015

Description

OpenAI's high-intelligence flagship model for complex, multi-step tasks

Context window (K tokens)

128

Max output

4,096 tokens

Model Name

o1-preview

Company

OpenAI

Input Price($/1k token)

0.015

Output Price($/1k token)

0.06

Description

OpenAI's reasoning model trained with reinforcement learning to perform complex reasoning.

Context window (K tokens)

128

Max output

32,768 tokens

Model Name

o1-mini

Company

OpenAI

Input Price($/1k token)

0.003

Output Price($/1k token)

0.012

Description

OpenAI's scaled reasoning model but faster and cheaper, good at coding, math, and science.

Context window (K tokens)

128

Max output

65,536 tokens

Model Name

Claude 3.5 Sonnet

Company

Anthropic

Input Price($/1k token)

0.003

Output Price($/1k token)

0.015

Description

Anthropic's most updated intelligent model with highest capability, balanced for scaled deployments

Context window (K tokens)

200

Max output

8,192 tokens

Model Name

Claude 3 Haiku

Company

Anthropic

Input Price($/1k token)

0.00025

Output Price($/1k token)

0.00125

Description

Anthropic's fastest and most compact model for near-instant responsiveness, suitable for quick and accurate targeted tasks

Context window (K tokens)

200

Max output

4,096 tokens

Model Name

Claude 3 Opus

Company

Anthropic

Input Price($/1k token)

0.015

Output Price($/1k token)

0.075

Description

Anthropic' most powerful model for highly complex tasks, great performance in intelligence, fluency, and understanding

Context window (K tokens)

200

Max output

4,096 tokens

Model Name

Gemini 1.5 Pro

Company

Gemini

Input Price($/1k token)

0.0035

Output Price($/1k token)

0.0105

Description

Google's best multimodal model, suitable for various reasoning tasks

Context window (K tokens)

128

Max output

4,096 tokens

Model Name

Gemini 1.5 Flash

Company

Gemini

Input Price($/1k token)

0.000075

Output Price($/1k token)

0.0003

Description

Google's cost-efficient multimodal model, suitable for diverse and repetitive tasks

Context window (K tokens)

200

Max output

8,192 tokens

Last updated