Models

01.AI: Yi Large

The Yi Large model was designed by 01.AI with the following usecases in mind: knowledge search, data classification, human-like chat bots, and customer service. It stands out for its multilingual pr ...

01 ai 32K context $3/M input tokens $3/M output tokens

Mistral Large 2411

Text 2 text

This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch anno ...

MistralAI 125K context $2/M input tokens $6/M output tokens

Mistral Large 2407

Text 2 text

This is Mistral AI's flagship model, Mistral Large 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch anno ...

MistralAI 125K context $2/M input tokens $6/M output tokens

Mistral: Pixtral Large 2411

Text image 2 text

Pixtral Large is a 124B open-weights multimodal model built on top of Mistral Large 2. The model is able to understand documents, charts and natural images. The mode ...

MistralAI 125K context $2/M input tokens $6/M output tokens $0.003/M image tokens

Perplexity: Llama 3.1 Sonar 70B

Text 2 text

Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is a normal offline LLM, but the [online version](/perpl ...

Perplexity 128K context $1/M input tokens $1/M output tokens

Perplexity: Llama 3.1 Sonar 8B

Text 2 text

Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is a normal offline LLM, but the [online version](/perpl ...

Perplexity 128K context $0.2/M input tokens $0.2/M output tokens

OpenChat 3.5 7B

Text 2 text

OpenChat 7B is a library of open-source language models, fine-tuned with "C-RLFT (Conditioned Reinforcement Learning Fine-Tuning)" - a strategy inspired by offline reinforcement learning. It has been ...

Openchat 8K context $0.055/M input tokens $0.055/M output tokens

OpenAI: GPT-3.5 Turbo 16k (older v1106)

Text 2 text

An older GPT-3.5 Turbo model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Sep 2021. ...

OpenAI 16K context $1/M input tokens $2/M output tokens

FREE

Toppy M 7B (free)

Text 2 text

# Free

A wild 7B parameter model that merges several models using the new task_arithmetic merge method from mergekit. List of merged models:NousResearch/Nous-Capybara-7B-V1.9 [HuggingFaceH4/zephyr-7b-b...

Undi95 4K context $0 input tokens $0 output tokens

Meta: LlamaGuard 2 8B

Text 2 text

This safeguard model has 8B parameters and is based on the Llama 3 family. Just like is predecessor, LlamaGuard 1, it can do both prompt and respons ...

Meta Llama 8K context $0.18/M input tokens $0.18/M output tokens

Mixtral 8x7B (base)

Text 2 text

A pretrained generative Sparse Mixture of Experts, by Mistral AI. Incorporates 8 experts (feed-forward networks) for a total of 47B parameters. Base model (not fine-tuned for instructions) - see [Mix ...

MistralAI 32K context $0.54/M input tokens $0.54/M output tokens

Mistral Small

Text 2 text

Cost-efficient, fast, and reliable option for use cases such as translation, summarization, and sentiment analysis. ...

MistralAI 31.25K context $0.2/M input tokens $0.6/M output tokens

Mistral Tiny

Text 2 text

This model is currently powered by Mistral-7B-v0.2, and incorporates a "better" fine-tuning than Mistral 7B, inspired by community work. It's best used for larg ...

MistralAI 31.25K context $0.25/M input tokens $0.25/M output tokens

Google: Gemini Pro 1.0

Text 2 text

Google's flagship text generation model. Designed to handle natural language tasks, multiturn text and code chat, and code generation. See the benchmarks and prompting guidelines from [Deepmind](htt ...

Google 31.99K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Llama 3 Lumimaid 70B

Text 2 text

The NeverSleep team is back, with a Llama 3 70B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necess ...

Meta Llama 8K context $3.375/M input tokens $4.5/M output tokens

Models

01.AI: Yi Large

Mistral Large 2411

Mistral Large 2407

Mistral: Pixtral Large 2411

Perplexity: Llama 3.1 Sonar 70B

Perplexity: Llama 3.1 Sonar 8B

OpenChat 3.5 7B

OpenAI: GPT-3.5 Turbo 16k (older v1106)

Toppy M 7B (free)

Meta: LlamaGuard 2 8B

Mixtral 8x7B (base)

Mistral Small

Mistral Tiny

Google: Gemini Pro 1.0

Llama 3 Lumimaid 70B

Categories

Tags