Models

Perplexity: R1 1776

Note: As this model does not return tags, thoughts will be streamed by default directly to the content field. R1 1776 is a version of DeepSeek-R1 that has been post-trained to remove censo ...

Perplexity 125K context $2/M input tokens $8/M output tokens

20% OFF

OpenAI: o3 Mini High

Text 2 text

# Discount

OpenAI o3-mini-high is the same model as o3-mini with reasoning_effort set to high. o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly ex ...

OpenAI 195.31K context $1.1/M input tokens $4.4/M output tokens

20% OFF

DeepSeek: R1

Text 2 text

# Hot # Discount

DeepSeek-R1 1. Introduction We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (R ...

DeepSeek 160K context $3/M input tokens $8/M output tokens

Google: Gemini Flash 2.0

Text image 2 text

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like [Gemini Pr ...

Google 976.56K context $0.1/M input tokens $0.4/M output tokens

DeepSeek: DeepSeek R1 Distill Llama 70B

Text 2 text

DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The m ...

DeepSeek 128K context $0.23/M input tokens $0.69/M output tokens

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge. Crea ...

Rifx.Online 8K context $0.03/M input tokens $0.06/M output tokens

Inflatebot: Mag Mell R1 12B

Text 2 text

Mag Mell is a merge of pre-trained language models created using mergekit, based on Mistral Nemo. It is a great roleplay and storytelling model which combines the best part ...

Rifx.Online 15.63K context $0.9/M input tokens $0.9/M output tokens

Meta: Llama 3.3 70B Instruct

Text 2 text

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model is optimiz ...

Meta Llama 128K context $0.13/M input tokens $0.4/M output tokens

text-embedding-3-small

Embedding

text-embedding-3-small is OpenAI's cost-effective text embedding model, serving as the lightweight version in the text-embedding-3 series. This model maintains good performance while offering a more ...

OpenAI $0.02/M input tokens $0 output tokens

Amazon: Nova Lite 1.0

Text image 2 text

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite can handle real-time cu ...

Amazon 292.97K context $0.06/M input tokens $0.24/M output tokens

Toppy M 7B

Text 2 text

A wild 7B parameter model that merges several models using the new task_arithmetic merge method from mergekit. List of merged models:NousResearch/Nous-Capybara-7B-V1.9 [HuggingFaceH4/zephyr-7b-b...

Undi95 4K context $0.07/M input tokens $0.07/M output tokens

ReMM SLERP 13B

Text 2 text

A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge ...

Undi95 4K context $1.125/M input tokens $1.125/M output tokens

Mistral: Pixtral 12B

Text image 2 text

The first image to text model from Mistral AI. Its weight was launched via torrent per their tradition: https://x.com/mistralai/status/1833758285167722836 ...

MistralAI 4K context $0.1/M input tokens $0.1/M output tokens $0.144/K image tokens

Phi-3.5 Mini 128K Instruct

Text 2 text

Phi-3.5 models are lightweight, state-of-the-art open models. These models were trained with Phi-3 datasets that include both synthetic data and the filtered, publicly available websites data, with a ...

Microsoft Azure 125K context $0.1/M input tokens $0.1/M output tokens

OpenAI: ChatGPT-4o

Text image 2 text

Dynamic model continuously updated to the current version of GPT-4o in ChatGPT. Intended for research and evaluation. Note: This model is currently experimental and not suitable fo ...

OpenAI 125K context $5/M input tokens $15/M output tokens $0.007/M image tokens

Models

Perplexity: R1 1776

OpenAI: o3 Mini High

DeepSeek: R1

Google: Gemini Flash 2.0

DeepSeek: DeepSeek R1 Distill Llama 70B

Sao10K: Llama 3 8B Lunaris

Inflatebot: Mag Mell R1 12B

Meta: Llama 3.3 70B Instruct

text-embedding-3-small

Amazon: Nova Lite 1.0

Toppy M 7B

ReMM SLERP 13B

Mistral: Pixtral 12B

Phi-3.5 Mini 128K Instruct

OpenAI: ChatGPT-4o

Categories

Tags