Google

Google

Google, a subsidiary of Alphabet Inc., stands at the forefront of artificial intelligence (AI) and large language model (LLM) development. Beyond its renowned search engine and internet services, Google has made breakthrough advances in AI.

Latest Model Developments

Google’s latest Gemini series models showcase its AI capabilities:

Gemini 1.5 Pro: Features a 2 million token context window, making it one of the most powerful models available. Accessible through both AI Studio and Vertex AI platforms.
Gemini 1.5 Flash: Focused on high performance, achieving 265-320 tokens per second output speed with just 0.25-0.41 seconds latency.
Gemini 1.0 Pro: Offers a 33k context window with approximately 100 tokens per second output speed.

Technical Advantages

High Performance: Gemini series models excel in output speed and latency
Large Context Window: Latest Pro versions support up to 2 million tokens context window
Versatility: Supports function calling and JSON mode
Competitive Pricing: Flash series models offer more economical pricing options

Innovation Journey

Google’s contributions to the LLM field include the development of crucial models like BERT, LaMDA, and PaLM. Through initiatives like Google AI and DeepMind, the company continues to drive innovation in computer vision, robotics, and quantum computing.

Future Outlook

As AI technology rapidly evolves, Google remains committed to optimizing model performance and expanding applications, aiming to provide smarter and more efficient AI solutions. The company particularly focuses on maintaining high performance while ensuring model ethics and practicality.

Google: Gemini Flash 8B 1.5 Experimental

Text image 2 text

Gemini 1.5 Flash 8B Experimental is an experimental, 8B parameter version of the Gemini 1.5 Flash model. Usage of Gemini is subject to Google's [Gemini Term ...

Google 976.56K context $0 input tokens $0 output tokens

Google: Gemini 1.5 Flash-8B

Text image 2 text

Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is hig ...

Google 976.56K context $0.037/M input tokens $0.15/M output tokens

Google: Gemini Flash 1.5

Text image 2 text

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, ...

Google 976.56K context $0.075/M input tokens $0.3/M output tokens $0.04/K image tokens

Google: Gemini Pro 1.5

Text image 2 text

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Prob...

Google 1.91M context $1.25/M input tokens $5/M output tokens $0.003/M image tokens

Google: Gemini Pro Vision 1.0

Text image 2 text

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response. See the benchmarks and prompting guidelines from [Deepmind](https:// ...

Google 16K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Google: Gemma 2 27B

Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of t ...

Google 8K context $0.27/M input tokens $0.27/M output tokens

Google: Gemma 2 9B

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empo ...

Google 8K context $0.06/M input tokens $0.06/M output tokens

Google: Gemini Flash 2.0

Text image 2 text

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like [Gemini Pr ...

Google 976.56K context $0.1/M input tokens $0.4/M output tokens

Google: Gemini 2.0 Flash Experimental

Gemini 2.0 Flash offers a significantly faster time to first token (TTFT) compared to Gemini 1.5 Flash, while maintaining quality on par with larger models like [Gemini 1.5 ...

Google 976.56K context $0.2/M input tokens $0.6/M output tokens

FREE

Google: Gemini 2.0 Flash Experimental (free)

Gemini 2.0 Flash offers a significantly faster time to first token (TTFT) compared to Gemini 1.5 Flash, while maintaining quality on par with larger models like [Gemini 1.5 ...

Google 976.56K context $0 input tokens $0 output tokens

Google: Gemini 2.0 Flash Lite

Text image 2 text

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like [Gemi ...

Google 1M context $0.075/M input tokens $0.3/M output tokens

FREE

Google: Gemini Flash Lite 2.0 Preview (free)

Text image 2 text

Gemini Flash Lite 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like [Gemin ...

Google 976.56K context $0 input tokens $0 output tokens

FREE

Google: Gemini 2.0 Flash Thinking Experimental (free)

Text image 2 text

Gemini 2.0 Flash Thinking Mode is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. As a result, Thinking Mode is capable of stro ...

Google 39.06K context $0 input tokens $0 output tokens

FREE

Google: Gemini 2.0 Flash Thinking Experimental (free)

Text image 2 text

Gemini 2.0 Flash Thinking Mode is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. As a result, Thinking Mode is capable of stro ...

Google 39.06K context $0 input tokens $0 output tokens

FREE

Google: Gemini Pro 2.0 Experimental (free)

Text image 2 text

Gemini 2.0 Pro Experimental is a bleeding-edge version of the Gemini 2.0 Pro model. Because it's currently experimental, it will be heavily rate-limited by Google. Usage of Gemini is subject to ...

Google 1.91M context $0 input tokens $0 output tokens

Google: Gemini 2.5 Flash Image Preview (Nano Banana)

Text image 2 text image

Gemini 2.5 Flash Image Preview, a.k.a. "Nano Banana," is a state of the art image generation model with contextual understanding. It is capable of image generation, edits, and multi-turn conversation ...

Google 32K context $0.3/M input tokens $2.5/M output tokens $0.001/M image tokens

Google: Gemini 2.5 Flash Lite

Text image 2 text

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and bette ...

Google 1M context $0.1/M input tokens $0.4/M output tokens

gemini-exp-1206

Text image 2 text

Experimental release (December 6, 2024) of Gemini. ...

Google 8K context $4/M input tokens $16/M output tokens

Google: Gemini 1.5 Flash-8B

Text image 2 text

Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective ...

Google 976.56K context $0.037/M input tokens $0.15/M output tokens

Google: Gemini Flash 1.5

Text image 2 text

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and vide ...

Google 976.56K context $0.075/M input tokens $0.3/M output tokens $0.04/K image tokens

FREE

Google: Gemini Pro 1.5 Experimental

Text image 2 text

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google 1.91M context $0 input tokens $0 output tokens $0.003/M image tokens

Google: Gemini Pro 1.5

Text image 2 text

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google 1.91M context $1.25/M input tokens $5/M output tokens $0.003/M image tokens

Google: Gemini Pro Vision 1.0

Text image 2 text

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response. See the benchmarks and prompting guidelines from [Deepmind](https://deepmind.googl ...

Google 16K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Google: Gemini Pro 1.0

Google's flagship text generation model. Designed to handle natural language tasks, multiturn text and code chat, and code generation. See the benchmarks and prompting guidelines from [Deepmind](htt ...

Google 31.99K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Google: Gemma 2 27B

Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of text generation ...

Google 8K context $0.27/M input tokens $0.27/M output tokens

Google: Gemma 2 9B

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empowers developer ...

Google 8K context $0.06/M input tokens $0.06/M output tokens

FREE

Google: Gemma 2 9B (free)

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empowers developer ...

Google 8K context $0 input tokens $0 output tokens

Google: Gemma 3 27B

Text image 2 text

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, ...

Google 125K context $0.3/M input tokens $0.5/M output tokens

FREE

Google: Gemma 3 27B (free)

Text image 2 text

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, ...

Google 125K context $0 input tokens $0 output tokens

Google: PaLM 2 Code Chat 32k

PaLM 2 fine-tuned for chatbot conversations that help with code-related questions. ...

Google 31.99K context $1/M input tokens $2/M output tokens

Google: PaLM 2 Chat 32k

PaLM 2 is a language model by Google with improved multilingual, reasoning and coding capabilities. ...

Google 31.99K context $1/M input tokens $2/M output tokens

Google: PaLM 2 Code Chat 32k

PaLM 2 fine-tuned for chatbot conversations that help with code-related questions. ...

Google 31.99K context $1/M input tokens $2/M output tokens

40% OFF

Gemini Flash 1.5

Text image 2 text

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and vid ...

Google 976.56K context $0.15/M input tokens $0.6/M output tokens $0.04/K image tokens

40% OFF

Gemini 1.5 Pro

Text image 2 text

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google 1.91M context $2.5/M input tokens $10/M output tokens $0.003/M image tokens