Type something to search...
Google

Google

Google, a subsidiary of Alphabet Inc., stands at the forefront of artificial intelligence (AI) and large language model (LLM) development. Beyond its renowned search engine and internet services, Google has made breakthrough advances in AI.

Latest Model Developments

Google’s latest Gemini series models showcase its AI capabilities:

  • Gemini 1.5 Pro: Features a 2 million token context window, making it one of the most powerful models available. Accessible through both AI Studio and Vertex AI platforms.
  • Gemini 1.5 Flash: Focused on high performance, achieving 265-320 tokens per second output speed with just 0.25-0.41 seconds latency.
  • Gemini 1.0 Pro: Offers a 33k context window with approximately 100 tokens per second output speed.

Technical Advantages

  • High Performance: Gemini series models excel in output speed and latency
  • Large Context Window: Latest Pro versions support up to 2 million tokens context window
  • Versatility: Supports function calling and JSON mode
  • Competitive Pricing: Flash series models offer more economical pricing options

Innovation Journey

Google’s contributions to the LLM field include the development of crucial models like BERT, LaMDA, and PaLM. Through initiatives like Google AI and DeepMind, the company continues to drive innovation in computer vision, robotics, and quantum computing.

Future Outlook

As AI technology rapidly evolves, Google remains committed to optimizing model performance and expanding applications, aiming to provide smarter and more efficient AI solutions. The company particularly focuses on maintaining high performance while ensuring model ethics and practicality.

Gemini 1.5 Flash 8B Experimental is an experimental, 8B parameter version of the Gemini 1.5 Flash model. Usage of Gemini is subject to Google's [Gemini Term ...

Google: Gemini Flash 8B 1.5 Experimental
Google
976.56K context $0 input tokens $0 output tokens

Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is hig ...

Google: Gemini 1.5 Flash-8B
Google
976.56K context $0.037/M input tokens $0.15/M output tokens

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, ...

Google: Gemini Flash 1.5
Google
976.56K context $0.075/M input tokens $0.3/M output tokens $0.04/K image tokens

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Prob...

Google: Gemini Pro 1.5
Google
1.91M context $1.25/M input tokens $5/M output tokens $0.003/M image tokens

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response. See the benchmarks and prompting guidelines from [Deepmind](https:// ...

Google: Gemini Pro Vision 1.0
Google
16K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of t ...

Google: Gemma 2 27B
Google
8K context $0.27/M input tokens $0.27/M output tokens

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empo ...

Google: Gemma 2 9B
Google
8K context $0.06/M input tokens $0.06/M output tokens

Gemini 2.0 Flash offers a significantly faster time to first token (TTFT) compared to Gemini 1.5 Flash, while maintaining quality on par with larger models like [Gemini 1.5 ...

Google: Gemini 2.0 Flash Experimental
Google
976.56K context $0.2/M input tokens $0.6/M output tokens
FREE

Gemini 2.0 Flash Thinking Mode is an experimental model that's trained to generate the "thinking process" the model goes through as part of its response. As a result, Thinking Mode is capable of stro ...

Google: Gemini 2.0 Flash Thinking Experimental (free)
Google
39.06K context $0 input tokens $0 output tokens

Experimental release (December 6, 2024) of Gemini. ...

gemini-exp-1206
Google
8K context $4/M input tokens $16/M output tokens

Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective ...

Google: Gemini 1.5 Flash-8B
Google
976.56K context $0.037/M input tokens $0.15/M output tokens

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and vide ...

Google: Gemini Flash 1.5
Google
976.56K context $0.075/M input tokens $0.3/M output tokens $0.04/K image tokens
FREE

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google: Gemini Pro 1.5 Experimental
Google
1.91M context $0 input tokens $0 output tokens $0.003/M image tokens

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Google: Gemini Pro 1.5
Google
1.91M context $1.25/M input tokens $5/M output tokens $0.003/M image tokens

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response. See the benchmarks and prompting guidelines from [Deepmind](https://deepmind.googl ...

Google: Gemini Pro Vision 1.0
Google
16K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Google's flagship text generation model. Designed to handle natural language tasks, multiturn text and code chat, and code generation. See the benchmarks and prompting guidelines from [Deepmind](htt ...

Google: Gemini Pro 1.0
Google
31.99K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of text generation ...

Google: Gemma 2 27B
Google
8K context $0.27/M input tokens $0.27/M output tokens

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empowers developer ...

Google: Gemma 2 9B
Google
8K context $0.06/M input tokens $0.06/M output tokens
FREE

Gemma 2 9B by Google is an advanced, open-source language model that sets a new standard for efficiency and performance in its size class. Designed for a wide variety of tasks, it empowers developer ...

Google: Gemma 2 9B (free)
Google
8K context $0 input tokens $0 output tokens

PaLM 2 fine-tuned for chatbot conversations that help with code-related questions. ...

Google: PaLM 2 Code Chat 32k
Google
31.99K context $1/M input tokens $2/M output tokens

PaLM 2 is a language model by Google with improved multilingual, reasoning and coding capabilities. ...

Google: PaLM 2 Chat 32k
Google
31.99K context $1/M input tokens $2/M output tokens

PaLM 2 fine-tuned for chatbot conversations that help with code-related questions. ...

Google: PaLM 2 Code Chat 32k
Google
31.99K context $1/M input tokens $2/M output tokens
40% OFF

Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and vid ...

Gemini Flash 1.5
Google
976.56K context $0.15/M input tokens $0.6/M output tokens $0.04/K image tokens
40% OFF

Google's latest multimodal model, supporting image and video in text or chat prompts. Optimized for language tasks including:Code generation Text generation Text editing Problem solving...

Gemini 1.5 Pro
Google
1.91M context $2.5/M input tokens $10/M output tokens $0.003/M image tokens