multimodal-model

Google的旗舰多模态模型，支持在文本或聊天提示中使用图像和视频，以获得文本或代码响应。请参阅Deepmind提供的基准和提示指南。使用Gemini需遵循Google的Gemini使用条款。 #multimodal ...

Google 16K context $0.5/M input tokens $1.5/M output tokens $0.003/M image tokens

Google最新的多模态模型，支持在文本或聊天提示中使用图像和视频。针对以下语言任务进行了优化：代码生成文本生成文本编辑问题解决推荐信息提取数据提取或生成 AI代理使用Gemini需遵循Google的Gemin使用条款。 #multimodal ...

Google 1.91M context $1.25/M input tokens $5/M output tokens $0.003/M image tokens

FREE

Google最新的多模态模型，支持在文本或聊天提示中使用图像和视频。针对以下语言任务进行了优化：代码生成文本生成文本编辑问题解决推荐信息提取数据提取或生成 AI代理使用Gemini需遵循Google的Gemin使用条款。 #multimodal ...

Google 1.91M context $0 input tokens $0 output tokens $0.003/M image tokens

Multimodal model