Models
成本效益高、快速且可靠的选项,适用于翻译、摘要和情感分析等用例。 ...
该模型目前由 Mistral-7B-v0.2 提供支持,并结合了比 Mistral 7B 更“优秀”的微调,灵感来自社区的工作。它最适合用于大批量处理任务,在这些任务中,成本是一个重要因素,但推理能力并不是关键。 ...
Google 的旗舰文本生成模型。旨在处理自然语言任务、多轮文本和代码聊天,以及代码生成。 请参阅来自 Deepmind 的基准测试和提示指南。 使用 Gemini 需遵循 Google 的 Gemini 使用条款。 ...
The NeverSleep团队回来了,带来了基于他们精心挑选的角色扮演数据训练的Llama 3 70B微调模型。Lumimaid在eRP和RP之间取得了平衡,旨在在必要时保持严肃,但又不受限制。 为了增强其整体智能和聊天能力,约40%的训练数据并非角色扮演。这提供了广泛的知识供访问,同时仍然保持角色扮演作为主要优势。 使用此模型须遵循[Meta的可接受使用政策](https://llama ...
一个大型 LLM 通过将两个微调的 Llama 70B 模型合并成一个 120B 模型而创建。结合了 Xwin 和 Euryale。 致谢@chargoddard 开发了用于合并模型的框架 - mergekit。 [@Undi95](h...
Google的旗舰多模态模型,支持在文本或聊天提示中使用图像和视频,以获得文本或代码响应。 请参阅Deepmind提供的基准和提示指南。 使用Gemini需遵循Google的Gemini使用条款。 #multimodal ...
WizardLM-2 7B 是微软 AI 最新 Wizard 模型的较小变体。它是最快的,并且在性能上与现有的 10 倍大开源领先模型相当。 它是对 Mistral 7B Instruct 的微调,使用与 WizardLM-2 8x22B 相同的技术。 要了解更多 ...
Google最新的多模态模型,支持在文本或聊天提示中使用图像和视频。 针对以下语言任务进行了优化:代码生成 文本生成 文本编辑 问题解决 推荐 信息提取 数据提取或生成 AI代理使用Gemini需遵循Google的Gemin使用条款。 #multimodal ...
command-r-plus-08-2024 是 Command R+ 的更新,与之前的 Command R+ 版本相比,吞吐量提高了大约 50%,延迟降低了 25%,同时硬件占用保持不变。 在此处阅读发布帖子 here。 ...
DBRX 是由 Databricks 开发的新开源大语言模型。在 132B 的参数量下,它在语言理解、编程、数学和逻辑的标准行业基准测试中超越了现有的开源 LLM,如 Llama 2 70B 和 Mixtral-8x7b。 它采用了细粒度的专家混合(MoE)架构。任何输入上都有 36B 参数处于激活状态。它在 12T 的文本和代码数据上进行 ...
The Jamba-Instruct model, introduced by AI21 Labs, is an instruction-tuned variant of their hybrid SSM-Transformer Jamba model, specifically optimized for enterprise applications.256K Context Win...
Euryale 70B v2.1 是一个专注于创意角色扮演的模型,来自 Sao10k。更好的提示遵循性。 更好的解剖学/空间意识。 更好地适应独特和自定义的格式/回复格式。 非常有创意,很多独特的风格。 在角色扮演过程中没有限制。...
一个高性能、行业标准的 7.3B 参数模型,针对速度和上下文长度进行了优化。 Mistral 7B Instruct 有多个版本变体,这里是最新版本。 ...
Phi-3 Mini 是一个强大的 3.8B 参数模型,旨在实现高级语言理解、推理和指令跟随。通过监督微调和偏好调整进行优化,它在涉及常识、数学、逻辑推理和代码处理的任务中表现出色。 在发布时,Phi-3 Medium 在轻量级模型中展示了最先进的性能。该模型是静态的,训练于一个截止日期为 2023 年 10 月的离线数据集。 ...
Phi-3 128K Medium 是一个强大的 140 亿参数模型,旨在实现高级语言理解、推理和指令跟随。通过监督微调和偏好调整进行优化,它在涉及常识、数学、逻辑推理和代码处理的任务中表现出色。 在发布时,Phi-3 Medium 在轻量级模型中展示了最先进的性能。在 MMLU-Pro 评估中,该模型甚至接近 Llama3 70B 的性能水平。 对于 4k 上下文长度,请尝试 [Phi-3 ...
Categories
Tags
- Multilingual chatbots
- Data classification
- Machine learning
- 01 ai
- Natural language processing
- Programming
- Customer support ai
- Data science
- Chatbots
- Yi large
- Knowledge retrieval
- Generative ai
- Ssm transformer
- Jamba 1 5 large
- Resource efficiency
- Ai21
- Document summarization
- Technology
- Context window
- Mamba based model
- Multilingual analysis
- 256k context window
- Jamba 1 5 mini
- Jamba
- Instruction tuning
- Enterprise optimization
- Large document processing
- Safety features
- Goliath 120b
- Model merging
- Large language model
- Mergekit
- Fine tuned llama
- Alpindale
- Prose quality
- Claude 3 alternative
- Qwen2 based
- Magnum 72b
- Roleplay data
- Roleplay
- New
- Document analysis
- Visual question answering
- Multimodal processing
- Amazon
- Real time interaction
- Nova lite v1
- Computer vision
- Translation
- Interactive chat model
- Nova micro v1
- Low latency text
- Text summarization tool
- Cost effective nlp
- Financial document processing
- Nova pro v1
- Video understanding
- Multimodal analysis
- Claude 3 quality
- Qwen 25 integration
- Magnum v4 72b
- Fine tuned model
- Anthracite org
- Prose generation
- Instant responsiveness
- Compact model
- Targeted performance
- Multimodal
- Anthropic
- Claude 3 haiku
- Deep understanding
- Complex task solving
- Claude 3 opus
- Advanced intelligence
- High fluency
- Claude 3 sonnet
- Scaled deployments
- Enterprise workloads
- Cost effective ai
- Benchmark results
- Real time moderation
- Claude 35 haiku
- Code completion
- Rapid response
- Data extraction
- Predictive analytics
- Data science expertise
- Visual processing
- Agentic tasks
- Claude 35 sonnet
- Autonomous systems
- Autonomous coding
- Text generation
- Baichuan3 turbo
- Baichuan
- Technologyweb
- Conversational systems
- Multilingual support
- Baichuan4
- Contextual understanding
- Conversational ai
- Ethics
- Multilingual ai
- High throughput
- Hardware efficiency
- Cohere
- Command r plus
- Low latency
- Performance upgrade
- Command r
- Complex workflows
- Conversational language tasks
- Retrieval augmented generation
- Code generation
- Instruction following
- Programmingscripting
- Long context
- Language tasks
- Command
- Language understanding
- Databricks
- Code pre training
- Dbrx
- Mixture of experts
- Multi token prediction
- Deepseek chat v3
- Deepseek
- Hot
- Load balancing strategy
- Multi head latent attention
- Mit licensed software
- Commercial use ai model
- Open source ai model
- Deepseek r1
- Technical report ai
- Voice assistants
- Free
- Discount
- Eva unit 01
- Roleplay model
- Eva llama 333 70b
- Creative finetune
- Narrative generation
- Storywriting ai
- Open source
- Robotics
- Complex instructions
- Fast ttft
- Gemini 20 flash exp
- Coding capabilities
- Multimodal understanding
- Thinking process generation
- Advanced reasoning
- Reasoning capabilities
- Gemini 20 flash thinking exp
- Experimental ai model
- Real time processing
- Gemini flash 15 8b
- Chat transcription
- Cost effective translation
- Visual understanding
- Content generation
- Gemini flash 15
- High frequency tasks
- Text editing
- Multimodal model
- Ai agents
- Gemini pro 15
- Text code response
- Gemini pro vision
- Image video processing
- Chat prompts
- Multiturn chat
- Gemini pro
- Gemma 2 27b it
- Reasoning
- Summarization
- Question answering
- Open source nlp
- Efficient ai
- Language model
- Performance optimization
- Gemma 2 9b it
- Code chatbot
- Palm 2 codechat bison 32k
- Developer support
- Coding qa
- Programming assistant
- Advanced small model
- Sota intelligence
- Multimodal inputs
- Gpt 4o mini
- Openai
- Text and image processing
- Gpt 4o
- Fast ai model
- Llama 2 integration
- Roleplay ai
- Extended context
- Fine tuning
- Gryphe
- Mythomax l2 13b
- Fictional narrative model
- Creative writing tool
- Mergekit language model
- Rifxonline
- Mn mag mell r1
- Roleplay storytelling
- Emotional intelligence
- Customer support
- Safety
- Inflection
- Chatbot safety
- Emotional intelligence chatbot
- Inflection 3 pi
- Roleplay scenarios
- Task optimization
- Precise guidelines
- Json output
- Inflection 3 productivity
- Multilingual
- Closed source comparison
- Meta llama
- Human evaluations
- Pre trained model
- Meta policy
- Llama 31 405b
- Multimodal integration
- Image captioning
- Visual linguistic ai
- Llama 32 11b vision
- Efficient language processing
- Multilingual text analysis
- Dialogue summarization
- Low resource nlp
- Llama 32 1b
- Dialogue generation
- Llama 32 3b
- Complex reasoning
- Text summarization
- Multilingual language model
- Multimodal ai
- Visual reasoning
- Llama 32 90b vision
- 70b language model
- Instruction tuned llm
- Multilingual text generation
- Llama 33 70b
- Multilingual dialogue model
- Llama guard 2 8b
- Safety classification
- Prompt response analysis
- Content moderation
- Llama 3 family
- Logical reasoning
- Code processing
- Advanced mathematics
- Phi 3 medium 128k
- Microsoft azure
- Mathematics tasks
- Phi 3 mini 128k
- Supervised fine tuning
- Dense transformer
- High quality datasets
- Phi 35 mini 128k
- Y
- L
- E
- U
- 4
- R
- K
- X
- G
- S
- M
- T
- O
- D
- I
- P
- C
- H
- A
- Q
- N
- Fast performance
- Model finetuning
- Mistral 7b
- Wizardlm 2 7b
- Model optimization
- Image understanding
- Minimax 01
- Vit mlp llm
- Text generation model
- Lightning attention
- Mistralai
- Codestral mamba
- Code reasoning model
- Transformer alternative
- Infinite sequence inference
- Large context window
- Parameter model
- Context length
- Industry standard ai
- Speed optimization
- Long context window
- Coding languages
- Mistral large
- Reasoning model
- Multilingual model
- Large context length
- 12b parameters
- Function calling
- Mistral nemo
- Mistral small
- Fast translation model
- Sentiment analysis ai
- Cost efficient nlp
- Fine tuned ai
- Large scale tasks
- Cost effective processing
- Mistral tiny
- Batch processing model
- Generative model
- Pretrained experts
- Sparse mixture of experts
- Feed forward networks
- Mixtral 8x7b
- Image to text
- Pixtral 12b
- Mistral ai
- Torrent release
- October 2023 data
- Natural image processing
- Pixtral large 2411
- Chart interpretation
- Chatbot integration
- Customer support automation
- Moonshot v1 8k
- Moonshot
- Education
- Semantic understanding
- Uncensored
- Intelligent dialogue
- Llama 3 lumimaid 70b
- Curated data
- Uncensored chatbot
- Language processing
- Improved dataset
- Finetune model
- Chat optimization
- Llama 31 lumimaid 8b
- Multi turn conversation
- Structured output
- Code generation skills
- Powerful steering capabilities
- Advanced agentic capabilities
- Nousresearch
- Hermes 3 llama 31 405b
- Hermes 3 llama 31 70b
- Roleplaying enhancement
- Science
- Chatgpt 4o latest
- Research evaluation tool
- Rate limited chatbot
- Dynamic language model
- Parallel function calling
- Reproducible outputs
- Gpt 35 turbo
- Improved instruction following
- Json mode
- Experimental model
- Rate limited
- O1 mini
- Phd level accuracy
- Stem optimization
- Advanced scientific computing
- O1 preview
- O1
- Chain of thought reasoning
- Reinforcement learning
- Conditioned reinforcement learning
- Mixed quality data
- Openchat
- Language models library
- Openchat 7b
- Offline ai model
- Cost efficient llm
- Perplexity
- High performance language model
- Llama 31 sonar large 128k chat
- Fast processing llm
- Llama 31 sonar small 128k chat
- Qwen
- Multilingual understanding
- Coding and reasoning
- Swiglu activation
- Qwen 2 7b
- Transformer model
- Video question answering
- Multilingual text recognition
- Mobile robot integration
- Qwen 2 vl 72b
- Qwen 2 vl 7b
- Structured data understanding
- Long context processing
- Qwen 25 72b
- Chatbot role play
- Qwen 25 7b
- Qwq 32b preview
- Safety considerations
- Ai reasoning
- Recursive loops
- Language mixing
- 40off
- Unique formatting
- L3 euryale 70b
- Creative roleplay
- Prompt adherence
- Spatial awareness
- Creative logic
- Strategic merge
- General knowledge
- Roleplaying model
- L3 lunaris 8b
- Storytelling ai
- Character interaction
- L31 euryale 70b
- L33 euryale 70b
- Character simulation
- Multilingual embeddings
- Semantic search
- Text embedding 3 large
- Similarity matching
- Text similarity matching
- Budget friendly nlp
- Cost effective embeddings
- Text embedding 3 small
- Creative writing model
- Engaging prose
- Rocinante 12b
- Thedrummer
- Undi95
- Mythomax l2 b13
- Recreation trial
- Remm slerp l2 13b
- Updated models
- Merge model
- Task arithmetic
- Toppy m 7b
- Uncensored ai
- Parameter blending
- Visual comprehension
- Object recognition
- X ai
- Style analysis
- Grok 2 vision