Parameters
Mini MiniCPM-o 2.6: The 8B Parameter Multimodal LLM Beating GPT-4o
In a groundbreaking development, Mini CPM-o has taken the world of multimodal large language models (LLMs) by storm. With its 8-billion parameter architecture, it not only outperforms GPT-4o on
Read MoreDeepSeek: The Chinese Rival to ChatGPT Aiming for AI Market Leadership
- Rifx.Online
- Natural Language Processing , Machine Learning , Ethics
- 29 Dec, 2024
DeepSeek V3, China’s bold AI model, challenges GPT-4 with 671B parameters, cost-efficient training, and innovation under U.S. sanctions.Ali Shaker- The Chinese startup DeepSeek has
Read MoreDramatically Reduce Inference Costs with DeepSeek-V3: A New Era in Open-Source LLMs
Introduction DeepSeek-V3 has emerged as the new heavy weight for open-source enthusiasts and enterprise users alike. Developed by a Chinese AI research company with a commitment to an
Read MoreDeepSeek V3: The best Open-source LLM | by Mehul Gupta | Data Science in your pocket | Dec, 2024 | Medium
Better than Claude 3.5 Sonnet, GPT-4o, Llama3.1 405B The year is about to end and just now, China’s DeepSeek has released its open-sourced model DeepSeek-v3, which has outperformed al
Read MoreMeta’s Llama 3.3: The Evolution of Open-Source Large Language Models
Meta’s recent release of Llama 3.3 represents a milestone in the development of large language models (LLMs). It introduces improvements in scale, efficiency, and safety, while remaining open
Read MoreAlibaba QwQ: Better than OpenAI-o1 for reasoning?
32b open-sourced model beats o1 mini and competes with o1-preview A few days back, Alibaba released Marco-o1, a 7b reasoning model. Now, they have released another, improved version cal
Read MoreThe Most Ambitious AI Crypto Project Ever is Here
- Rifx.Online
- Technology , Machine Learning , Blockchain
- 16 Nov, 2024
AI & Blockchains: A Match Made in Heaven, or a Scam? One of the founding fathers of modern AI wants to use the blockchain to train the world's largest open-source Large Language Model (LLM),
Read MoreMeet Qwen2.5-Coder-32B-Instruct -Coder -open source better than gpt4o.
- Rifx.Online
- Programming , Generative AI , Data Science
- 14 Nov, 2024
Meet Qwen2.5-Coder-32B-Coder, Your New AI Coding Buddy Have you ever wished that coding was a little easier, faster, and maybe even more fun? So, prepare to meet your new AI coding friend, Qw
Read MoreSmolLM2: Very Good Alternatives to Qwen2.5 and Llama 3.2
- Rifx.Online
- Technology , Machine Learning , Data Science
- 10 Nov, 2024
And it's fully open! Hugging Face has doubled down on their SmolLM initiative. They released SmolLM2: 1.7B, 360M, and 135M models trained on 11T tokens (against 1T for SmolLM). They release
Read MoreGoogle Releases Gemma — A Lightweight And Open Source Model
- Rifx.Online
- Natural Language Processing , Programming , Chatbots
- 29 Oct, 2024
In just a week, the world has witnessed the most groundbreaking AI advancements from two tech giants. OpenAI introduced its jaw-dropping AI video generator, [Sora](https://readmedium.com/3d1638
Read More