Natural language processing
DeepSeek-R1-Lite-Preview: A New Era in AI Reasoning
Artificial Intelligence (AI) is no stranger to breaking barriers, but every once in a while, a new innovation comes along that makes us pause and take note. One such innovation is **DeepSeek-R1
Read MoreHow Good is DeepSeek-R1-Lite Preview at Reasoning
DeepSeek-R1-Lite-Preview (R1LP) is a LRM (Large Reasoning Model) recently released by the Chinese company DeepSeek. According to the mode
Read MoreThe Best Free Text-to-Speech Model for Commercial Use
In a groundbreaking development for text-to-speech (TTS) technology, the Kokoro TTS model has emerged as the number one free and commercially available TTS solution. Built on top of the widely p
Read MoreMini MiniCPM-o 2.6: The 8B Parameter Multimodal LLM Beating GPT-4o
In a groundbreaking development, Mini CPM-o has taken the world of multimodal large language models (LLMs) by storm. With its 8-billion parameter architecture, it not only outperforms GPT-4o on
Read MoreUsing agents to breathe life into NPCs | by MichaelT Shomsky | Dec, 2024 | Medium
- Rifx.Online
- Programming , Roleplay , Natural Language Processing
- 20 Jan, 2025
OutlineSimulating 2d societies Create a 2d top down platformer Setup non playable characters that maintain a personal history Begin a framework to allow NPCs to conduct themselves in
Read MoreHow Agentic Hybrid Search Creates Smarter RAG Apps
By Ryan Michael If you’re building a [retrieval-augmented generation (RAG)](https://www.datastax.com/guides/what-is-retrieval-augmented-ge
Read MoreBuilding Human-Facing Agentic Systems: The Psychology and Sociology of Super Intelligence
Soundcloud Podcast Executive Summary“Power is in tearin
Read MoreA Developer’s Roadmap to Getting Started with AI in 2025
In my last article, I wrote about a learning path of AI for beginners, in an attempt to demystify its tools and applications for day-to-day tasks. This time, we’re taking a sharp left, so to
Read MoreSky-T1–32B-Preview : Open-sourced LLM outperforms OpenAI-o1
UC Berkley’s Sky-T1–32B-Preview details So, the first big Generative AI release is here where UC Berkeley’s open-sourced LLM, Sky-T1–32B has beaten OpenAI’s o1 on Maths and Coding benc
Read MoreSmolagents + Web Scraper + DeepSeek V3 Python = Powerful AI Research Agent
- Rifx.Online
- Programming , Natural Language Processing , Chatbots
- 19 Jan, 2025
In this video, I have a super quick tutorial showing you how to create a multi-agent chatbot with Smolagents, Web Scraper and DeepSeek V3 to make a powerful Agent Chatbot for your business or p
Read MoreMy GPT-evaluator got 1000% better with this simple trick.
I wish I had known this trick sooner. All my articles are free to read. [Non-members can read for free by clicking this link.](https://readmedium.com/my-gpt-evaluator-performance-impr
Read MoreAI to Code Like Engineers by 2025, Predicts Zuckerberg
- Rifx.Online
- Programming , Natural Language Processing , Ethics
- 14 Jan, 2025
In an era where technology evolves at breakneck speed, Mark Zuckerberg, the visionary behind Meta, has made a bold prediction: by 2025, artificial intelligence will code like mid-level engineer
Read MoreBuilding Graph RAG for structured and unstructured data.
RAG architecture is, by far, the most adapted and sophisticated solution for missing contextualisation of LLM’s. With no overhead of fine tuning, to a huge extent problems concerning the usage o
Read MoreCreate AI Agent CRUD Application with PydanticAI: Step by Step
It’s good to be back on Skolo Online! In our first tutorial of 2025 we will take a deep dive into PydanticAI. In this article we will cover the following:How to get started with Pydantic AI
Read MoreBuilding RAG research Multi-Agent with LangGraph
❓Introduction — Naive vs Agentic RAG 🧠 Project overview 📊 Results 🔚 ConclusionIn this article, we present a practical project featuring a RAG Research Multi-Agent tool deve
Read MoreAI Agents Simplified: How AI Agents Answer Questions Using Domain Knowledge
Demystify how enterprise AI agents are tailored to answer client questions using domain knowledge and strictly confine answers to domain knowledge. Background Have you ever wondered w
Read MoreUnlocking AI’s Potential: The Power of Chain of Thought Prompting
- Rifx.Online
- Generative AI , Chatbots , Natural Language Processing
- 14 Jan, 2025
The AI That Thinks Like a Human Have you ever been amazed by how ChatGPT can give lightning-fast responses but felt disappointed that the answers sometimes seem shallow or incomplete? You mi
Read MoreA Multimodal AI Assistant: Combining Local and Cloud Models
Use LangGraph, mlx and Florence2 to build an agent that answers complex image questions, with the option to run everything locally. *In this article we’ll use LangGraph in conjunction with
Read MoreBuilding Autonomous Multi-Tool Agents with Gemini 2.0 and LangGraph
A practical tutorial with full code examples for building and running multi-tool agents LLMs are remarkable — they can memorize vast amounts of information, answer general knowledge quest
Read MoreLet’s build a Text Analysis Pipeline with LangGraph Agents
In this article, I will present to you LangGraph, an incredible framework for building applications using graph-based workflows that would otherwise be impossible. I will share my experience wi
Read MoreTraditional vs. Generative AI for Sentiment Classification
5 ways to classify text (even without train data) Introduction This article focuses on the sentiment analysis of product reviews from the [Flipkart Customer Review dataset](https:
Read MoreGoogle Chrome just leveled up with Al!
It can now see and explain anything on your screen. Here are 11 powerful features you’ll wish you discovered earlier: 1. Google Lens in Chrome - Add words to your Google Lens s
Read MoreData Exploration with Agentic AI: Exploring the Titanic Dataset using SmolAgents
When I began my journey into machine learning a decade ago, like many of us, I started with the Titanic dataset. I vividly recall the thrill of performing my first exploratory data analysis (EDA
Read MoreA Practical Guide: Fine-Tuning Large Language Models with HuggingFace
Co-authors: Srijith Rajamohan, Ahmed Salhin, Todd Cook, Josh Frazier Every new announcement of Large Language Models (LLMs) tends to push performance to new heights, often [surpassing the pr
Read MorePhi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning
Phi-4: A Compact Powerhouse Redefining AI Reasoning for Real-World Applications Microsoft’s Phi-4 represents a remarkable advancement in the realm of small language models, excelling in
Read MoreExploring DeepSeek Version 3: A Technical Overview
The launch of DeepSeek Version 3 has sparked considerable excitement within the AI community, thanks to its remarkable capabilities and cost-efficiency. As an advanced open-weight large language
Read MoreTop 21 Famous RAG Frameworks in 2025
Revolutionizing AI with Advanced Retrieval-Augmented Generation Disclosure: I use GPT search. The entire article is still drafted by me.My writing style tends to be assertive and anal
Read MoreMastering CrewAI: Chapter 6 — Memory Basics | by Okan Yenigün | Jan, 2025 | Medium
Memory Types in CrewAI In CrewAI, the memory system enhances AI agents by enabling them to remember, reason, and learn from past interactions. Previous Chapter: This system comprises sev
Read MoreMastering CrewAI: Chapter 3— Knowledge
Knowledge — CrewAI In CrewAI, “knowledge” is the system that allows AI agents to access and utilize external information sources during their tasks. We can think of it as giving our agents
Read MoreBreak Into AI in 2025: Your Complete 6-Month Learning Path
📊 LinkedIn | ✍️ Medium | 💻 GitHub |🤝 Fiverr As
Read MoreQuickest Way to Learn AI in 2025
Why go long when short works? This blog is here to guide you on a clear, shortcut path to getting started in the AI world, while also let you explore the new AI products at the same time, to
Read MoreOpenAI’s Swarm (Part 2): A straightforward, local-first approach with Ollama and Pydantic
- Rifx.Online
- Programming , Natural Language Processing , Chatbots
- 03 Jan, 2025
A short code reference to build upon. TLDR: Combining the Ollama and Swarm frameworks presents a local-first approach to building intelligent AI agents. Ollama can run large language
Read MoreDeepSeek: The Chinese Rival to ChatGPT Aiming for AI Market Leadership
- Rifx.Online
- Natural Language Processing , Machine Learning , Ethics
- 29 Dec, 2024
DeepSeek V3, China’s bold AI model, challenges GPT-4 with 671B parameters, cost-efficient training, and innovation under U.S. sanctions.Ali Shaker- The Chinese startup DeepSeek has
Read MoreDramatically Reduce Inference Costs with DeepSeek-V3: A New Era in Open-Source LLMs
Introduction DeepSeek-V3 has emerged as the new heavy weight for open-source enthusiasts and enterprise users alike. Developed by a Chinese AI research company with a commitment to an
Read MoreMy LLM’s outputs got 1000% better with this simple trick.
I wish I had known this trick sooner. When I interned at Adobe Research (Bangalore) last summer, my job was to make open-source LLMs more aligned with the context. That means that no matte
Read MoreDeepSeek V3: The best Open-source LLM | by Mehul Gupta | Data Science in your pocket | Dec, 2024 | Medium
Better than Claude 3.5 Sonnet, GPT-4o, Llama3.1 405B The year is about to end and just now, China’s DeepSeek has released its open-sourced model DeepSeek-v3, which has outperformed al
Read MoreImage Inference through Multi-Modal LLM Models
MULTIMODAL AI | LLM | OPENAI | GEMINI | VISION This blog explores the capabilities of multi-modal models in image inference, highlighting their ability to integrate visual and text
Read MoreLlm Fine Tuning Guide: Do You Need It and How to Do It
Working with LLMs, one of the most popular questions we get is about fine-tuning. Every second client asks if they should do additional training on their model. In most cases the answer is no, t
Read MoreMulti-Agent AI Architecture with Large Language Models (LLMs)
End To End Multi-Agent Implementation With LLMs A multi-agent architecture consists of multiple autonomous agents collaborating to accomplish complex tasks. With the recent advancements in
Read MoreQwen QVQ-72B: Best open-sourced Image Reasoning LLM
Visual Reasoning LLM by Alibaba So. before ending 2024, Qwen (by Alibaba) is back with a bang and has released another open-sourced LLM, Qwen QVQ-72B which is a visual reasoning LLM i.e.
Read MoreQwen2.5: Redefining Efficiency in Large Language Models
Scaling Smarter and Learning Better with Reinforcement Learning A New Era of Language Models: Introducing Qwen2.5 Building on the breakthroughs in large language models (LLMs), **Qwe
Read MoreUse Gemini 2.0 to Build a Realtime Chat App with Multimodal Live API
- Rifx.Online
- Programming , Chatbots , Natural Language Processing
- 27 Dec, 2024
Gemini Development Tutorial Google launched Gemini 2.0 with the preview model Gemini 2.0 Flash Experimental, and you must have learned about it from videos and articles. This model has g
Read MoreAgentic Reflection for Natural Language to SQL Generation
Authors: Atul Varshneya, Toby Fotherby, Shweta Keshavanarayana Introduction Natural language to SQL (NL2SQL) conversion promises to democratize data access for non-technical users. Howeve
Read MoreBeyond Text2SQL: A New Frontier in Natural Language Database Queries
Introducing TAG, a Unified Model for Complex Question Answering Bridging the Gap Between AI and Databases The Promise and the Bottleneck Artificial intelligence (AI) is rapidly
Read MoreClaude Conversing With Neo4j Via MCP
Give LLMs the Power of Knowledge Graphs With Anthropic’s Model Context Protocol Last week, Anthropic published a new [Model Context Protocol](https://www.anthropic.com/news/model-context-pro
Read MoreConversational BI: Text2SQL Challenges & Solutions
The art of querying SQL Databases / Warehouses in Natural Language Introduction the future of BI is Conversational — this is what Gartner and other analysts have been telling us for th
Read MoreFrom Posts to Reports: Leveraging LLMs for Social Media Data Mining
How to instruct LLMs to filter restaurant posts and extract critical insights for business growth. Application Overview We’re living in the golden age of automation, powered by the r
Read MoreHow I Built an LLM App Based on Graph-RAG System with ChromaDB and Chainlit
End-to-end app with GUI and storing new knowledge on vector database in just 3 scripts Large language models (LLMs) and knowledge graphs are valuable tools to work with natural language proce
Read MoreHow I Fix Prompts for Flawless AI Responses Every Time
When ChatGPT first launched, prompt templates for nearly every industry and profession flooded the internet. You’ve probably come across posts like “The Best [N] ChatGPT Prompts for [Sector/Pr
Read MoreHow to build Agentic RAG using CrewAI and Langchain
In the rapidly evolving landscape of AI, the ability to provide accurate, context-aware responses to user queries is a game-changer. Retrieval-Augmented Generation (RAG) has emerged as a powerf
Read MoreHow to Scrape and Analyse Data for Free using AI: From Collection to Insight
Learn how to combine web scraping, proxies, and AI-powered language models to automate data extraction and gain actionable insights effortlessly. While some websites are straightforward t
Read MoreMarkItDown: A Powerful and must-have Toolkit for LLMs projects, tested with OpenAI and Gemini 2.0
*MarkItDown is a versatile utility developed by Microsoft that transforms various file formats into Markdown, making it an essential tool for developers working with Large Language Models (LLMs).
Read MoreMultilingual Vision Captioning: A Multi-Model Multimodal Approach to Image and Video Captioning and…
Using a combination of Meta’s Llama 3.2 11B Vision Instruct, Facebook’s 600M NLLB-200, and LLaVA-Next-Video 7B models to produce multilingual image and video captions, descriptive tags, a
Read MoreOpenAI Unveils O3 and O3-Mini: Transforming Language Processing
- Rifx.Online
- Natural Language Processing , Generative AI , Ethics
- 26 Dec, 2024
I’ve been following OpenAI’s updates closely, and I’m excited to share some breaking news: OpenAI just announced two of their newest language models, O3 and O3-mini. After watching a recent demon
Read MoreThe Rise of AI-Driven UX: Balancing Automation and Human-Centered Design in 2025
If there is one thing that’s altering the way we create user experience (UX) designs and conduct research in 2024, it is definitely artificial intelligence (AI). UX experts have already integrated
Read MoreBuild Smarter AI Agents with Long-Term, Persistent Memory and Atomic Agents
One of the most useful features in any AI assistant is undoubtedly the ability to have a persistent long-term memory. Whether the goal is to learn more about the user, adjust the AI’s b
Read MoreDeepSeek-VL2: Advancing Multimodal Understanding with Mixture-of-Experts Vision-Language Models
DeepSeek-VL2 represents a significant leap forward in the field of vision-language models, offering advanced capabilities for multimodal understanding. This innovative series of large Mixture-o
Read MoreBuilding a Research Assistant That Can Write to Google Docs (Part 2)
- Rifx.Online
- Programming , Technology , Natural Language Processing
- 19 Dec, 2024
A tool that might help with your homework ***This article is the second of a two part series where we use LangGraph and Tavily to build a simple research agent, which writes and refines sh
Read MoreBuilding a Research Agent That Can Write to Google Docs (Part 1)
A tool that might help with your homework ***This article is the first of a two part series where we use LangGraph and Tavily to build a simple research agent, which writes and refines sho
Read MoreQwen2-VL: A Vision Language Model That Runs Locally
This is an introduction to「Qwen2-VL」, a machine learning model that can be used with ailia SDK. You can easily use this model to create AI applications using [ailia SDK](h
Read MoreHow to Fine-Tune Llama-3.2 on your own data: A detailed guide
Introduction The release of Llama-3.2 collection from Meta marked an important milestone in the open-s
Read MoreHow to Use Structured Generation for LLM-as-a-Judge Evaluations
Structured generation is fundamental to building complex, multi-step reasoning agents in LLM evaluations — especially for open source models *Disclosure: I am a maintainer of [Opik](https
Read MoreStep by Step Midjourney Prompts Generation using ChatGPT 4o’s Vision Analysis
You know that feeling when you’re trying to create the perfect Midjourney prompt, but the words just don’t capture what’s in your mind? I’ve been there too. After countless hours of experimentat
Read MoreThe Role of Small Models in the LLM Era
A recent study extensively explored the role of Small Language Models (SLMs) in modern AI. The research provided a comprehensive analysis of SLMs, focusing on their capabilities, applicatio
Read MoreMeta’s Llama 3.3: The Evolution of Open-Source Large Language Models
Meta’s recent release of Llama 3.3 represents a milestone in the development of large language models (LLMs). It introduces improvements in scale, efficiency, and safety, while remaining open
Read MoreGoogle Gemini-Exp-1206: The new Best LLM
Beats GPT-4.0, OpenAI-o1, Claude3.5 Sonnet and Gemini 1.5 on LMArena Google Gemini, after a lacklustre debut in the GenAI space some months back, is picking up pace quite fast. It has
Read MoreMeta Llama3.3 : 70B model alternate for Llama3.1 405B
3rd model series in Llama after Llama3.1, Llama3.2 So Meta, in their final release for the year, launched Llama3.3, a 70B GenAI model that has shown some great results and has already b
Read MoreOpenAI’s O1 Model: A Detailed Exploration into the Future of AI
IntroductionArtificial intelligence has rapidly evolved over the last decade, leading to breakthroughs in natural language processing (NLP), machine learning, and multimodal applications. Op
Read MoreTesting OpenAI’s o1 Models: A Look at Chain-of-Thought Prompting for Journalism Tasks
New LLMs are touted as smarter problem-solvers — but how do they fare in real-world journalistic tasks like data visualization and headline selection? Last month, OpenAI released two new la
Read MoreSmarter and Faster: OpenAI o1 and o1 pro mode
Just 12 hours ago, OpenAI rolled out the new o1 model and o1 with pro mode. As you may already know, o1 models are the first series of models designed to think before answering, providing more det
Read MoreOptimizing Workflow Efficiency with LangGraph and Agents: Key Features, Use Cases, and Integration…
In the context of Generative AI and large language models (LLMs), agents and LangGraph are tools and frameworks that enhance the functionality of LLMs by allowing them to perform tasks,
Read MoreElevenLabs Conversational AI: Build Smart, Scalable Voice Agents in Minutes
Hello, creators! 🗣️ The way we interact with technology is changing, and conversational AI is at the forefront of this revolution. ElevenLabs has introduced a cut
Read MoreAnthropic’s Claude AI Can Now Copy Your Writing Style
I’ve been working as an AI ambassador for various organizations for months, and if there’s one thing most users keep asking, it’s how to make AI chatbots write like them. Essentially, they want
Read MoreMastering the 25 Types of RAG Architectures: When and How to Use Each One
Introduction Retrieval-Augmented Generation (RAG) architectures have revolutionized the way we approach information retrieval. These architectures bridge the gap between generating responses
Read MoreImplement A Simple YouTube Video Summarizer Application using Cursor AI
Introduction Cursor AI is revolutionizing software development by integrating advanced artificial intelligence into a familiar coding environment. As a fork of
Read MoreAlibaba QwQ: Better than OpenAI-o1 for reasoning?
32b open-sourced model beats o1 mini and competes with o1-preview A few days back, Alibaba released Marco-o1, a 7b reasoning model. Now, they have released another, improved version cal
Read MoreApple’s LLM Siri: A New Era in AI-Powered Conversations?
- Rifx.Online
- Generative AI , Chatbots , Natural Language Processing
- 30 Nov, 2024
ARTIFICIAL INTELLIGENCE The world of artificial intelligence has been rapidly evolving in recent years, with generative AI (genAI) seemingly being added to everything. Apple’s Siri, once a
Read MoreAlibaba Marco-o1 : Open-source alternative for OpenAI-o1
LLM for Open-Ended problems OpenAI-o1 was a revolutionary release, enabling detailed reasoning by LLMs for complex tasks like maths or physics problem, not limiting GenAI to just generic
Read MoreBuilding a Powerful Local AI Web Search Assistant with Llama 3.2:
Combine Large Language Models with Real-Time Web Search for Up-to-Date Answers and Context-Aware Insights 👨🏾💻 GitHub ⭐️ | 👔 [LinkedIn](https://www
Read MoreComplex SQL Joins with LangGraph and Waii
In the rapidly evolving landscape of data analytics, the ability to interact with data through natural language is becoming increasingly valuable. Conversational analytics aims to make complex d
Read MoreHow to make more reliable reports using AI — A Technical Guide
Technical guide, sharing my experience of working with AI Over the past year, I’ve been running my own AI software development and consulting agency, working with a variety of clients — fr
Read MoreIntroducing LightRAG: A New Era in Retrieval Augmented Generation
In the rapidly evolving field of natural language processing, Retrieval-Augmented Generation (RAG) systems have become essential for providing accurate, contextually rich responses. However, trad
Read MoreWhich AI Agent framework should i use? (CrewAI, Langgraph, Majestic-one and pure code)
With the advancement of large language models, it is now possible for artificial intelligence models to reason through a problem. At the start, we believed that the models would not be able to acc
Read MoreChatGPT 4 vs Claude 3.5 Sonnet: Who’s Better? Let’s Review:
I Ran Tests — ChatGPT 4 vs Claude 3 Sonnet, Who Wins? The buzz is on there’s a new chatbot player in town, the Claude 3 Sonnet. It’s been described as being better than ChatGPT, but there h
Read MoreClaude 3.5 Sonnet Unleashed: The AI Revolution You Can’t Afford to Miss | by Ahmed Bahaa Eldin | Medium
The technological area known as Artificial Intelligence is a relatively young field that has been progressing exceptionally fast, and among the newest examples of its development is known as [**
Read MoreClaude 3.5 vs GPT-4o: Key Differences You Need to Know
Anthropic’s latest release, Claude 3.5 Sonnet, enters a market where OpenAI’s GPT-4o has set a high benchmark, with [92% of Fortune 500](https://www.techbusinessnews.com.au/news/92-of-fortune-
Read MoreHow to Use Claude 3.5 Sonnet API for Creative Writing
Let’s talk about something that we all face during development: API Testing with Postman for your Development Team. Yeah, I’ve heard of it as well, [Postman is getting worse](https://www.reddit
Read MoreLlama 3.2 vs GPT-4 vs OpenAI O1 vs Gemini Ultra vs Claude 3.5: Which AI Model Is Right for You?
- Rifx.Online
- Generative AI , Natural Language Processing , Ethics
- 20 Nov, 2024
Artificial Intelligence has come a long way, and each new model seems to push the boundaries further. Today, I want to share my thoughts on five of the leading AI models out there: Meta’s Llama
Read More5 AI Projects You Can Build This Weekend (with Node.js)
5 Exciting AI Projects to Build in a Weekend with Node.js (Perfect for Beginners) Are you interested in building AI projects but short on time? With just Node.js and a weekend, you can dive into
Read MoreConversational AI for Customer Service: Best Practices and Key Steps for Success
In today’s fast-paced business environment, customer service plays a crucial role in building and maintaining customer loyalty. As businesses strive to offer personalized and efficient support,
Read MoreGet ChatGPT to Sound More Human: Essential Tips for Creating Natural, Engaging AI Conversations
Have you ever found your AI assistant sounding a bit too… mechanical? ChatGPT, while impressively capable, sometimes leans on overly formal or generic language. But with a few adjustments, you c
Read MoreIs Perplexity Pro a Smarter, More Efficient Way to Search the Web?
- Rifx.Online
- Chatbots , Natural Language Processing , Technology/Web
- 16 Nov, 2024
The Future of Search Is Perplexity Pro a Smarter, More Efficient Way to Search the Web? How does it compare to traditional search engines and is it worth the cost? Hey AI Friends and follo
Read MoreBuild Your Talking Voice AI Assistant Locally: Memory-Retaining Chatbot with Streamlit UI…
Step-by-Step Guide to Developing Your Own Voice AI with Context Memory and Real-Time Chat, Powered by Llama3.1 & Llama3.2 Models The concept of a voice-based personal assistant has gro
Read MoreBuilding a Reliable Text Classification Pipeline with LLMs: A Step-by-Step Guide
Overcoming common challenges in LLM-based text classification In this step-by-step tutorial, we’ll walk through how to use large language models (LLMs) to build a text classification pi
Read MoreComparing Leading Text-to-Image Generation Models for Adding Text to Images
A comparison of nine leading image generation models’ ability to render accurate text (words and phrases) within an image. In this post, we will assess the capabilities of nine state-of-the-a
Read MoreMultimodal AI for Conversational Human Motion
Written by Christian Safka and Keyu Chen In this exploration we’ll look at how multi
Read MoreQwen2.5-Coder, Cosmos Tokenizer, OpenCoder, and New SentenceTransformers: Great Times for Open…
- Rifx.Online
- Programming , Technology , Natural Language Processing
- 14 Nov, 2024
I want to highlight some standout open-source advancements that have really caught my eye:Qwen2.5-Coder Series: An open-source code LLM that’s giving GPT-4 a run for its money. **Co
Read MoreRAGate: Adaptive RAG for Conversational AI
Building Conversational AI systems is hard!!! It’s feasible but also complex, time-consuming, and resource-intensive. The challenge lies in designing systems that can understand and genera
Read MoreGLM-4-Voice 9B — Real-time Multilingual Voice Conversation AI — Install Locally in Minutes
How to set up GLM-4-Voice 9B for seamless real-time voice interaction in English and Chinese, and explore its unique architecture, low-latency response, and customizable voice attributes.
Read MoreBuild a RAG-based scientific ChatBot with LangChain, Streamlit & PubMed — Part 4(Put it all…
- Rifx.Online
- Chatbots , Natural Language Processing , Science
- 13 Nov, 2024
Hello and welcome to the last part of the series to build a scientific ChatBot with Langchain, Streamlit, and PubMed! In the previous part, we built the data persistence and RAG pipeline with v
Read MoreWhat’s new with Claude Sonnet 3.5 & Claude 3.5 Haiku?
First off, what is Claude? Claude is a language model created by Anthropic and it’s designed to help with tasks like answering quest
Read MoreWhy embedding matters when building a non-English RAG system — Multilingual embeddings
Why Embeddings are Key Embeddings are a cornerstone of modern generative AI, silently driving the functionality of many systems we interact with daily. At their simplest, embeddings are **num
Read MoreGemma vs. Llama vs. Mistral: Exploring Smaller AI Models
A Comparative Study of Small-Scale Language Models: Evaluating Gemma, Llama 3, and Mistral in Reading Comprehension Tasks Introduction Large Language Models (LLMs) have been evolving rapidl
Read MoreGoogle Gemini: Are Big Context Windows the Killer Feature?
Goggle’s upcoming LLM makes a massive move Barely eight months ago, a leaked Google email revealed the company was struggling to outpace its AI rivals. Not only was there [no moat](https://www
Read MoreLlama 3.2: The Next Generation of Lightweight, Instruction-Tuned Language Models: A Hands-On…
Discover LLaMA 3.2’s Key Innovations in Pruning, Knowledge Distillation, and Multilingual Performance, Plus a Hands-On Tutorial to Run Locally or Through Google Colab 👨🏾💻 [GitHub](https://
Read MoreAI-Powered OCR with Phi-3-Vision-128K: The Future of Document Processing
In the fast-evolving world of artificial intelligence, multimodal models are setting new standards for integrating visual and textual data. One of the latest breakthroughs is the **Phi-3-Visi
Read MoreArtifacts: Top Mindblowing uses of Claude 3.5 Sonent
Anthropic recently launched its most advanced LLM, “Claude 3.5 Sonnet,” and it’s mindblowing. People on social media called this model the most advanced LLM currently available. This AI model out
Read MoreLongRAG: Giving AI a Bigger Net to Catch More Fish in the Sea of Information
In my previous article, I introduced whether RAG would become obsolete due to long-context LLMs. Today, let’
Read MoreVisualize your RAG Data — Evaluate your Retrieval-Augmented Generation System with Ragas
How to use UMAP dimensionality reduction for Embeddings to show multiple evaluation Questions and their relationships to source documents with Ragas, OpenAI, Langchain and ChromaDB Retrieval
Read MoreHow to Improve LLMs with RAG
A beginner-friendly introduction w/ Python code This article is part of a larger series on using large language mo
Read MoreUnlocking Mixture-of-Experts (MoE) LLM : Your MoE model can be embedding model for free
Mixture-of-experts (MoE) LLM can be used as an embedding model for free. I recently found an interesting paper titled “Your Mixture-of-Experts LLM is Secretly an Embedding Model for Free.”
Read MoreUsers Will Fall in Love With OpenAI’s New GPT-4o Model. Literally.
- Rifx.Online
- Generative AI , Chatbots , Natural Language Processing
- 01 Nov, 2024
The company’s new GPT-4o can understand and mimic human speech and emotion In the iconic 2013 film Her, the protagonist develops an intense relationship — which morphs into a love affair
Read MoreOpenAI’s ‘Leaked’ GPT2 Model Has Everyone Stunned.
- Rifx.Online
- Chatbots , Generative AI , Natural Language Processing
- 01 Nov, 2024
On-Purpose leak? The influence that OpenAI has on the AI industry can’t be understated. Every move or decision makes headlines automatically… even if they don’t actually announce the thin
Read MoreOpenAI Confirms the Arrival of GPT-5, Poised to Bring Huge Improvements to Artificial Intelligence…
A netizen posted a GPT5 countdown post on x, saying that it was a conclusion drawn from clues from various platforms. The comment section has reached a climax with all kinds of opinions coming out
Read MoreHow to create an AI team to write compelling stories with CrewAI and Gemini Pro
Are you fascinated by the idea of AI generating stories that capture the imagination? If so, you’re not alone! In this article, we’ll dive into an introductory project that combines the powers of C
Read MoreRetrieval-Augmented Generation: Approaches, State of the Art, and Optimization Strategies
⭐ RAG is particularly useful in knowledge-intensive scenarios or domain-specific applications that require knowledge that’s continually updating. RAG has been popularized recently with its app
Read MoreFine-tuning LLama 3 with Unsloth
In this article I will show you how to fine-tune an LLM (Llama 3 from Meta) using Unsloth. I will also provide a way to use your own custom dataset. **Note
Read MoreQwen2.5 1.5b: the future of Mobile AI?
Local Testing and Evaluation of Alibaba Cloud’s Latest LLM. With llama-cpp-python and a DIY prompt catalog. In part one we explored together the innovations from Alibaba Cloud’s team with th
Read MoreIntroduction to LLaVA: A Multimodal AI Model
LLaVA is an end-to-end trained large multimodal model that is designed to understand and generate content based on both visual inputs (images) and textual instructions. It combines the capabil
Read MoreGoogle Releases Gemma — A Lightweight And Open Source Model
- Rifx.Online
- Natural Language Processing , Programming , Chatbots
- 29 Oct, 2024
In just a week, the world has witnessed the most groundbreaking AI advancements from two tech giants. OpenAI introduced its jaw-dropping AI video generator, [Sora](https://readmedium.com/3d1638
Read MoreClaude 3.5 Sonnet V/S GPT-4O: Which one is better
In November 2022, OpenAI launched ChatGPT, a model that has revolutionized how we search and interact with information. Next year, in March, an American startup,” Anthropic,” founded by ex-OpenAI
Read MoreAlibaba’s Open-Source Qwen: How It’s Revolutionizing AI and How You Can Use It
Alibaba has recently made waves in the AI world by open-sourcing its Qwen 2.5 models during the 2024 Apsara Conference. With over 100 models, Qwen spans multiple modalities including language, vi
Read MoreGenerating structured data from an image with GPT vision and Langchain
In today’s world, where visual data is abundant, the ability to extract meaningful information from images is becoming increasingly valuable. Langchain, a powerful framework for building applica
Read More