Type something to search...

Natural language processing

DeepSeek-R1-Lite-Preview: A New Era in AI Reasoning

DeepSeek-R1-Lite-Preview: A New Era in AI Reasoning

Artificial Intelligence (AI) is no stranger to breaking barriers, but every once in a while, a new innovation comes along that makes us pause and take note. One such innovation is **DeepSeek-R1

Read More
How Good is DeepSeek-R1-Lite Preview at Reasoning

How Good is DeepSeek-R1-Lite Preview at Reasoning

DeepSeek-R1-Lite-Preview (R1LP) is a LRM (Large Reasoning Model) recently released by the Chinese company DeepSeek. According to the mode

Read More
The Best Free Text-to-Speech Model for Commercial Use

The Best Free Text-to-Speech Model for Commercial Use

In a groundbreaking development for text-to-speech (TTS) technology, the Kokoro TTS model has emerged as the number one free and commercially available TTS solution. Built on top of the widely p

Read More
Mini MiniCPM-o 2.6: The 8B Parameter Multimodal LLM Beating GPT-4o

Mini MiniCPM-o 2.6: The 8B Parameter Multimodal LLM Beating GPT-4o

In a groundbreaking development, Mini CPM-o has taken the world of multimodal large language models (LLMs) by storm. With its 8-billion parameter architecture, it not only outperforms GPT-4o on

Read More
Using agents to breathe life into NPCs | by MichaelT Shomsky | Dec, 2024 | Medium

Using agents to breathe life into NPCs | by MichaelT Shomsky | Dec, 2024 | Medium

OutlineSimulating 2d societies Create a 2d top down platformer Setup non playable characters that maintain a personal history Begin a framework to allow NPCs to conduct themselves in

Read More
How Agentic Hybrid Search Creates Smarter RAG Apps

How Agentic Hybrid Search Creates Smarter RAG Apps

By Ryan Michael If you’re building a [retrieval-augmented generation (RAG)](https://www.datastax.com/guides/what-is-retrieval-augmented-ge

Read More
A Developer’s Roadmap to Getting Started with AI in 2025

A Developer’s Roadmap to Getting Started with AI in 2025

In my last article, I wrote about a learning path of AI for beginners, in an attempt to demystify its tools and applications for day-to-day tasks. This time, we’re taking a sharp left, so to

Read More
Sky-T1–32B-Preview : Open-sourced LLM outperforms OpenAI-o1

Sky-T1–32B-Preview : Open-sourced LLM outperforms OpenAI-o1

UC Berkley’s Sky-T1–32B-Preview details So, the first big Generative AI release is here where UC Berkeley’s open-sourced LLM, Sky-T1–32B has beaten OpenAI’s o1 on Maths and Coding benc

Read More
Smolagents + Web Scraper + DeepSeek V3 Python = Powerful AI Research Agent

Smolagents + Web Scraper + DeepSeek V3 Python = Powerful AI Research Agent

In this video, I have a super quick tutorial showing you how to create a multi-agent chatbot with Smolagents, Web Scraper and DeepSeek V3 to make a powerful Agent Chatbot for your business or p

Read More
My GPT-evaluator got 1000% better with this simple trick.

My GPT-evaluator got 1000% better with this simple trick.

I wish I had known this trick sooner. All my articles are free to read. [Non-members can read for free by clicking this link.](https://readmedium.com/my-gpt-evaluator-performance-impr

Read More
AI to Code Like Engineers by 2025, Predicts Zuckerberg

AI to Code Like Engineers by 2025, Predicts Zuckerberg

In an era where technology evolves at breakneck speed, Mark Zuckerberg, the visionary behind Meta, has made a bold prediction: by 2025, artificial intelligence will code like mid-level engineer

Read More
Building Graph RAG for structured and unstructured data.

Building Graph RAG for structured and unstructured data.

RAG architecture is, by far, the most adapted and sophisticated solution for missing contextualisation of LLM’s. With no overhead of fine tuning, to a huge extent problems concerning the usage o

Read More
Create AI Agent CRUD Application with PydanticAI: Step by Step

Create AI Agent CRUD Application with PydanticAI: Step by Step

It’s good to be back on Skolo Online! In our first tutorial of 2025 we will take a deep dive into PydanticAI. In this article we will cover the following:How to get started with Pydantic AI

Read More
Building RAG research Multi-Agent with LangGraph

Building RAG research Multi-Agent with LangGraph

❓Introduction — Naive vs Agentic RAG 🧠 Project overview 📊 Results 🔚 ConclusionIn this article, we present a practical project featuring a RAG Research Multi-Agent tool deve

Read More
AI Agents Simplified: How AI Agents Answer Questions Using Domain Knowledge

AI Agents Simplified: How AI Agents Answer Questions Using Domain Knowledge

Demystify how enterprise AI agents are tailored to answer client questions using domain knowledge and strictly confine answers to domain knowledge. Background Have you ever wondered w

Read More
Unlocking AI’s Potential: The Power of Chain of Thought Prompting

Unlocking AI’s Potential: The Power of Chain of Thought Prompting

The AI That Thinks Like a Human Have you ever been amazed by how ChatGPT can give lightning-fast responses but felt disappointed that the answers sometimes seem shallow or incomplete? You mi

Read More
A Multimodal AI Assistant: Combining Local and Cloud Models

A Multimodal AI Assistant: Combining Local and Cloud Models

Use LangGraph, mlx and Florence2 to build an agent that answers complex image questions, with the option to run everything locally. *In this article we’ll use LangGraph in conjunction with

Read More
Building Autonomous Multi-Tool Agents with Gemini 2.0 and LangGraph

Building Autonomous Multi-Tool Agents with Gemini 2.0 and LangGraph

A practical tutorial with full code examples for building and running multi-tool agents LLMs are remarkable — they can memorize vast amounts of information, answer general knowledge quest

Read More
Let’s build a Text Analysis Pipeline with LangGraph Agents

Let’s build a Text Analysis Pipeline with LangGraph Agents

In this article, I will present to you LangGraph, an incredible framework for building applications using graph-based workflows that would otherwise be impossible. I will share my experience wi

Read More
Traditional vs. Generative AI for Sentiment Classification

Traditional vs. Generative AI for Sentiment Classification

5 ways to classify text (even without train data) Introduction This article focuses on the sentiment analysis of product reviews from the [Flipkart Customer Review dataset](https:

Read More
Google Chrome just leveled up with Al!

Google Chrome just leveled up with Al!

It can now see and explain anything on your screen. Here are 11 powerful features you’ll wish you discovered earlier: 1. Google Lens in Chrome - Add words to your Google Lens s

Read More
Data Exploration with Agentic AI: Exploring the Titanic Dataset using SmolAgents

Data Exploration with Agentic AI: Exploring the Titanic Dataset using SmolAgents

When I began my journey into machine learning a decade ago, like many of us, I started with the Titanic dataset. I vividly recall the thrill of performing my first exploratory data analysis (EDA

Read More
A Practical Guide: Fine-Tuning Large Language Models with HuggingFace

A Practical Guide: Fine-Tuning Large Language Models with HuggingFace

Co-authors: Srijith Rajamohan, Ahmed Salhin, Todd Cook, Josh Frazier Every new announcement of Large Language Models (LLMs) tends to push performance to new heights, often [surpassing the pr

Read More
Phi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning

Phi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning

Phi-4: A Compact Powerhouse Redefining AI Reasoning for Real-World Applications Microsoft’s Phi-4 represents a remarkable advancement in the realm of small language models, excelling in

Read More
Exploring DeepSeek Version 3: A Technical Overview

Exploring DeepSeek Version 3: A Technical Overview

The launch of DeepSeek Version 3 has sparked considerable excitement within the AI community, thanks to its remarkable capabilities and cost-efficiency. As an advanced open-weight large language

Read More
Top 21 Famous RAG Frameworks in 2025

Top 21 Famous RAG Frameworks in 2025

Revolutionizing AI with Advanced Retrieval-Augmented Generation Disclosure: I use GPT search. The entire article is still drafted by me.My writing style tends to be assertive and anal

Read More
Mastering CrewAI: Chapter 6 — Memory Basics | by Okan Yenigün | Jan, 2025 | Medium

Mastering CrewAI: Chapter 6 — Memory Basics | by Okan Yenigün | Jan, 2025 | Medium

Memory Types in CrewAI In CrewAI, the memory system enhances AI agents by enabling them to remember, reason, and learn from past interactions. Previous Chapter: This system comprises sev

Read More
Mastering CrewAI: Chapter 3— Knowledge

Mastering CrewAI: Chapter 3— Knowledge

Knowledge — CrewAI In CrewAI, “knowledge” is the system that allows AI agents to access and utilize external information sources during their tasks. We can think of it as giving our agents

Read More
Quickest Way to Learn AI in 2025

Quickest Way to Learn AI in 2025

Why go long when short works? This blog is here to guide you on a clear, shortcut path to getting started in the AI world, while also let you explore the new AI products at the same time, to

Read More
OpenAI’s Swarm (Part 2): A straightforward, local-first approach with Ollama and Pydantic

OpenAI’s Swarm (Part 2): A straightforward, local-first approach with Ollama and Pydantic

A short code reference to build upon. TLDR: Combining the Ollama and Swarm frameworks presents a local-first approach to building intelligent AI agents. Ollama can run large language

Read More
DeepSeek: The Chinese Rival to ChatGPT Aiming for AI Market Leadership

DeepSeek: The Chinese Rival to ChatGPT Aiming for AI Market Leadership

DeepSeek V3, China’s bold AI model, challenges GPT-4 with 671B parameters, cost-efficient training, and innovation under U.S. sanctions.Ali Shaker- The Chinese startup DeepSeek has

Read More
Dramatically Reduce Inference Costs with DeepSeek-V3: A New Era in Open-Source LLMs

Dramatically Reduce Inference Costs with DeepSeek-V3: A New Era in Open-Source LLMs

Introduction DeepSeek-V3 has emerged as the new heavy weight for open-source enthusiasts and enterprise users alike. Developed by a Chinese AI research company with a commitment to an

Read More
My LLM’s outputs got 1000% better with this simple trick.

My LLM’s outputs got 1000% better with this simple trick.

I wish I had known this trick sooner. When I interned at Adobe Research (Bangalore) last summer, my job was to make open-source LLMs more aligned with the context. That means that no matte

Read More
DeepSeek V3: The best Open-source LLM | by Mehul Gupta | Data Science in your pocket | Dec, 2024 | Medium

DeepSeek V3: The best Open-source LLM | by Mehul Gupta | Data Science in your pocket | Dec, 2024 | Medium

Better than Claude 3.5 Sonnet, GPT-4o, Llama3.1 405B The year is about to end and just now, China’s DeepSeek has released its open-sourced model DeepSeek-v3, which has outperformed al

Read More
Image Inference through Multi-Modal LLM Models

Image Inference through Multi-Modal LLM Models

MULTIMODAL AI | LLM | OPENAI | GEMINI | VISION This blog explores the capabilities of multi-modal models in image inference, highlighting their ability to integrate visual and text

Read More
Llm Fine Tuning Guide: Do You Need It and How to Do It

Llm Fine Tuning Guide: Do You Need It and How to Do It

Working with LLMs, one of the most popular questions we get is about fine-tuning. Every second client asks if they should do additional training on their model. In most cases the answer is no, t

Read More
Multi-Agent AI Architecture with Large Language Models (LLMs)

Multi-Agent AI Architecture with Large Language Models (LLMs)

End To End Multi-Agent Implementation With LLMs A multi-agent architecture consists of multiple autonomous agents collaborating to accomplish complex tasks. With the recent advancements in

Read More
Qwen QVQ-72B: Best open-sourced Image Reasoning LLM

Qwen QVQ-72B: Best open-sourced Image Reasoning LLM

Visual Reasoning LLM by Alibaba So. before ending 2024, Qwen (by Alibaba) is back with a bang and has released another open-sourced LLM, Qwen QVQ-72B which is a visual reasoning LLM i.e.

Read More
Qwen2.5: Redefining Efficiency in Large Language Models

Qwen2.5: Redefining Efficiency in Large Language Models

Scaling Smarter and Learning Better with Reinforcement Learning A New Era of Language Models: Introducing Qwen2.5 Building on the breakthroughs in large language models (LLMs), **Qwe

Read More
Use Gemini 2.0 to Build a Realtime Chat App with Multimodal Live API

Use Gemini 2.0 to Build a Realtime Chat App with Multimodal Live API

Gemini Development Tutorial Google launched Gemini 2.0 with the preview model Gemini 2.0 Flash Experimental, and you must have learned about it from videos and articles. This model has g

Read More
Agentic Reflection for Natural Language to SQL Generation

Agentic Reflection for Natural Language to SQL Generation

Authors: Atul Varshneya, Toby Fotherby, Shweta Keshavanarayana Introduction Natural language to SQL (NL2SQL) conversion promises to democratize data access for non-technical users. Howeve

Read More
Beyond Text2SQL: A New Frontier in Natural Language Database Queries

Beyond Text2SQL: A New Frontier in Natural Language Database Queries

Introducing TAG, a Unified Model for Complex Question Answering Bridging the Gap Between AI and Databases The Promise and the Bottleneck Artificial intelligence (AI) is rapidly

Read More
Claude Conversing With Neo4j Via MCP

Claude Conversing With Neo4j Via MCP

Give LLMs the Power of Knowledge Graphs With Anthropic’s Model Context Protocol Last week, Anthropic published a new [Model Context Protocol](https://www.anthropic.com/news/model-context-pro

Read More
Conversational BI: Text2SQL Challenges & Solutions

Conversational BI: Text2SQL Challenges & Solutions

The art of querying SQL Databases / Warehouses in Natural Language Introduction the future of BI is Conversational — this is what Gartner and other analysts have been telling us for th

Read More
From Posts to Reports: Leveraging LLMs for Social Media Data Mining

From Posts to Reports: Leveraging LLMs for Social Media Data Mining

How to instruct LLMs to filter restaurant posts and extract critical insights for business growth. Application Overview We’re living in the golden age of automation, powered by the r

Read More
How I Built an LLM App Based on Graph-RAG System with ChromaDB and Chainlit

How I Built an LLM App Based on Graph-RAG System with ChromaDB and Chainlit

End-to-end app with GUI and storing new knowledge on vector database in just 3 scripts Large language models (LLMs) and knowledge graphs are valuable tools to work with natural language proce

Read More
How I Fix Prompts for Flawless AI Responses Every Time

How I Fix Prompts for Flawless AI Responses Every Time

When ChatGPT first launched, prompt templates for nearly every industry and profession flooded the internet. You’ve probably come across posts like “The Best [N] ChatGPT Prompts for [Sector/Pr

Read More
How to build Agentic RAG using CrewAI and Langchain

How to build Agentic RAG using CrewAI and Langchain

In the rapidly evolving landscape of AI, the ability to provide accurate, context-aware responses to user queries is a game-changer. Retrieval-Augmented Generation (RAG) has emerged as a powerf

Read More
How to Scrape and Analyse Data for Free using AI: From Collection to Insight

How to Scrape and Analyse Data for Free using AI: From Collection to Insight

Learn how to combine web scraping, proxies, and AI-powered language models to automate data extraction and gain actionable insights effortlessly. While some websites are straightforward t

Read More
MarkItDown: A Powerful and must-have Toolkit for LLMs projects, tested with OpenAI and Gemini 2.0

MarkItDown: A Powerful and must-have Toolkit for LLMs projects, tested with OpenAI and Gemini 2.0

*MarkItDown is a versatile utility developed by Microsoft that transforms various file formats into Markdown, making it an essential tool for developers working with Large Language Models (LLMs).

Read More
Multilingual Vision Captioning: A Multi-Model Multimodal Approach to Image and Video Captioning and…

Multilingual Vision Captioning: A Multi-Model Multimodal Approach to Image and Video Captioning and…

Using a combination of Meta’s Llama 3.2 11B Vision Instruct, Facebook’s 600M NLLB-200, and LLaVA-Next-Video 7B models to produce multilingual image and video captions, descriptive tags, a

Read More
OpenAI Unveils O3 and O3-Mini: Transforming Language Processing

OpenAI Unveils O3 and O3-Mini: Transforming Language Processing

I’ve been following OpenAI’s updates closely, and I’m excited to share some breaking news: OpenAI just announced two of their newest language models, O3 and O3-mini. After watching a recent demon

Read More
The Rise of AI-Driven UX: Balancing Automation and Human-Centered Design in 2025

The Rise of AI-Driven UX: Balancing Automation and Human-Centered Design in 2025

If there is one thing that’s altering the way we create user experience (UX) designs and conduct research in 2024, it is definitely artificial intelligence (AI). UX experts have already integrated

Read More
Build Smarter AI Agents with Long-Term, Persistent Memory and Atomic Agents

Build Smarter AI Agents with Long-Term, Persistent Memory and Atomic Agents

One of the most useful features in any AI assistant is undoubtedly the ability to have a persistent long-term memory. Whether the goal is to learn more about the user, adjust the AI’s b

Read More
DeepSeek-VL2: Advancing Multimodal Understanding with Mixture-of-Experts Vision-Language Models

DeepSeek-VL2: Advancing Multimodal Understanding with Mixture-of-Experts Vision-Language Models

DeepSeek-VL2 represents a significant leap forward in the field of vision-language models, offering advanced capabilities for multimodal understanding. This innovative series of large Mixture-o

Read More
Building a Research Assistant That Can Write to Google Docs (Part 2)

Building a Research Assistant That Can Write to Google Docs (Part 2)

A tool that might help with your homework ***This article is the second of a two part series where we use LangGraph and Tavily to build a simple research agent, which writes and refines sh

Read More
Building a Research Agent That Can Write to Google Docs (Part 1)

Building a Research Agent That Can Write to Google Docs (Part 1)

A tool that might help with your homework ***This article is the first of a two part series where we use LangGraph and Tavily to build a simple research agent, which writes and refines sho

Read More
Qwen2-VL: A Vision Language Model That Runs Locally

Qwen2-VL: A Vision Language Model That Runs Locally

This is an introduction to「Qwen2-VL」, a machine learning model that can be used with ailia SDK. You can easily use this model to create AI applications using [ailia SDK](h

Read More
How to Fine-Tune Llama-3.2 on your own data: A detailed guide

How to Fine-Tune Llama-3.2 on your own data: A detailed guide

Introduction The release of Llama-3.2 collection from Meta marked an important milestone in the open-s

Read More
How to Use Structured Generation for LLM-as-a-Judge Evaluations

How to Use Structured Generation for LLM-as-a-Judge Evaluations

Structured generation is fundamental to building complex, multi-step reasoning agents in LLM evaluations — especially for open source models *Disclosure: I am a maintainer of [Opik](https

Read More
Step by Step Midjourney Prompts Generation using ChatGPT 4o’s Vision Analysis

Step by Step Midjourney Prompts Generation using ChatGPT 4o’s Vision Analysis

You know that feeling when you’re trying to create the perfect Midjourney prompt, but the words just don’t capture what’s in your mind? I’ve been there too. After countless hours of experimentat

Read More
The Role of Small Models in the LLM Era

The Role of Small Models in the LLM Era

A recent study extensively explored the role of Small Language Models (SLMs) in modern AI. The research provided a comprehensive analysis of SLMs, focusing on their capabilities, applicatio

Read More
Meta’s Llama 3.3: The Evolution of Open-Source Large Language Models

Meta’s Llama 3.3: The Evolution of Open-Source Large Language Models

Meta’s recent release of Llama 3.3 represents a milestone in the development of large language models (LLMs). It introduces improvements in scale, efficiency, and safety, while remaining open

Read More
Google Gemini-Exp-1206: The new Best LLM

Google Gemini-Exp-1206: The new Best LLM

Beats GPT-4.0, OpenAI-o1, Claude3.5 Sonnet and Gemini 1.5 on LMArena Google Gemini, after a lacklustre debut in the GenAI space some months back, is picking up pace quite fast. It has

Read More
Meta Llama3.3 : 70B model alternate for Llama3.1 405B

Meta Llama3.3 : 70B model alternate for Llama3.1 405B

3rd model series in Llama after Llama3.1, Llama3.2 So Meta, in their final release for the year, launched Llama3.3, a 70B GenAI model that has shown some great results and has already b

Read More
OpenAI’s O1 Model: A Detailed Exploration into the Future of AI

OpenAI’s O1 Model: A Detailed Exploration into the Future of AI

IntroductionArtificial intelligence has rapidly evolved over the last decade, leading to breakthroughs in natural language processing (NLP), machine learning, and multimodal applications. Op

Read More
Testing OpenAI’s o1 Models: A Look at Chain-of-Thought Prompting for Journalism Tasks

Testing OpenAI’s o1 Models: A Look at Chain-of-Thought Prompting for Journalism Tasks

New LLMs are touted as smarter problem-solvers — but how do they fare in real-world journalistic tasks like data visualization and headline selection? Last month, OpenAI released two new la

Read More
Smarter and Faster: OpenAI o1 and o1 pro mode

Smarter and Faster: OpenAI o1 and o1 pro mode

Just 12 hours ago, OpenAI rolled out the new o1 model and o1 with pro mode. As you may already know, o1 models are the first series of models designed to think before answering, providing more det

Read More
Optimizing Workflow Efficiency with LangGraph and Agents: Key Features, Use Cases, and Integration…

Optimizing Workflow Efficiency with LangGraph and Agents: Key Features, Use Cases, and Integration…

In the context of Generative AI and large language models (LLMs), agents and LangGraph are tools and frameworks that enhance the functionality of LLMs by allowing them to perform tasks,

Read More
ElevenLabs Conversational AI: Build Smart, Scalable Voice Agents in Minutes

ElevenLabs Conversational AI: Build Smart, Scalable Voice Agents in Minutes

Hello, creators! 🗣️ The way we interact with technology is changing, and conversational AI is at the forefront of this revolution. ElevenLabs has introduced a cut

Read More
Anthropic’s Claude AI Can Now Copy Your Writing Style

Anthropic’s Claude AI Can Now Copy Your Writing Style

I’ve been working as an AI ambassador for various organizations for months, and if there’s one thing most users keep asking, it’s how to make AI chatbots write like them. Essentially, they want

Read More

Mastering the 25 Types of RAG Architectures: When and How to Use Each One

Introduction Retrieval-Augmented Generation (RAG) architectures have revolutionized the way we approach information retrieval. These architectures bridge the gap between generating responses

Read More
Implement A Simple YouTube Video Summarizer Application using Cursor AI

Implement A Simple YouTube Video Summarizer Application using Cursor AI

Introduction Cursor AI is revolutionizing software development by integrating advanced artificial intelligence into a familiar coding environment. As a fork of

Read More
Alibaba QwQ: Better than OpenAI-o1 for reasoning?

Alibaba QwQ: Better than OpenAI-o1 for reasoning?

32b open-sourced model beats o1 mini and competes with o1-preview A few days back, Alibaba released Marco-o1, a 7b reasoning model. Now, they have released another, improved version cal

Read More
Apple’s LLM Siri: A New Era in AI-Powered Conversations?

Apple’s LLM Siri: A New Era in AI-Powered Conversations?

ARTIFICIAL INTELLIGENCE The world of artificial intelligence has been rapidly evolving in recent years, with generative AI (genAI) seemingly being added to everything. Apple’s Siri, once a

Read More
Alibaba Marco-o1 : Open-source alternative for OpenAI-o1

Alibaba Marco-o1 : Open-source alternative for OpenAI-o1

LLM for Open-Ended problems OpenAI-o1 was a revolutionary release, enabling detailed reasoning by LLMs for complex tasks like maths or physics problem, not limiting GenAI to just generic

Read More
Building a Powerful Local AI Web Search Assistant with Llama 3.2:

Building a Powerful Local AI Web Search Assistant with Llama 3.2:

Combine Large Language Models with Real-Time Web Search for Up-to-Date Answers and Context-Aware Insights 👨🏾‍💻 GitHub ⭐️ | 👔 [LinkedIn](https://www

Read More
Complex SQL Joins with LangGraph and Waii

Complex SQL Joins with LangGraph and Waii

In the rapidly evolving landscape of data analytics, the ability to interact with data through natural language is becoming increasingly valuable. Conversational analytics aims to make complex d

Read More
How to make more reliable reports using AI — A Technical Guide

How to make more reliable reports using AI — A Technical Guide

Technical guide, sharing my experience of working with AI Over the past year, I’ve been running my own AI software development and consulting agency, working with a variety of clients — fr

Read More
Introducing LightRAG: A New Era in Retrieval Augmented Generation

Introducing LightRAG: A New Era in Retrieval Augmented Generation

In the rapidly evolving field of natural language processing, Retrieval-Augmented Generation (RAG) systems have become essential for providing accurate, contextually rich responses. However, trad

Read More
Which AI Agent framework should i use? (CrewAI, Langgraph, Majestic-one and pure code)

Which AI Agent framework should i use? (CrewAI, Langgraph, Majestic-one and pure code)

With the advancement of large language models, it is now possible for artificial intelligence models to reason through a problem. At the start, we believed that the models would not be able to acc

Read More
ChatGPT 4 vs Claude 3.5 Sonnet: Who’s Better? Let’s Review:

ChatGPT 4 vs Claude 3.5 Sonnet: Who’s Better? Let’s Review:

I Ran Tests — ChatGPT 4 vs Claude 3 Sonnet, Who Wins? The buzz is on there’s a new chatbot player in town, the Claude 3 Sonnet. It’s been described as being better than ChatGPT, but there h

Read More
Claude 3.5 Sonnet Unleashed: The AI Revolution You Can’t Afford to Miss | by Ahmed Bahaa Eldin | Medium

Claude 3.5 Sonnet Unleashed: The AI Revolution You Can’t Afford to Miss | by Ahmed Bahaa Eldin | Medium

The technological area known as Artificial Intelligence is a relatively young field that has been progressing exceptionally fast, and among the newest examples of its development is known as [**

Read More
Claude 3.5 vs GPT-4o: Key Differences You Need to Know

Claude 3.5 vs GPT-4o: Key Differences You Need to Know

Anthropic’s latest release, Claude 3.5 Sonnet, enters a market where OpenAI’s GPT-4o has set a high benchmark, with [92% of Fortune 500](https://www.techbusinessnews.com.au/news/92-of-fortune-

Read More
How to Use Claude 3.5 Sonnet API for Creative Writing

How to Use Claude 3.5 Sonnet API for Creative Writing

Let’s talk about something that we all face during development: API Testing with Postman for your Development Team. Yeah, I’ve heard of it as well, [Postman is getting worse](https://www.reddit

Read More
Llama 3.2 vs GPT-4 vs OpenAI O1 vs Gemini Ultra vs Claude 3.5: Which AI Model Is Right for You?

Llama 3.2 vs GPT-4 vs OpenAI O1 vs Gemini Ultra vs Claude 3.5: Which AI Model Is Right for You?

Artificial Intelligence has come a long way, and each new model seems to push the boundaries further. Today, I want to share my thoughts on five of the leading AI models out there: Meta’s Llama

Read More
5 AI Projects You Can Build This Weekend (with Node.js)

5 AI Projects You Can Build This Weekend (with Node.js)

5 Exciting AI Projects to Build in a Weekend with Node.js (Perfect for Beginners) Are you interested in building AI projects but short on time? With just Node.js and a weekend, you can dive into

Read More
Conversational AI for Customer Service: Best Practices and Key Steps for Success

Conversational AI for Customer Service: Best Practices and Key Steps for Success

In today’s fast-paced business environment, customer service plays a crucial role in building and maintaining customer loyalty. As businesses strive to offer personalized and efficient support,

Read More
Get ChatGPT to Sound More Human: Essential Tips for Creating Natural, Engaging AI Conversations

Get ChatGPT to Sound More Human: Essential Tips for Creating Natural, Engaging AI Conversations

Have you ever found your AI assistant sounding a bit too… mechanical? ChatGPT, while impressively capable, sometimes leans on overly formal or generic language. But with a few adjustments, you c

Read More
Is Perplexity Pro a Smarter, More Efficient Way to Search the Web?

Is Perplexity Pro a Smarter, More Efficient Way to Search the Web?

The Future of Search Is Perplexity Pro a Smarter, More Efficient Way to Search the Web? How does it compare to traditional search engines and is it worth the cost? Hey AI Friends and follo

Read More
Build Your Talking Voice AI Assistant Locally: Memory-Retaining Chatbot with Streamlit UI…

Build Your Talking Voice AI Assistant Locally: Memory-Retaining Chatbot with Streamlit UI…

Step-by-Step Guide to Developing Your Own Voice AI with Context Memory and Real-Time Chat, Powered by Llama3.1 & Llama3.2 Models The concept of a voice-based personal assistant has gro

Read More
Building a Reliable Text Classification Pipeline with LLMs: A Step-by-Step Guide

Building a Reliable Text Classification Pipeline with LLMs: A Step-by-Step Guide

Overcoming common challenges in LLM-based text classification In this step-by-step tutorial, we’ll walk through how to use large language models (LLMs) to build a text classification pi

Read More
Comparing Leading Text-to-Image Generation Models for Adding Text to Images

Comparing Leading Text-to-Image Generation Models for Adding Text to Images

A comparison of nine leading image generation models’ ability to render accurate text (words and phrases) within an image. In this post, we will assess the capabilities of nine state-of-the-a

Read More
Multimodal AI for Conversational Human Motion

Multimodal AI for Conversational Human Motion

Written by Christian Safka and Keyu Chen In this exploration we’ll look at how multi

Read More
Qwen2.5-Coder, Cosmos Tokenizer, OpenCoder, and New SentenceTransformers: Great Times for Open…

Qwen2.5-Coder, Cosmos Tokenizer, OpenCoder, and New SentenceTransformers: Great Times for Open…

I want to highlight some standout open-source advancements that have really caught my eye:Qwen2.5-Coder Series: An open-source code LLM that’s giving GPT-4 a run for its money. **Co

Read More
RAGate: Adaptive RAG for Conversational AI

RAGate: Adaptive RAG for Conversational AI

Building Conversational AI systems is hard!!! It’s feasible but also complex, time-consuming, and resource-intensive. The challenge lies in designing systems that can understand and genera

Read More
GLM-4-Voice 9B — Real-time Multilingual Voice Conversation AI — Install Locally in Minutes

GLM-4-Voice 9B — Real-time Multilingual Voice Conversation AI — Install Locally in Minutes

How to set up GLM-4-Voice 9B for seamless real-time voice interaction in English and Chinese, and explore its unique architecture, low-latency response, and customizable voice attributes.

Read More
Build a RAG-based scientific ChatBot with LangChain, Streamlit & PubMed — Part 4(Put it all…

Build a RAG-based scientific ChatBot with LangChain, Streamlit & PubMed — Part 4(Put it all…

Hello and welcome to the last part of the series to build a scientific ChatBot with Langchain, Streamlit, and PubMed! In the previous part, we built the data persistence and RAG pipeline with v

Read More
What’s new with Claude Sonnet 3.5 & Claude 3.5 Haiku?

What’s new with Claude Sonnet 3.5 & Claude 3.5 Haiku?

First off, what is Claude? Claude is a language model created by Anthropic and it’s designed to help with tasks like answering quest

Read More
Why embedding matters when building a non-English RAG system — Multilingual embeddings

Why embedding matters when building a non-English RAG system — Multilingual embeddings

Why Embeddings are Key Embeddings are a cornerstone of modern generative AI, silently driving the functionality of many systems we interact with daily. At their simplest, embeddings are **num

Read More
Gemma vs. Llama vs. Mistral: Exploring Smaller AI Models

Gemma vs. Llama vs. Mistral: Exploring Smaller AI Models

A Comparative Study of Small-Scale Language Models: Evaluating Gemma, Llama 3, and Mistral in Reading Comprehension Tasks Introduction Large Language Models (LLMs) have been evolving rapidl

Read More
Google Gemini: Are Big Context Windows the Killer Feature?

Google Gemini: Are Big Context Windows the Killer Feature?

Goggle’s upcoming LLM makes a massive move Barely eight months ago, a leaked Google email revealed the company was struggling to outpace its AI rivals. Not only was there [no moat](https://www

Read More
Llama 3.2: The Next Generation of Lightweight, Instruction-Tuned Language Models: A Hands-On…

Llama 3.2: The Next Generation of Lightweight, Instruction-Tuned Language Models: A Hands-On…

Discover LLaMA 3.2’s Key Innovations in Pruning, Knowledge Distillation, and Multilingual Performance, Plus a Hands-On Tutorial to Run Locally or Through Google Colab 👨🏾‍💻 [GitHub](https://

Read More
AI-Powered OCR with Phi-3-Vision-128K: The Future of Document Processing

AI-Powered OCR with Phi-3-Vision-128K: The Future of Document Processing

In the fast-evolving world of artificial intelligence, multimodal models are setting new standards for integrating visual and textual data. One of the latest breakthroughs is the **Phi-3-Visi

Read More
Artifacts: Top Mindblowing uses of Claude 3.5 Sonent

Artifacts: Top Mindblowing uses of Claude 3.5 Sonent

Anthropic recently launched its most advanced LLM, “Claude 3.5 Sonnet,” and it’s mindblowing. People on social media called this model the most advanced LLM currently available. This AI model out

Read More
LongRAG: Giving AI a Bigger Net to Catch More Fish in the Sea of Information

LongRAG: Giving AI a Bigger Net to Catch More Fish in the Sea of Information

In my previous article, I introduced whether RAG would become obsolete due to long-context LLMs. Today, let’

Read More
Visualize your RAG Data — Evaluate your  Retrieval-Augmented Generation System with Ragas

Visualize your RAG Data — Evaluate your Retrieval-Augmented Generation System with Ragas

How to use UMAP dimensionality reduction for Embeddings to show multiple evaluation Questions and their relationships to source documents with Ragas, OpenAI, Langchain and ChromaDB Retrieval

Read More
How to Improve LLMs with RAG

How to Improve LLMs with RAG

A beginner-friendly introduction w/ Python code This article is part of a larger series on using large language mo

Read More
Unlocking Mixture-of-Experts (MoE) LLM : Your MoE model can be embedding model for free

Unlocking Mixture-of-Experts (MoE) LLM : Your MoE model can be embedding model for free

Mixture-of-experts (MoE) LLM can be used as an embedding model for free. I recently found an interesting paper titled “Your Mixture-of-Experts LLM is Secretly an Embedding Model for Free.”

Read More
Users Will Fall in Love With OpenAI’s New GPT-4o Model. Literally.

Users Will Fall in Love With OpenAI’s New GPT-4o Model. Literally.

The company’s new GPT-4o can understand and mimic human speech and emotion In the iconic 2013 film Her, the protagonist develops an intense relationship — which morphs into a love affair

Read More
OpenAI’s ‘Leaked’ GPT2 Model Has Everyone Stunned.

OpenAI’s ‘Leaked’ GPT2 Model Has Everyone Stunned.

On-Purpose leak? The influence that OpenAI has on the AI industry can’t be understated. Every move or decision makes headlines automatically… even if they don’t actually announce the thin

Read More
OpenAI Confirms the Arrival of GPT-5, Poised to Bring Huge Improvements to Artificial Intelligence…

OpenAI Confirms the Arrival of GPT-5, Poised to Bring Huge Improvements to Artificial Intelligence…

A netizen posted a GPT5 countdown post on x, saying that it was a conclusion drawn from clues from various platforms. The comment section has reached a climax with all kinds of opinions coming out

Read More
How to create an AI team to write compelling stories with CrewAI and Gemini Pro

How to create an AI team to write compelling stories with CrewAI and Gemini Pro

Are you fascinated by the idea of AI generating stories that capture the imagination? If so, you’re not alone! In this article, we’ll dive into an introductory project that combines the powers of C

Read More
Retrieval-Augmented Generation: Approaches, State of the Art, and Optimization Strategies

Retrieval-Augmented Generation: Approaches, State of the Art, and Optimization Strategies

⭐ RAG is particularly useful in knowledge-intensive scenarios or domain-specific applications that require knowledge that’s continually updating. RAG has been popularized recently with its app

Read More
Fine-tuning LLama 3 with Unsloth

Fine-tuning LLama 3 with Unsloth

In this article I will show you how to fine-tune an LLM (Llama 3 from Meta) using Unsloth. I will also provide a way to use your own custom dataset. **Note

Read More
Qwen2.5 1.5b: the future of Mobile AI?

Qwen2.5 1.5b: the future of Mobile AI?

Local Testing and Evaluation of Alibaba Cloud’s Latest LLM. With llama-cpp-python and a DIY prompt catalog. In part one we explored together the innovations from Alibaba Cloud’s team with th

Read More
Introduction to LLaVA: A Multimodal AI Model

Introduction to LLaVA: A Multimodal AI Model

LLaVA is an end-to-end trained large multimodal model that is designed to understand and generate content based on both visual inputs (images) and textual instructions. It combines the capabil

Read More
Google Releases Gemma — A Lightweight And Open Source Model

Google Releases Gemma — A Lightweight And Open Source Model

In just a week, the world has witnessed the most groundbreaking AI advancements from two tech giants. OpenAI introduced its jaw-dropping AI video generator, [Sora](https://readmedium.com/3d1638

Read More
Claude 3.5 Sonnet V/S GPT-4O: Which one is better

Claude 3.5 Sonnet V/S GPT-4O: Which one is better

In November 2022, OpenAI launched ChatGPT, a model that has revolutionized how we search and interact with information. Next year, in March, an American startup,” Anthropic,” founded by ex-OpenAI

Read More
Alibaba’s Open-Source Qwen: How It’s Revolutionizing AI and How You Can Use It

Alibaba’s Open-Source Qwen: How It’s Revolutionizing AI and How You Can Use It

Alibaba has recently made waves in the AI world by open-sourcing its Qwen 2.5 models during the 2024 Apsara Conference. With over 100 models, Qwen spans multiple modalities including language, vi

Read More
Generating structured data from an image with GPT vision and Langchain

Generating structured data from an image with GPT vision and Langchain

In today’s world, where visual data is abundant, the ability to extract meaningful information from images is becoming increasingly valuable. Langchain, a powerful framework for building applica

Read More