Ethics

Can AI Really Think? DeepSeek’s R1 Says “Yes” (and Shows You How)

Can AI Really Think? DeepSeek’s R1 Says “Yes” (and Shows You How)

Rifx.Online
Artificial Intelligence , Ethics , Machine Learning
20 Jan, 2025

Imagine a computer that doesn’t just crunch numbers and follow instructions, but actually thinks things through, step-by-step, just like you do. That’s the exciting promise of “reasoning mod

Building Human-Facing Agentic Systems: The Psychology and Sociology of Super Intelligence

Building Human-Facing Agentic Systems: The Psychology and Sociology of Super Intelligence

Rifx.Online
Ethics , Natural Language Processing , Autonomous Systems
19 Jan, 2025

Soundcloud Podcast Executive Summary“Power is in tearin

AI to Code Like Engineers by 2025, Predicts Zuckerberg

AI to Code Like Engineers by 2025, Predicts Zuckerberg

Rifx.Online
Programming , Natural Language Processing , Ethics
14 Jan, 2025

In an era where technology evolves at breakneck speed, Mark Zuckerberg, the visionary behind Meta, has made a bold prediction: by 2025, artificial intelligence will code like mid-level engineer

70% OFF

Qwen QwQ-32B-Preview

Introduction QwQ-32B-Preview is an experimental research model developed by the Qwen Team, focused on advancing AI reasoning capabilities. As a preview release, it demonstrates promising ana ...

Qwen 32K context $0.12/M input tokens $0.18/M output tokens

The Rapid Rise of ‘o3’: A New Turning Point in the AGI Debate

The Rapid Rise of ‘o3’: A New Turning Point in the AGI Debate

This week, the AI community has been abuzz with discussions surrounding a new frontier: OpenAI’s “o3,” a breakthrough model that has catapulted the conversation around Artificial General Int

OpenAI’s O1 Model: A Detailed Exploration into the Future of AI

OpenAI’s O1 Model: A Detailed Exploration into the Future of AI

Rifx.Online
Natural Language Processing , Machine Learning , Technology/Web
12 Dec, 2024

IntroductionArtificial intelligence has rapidly evolved over the last decade, leading to breakthroughs in natural language processing (NLP), machine learning, and multimodal applications. Op

baichuan4

Baichuan4 Model Introduction Baichuan4 is a state-of-the-art artificial intelligence language model designed to enhance natural language understanding and generation capabilities. Built on cutti ...

Baichuan 31.25K context $14.3/M input tokens $14.3/M output tokens

Meta: LlamaGuard 2 8B

This safeguard model has 8B parameters and is based on the Llama 3 family. Just like is predecessor, LlamaGuard 1, it can do both prompt and respons ...

Meta Llama 8K context $0.18/M input tokens $0.18/M output tokens

Llama 3 Lumimaid 70B

The NeverSleep team is back, with a Llama 3 70B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necess ...

Meta Llama 8K context $3.375/M input tokens $4.5/M output tokens

Cohere: Command R+

command-r-plus-08-2024 is an update of the Command R+ with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keepin ...

Cohere 125K context $2.85/M input tokens $14.25/M output tokens

Cohere: Command

Command is an instruction-following conversational model that performs language tasks with high quality, more reliably and with a longer context than our base generative models. Use of this model is ...

Cohere 4K context $0.95/M input tokens $1.9/M output tokens

Qwen: QwQ 32B Preview

QwQ-32B-Preview is an experimental research model focused on AI reasoning capabilities developed by the Qwen Team. As a preview release, it demonstrates promising analytical abilities while having se ...

Qwen 32K context $0.15/M input tokens $0.6/M output tokens

Meta: Llama 3.1 405B (base)

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This is the base 405B pre-trained version. It has demonstrated strong performance compared to leading closed-sour ...

Meta Llama 128K context $2/M input tokens $2/M output tokens

FREE

Meta: Llama 3.2 11B Vision Instruct (free)

Text image 2 text

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answ ...

Meta Llama 128K context $0 input tokens $0 output tokens $0.079/K image tokens

Meta: Llama 3.2 11B Vision Instruct

Text image 2 text

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answ ...

Meta Llama 128K context $0.055/M input tokens $0.055/M output tokens $0.079/K image tokens

Lumimaid v0.2 8B

Lumimaid v0.2 8B is a finetune of Llama 3.1 8B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged. Usage of this model ...

Meta Llama 128K context $0.188/M input tokens $1.125/M output tokens

The Rise of the AI Agent Product Manager and AI Agent Engineer

The Rise of the AI Agent Product Manager and AI Agent Engineer

Imagine a future where Generative AI doesn’t just respond to queries but proactively solves complex problems across every facet of business. This isn’t science fiction; it’s the rapidly approach

OpenAI GPT-5: Ph.D.-Level Intelligence Expected by 2025

OpenAI GPT-5: Ph.D.-Level Intelligence Expected by 2025

Rifx.Online
Machine Learning , Ethics , Data Science
01 Nov, 2024

After months of speculation, OpenAI has finally unveiled details about the highly anticipated GPT-5. Initially expected in 2024, its release has been postponed to late 2025 or early 2026. Mir

Lumimaid v0.2 70B

Lumimaid v0.2 70B is a finetune of Llama 3.1 70B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged. Us ...

Neversleep 128K context $3.375/M input tokens $4.5/M output tokens

Ministral 8B

Ministral 8B is an 8B parameter model featuring a unique interleaved sliding-window attention pattern for faster, memory-efficient inference. Designed for edge use cases, it supports up ...

Mistralai 125K context $0.1/M input tokens $0.1/M output tokens

Nvidia: Llama 3.1 Nemotron 70B Instruct

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging Llama 3.1 70B architect ...

Nvidia 128K context $0.35/M input tokens $0.4/M output tokens

Lumimaid v0.2 8B

Lumimaid v0.2 8B is a finetune of Llama 3.1 8B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged. Usage ...

Neversleep 128K context $0.188/M input tokens $1.125/M output tokens

Cohere: Command R+ (08-2024)

command-r-plus-08-2024 is an update of the Command R+ with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version ...

Cohere 125K context $2.375/M input tokens $9.5/M output tokens

Meta: Llama 3.1 70B Instruct

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrate ...

Meta llama 128K context $0.3/M input tokens $0.3/M output tokens

Meta: Llama 3.1 8B Instruct

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compar ...

Meta llama 128K context $0.055/M input tokens $0.055/M output tokens

Qwen 2 7B Instruct

Qwen2 7B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV ...

Qwen 32K context $0.054/M input tokens $0.054/M output tokens

Qwen 2 7B Instruct (free)

Qwen2 7B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV ...

Rifx.Online 8K context $0 input tokens $0 output tokens

Dolphin 2.9.2 Mixtral 8x22B 🐬

Dolphin 2.9 is designed for instruction following, conversational, and coding. This model is a finetune of Mixtral 8x22B Instruct. It features a 64k ...

Cognitivecomputations 64K context $0.9/M input tokens $0.9/M output tokens

Dolphin 2.6 Mixtral 8x7B 🐬

This is a 16k context fine-tune of Mixtral-8x7b. It excels in coding tasks due to extensive training with coding data and is known for its obedience, although ...

Cognitivecomputations 32K context $0.5/M input tokens $0.5/M output tokens