Articles tagged: LLMs

44 articles

Local models

Mistral's Latest Updates: New Local Models and Open-Source Advances

Mistral AI has released new local models with improved efficiency and performance. These updates include enhanced reasoning capabilities a...

Jul 5, 20266 min
AI tools

Time-Series LLMs, Explained with t0-alpha

Time-series LLMs like t0-alpha leverage transformer architectures to analyze sequential data. This article explains how t0-alpha handles f...

Jul 5, 20266 min
AI agents

AI Agents Explained: What Is a ReAct Loop and How Does It Work?

The ReAct loop combines reasoning and acting to enable AI agents to solve complex tasks iteratively. By alternating between thought, actio...

Jul 4, 20268 min
AI agents

Introducing OpenWiki: An Open Source Agent for Automated Repository Documentation

OpenWiki is a new open source AI agent that automatically generates, updates, and maintains documentation for code repositories. It integr...

Jul 3, 20267 min
AI research

ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration

ScarfBench introduces a standardized benchmark to evaluate AI agents on migrating enterprise Java frameworks. It tests code refactoring, d...

Jul 2, 20268 min
AI tools

Prompt Engineering Fails Quietly

Prompt regression causes AI outputs to degrade over time without warning. Learn why it happens, how to detect it, and practical strategies...

Jul 1, 20268 min
AI tools

We Built a Routing Layer to Cut Our AI Costs. It Broke the Product.

A team built a custom AI routing layer to reduce API costs, but it introduced latency, errors, and unpredictable behavior that degraded th...

Jun 30, 20267 min
AI research

The Next Frontier in AI Research: Beyond Large Language Models

Explore groundbreaking AI research trends beyond LLMs, including neuro-symbolic reasoning, self-supervised learning, and energy-efficient...

Jun 29, 20269 min
AI agents

Tail Control: The Counterintuitive Engineering of Reliable Agentic Workflows

Reliable AI agents often fail due to over-engineering the 'head' (reasoning). Tail control flips this: by constraining the agent's actions...

Jun 29, 20268 min
AI research

The Frontier of Artificial Intelligence: Current Research and Future Directions

AI research explores cutting-edge topics like deep learning, reinforcement learning, and AI safety. This article examines key breakthrough...

Jun 28, 20268 min
AI research

The Evolution of Artificial Intelligence: From Narrow Tasks to General Intelligence

AI research is advancing from narrow, task-specific algorithms toward general intelligence. This article explores key breakthroughs, inclu...

Jun 27, 20268 min
AI agents

Prompt Caching with Deep Agents

Prompt caching reduces latency and cost in AI agents by storing and reusing processed prompts. This technique enables faster multi-step re...

Jun 27, 20267 min
Local models

Mistral OCR 4: Redefining Document Understanding on Local Hardware

Mistral OCR 4 brings powerful, privacy-first document OCR to local models. This article explores its architecture, performance on consumer...

Jun 27, 20267 min
AI research

The Next Frontier in AI Research: Beyond Generative Models

AI research is shifting from scaling generative models to building efficient, reasoning-driven systems. New paradigms like neuro-symbolic...

Jun 26, 20268 min
AI agents

From Local LLM to Tool-Using Agent

Learn how to transform a local large language model into a powerful agent by integrating external tools like web search, APIs, and code ex...

Jun 26, 20268 min
AI agents

Vector RAG Isn’t Enough

Standard vector retrieval fails multi-agent systems. Discover how adding a context graph layer enables agents to share structured memory,...

Jun 26, 20268 min
Local models

Run a vLLM Server on HF Jobs in One Command

Learn how to launch a vLLM inference server on Hugging Face Jobs with a single command. This guide covers setup, configuration, and practi...

Jun 26, 20266 min
AI tools

An LLM as Arbiter in RAG Retrieval: Picking the Right Candidate with Reasons

Explore how to use an LLM as an intelligent arbiter to select the best document from RAG retrieval candidates, enhancing accuracy with con...

Jun 26, 20268 min
AI agents

3 Agents. 3 LLMs. 1 Aging GPU: Engineering Parallel Inference on Bare Metal

Learn how to run three AI agents with separate LLMs simultaneously on a single outdated GPU. This article covers bare-metal parallel infer...

Jun 25, 20267 min
AI agents

Build Real Agentic Apps Using CUGA: Two Dozen Working Examples on a Lightweight Harness

Discover how CUGA, a lightweight harness, powers two dozen practical agentic applications. Learn to build autonomous AI agents with code e...

Jun 23, 20269 min
Local models

We Got Local Models to Triage the OpenClaw Repo for FREE!*

Discover how we used local AI models to automate issue triage on the OpenClaw repository at zero cost, enhancing efficiency and reducing m...

Jun 23, 20266 min
AI research

The Frontier of Artificial Intelligence: Unraveling the Mysteries of AI Research

AI research is advancing rapidly, exploring machine learning, neural networks, and ethics. This article delves into current breakthroughs,...

Jun 22, 20267 min
AI research

The Frontier of Artificial Intelligence: Current Trends in AI Research

AI research is rapidly evolving, from large language models and multimodal systems to breakthroughs in reasoning and safety. This article...

Jun 21, 20267 min
AI agents

Tool Calling, Explained: How AI Agents Decide What to Do Next

Discover how AI agents use tool calling to decide their next action. This article breaks down the decision-making process, from function s...

Jun 21, 20269 min
AI research

The Evolution of Artificial Intelligence: From Theory to Transformative Research

Explore how AI research has evolved from foundational theories to cutting-edge breakthroughs, including deep learning and reinforcement le...

Jun 20, 20267 min
Guides

Testing Mythos and Fable: Moving Beyond SWE-bench with Nvidia’s Open Contender

Explore how Nvidia’s new open-source framework challenges SWE-bench dominance. Learn to test AI models with Mythos and Fable for real-worl...

Jun 20, 20268 min
Guides

Structured Outputs with LLMs: JSON Mode, Function Calling, and When to Use Each

Learn how to get structured data from large language models using JSON mode and function calling. This guide compares both approaches with...

Jun 19, 20268 min
AI research

MosaicLeaks: Can your research agent keep a secret?

MosaicLeaks reveals how AI research agents can inadvertently reconstruct sensitive information from fragmented data. This article explores...

Jun 19, 20268 min
AI research

The Frontiers of Artificial Intelligence: Current Trends in AI Research

AI research is rapidly evolving, focusing on areas like generative models, reinforcement learning, and ethical frameworks. These advances...

Jun 18, 20268 min
Guides

LLMs Inside the Product: A Practical Field Guide

A hands-on guide to integrating large language models into products, covering architecture patterns, prompt engineering, cost optimization...

Jun 17, 20266 min
AI agents

From the Hugging Face Hub to Robot Hardware with Strands Agents and LeRobot

Discover how Strands Agents and LeRobot bridge the gap between AI models on Hugging Face Hub and real-world robot hardware, enabling seaml...

Jun 17, 20267 min
AI research

The Next Frontier: Exploring the Frontiers of Artificial Intelligence Research

AI research is rapidly pushing boundaries, from generative models to reasoning systems. This article explores key breakthroughs, including...

Jun 16, 20269 min
AI agents

LLM Fallbacks Break Agent Pipelines

When LLM calls fail, agent pipelines silently break. This article introduces a recovery layer that catches fallbacks, retries intelligentl...

Jun 16, 20268 min
Local models

Run a Local LLM with OpenClaw on Your Mac Mini

Learn how to install and run OpenClaw on a Mac Mini for private, offline AI inference. Step-by-step guide covers setup, model loading, and...

Jun 16, 20266 min
AI coding

How We Made Coding Agent Spend Predictable

Discover how we engineered a cost-predictable coding agent by combining token budgets, early stopping, and adaptive context management. Le...

Jun 16, 20266 min
Guides

DeepSeek Sharpens Its Reasoning: DeepSeek-R1, an Affordable Rival to OpenAI’s o1

DeepSeek-R1 brings advanced reasoning capabilities at a fraction of the cost of OpenAI’s o1. Learn how this open-source model matches o1 i...

Jun 15, 20266 min
Guides

Vision LLMs are PDF Parsers Too: Reading Charts and Diagrams for RAG

Learn how vision LLMs extract data from charts and diagrams in PDFs for RAG pipelines. This guide covers practical examples using multimod...

Jun 14, 20268 min
AI agents

GPU Time-Slicing for Concurrent LLM Agents on Kubernetes

Learn how GPU time-slicing enables concurrent LLM agents on Kubernetes, maximizing GPU utilization and reducing costs. This article covers...

Jun 14, 20266 min
AI research

The Next Frontier in AI Research: From Deep Learning to Autonomous Reasoning

AI research is shifting from scaling deep learning models to developing systems capable of autonomous reasoning and causal inference. This...

Jun 13, 202610 min
AI research

The Frontier of AI Research: From Deep Learning to Autonomous Reasoning

AI research is advancing beyond deep learning into areas like autonomous reasoning, causal inference, and neuro-symbolic systems. These de...

Jun 13, 20268 min
AI research

olmo-eval: An evaluation workbench for the model development loop

olmo-eval is an evaluation workbench designed to integrate seamlessly into the model development loop, enabling rapid iteration and system...

Jun 12, 20267 min
AI tools

Increase Recommendation Systems’ Precision with LLMs, Using Python

A clear and practical article about artificial intelligence for a professional audience.

Jun 9, 20264 min
AI agents

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

A clear and practical article about artificial intelligence for a professional audience.

Jun 7, 20264 min
AI tools

Automate Writing Your LLM Prompts

A clear and practical article about artificial intelligence for a professional audience.

Jun 6, 20264 min