LLM agents: all you need to know in 2025

LLM agents have applications ranging from business agents to optimization. Their rise is a watershed moment - even in the context of Deep Learning and transformers. One of the best things about LLM agents is that people with little background in ML can also understand them.

What is an LLM agent?

An LLM agent is a system built on top of a Large Language Model which is designed to perform some specific task(s) and interact with its environment.

It can perform a specific task. For example, it can suggest travel destinations based on parameters like weather and budget or give mental health advice* based on the user’s condition.
It can interact with the environment. One of the limitations surrounding LLMs was their inaccessibility to the outer world. Not long ago, ChatGPT used to reply with something like, “Sorry, I am updated as of [date] and cannot provide up-to-date information about that.” This is no longer the case - LLM agents are able to interact with external sources like the internet, databases, APIs, or even other LLMs.

✳️

Note: AI isn’t error-free and can't replace physicians or psychiatrists. For both mental and physical health, please refer to experts.

Chain of Thoughts reasoning

LLMs are pretty good at language understanding but often struggle with basic questions. They made researchers wonder if LLMs can reason. This curiosity led to an interesting observation: LLMs can reason by breaking down tasks into sub-tasks without requiring explicit fine-tuning if prompted with a chain of thoughts. Surprisingly, just a few prompts are sufficient enough for an LLM to begin CoT reasoning.

Standard Prompting vs CoT Prompting – CoT prompting allows the LLM to reason. (Credits: CoT Paper) — Standard Prompting vs. CoT Prompting – CoT prompting allows the LLM to reason. (Credits: CoT Paper)

This prompting is quite similar to guiding students to solve a mathematical problem rather than providing only the answer keys. It's remarkable how LLMs (thanks to the vast amount of data they’re trained on) require far fewer prompts than human learners before they can reason about unseen tasks.

📌

The word reason refers to agentic reasoning only. There’s still a wide gap between human reasoning capabilities and agentic reasoning.

An example of CoT: taking steps to solve the given quadratic equation (Credits: IBM)

ReAct

The fewer prompts required for CoT reasoning are cost-effective, but they may cause hallucinations. To make CoT work as an agent, we need the environment interaction. Combining these reasoning and acting capabilities has led to an advanced type of agent called ReAct (Reasoning and Acting).

ReAct works similarly to how we have those “inner thoughts” while performing some task. “Now I’ve covered ReAct a bit; I must move to their benefits part now. Oh wait, I’m yet to conclude this section [starts writing the conclusion]” is an example of the way I just reasoned and acted here.

Types of LLM agents

LLM agents are a broad and diverse species that can be further categorized into a few types. Here, we're categorizing them based on their functionality.

Agentic AI

While LLM agents are supposed to be truly agentic (fully autonomous), many of them require further prompting or user feedback (like asking the user to pick the most suitable recipe from the suggestions). Agentic AIs, on the other hand, are meant to execute tasks with minimal (ideally no) human input. These agents involve iterative reasoning and external validation. There's also an increase in techniques not requiring any external validation like Universal Self-Consistency.

Agents like O1, Gemini, DeepSeek, and Claude are capable of handling diverse data types. By combining different AI models (trained on images, audio, text, etc), they provide a unified interface for working with different types of data.

Personal assistants

Gemini, Siri, and Alexa are off-the-top-of-the-head examples of personal assistants. But there are many more dedicated PAs too, like Notion AI, WHOOP AI Coach, Cleo AI, etc. In fact, with a little customization, you can build your own personal assistant.

💡

One of the main questions you should consider before using these agents for personal tasks is data privacy.

Embodied agents

One of the most exciting applications is embodied agents. LLMs are empowering robots, autonomous-driving cars, smart home systems, drones, and some virtual assistants. Examples include Ameca, Tesla Optimus, and Wayve.

Capabilities of LLM agents

The increasing interest in these agents isn’t overhyped. Here are a few definite reasons for this.

Automation: Every agent is meant to automate tasks. Be it the classical Royal Mail automated delivery system or Apify Actors, they all are aimed at minimizing human intervention and automating tasks, which is both time and cost-effective. This is where LLM agents overlap with general AI agents, as both are capable of automating the desired tasks (with LLM agents having more capabilities).
User interaction: Chatbots were among the first LLM agents. Beyond them, LLM agents are also capable of interactions, like voice assistants, customer support, or digital tutors.
Integration: Not long ago, I was wondering if the LLM agent I'm designing would be able to integrate with the existing code. But they surpassed my expectations: LLM agents are capable of integrating with APIs, existing code (I've tested it on Python), databases, and even other LLMs.
Personalization: By integrating with the database, LLM agents can maintain conversation history and provide contextual responses. This is in contrast to the normal or even rule-based AI agents that either don’t maintain memory or have limited memory.
Natural language understanding: Often, our applications don’t require a hardcore AI task but just an interpretation of the user’s query. This interpreted query can be used in some functions for the ultimate application. A good example is Singapore Air’s Flight Recommender system.
Optimization: LLM agents’ capabilities seem to include their use as optimizers in complex problems like the Travelling Salesman Problem (TSP).

💡

Learn how to build, publish, and monetize AI agents on the Apify platform in this complete step-by-step guide

LLM agent frameworks

The rise of LLM agents also means a rise in agent frameworks - both for development and deployment.

Development

LangChain/LangGraph: LangChain and LangGraph are the go-to frameworks for LLM application development. Their vast coverage of functions and the ability to integrate with several other libraries make them the first choice for LLM agent development.

LlamaIndex: LlamaIndex can be the preferred option for RAG-based agents. It also provides the option to host these LLM agents locally, which can be interesting from the data privacy aspect.

Haystack: Originally launched as a normal NLP tasks library, Haystack has quickly adapted to LLM agents. It provides several integrations for different vector stores and models.

💡

LangChain, LlamaIndex, Haystack, and Hugging Face have respective Apify integrations

AutoGen: AutoGen, developed by Microsoft, uses a layered approach that lets developers use both high-level and low-level APIs. In addition to other benefits like message-passing and external API integration framework, it also supports .Net (yet to launch as of Feb 2025).

SmolAgent: SmolAgent is a simple framework by Hugging Face. The huge range of transformers on Hugging Face means one can use any of those models for developing LLM agents. The real power of SmolAgents lies in the concept of CodeAct, where we provide code snippets instead of Text (or even JSON) for the agent’s reasoning.

Traditional agents vs CodeAct (Taken from the CodeAct paper [ICML 2025])

Deployment

Anthropic Claude API: Despite the presence of tech giants like Google, Microsoft, and OpenAI, Anthropic Claude API is still gaining popularity as an LLM agent deployment framework. It (at least theoretically) centers around ethical AI assistants.

Relevance AI: Relevance AI is focused towards “no-coding” agents. These agents can be developed with little (or no) coding and can integrate with some APIs.

Microsoft Copilot Studio: Copilot Studio is especially useful for office productivity. It provides a range of OpenAI models and integrates seamlessly with the Microsoft 365 apps.

LLM agent challenges

Being a rapidly growing area means LLM agents have room for further exploration and face some challenges as well.

Hallucinations

Hallucinations are always a problem with LLM agents. Despite the facility of interaction with the internet, LLM agents often make up their own facts. Fine-tuning them is quite costly but Retrieval Augmented Generation (RAG) can be an effective (both in cost and efficiency) solution. RAG is a huge application where we can use web scraping efficiently if it's combined with some expert feedback/guidance.

Privacy

ChatGPT maintains a history of our conversations (which is true for most agents). While it's helpful in maintaining context, it comes at a cost, as it can expose our sensitive information. Sensitive information goes beyond passwords and credit card details. It may include metadata (like location and search trends) and semi-sensitive information related to business or family.

Jailbreaks

Jailbreaking LLMs has proven to be easier than it seemed at first. With a little tweak in the prompt, one can exploit LLMs for negative purposes. Becuase these agents are available to anyone from school kids to outlaws (mainly via mobile apps), these jailbreaks can share some unwanted information.

An example of a Jailbreak prompt: with a little tweak we can exploit LLMs (Credits: “Do Anything Now” paper)

These challenges, especially jailbreaks, remind me of the Radar problem we studied in childhood. As radars kept advancing, so did stealth techniques, leading to advancements in both industries. So, these jailbreaks may sound daunting, but they (their identifications) will help enhance LLM agents’ capabilities even more in the long term.

Conclusion

LLM agents remind me of a friend from my university days who used to play down the need for extensive programming, saying, “Don’t ye worry! Agent systems will soon take over.” The joke was on us, as time has proven how right he was!

The way LLM agents have changed the way we approach not only software applications but also the whole AI/Deep Learning canvas is unbelievable (and the future is even more exciting).

LLM agents are making it possible to automate a range of tasks smartly with minimal setup. I say "smartly" rather than "intelligently," as hallucinations and some other challenges still need to be solved, and they remain one of the researchers’ focus points, too.

More about AI agents

How to build an AI agent - A complete step-by-step guide to creating, publishing, and monetizing AI agents on the Apify platform.
AI agent workflow - building an agent to query Apify datasets - Learn how to extract insights from datasets using simple natural language queries without deep SQL knowledge or external data exports.
AI agent orchestration with OpenAI Agents SDK - Learn to build an effective multiagent system with AI agent orchestration.
11 AI agent use cases (on Apify) - 10 practical applications for AI agents, plus one meta-use case that hints at the future of agentic systems.
The state of MCP - The latest developments in Model Context Protocol and solutions to key industry challenges.
How to use MCP with Apify Actors - Learn how to expose over 4,500 Apify Actors to AI agents like Claude and LangGraph, and configure MCP clients and servers.
10 best AI agent frameworks - 5 paid platforms and 5 open-source options for building AI agents.
What are AI agents? - The Apify platform is turning the potential of AI agents into practical solutions.
AI agent architecture in 1,000 words - A comprehensive overview of AI agents' core components and architectural types.
5 open-source AI agents on Apify that save you time - These AI agents are practical tools you can test and use today, or build on if you're creating your own automation.
7 real-world AI agent examples in 2025 you need to know - From goal-based assistants to learning-driven systems, these agents are powering everything from self-driving cars to advanced web automation.
7 types of AI agents you should know about - What defines an AI agent? We go through the agent spectrum, from simple reflexive systems to adaptive multi-agent networks.