AI Agent execution within an AI Guardian

AI agents utilize generative-AI models (LLMs), and all LLMs have some common core capabilities:

The ability to have contextual conversational interaction and follow user-provided instructions.
"Knowledge" of a broad collection of information from public sources (though usually a few months stale).
The ability to understand and generate content (documents, images, videos).

"Guardian" Agent Environment vs "LLM"

It is important to distinguish between AI agents (things that do work on behalf of the user) and the large language models (LLM) that power them. The LLM used in Thunk.AI could be GPT-4o or Claude Sonnet or Google Gemini or other LLMs.

An AI agent is a directed and controlled use of an LLM to achieve a specific task. This is implemented by running the LLM in the control of an "AI guardian" agent environment. In Thunk.AI, the platform runtime is responsible for implementing and managing this environment.

Every AI agent runs within and is controlled and constrained by a control  environment called the AI Guardian.

LLMs can only respond to instructions with responses. It is up to the AI agent environment to provide them meaningful instructions ("Steering"), meaningful data ("Grounding"), and meaningful capabilities ("Agency" via AI tools). However, this is only part of the required behavior. It is very important to also appropriately handle the responses appropriately ("Validation"). Collectively, this is where the AI agentic layer of the Thunk.AI platform plays a crucial role.

The important takeaways are:

The LLM never accesses information directly. Any information needed by the LLM is provided by the Thunk.AI platform, either as part of setting up the AI agent environment or by responding to tool call requests.
The LLM never updates data directly. Any such changes are made by the AI guardian environment by responding to tool call requests.

The AI Guardian

Control, reliability, and security are enabled by the AI Guardian. This agent environment performs several critical functions:

It invokes the AI agent in a loop until the task is complete.
It sets up the appropriate context and instructions to provide the AI agent. This context varies dynamically as the loop progresses, reflecting the AI agent's progress through the task at hand and keeping it focused on the current aspect of the task.
It limits the responses of the AI agent to a specific set of structured AI tool calls.
It vets and validates every proposed tool call, then executes it, checks its result, and then passes it back to the AI agent for the next iteration of processing.
It implements several mechanisms that mitigate common flaws in LLMs (like hallucination, inconsistency, early termination, etc).

The AI Guardian is the most important and novel platform innovation in the Thunk.AI platform. AI reliability stems from the effective use of the AI Guardian for Steering, Grounding, Agency, and Validation. Read more about AI reliability in Thunk.AI here.

Ensuring Reliability and Consistency with AI Agents

Secure AI Agents

Overview

Understanding LLM costs and latency

User Identity during Agent Execution