Agentic AI represents the next step in the development of large language models. But the reality is more complex than the promise:
- Just how robust are LLMs' planning capabilities, really?
- Where are cost-effective models sufficient, and where is flagship capacity needed?
- How do you design robust systems when their behavior differs fundamentally from that of traditional, deterministic software?
The productive use of AI agents requires a thorough understanding of their capabilities and limitations.
We develop AI agents that can independently plan and execute multi-step tasks while always remaining within clearly defined parameters. Through the interaction of specialized tools and data sources, they handle collaborative workflows along predefined processes, independently selecting the appropriate methods to achieve their goals. By connecting to internal knowledge bases and existing applications, they operate in a context-aware manner, precisely tailored to the respective application domain.
We investigate how the often abstractly formulated requirements from regulatory frameworks, such as the EU AI Act, as well as organizational guidelines, can be translated into technically verifiable architectural principles. Under the term “Agent Policies,” we define which decisions an agent is permitted to make, how it must justify its actions, and where binding guidelines apply. The goal is not to treat transparency, explainability, and governance as afterthoughts, but to embed them as structuring principles within the software—documentable, verifiable, and traceable throughout the entire lifecycle.
This allows even complex workflows to be reliably automated without deviating from the technical and organizational framework.