Ladder of intentions: unifying agent architectures for explainability and transferability

Within the field of Autonomous Agents, the predominant paradigm is that agents perceive, reflect, reason, and act on an environment, employing some specific decision mechanism to pick actions. Nonetheless, the process that originates the decisions may differ depending on the agent, as this paradigm is agnostic about its concrete action selection inference. However, the need for being able to explain these decisions is constantly increasing, and the heterogeneity of the internal processes of agents has resulted in different ad hoc techniques for each architecture, for providing explanations with disparate validation mechanisms, hindering efforts at comparing mechanisms. To tackle this, in this contribution, we propose a unifying architecture framework based on causality, beliefs, and intentions. This framework allows for the examination of heterogeneous agents (from BDI and RL to LLM-based agents) without modification. This approach clearly decouples declarative and procedural knowledge, as well as designer-given versus learnt representations. It categorises what kind of questions can be answered by each agent reasoning component and allows a more seamless workflow for transferring knowledge between diverse agent architectures.

This work has been partially supported by the HUMANE (Grant agreement ID: 952026) and V. Gimenez-Abalos’ fellowship within the “Generación D” initiative, Red.es, MTDFP, for talent atraction (C005/24-ED CV1). Funded by the European Union NextGenerationEU funds, through PRTR.

Peer Reviewed

Postprint (author's final draft)

Document Type

Conference lecture

Language

English

Subjects and keywords

Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Agents intel·ligents; XAI; Intentions; Agent explainability; Knowledge representation; Knowledge transfer; Cognitive architecture; Telic explanations; Explainable agency; RL; BDI; Agentic AI

Publisher

Springer

Related items

https://link.springer.com/chapter/10.1007/978-3-032-01399-6_8

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

Restricted access - publisher's policy

This item appears in the following Collection(s)

E-prints [73124]

Ladder of intentions: unifying agent architectures for explainability and transferability

Author

Other authors

Publication date

Share

Abstract