Universitat Politècnica de Catalunya. Doctorat en Intel·ligència Artificial
Universitat Politècnica de Catalunya. Departament de Ciències de la Computació
Universitat Politècnica de Catalunya. IDEAI-UPC - Intelligent Data sciEnce and Artificial Intelligence Research Group
2025
Within the field of Autonomous Agents, the predominant paradigm is that agents perceive, reflect, reason, and act on an environment, employing some specific decision mechanism to pick actions. Nonetheless, the process that originates the decisions may differ depending on the agent, as this paradigm is agnostic about its concrete action selection inference. However, the need for being able to explain these decisions is constantly increasing, and the heterogeneity of the internal processes of agents has resulted in different ad hoc techniques for each architecture, for providing explanations with disparate validation mechanisms, hindering efforts at comparing mechanisms. To tackle this, in this contribution, we propose a unifying architecture framework based on causality, beliefs, and intentions. This framework allows for the examination of heterogeneous agents (from BDI and RL to LLM-based agents) without modification. This approach clearly decouples declarative and procedural knowledge, as well as designer-given versus learnt representations. It categorises what kind of questions can be answered by each agent reasoning component and allows a more seamless workflow for transferring knowledge between diverse agent architectures.
This work has been partially supported by the HUMANE (Grant agreement ID: 952026) and V. Gimenez-Abalos’ fellowship within the “Generación D” initiative, Red.es, MTDFP, for talent atraction (C005/24-ED CV1). Funded by the European Union NextGenerationEU funds, through PRTR.
Peer Reviewed
Postprint (author's final draft)
Conference lecture
English
Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Agents intel·ligents; XAI; Intentions; Agent explainability; Knowledge representation; Knowledge transfer; Cognitive architecture; Telic explanations; Explainable agency; RL; BDI; Agentic AI
Springer
https://link.springer.com/chapter/10.1007/978-3-032-01399-6_8
Restricted access - publisher's policy
E-prints [73124]