Natural language models for learning assessment from unstructured data

Espasa Rosell, Jordi; Espasa Rosell, Jordi

Natural language models for learning assessment from unstructured data

To access the full text documents, please follow this link: https://hdl.handle.net/2117/445762

Author

Espasa Rosell, Jordi

Other authors

Universitat Politècnica de Catalunya. Departament de Ciències de la Computació

Sallés Rius, Anna

Publication date

2025-10-20

Abstract

This Master's Thesis optimizes large language models (LLMs) for multiple-choice question answering (MCQA) to evaluate employee performance from spoken transcripts in personalized training platforms. Current LLMs achieve only 63% accuracy in dynamic assessments due to biases, reasoning failures, and inefficiencies. We develop a systematic framework balancing precision, cost, and execution time through iterative evaluation refinement, corpus preparation, baseline selection, and phased experiments, including single-factor screening (OFAT), multi-factor interactions, and parameter-efficient fine-tuning (PEFT). Key factors assessed include model scale, in-context learning, chain-of-thought (CoT), chain-of-density (CoD), self-correction, and agentic ensembles. Contributions encompass a replicable optimization pipeline and strategies to mitigate biases like positional and literal interpretation errors. Results show improvements from 63% to 80% accuracy and enhanced F1-scores, enabling ethical, scalable AI-driven assessments for enterprise individualized learning.

Document Type

Master thesis

Language

English

Subjects and keywords

Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Aprenentatge automàtic; Deep learning (Machine learning); Questions and answers; Models de llenguatge de gran escala; Resposta a preguntes d'opció múltiple; Avaluació del rendiment d'empleats; Transcripcions orals; Plataformes de formació personalitzada; Optimització de models; Precisió en avaluacions dinàmiques; Biaixos en models d'IA; Fallades de raonament; Marc sistemàtic; Refinament iteratiu d'avaluació; Preparació de corpus; Selecció de línia base; Experiments per fases; Cribratge d'un sol factor; Large language models; Multiple-choice question answering; Employee performance evaluation; Spoken transcripts; Mersonalized training platforms; Model optimization; Accuracy in dynamic assessments; AI model biases; Reasoning failures; Systematic framework; Iterative evaluation refinement; Corpus preparation; Baseline selection; Phased experiments; One-factor-at-a-time screening; Multi-factor interactions; Parameter-efficient fine-tuning; Model scale; In-context learning; Chain-of-thought; Aprenentatge profund (Aprenentatge automàtic); Preguntes i respostes

Publisher

Universitat Politècnica de Catalunya

Recommended citation

This citation was generated automatically.

Export

DIDL MARC MARC_CCUC METS OAI_DC ORE QDC RDF

Rights

Open Access

This item appears in the following Collection(s)

Treballs acadèmics [82542]

Natural language models for learning assessment from unstructured data

Author

Other authors

Publication date

Share

Abstract

Document Type

Language

Subjects and keywords

Publisher

Recommended citation

Export

Rights

This item appears in the following Collection(s)