Prompt engineering for medical foundational models

Bayarri Planas, Jordi

Prompt engineering for medical foundational models

dc.contributor

Universitat Politècnica de Catalunya. Departament de Ciències de la Computació

dc.contributor

Garcia Gasulla, Dario

dc.contributor.author

Bayarri Planas, Jordi

dc.date.issued

2024-06-27

dc.identifier

https://hdl.handle.net/2117/413969

dc.identifier

189860

dc.description.abstract

Foundational models have rapidly emerged in recent years, demonstrating remarkable capabilities across a wide array of tasks, predominantly in natural language processing. Significant efforts have been dedicated to this field, resulting in the frequent release of new, increasingly sophisticated models. This thesis explores the efficacy of advanced prompting strategies applied to these foundational models within the medical question-answering domain, focusing on the potential of open-source models enhanced through sophisticated prompt engineering. An efficient and functional evaluation framework, named "prompt\_engine", has been developed to study the potential of two prompting strategies: Self-Consistency Chain of Thought and a Medprompt-based technique. Through this framework, a comprehensive range of experiments was conducted, leading to significant performance enhancements through the strategic combination and optimization of these prompting techniques. Key findings reveal that the performance of open-source models can be significantly enhanced, allowing them to outperform current state-of-the-art private models in existing medical question-answering benchmarks.

dc.format

application/pdf

dc.format

application/pdf

dc.language

eng

dc.publisher

Universitat Politècnica de Catalunya

dc.rights

Open Access

dc.subject

Àrees temàtiques de la UPC::Informàtica::Llenguatges de programació

dc.subject

Natural language processing (Computer science)

dc.subject

Models fundacionals

dc.subject

models de llenguatge

dc.subject

engineyria d'instruccions

dc.subject

avaluació mèdica

dc.subject

Foundational models

dc.subject

large language models

dc.subject

prompt engineering

dc.subject

medical benchmarks

dc.subject

Tractament del llenguatge natural (Informàtica)

dc.title

Prompt engineering for medical foundational models

dc.type

Master thesis

Fitxers en aquest element

Fitxers	Grandària	Format	Visualització
No hi ha fitxers associats a aquest element.

Aquest element apareix en la col·lecció o col·leccions següent(s)

Treballs acadèmics [82549]