Abstract
This study extends the exploration of ontology enrichment by evaluating the performance of various open-sourced Large Language Models (LLMs) on the task of predicting hierarchical relationships (IS-A) in medical ontologies including SNOMED CT Clinical Finding and Procedure hierarchies and the human Disease Ontology. With the previous finetuned BERT models for hierarchical relationship prediction as the baseline, we assessed eight open-source generative LLMs for the same task. We observed only three models, without finetuning, demonstrated comparable or superior performance compared to the baseline BERT -based models. The best performance model OpenChat achieved a macro average F1 score of 0.96 (0.95) on SNOMED CT Clinical Finding (Procedure) hierarchy, an increase over 7% from the baseline 0.89 (0.85). On human Disease Ontology, OpenChat excels with an F1 score of 0.91, outperforming the second-best performance model Vicuna (0.84). Notably, some LLMs prove unsuitable for hierarchical relationship prediction tasks or appliable for concept placement of medical ontologies. We also explored various prompt templates and ensemble techniques to uncover potential confounding factors in applying LLMs for IS-A relation predictions for medical ontologies.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 248-256 |
| Number of pages | 9 |
| ISBN (Electronic) | 9798350383737 |
| DOIs | |
| State | Published - 2024 |
| Event | 12th IEEE International Conference on Healthcare Informatics, ICHI 2024 - Orlando, United States Duration: 3 Jun 2024 → 6 Jun 2024 |
Publication series
| Name | Proceedings - 2024 IEEE 12th International Conference on Healthcare Informatics, ICHI 2024 |
|---|
Conference
| Conference | 12th IEEE International Conference on Healthcare Informatics, ICHI 2024 |
|---|---|
| Country/Territory | United States |
| City | Orlando |
| Period | 3/06/24 → 6/06/24 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Hieratical Relation Prediction
- Large Language Models
- Medical Ontology
- Prompts Design
- SNOMED CT
Fingerprint
Dive into the research topics of 'Using Generative Large Language Models for Hierarchical Relationship Prediction in Medical Ontologies'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver