Projects per year
Abstract
In health sciences, high-quality text embeddings may augment qualitative data analysis of large amounts of text by enabling, e.g., searching and clustering of health information. This study aimed to evaluate three different sentence-level embedding methods in clustering sentences in nursing narratives from individual patients' hospital care episodes. Two of these embeddings are generated from language models based on the BERT framework, and the third on the Sent2Vec method. These embedding methods were used to cluster sentences from 20 patient care episodes and the results were manually evaluated. Findings suggest that the best clusters were produced by the embeddings from a BERT model fine-tuned for the proxy task of predicting subject headings for nursing text.
Original language | English |
---|---|
Title of host publication | Challenges of Trustable AI and Added-Value on Health - Proceedings of MIE 2022 |
Editors | Brigitte Seroussi, Patrick Weber, Ferdinand Dhombres, Cyril Grouin, Jan-David Liebe, Jan-David Liebe, Jan-David Liebe, Sylvia Pelayo, Andrea Pinna, Bastien Rance, Bastien Rance, Lucia Sacchi, Adrien Ugon, Adrien Ugon, Arriel Benis, Parisis Gallos |
Publisher | IOS Press |
Pages | 854-858 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-64368-284-6 |
DOIs | |
Publication status | Published - 25 May 2022 |
MoE publication type | A4 Conference publication |
Event | Medical Informatics Europe Conference - Nice, France Duration: 27 May 2022 → 30 May 2022 Conference number: 32 |
Publication series
Name | Studies in Health Technology and Informatics |
---|---|
Publisher | IOS Press |
Volume | 294 |
ISSN (Print) | 0926-9630 |
ISSN (Electronic) | 1879-8365 |
Conference
Conference | Medical Informatics Europe Conference |
---|---|
Abbreviated title | MIE |
Country/Territory | France |
City | Nice |
Period | 27/05/2022 → 30/05/2022 |
Keywords
- electronic health records
- natural language processing
- nursing documentation
- sentence embeddings
- Text clustering
Fingerprint
Dive into the research topics of 'Clustering Nursing Sentences-Comparing Three Sentence Embedding Methods'. Together they form a unique fingerprint.-
INTERVENE: International consortium for integrative genomics prediction
Kaski, S. (Principal investigator)
01/01/2021 → 31/12/2025
Project: EU H2020 Framework program
-
eMOM: CleverHealth Network: eMOM GDM -Project
Marttinen, P. (Principal investigator)
05/02/2018 → 31/01/2023
Project: Business Finland: Other research funding