Abstract
Human coders assign standardized medical codes to clinical documents generated during patients’ hospitalization, which is error prone and labor intensive. Automated medical coding approaches have been developed using machine learning methods, such as deep neural networks. Nevertheless, automated medical coding is still challenging because of complex code association, noise in lengthy documents, and the imbalanced class problem. We propose a novel neural network, called the Multitask Balanced and Recalibrated Neural Network, to solve these issues. Significantly, the multitask learning scheme shares the relationship knowledge between different coding branches to capture code association. A recalibrated aggregation module is developed by cascading convolutional blocks to extract high-level semantic features that mitigate the impact of noise in documents. Also, the cascaded structure of the recalibrated module can benefit learning from lengthy notes. To solve the imbalanced class problem, we deploy focal loss to redistribute the attention on low- and high-frequency medical codes. Experimental results show that our proposed model outperforms competitive baselines on a real-world clinical dataset called the Medical Information Mart for Intensive Care (MIMIC-III).
| Original language | English |
|---|---|
| Article number | 17 |
| Pages (from-to) | 1–20 |
| Number of pages | 20 |
| Journal | ACM Transactions on Intelligent Systems and Technology |
| Volume | 14 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 9 Nov 2022 |
| MoE publication type | A1 Journal article-refereed |
Fingerprint
Dive into the research topics of 'Multitask Balanced and Recalibrated Network for Medical Code Prediction'. Together they form a unique fingerprint.Projects
- 2 Finished
-
INTERVENE: International consortium for integrative genomics prediction
Kaski, S. (Principal investigator), Moen, H. (Project Member), Cui, T. (Project Member), Raj, V. (Project Member), Safinianaini, N. (Project Member), Wharrie, S. (Project Member) & Mäkinen, L. (Project Member)
01/01/2021 → 31/12/2025
Project: EU H2020 Framework program
-
DATALIT: Data Literacy for Responsible Decision-Making
Marttinen, P. (Principal investigator), Tiwari, P. (Project Member), Kumar, Y. (Project Member), Raj, V. (Project Member), Ojala, F. (Project Member), Gröhn, T. (Project Member), Pöllänen, A. (Project Member), Honkamaa, J. (Project Member) & Ji, S. (Project Member)
01/10/2020 → 30/09/2023
Project: RCF SRC (STN)
Equipment
Press/Media
-
Study Data from Aalto University Provide New Insights into Networks (Multitask Balanced and Recalibrated Network for Medical Code Prediction)
21/04/2023
1 item of Media coverage
Press/Media: Media appearance
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver