Related papers: A Concept Annotation System for Clinical Records

LTR-ICD: A Learning-to-Rank Approach for Automatic ICD Coding

Clinical notes contain unstructured text provided by clinicians during patient encounters. These notes are usually accompanied by a sequence of diagnostic codes following the International Classification of Diseases (ICD). Correctly…

Machine Learning · Computer Science 2025-10-17 Mohammad Mansoori , Amira Soliman , Farzaneh Etminani

MICA: Towards Explainable Skin Lesion Diagnosis via Multi-Level Image-Concept Alignment

Black-box deep learning approaches have showcased significant potential in the realm of medical image analysis. However, the stringent trustworthiness requirements intrinsic to the medical field have catalyzed research into the utilization…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Yequan Bie , Luyang Luo , Hao Chen

EHRmonize: A Framework for Medical Concept Abstraction from Electronic Health Records using Large Language Models

Electronic health records (EHRs) contain vast amounts of complex data, but harmonizing and processing this information remains a challenging and costly task requiring significant clinical expertise. While large language models (LLMs) have…

Computation and Language · Computer Science 2024-07-02 João Matos , Jack Gallifant , Jian Pei , A. Ian Wong

TERMinator: A system for scientific texts processing

This paper is devoted to the extraction of entities and semantic relations between them from scientific texts, where we consider scientific terms as entities. In this paper, we present a dataset that includes annotations for two tasks and…

Computation and Language · Computer Science 2022-09-30 Elena Bruches , Olga Tikhobaeva , Yana Dementyeva , Tatiana Batura

Automatic Ontology Learning from Domain-Specific Short Unstructured Text Data

Ontology learning is a critical task in industry, dealing with identifying and extracting concepts captured in text data such that these concepts can be used in different tasks, e.g. information retrieval. Ontology learning is non-trivial…

Information Retrieval · Computer Science 2019-03-12 Yiming Xu , Dnyanesh Rajpathak , Ian Gibbs , Diego Klabjan

Beyond Long Context: When Semantics Matter More than Tokens

Electronic Health Records (EHR) store clinical documentation as base64 encoded attachments in FHIR DocumentReference resources, which makes semantic question answering difficult. Traditional vector database methods often miss nuanced…

Computation and Language · Computer Science 2025-10-31 Tarun Kumar Chawdhury , Jon D. Duke

Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes

The large amount of time clinicians spend sifting through patient notes and documenting in electronic health records (EHRs) is a leading cause of clinician burnout. By proactively and dynamically retrieving relevant notes during the…

Information Retrieval · Computer Science 2023-08-17 Sharon Jiang , Shannon Shen , Monica Agrawal , Barbara Lam , Nicholas Kurtzman , Steven Horng , David Karger , David Sontag

Information Extraction of Clinical Trial Eligibility Criteria

Clinical trials predicate subject eligibility on a diversity of criteria ranging from patient demographics to food allergies. Trials post their requirements as semantically complex, unstructured free-text. Formalizing trial criteria to a…

Computation and Language · Computer Science 2020-07-29 Yitong Tseo , M. I. Salkola , Ahmed Mohamed , Anuj Kumar , Freddy Abnousi

Concept-Centric Visual Turing Tests for Method Validation

Recent advances in machine learning for medical imaging have led to impressive increases in model complexity and overall capabilities. However, the ability to discern the precise information a machine learning method is using to make…

Machine Learning · Computer Science 2020-03-17 Tatiana Fountoukidou , Raphael Sznitman

Phenotyping of Clinical Notes with Improved Document Classification Models Using Contextualized Neural Language Models

Clinical notes contain an extensive record of a patient's health status, such as smoking status or the presence of heart conditions. However, this detail is not replicated within the structured data of electronic health systems.…

Computation and Language · Computer Science 2020-09-18 Andriy Mulyar , Elliot Schumacher , Masoud Rouhizadeh , Mark Dredze

Semantic Analysis of SNOMED CT Concept Co-occurrences in Clinical Documentation using MIMIC-IV

Clinical notes contain rich clinical narratives but their unstructured format poses challenges for large-scale analysis. Standardized terminologies such as SNOMED CT improve interoperability, yet understanding how concepts relate through…

Computation and Language · Computer Science 2025-09-05 Ali Noori , Somya Mohanty , Prashanti Manda

PhenoTagger: A Hybrid Method for Phenotype Concept Recognition using Human Phenotype Ontology

Automatic phenotype concept recognition from unstructured text remains a challenging task in biomedical text mining research. Previous works that address the task typically use dictionary-based matching methods, which can achieve high…

Computation and Language · Computer Science 2021-01-26 Ling Luo , Shankai Yan , Po-Ting Lai , Daniel Veltri , Andrew Oler , Sandhya Xirasagar , Rajarshi Ghosh , Morgan Similuk , Peter N. Robinson , Zhiyong Lu

Multi-task Learning for Personal Health Mention Detection on Social Media

Detecting personal health mentions on social media is essential to complement existing health surveillance systems. However, annotating data for detecting health mentions at a large scale is a challenging task. This research employs a…

Computation and Language · Computer Science 2022-12-13 Olanrewaju Tahir Aduragba , Jialin Yu , Alexandra I. Cristea

Inferring Conceptual Relationships When Ranking Patients

Searching patients based on the relevance of their medical records is challenging because of the inherent implicit knowledge within the patients' medical records and queries. Such knowledge is known to the medical practitioners but may be…

Information Retrieval · Computer Science 2017-02-02 Nut Limsopatham , Craig Macdonald , Iadh Ounis

Learning Interpretable Concept-Based Models with Human Feedback

Machine learning models that first learn a representation of a domain in terms of human-understandable concepts, then use it to make predictions, have been proposed to facilitate interpretation and interaction with models trained on…

Machine Learning · Computer Science 2020-12-08 Isaac Lage , Finale Doshi-Velez

Structured Semantics from Unstructured Notes: Language Model Approaches to EHR-Based Decision Support

The advent of large language models (LLMs) has opened new avenues for analyzing complex, unstructured data, particularly within the medical domain. Electronic Health Records (EHRs) contain a wealth of information in various formats,…

Information Retrieval · Computer Science 2025-06-10 Wu Hao Ran , Xi Xi , Furong Li , Jingyi Lu , Jian Jiang , Hui Huang , Yuzhuan Zhang , Shi Li

Annotation Error Detection: Analyzing the Past and Present for a More Coherent Future

Annotated data is an essential ingredient in natural language processing for training and evaluating machine learning models. It is therefore very desirable for the annotations to be of high quality. Recent work, however, has shown that…

Computation and Language · Computer Science 2022-09-27 Jan-Christoph Klie , Bonnie Webber , Iryna Gurevych

Ontology-supported processing of clinical text using medical knowledge integration for multi-label classification of diagnosis coding

This paper discusses the knowledge integration of clinical information extracted from distributed medical ontology in order to ameliorate a machine learning-based multi-label coding assignment system. The proposed approach is implemented…

Machine Learning · Computer Science 2010-04-09 Phanu Waraporn , Phayung Meesad , Gareth Clayton

An Analysis of Attention over Clinical Notes for Predictive Tasks

The shift to electronic medical records (EMRs) has engendered research into machine learning and natural language technologies to analyze patient records, and to predict from these clinical outcomes of interest. Two observations motivate…

Computation and Language · Computer Science 2019-04-09 Sarthak Jain , Ramin Mohammadi , Byron C. Wallace

A Relation Extraction Approach for Clinical Decision Support

In this paper, we investigate how semantic relations between concepts extracted from medical documents can be employed to improve the retrieval of medical literature. Semantic relations explicitly represent relatedness between concepts and…

Information Retrieval · Computer Science 2019-05-06 Maristella Agosti , Giorgio Maria Di Nunzio , Stefano Marchesin , Gianmaria Silvello