English
Related papers

Related papers: A Concept Annotation System for Clinical Records

200 papers

Clinical notes contain unstructured text provided by clinicians during patient encounters. These notes are usually accompanied by a sequence of diagnostic codes following the International Classification of Diseases (ICD). Correctly…

Machine Learning · Computer Science 2025-10-17 Mohammad Mansoori , Amira Soliman , Farzaneh Etminani

Black-box deep learning approaches have showcased significant potential in the realm of medical image analysis. However, the stringent trustworthiness requirements intrinsic to the medical field have catalyzed research into the utilization…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Yequan Bie , Luyang Luo , Hao Chen

Electronic health records (EHRs) contain vast amounts of complex data, but harmonizing and processing this information remains a challenging and costly task requiring significant clinical expertise. While large language models (LLMs) have…

Computation and Language · Computer Science 2024-07-02 João Matos , Jack Gallifant , Jian Pei , A. Ian Wong

This paper is devoted to the extraction of entities and semantic relations between them from scientific texts, where we consider scientific terms as entities. In this paper, we present a dataset that includes annotations for two tasks and…

Computation and Language · Computer Science 2022-09-30 Elena Bruches , Olga Tikhobaeva , Yana Dementyeva , Tatiana Batura

Ontology learning is a critical task in industry, dealing with identifying and extracting concepts captured in text data such that these concepts can be used in different tasks, e.g. information retrieval. Ontology learning is non-trivial…

Information Retrieval · Computer Science 2019-03-12 Yiming Xu , Dnyanesh Rajpathak , Ian Gibbs , Diego Klabjan

Electronic Health Records (EHR) store clinical documentation as base64 encoded attachments in FHIR DocumentReference resources, which makes semantic question answering difficult. Traditional vector database methods often miss nuanced…

Computation and Language · Computer Science 2025-10-31 Tarun Kumar Chawdhury , Jon D. Duke

The large amount of time clinicians spend sifting through patient notes and documenting in electronic health records (EHRs) is a leading cause of clinician burnout. By proactively and dynamically retrieving relevant notes during the…

Information Retrieval · Computer Science 2023-08-17 Sharon Jiang , Shannon Shen , Monica Agrawal , Barbara Lam , Nicholas Kurtzman , Steven Horng , David Karger , David Sontag

Clinical trials predicate subject eligibility on a diversity of criteria ranging from patient demographics to food allergies. Trials post their requirements as semantically complex, unstructured free-text. Formalizing trial criteria to a…

Computation and Language · Computer Science 2020-07-29 Yitong Tseo , M. I. Salkola , Ahmed Mohamed , Anuj Kumar , Freddy Abnousi

Recent advances in machine learning for medical imaging have led to impressive increases in model complexity and overall capabilities. However, the ability to discern the precise information a machine learning method is using to make…

Machine Learning · Computer Science 2020-03-17 Tatiana Fountoukidou , Raphael Sznitman

Clinical notes contain an extensive record of a patient's health status, such as smoking status or the presence of heart conditions. However, this detail is not replicated within the structured data of electronic health systems.…

Computation and Language · Computer Science 2020-09-18 Andriy Mulyar , Elliot Schumacher , Masoud Rouhizadeh , Mark Dredze

Clinical notes contain rich clinical narratives but their unstructured format poses challenges for large-scale analysis. Standardized terminologies such as SNOMED CT improve interoperability, yet understanding how concepts relate through…

Computation and Language · Computer Science 2025-09-05 Ali Noori , Somya Mohanty , Prashanti Manda

Automatic phenotype concept recognition from unstructured text remains a challenging task in biomedical text mining research. Previous works that address the task typically use dictionary-based matching methods, which can achieve high…

Computation and Language · Computer Science 2021-01-26 Ling Luo , Shankai Yan , Po-Ting Lai , Daniel Veltri , Andrew Oler , Sandhya Xirasagar , Rajarshi Ghosh , Morgan Similuk , Peter N. Robinson , Zhiyong Lu

Detecting personal health mentions on social media is essential to complement existing health surveillance systems. However, annotating data for detecting health mentions at a large scale is a challenging task. This research employs a…

Computation and Language · Computer Science 2022-12-13 Olanrewaju Tahir Aduragba , Jialin Yu , Alexandra I. Cristea

Searching patients based on the relevance of their medical records is challenging because of the inherent implicit knowledge within the patients' medical records and queries. Such knowledge is known to the medical practitioners but may be…

Information Retrieval · Computer Science 2017-02-02 Nut Limsopatham , Craig Macdonald , Iadh Ounis

Machine learning models that first learn a representation of a domain in terms of human-understandable concepts, then use it to make predictions, have been proposed to facilitate interpretation and interaction with models trained on…

Machine Learning · Computer Science 2020-12-08 Isaac Lage , Finale Doshi-Velez

The advent of large language models (LLMs) has opened new avenues for analyzing complex, unstructured data, particularly within the medical domain. Electronic Health Records (EHRs) contain a wealth of information in various formats,…

Information Retrieval · Computer Science 2025-06-10 Wu Hao Ran , Xi Xi , Furong Li , Jingyi Lu , Jian Jiang , Hui Huang , Yuzhuan Zhang , Shi Li

Annotated data is an essential ingredient in natural language processing for training and evaluating machine learning models. It is therefore very desirable for the annotations to be of high quality. Recent work, however, has shown that…

Computation and Language · Computer Science 2022-09-27 Jan-Christoph Klie , Bonnie Webber , Iryna Gurevych

This paper discusses the knowledge integration of clinical information extracted from distributed medical ontology in order to ameliorate a machine learning-based multi-label coding assignment system. The proposed approach is implemented…

Machine Learning · Computer Science 2010-04-09 Phanu Waraporn , Phayung Meesad , Gareth Clayton

The shift to electronic medical records (EMRs) has engendered research into machine learning and natural language technologies to analyze patient records, and to predict from these clinical outcomes of interest. Two observations motivate…

Computation and Language · Computer Science 2019-04-09 Sarthak Jain , Ramin Mohammadi , Byron C. Wallace

In this paper, we investigate how semantic relations between concepts extracted from medical documents can be employed to improve the retrieval of medical literature. Semantic relations explicitly represent relatedness between concepts and…

Information Retrieval · Computer Science 2019-05-06 Maristella Agosti , Giorgio Maria Di Nunzio , Stefano Marchesin , Gianmaria Silvello