English
Related papers

Related papers: Generative AI for automatic topic labelling

200 papers

Topic modeling has become a crucial method for analyzing text data, particularly for extracting meaningful insights from large collections of documents. However, the output of these models typically consists of lists of keywords that…

Information Retrieval · Computer Science 2025-02-27 Trishia Khandelwal

The purpose of this study is to assess how large language models (LLMs) can be used for fact-checking and contribute to the broader debate on the use of automated means for veracity identification. To achieve this purpose, we use AI…

Computation and Language · Computer Science 2025-03-12 Elizaveta Kuznetsova , Ilaria Vitulano , Mykola Makhortykh , Martha Stolze , Tomas Nagy , Victoria Vziatysheva

We explore generating factual and accurate tables from the parametric knowledge of large language models (LLMs). While LLMs have demonstrated impressive capabilities in recreating knowledge bases and generating free-form text, we focus on…

Computation and Language · Computer Science 2024-06-18 Yevgeni Berkovitch , Oren Glickman , Amit Somech , Tomer Wolfson

Unsupervised machine learning techniques, such as topic modeling and clustering, are often used to identify latent patterns in unstructured text data in fields such as political science and sociology. These methods overcome common concerns…

Computation and Language · Computer Science 2025-04-25 Anna Lieb , Maneesh Arora , Eni Mustafaraj

In the rapidly evolving field of artificial intelligence (AI), the application of large language models (LLMs) in agriculture, particularly in pest management, remains nascent. We aimed to prove the feasibility by evaluating the content of…

Computation and Language · Computer Science 2024-03-19 Shanglong Yang , Zhipeng Yuan , Shunbao Li , Ruoling Peng , Kang Liu , Po Yang

Artificial intelligence (AI) is widely deployed to solve problems related to marketing attribution and budget optimization. However, AI models can be quite complex, and it can be difficult to understand model workings and insights without…

Computation and Language · Computer Science 2024-04-23 Yilin Gao , Sai Kumar Arava , Yancheng Li , James W. Snyder

Ontologies of research topics are crucial for structuring scientific knowledge, enabling scientists to navigate vast amounts of research, and forming the backbone of intelligent systems such as search engines and recommendation systems.…

Digital Libraries · Computer Science 2025-06-12 Tanay Aggarwal , Angelo Salatino , Francesco Osborne , Enrico Motta

Labeling data is essential for training text classifiers but is often difficult to accomplish accurately, especially for complex and abstract concepts. Seeking an improved method, this paper employs a novel approach using a generative…

Computation and Language · Computer Science 2024-12-31 Sergio Pelaez , Gaurav Verma , Barbara Ribeiro , Philip Shapira

Educational materials such as survey articles in specialized fields like computer science traditionally require tremendous expert inputs and are therefore expensive to create and update. Recently, Large Language Models (LLMs) have achieved…

Computation and Language · Computer Science 2024-05-24 Fan Gao , Hang Jiang , Rui Yang , Qingcheng Zeng , Jinghui Lu , Moritz Blum , Dairui Liu , Tianwei She , Yuang Jiang , Irene Li

The surge in scientific submissions has placed increasing strain on the traditional peer-review process, prompting the exploration of large language models (LLMs) for automated review generation. While LLMs demonstrate competence in…

Computation and Language · Computer Science 2025-09-25 Ruochi Li , Haoxuan Zhang , Edward Gehringer , Ting Xiao , Junhua Ding , Haihua Chen

While most generative models show achievements in image data generation, few are developed for tabular data generation. Recently, due to success of large language models (LLM) in diverse tasks, they have also been used for tabular data…

Machine Learning · Computer Science 2024-10-30 Dang Nguyen , Sunil Gupta , Kien Do , Thin Nguyen , Svetha Venkatesh

Topic modelling is a popular unsupervised method for identifying the underlying themes in document collections that has many applications in information retrieval. A topic is usually represented by a list of terms ranked by their…

Information Retrieval · Computer Science 2020-06-02 Areej Alokaili , Nikolaos Aletras , Mark Stevenson

Generative AI offers a simple, prompt-based alternative to fine-tuning smaller BERT-style LLMs for text classification tasks. This promises to eliminate the need for manually labeled training data and task-specific model training. However,…

Computation and Language · Computer Science 2024-08-19 Martin Juan José Bucher , Marco Martini

Suicide remains a pressing global health crisis, with over 720,000 deaths annually and millions more affected by suicide ideation (SI) and suicide attempts (SA). Early identification of suicidality-related factors (SrFs), including SI, SA,…

Computation and Language · Computer Science 2025-07-24 Ming Huang , Zehan Li , Yan Hu , Wanjing Wang , Andrew Wen , Scott Lane , Salih Selek , Lokesh Shahani , Rodrigo Machado-Vieira , Jair Soares , Hua Xu , Hongfang Liu

Sensitive information detection is crucial in content moderation to maintain safe online communities. Assisting in this traditionally manual process could relieve human moderators from overwhelming and tedious tasks, allowing them to focus…

The limitations sections of scientific articles play a crucial role in highlighting the boundaries and shortcomings of research, thereby guiding future studies and improving research methods. Analyzing these limitations benefits…

Computation and Language · Computer Science 2025-03-17 Ibrahim Al Azhar , Venkata Devesh Reddy , Hamed Alhoori , Akhil Pandey Akella

Artificially intelligent (AI) co-scientists must be able to sift through research literature cost-efficiently while applying nuanced scientific reasoning. We evaluate Small Language Models (SLMs, <= 8B parameters) for classifying medical…

Computational Engineering, Finance, and Science · Computer Science 2025-12-09 Muhammed Muaaz Dawood , Mohammad Zaid Moonsamy , Kaela Kokkas , Hairong Wang , Robert F. Breiman , Richard Klein , Emmanuel K. Sekyi , Bruce A. Bassett

Although supervised machine learning is popular for information extraction from clinical notes, creating large annotated datasets requires extensive domain expertise and is time-consuming. Meanwhile, large language models (LLMs) have…

Computation and Language · Computer Science 2024-01-26 Madhumita Sushil , Travis Zack , Divneet Mandair , Zhiwei Zheng , Ahmed Wali , Yan-Ning Yu , Yuwei Quan , Atul J. Butte

Automated text annotation is a compelling use case for generative large language models (LLMs) in social media research. Recent work suggests that LLMs can achieve strong performance on annotation tasks; however, these studies evaluate LLMs…

Computation and Language · Computer Science 2024-09-24 Nicholas Pangakis , Samuel Wolken

Ontologies and taxonomies of research fields are critical for managing and organising scientific knowledge, as they facilitate efficient classification, dissemination and retrieval of information. However, the creation and maintenance of…

Digital Libraries · Computer Science 2025-08-29 Tanay Aggarwal , Angelo Salatino , Francesco Osborne , Enrico Motta
‹ Prev 1 2 3 10 Next ›