Related papers: Generative AI for automatic topic labelling

Using LLM-Based Approaches to Enhance and Automate Topic Labeling

Topic modeling has become a crucial method for analyzing text data, particularly for extracting meaningful insights from large collections of documents. However, the output of these models typically consists of lists of keywords that…

Information Retrieval · Computer Science 2025-02-27 Trishia Khandelwal

Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information

The purpose of this study is to assess how large language models (LLMs) can be used for fact-checking and contribute to the broader debate on the use of automated means for veracity identification. To achieve this purpose, we use AI…

Computation and Language · Computer Science 2025-03-12 Elizaveta Kuznetsova , Ilaria Vitulano , Mykola Makhortykh , Martha Stolze , Tomas Nagy , Victoria Vziatysheva

Generating Tables from the Parametric Knowledge of Language Models

We explore generating factual and accurate tables from the parametric knowledge of large language models (LLMs). While LLMs have demonstrated impressive capabilities in recreating knowledge bases and generating free-form text, we focus on…

Computation and Language · Computer Science 2024-06-18 Yevgeni Berkovitch , Oren Glickman , Amit Somech , Tomer Wolfson

Creating Targeted, Interpretable Topic Models with LLM-Generated Text Augmentation

Unsupervised machine learning techniques, such as topic modeling and clustering, are often used to identify latent patterns in unstructured text data in fields such as political science and sociology. These methods overcome common concerns…

Computation and Language · Computer Science 2025-04-25 Anna Lieb , Maneesh Arora , Eni Mustafaraj

GPT-4 as Evaluator: Evaluating Large Language Models on Pest Management in Agriculture

In the rapidly evolving field of artificial intelligence (AI), the application of large language models (LLMs) in agriculture, particularly in pest management, remains nascent. We aimed to prove the feasibility by evaluating the content of…

Computation and Language · Computer Science 2024-03-19 Shanglong Yang , Zhipeng Yuan , Shunbao Li , Ruoling Peng , Kang Liu , Po Yang

Improving the Capabilities of Large Language Model Based Marketing Analytics Copilots With Semantic Search And Fine-Tuning

Artificial intelligence (AI) is widely deployed to solve problems related to marketing attribution and budget optimization. However, AI models can be quite complex, and it can be difficult to understand model workings and insights without…

Computation and Language · Computer Science 2024-04-23 Yilin Gao , Sai Kumar Arava , Yancheng Li , James W. Snyder

Large Language Models for Scholarly Ontology Generation: An Extensive Analysis in the Engineering Field

Ontologies of research topics are crucial for structuring scientific knowledge, enabling scientists to navigate vast amounts of research, and forming the backbone of intelligent systems such as search engines and recommendation systems.…

Digital Libraries · Computer Science 2025-06-12 Tanay Aggarwal , Angelo Salatino , Francesco Osborne , Enrico Motta

Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents

Labeling data is essential for training text classifiers but is often difficult to accomplish accurately, especially for complex and abstract concepts. Seeking an improved method, this paper employs a novel approach using a generative…

Computation and Language · Computer Science 2024-12-31 Sergio Pelaez , Gaurav Verma , Barbara Ribeiro , Philip Shapira

Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts

Educational materials such as survey articles in specialized fields like computer science traditionally require tremendous expert inputs and are therefore expensive to create and update. Recently, Large Language Models (LLMs) have achieved…

Computation and Language · Computer Science 2024-05-24 Fan Gao , Hang Jiang , Rui Yang , Qingcheng Zeng , Jinghui Lu , Moritz Blum , Dairui Liu , Tianwei She , Yuang Jiang , Irene Li

Unveiling the Merits and Defects of LLMs in Automatic Review Generation for Scientific Papers

The surge in scientific submissions has placed increasing strain on the traditional peer-review process, prompting the exploration of large language models (LLMs) for automated review generation. While LLMs demonstrate competence in…

Computation and Language · Computer Science 2025-09-25 Ruochi Li , Haoxuan Zhang , Edward Gehringer , Ting Xiao , Junhua Ding , Haihua Chen

Generating Realistic Tabular Data with Large Language Models

While most generative models show achievements in image data generation, few are developed for tabular data generation. Recently, due to success of large language models (LLM) in diverse tasks, they have also been used for tabular data…

Machine Learning · Computer Science 2024-10-30 Dang Nguyen , Sunil Gupta , Kien Do , Thin Nguyen , Svetha Venkatesh

Automatic Generation of Topic Labels

Topic modelling is a popular unsupervised method for identifying the underlying themes in document collections that has many applications in information retrieval. A topic is usually represented by a list of terms ranked by their…

Information Retrieval · Computer Science 2020-06-02 Areej Alokaili , Nikolaos Aletras , Mark Stevenson

Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification

Generative AI offers a simple, prompt-based alternative to fine-tuning smaller BERT-style LLMs for text classification tasks. This promises to eliminate the need for manually labeled training data and task-specific model training. However,…

Computation and Language · Computer Science 2024-08-19 Martin Juan José Bucher , Marco Martini

Multi-Label Classification with Generative AI Models in Healthcare: A Case Study of Suicidality and Risk Factors

Suicide remains a pressing global health crisis, with over 720,000 deaths annually and millions more affected by suicide ideation (SI) and suicide attempts (SA). Early identification of suicidality-related factors (SrFs), including SI, SA,…

Computation and Language · Computer Science 2025-07-24 Ming Huang , Zehan Li , Yan Hu , Wanjing Wang , Andrew Wen , Scott Lane , Salih Selek , Lokesh Shahani , Rodrigo Machado-Vieira , Jair Soares , Hua Xu , Hongfang Liu

Large Language Models for Automatic Detection of Sensitive Topics

Sensitive information detection is crucial in content moderation to maintain safe online communities. Assisting in this traditionally manual process could relieve human moderators from overwhelming and tedious tasks, allowing them to focus…

Computation and Language · Computer Science 2024-09-04 Ruoyu Wen , Stephanie Elena Crowe , Kunal Gupta , Xinyue Li , Mark Billinghurst , Simon Hoermann , Dwain Allan , Alaeddin Nassani , Thammathip Piumsomboon

LimTopic: LLM-based Topic Modeling and Text Summarization for Analyzing Scientific Articles limitations

The limitations sections of scientific articles play a crucial role in highlighting the boundaries and shortcomings of research, thereby guiding future studies and improving research methods. Analyzing these limitations benefits…

Computation and Language · Computer Science 2025-03-17 Ibrahim Al Azhar , Venkata Devesh Reddy , Hamed Alhoori , Akhil Pandey Akella

Small Language Models Can Use Nuanced Reasoning For Health Science Research Classification: A Microbial-Oncogenesis Case Study

Artificially intelligent (AI) co-scientists must be able to sift through research literature cost-efficiently while applying nuanced scientific reasoning. We evaluate Small Language Models (SLMs, <= 8B parameters) for classifying medical…

Computational Engineering, Finance, and Science · Computer Science 2025-12-09 Muhammed Muaaz Dawood , Mohammad Zaid Moonsamy , Kaela Kokkas , Hairong Wang , Robert F. Breiman , Richard Klein , Emmanuel K. Sekyi , Bruce A. Bassett

A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification

Although supervised machine learning is popular for information extraction from clinical notes, creating large annotated datasets requires extensive domain expertise and is time-consuming. Meanwhile, large language models (LLMs) have…

Computation and Language · Computer Science 2024-01-26 Madhumita Sushil , Travis Zack , Divneet Mandair , Zhiwei Zheng , Ahmed Wali , Yan-Ning Yu , Yuwei Quan , Atul J. Butte

Keeping Humans in the Loop: Human-Centered Automated Annotation with Generative AI

Automated text annotation is a compelling use case for generative large language models (LLMs) in social media research. Recent work suggests that LLMs can achieve strong performance on annotation tasks; however, these studies evaluate LLMs…

Computation and Language · Computer Science 2024-09-24 Nicholas Pangakis , Samuel Wolken

Leveraging Large Language Models for Generating Research Topic Ontologies: A Multi-Disciplinary Study

Ontologies and taxonomies of research fields are critical for managing and organising scientific knowledge, as they facilitate efficient classification, dissemination and retrieval of information. However, the creation and maintenance of…

Digital Libraries · Computer Science 2025-08-29 Tanay Aggarwal , Angelo Salatino , Francesco Osborne , Enrico Motta