Related papers: Knowledge Engineering for Planning-Based Hypothesi…

A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models

Hypothesis generation is a fundamental step in scientific discovery, yet it is increasingly challenged by information overload and disciplinary fragmentation. Recent advances in Large Language Models (LLMs) have sparked growing interest in…

Computation and Language · Computer Science 2025-04-09 Atilla Kaan Alkan , Shashwat Sourav , Maja Jablonska , Simone Astarita , Rishabh Chakrabarty , Nikhil Garuda , Pranav Khetarpal , Maciej Pióro , Dimitrios Tanoglidis , Kartheik G. Iyer , Mugdha S. Polimera , Michael J. Smith , Tirthankar Ghosal , Marc Huertas-Company , Sandor Kruk , Kevin Schawinski , Ioana Ciucă

Scientific Hypothesis Generation by a Large Language Model: Laboratory Validation in Breast Cancer Treatment

Large language models LLMs have transformed AI and achieved breakthrough performance on a wide range of tasks In science the most interesting application of LLMs is for hypothesis formation A feature of LLMs which results from their…

Quantitative Methods · Quantitative Biology 2025-05-09 Abbi Abdel-Rehim , Hector Zenil , Oghenejokpeme Orhobor , Marie Fisher , Ross J. Collins , Elizabeth Bourne , Gareth W. Fearnley , Emma Tate , Holly X. Smith , Larisa N. Soldatova , Ross D. King

Hypothesis generation and updating in large language models

Large language models (LLMs) increasingly help people solve problems, from debugging code to repairing machinery. This process requires generating plausible hypotheses from partial descriptions, then updating them as more information…

Machine Learning · Computer Science 2026-05-08 Hua-Dong Xiong

Hypothesis Generation with Large Language Models

Effective generation of novel hypotheses is instrumental to scientific progress. So far, researchers have been the main powerhouse behind hypothesis generation by painstaking data analysis and thinking (also known as the Eureka moment). In…

Artificial Intelligence · Computer Science 2024-12-20 Yangqiaoyu Zhou , Haokun Liu , Tejes Srivastava , Hongyuan Mei , Chenhao Tan

Automatically Generating Hard Math Problems from Hypothesis-Driven Error Analysis

Numerous math benchmarks exist to evaluate LLMs' mathematical capabilities. However, most involve extensive manual effort and are difficult to scale. Consequently, they cannot keep pace with LLM development or easily provide new instances…

Artificial Intelligence · Computer Science 2026-04-07 Jiayu Fu , Mourad Heddaya , Chenhao Tan

Hypothesis Generation via LLM-Automated Language Bias for ILP

Inductive Logic Programming (ILP) is a principled approach for generalizing regularities from data and constructing hypotheses as interpretable logic programs. However, a key limitation is its reliance on expert-crafted language bias - the…

Artificial Intelligence · Computer Science 2026-01-21 Yang Yang , Jiemin Wu , Yutao Yue

HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation

There is growing interest in hypothesis generation with large language models (LLMs). However, fundamental questions remain: what makes a good hypothesis, and how can we systematically evaluate methods for hypothesis generation? To address…

Artificial Intelligence · Computer Science 2026-02-12 Haokun Liu , Sicong Huang , Jingyu Hu , Yangqiaoyu Zhou , Chenhao Tan

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation

The rapid growth of biomedical knowledge has outpaced our ability to efficiently extract insights and generate novel hypotheses. Large language models (LLMs) have emerged as a promising tool to revolutionize knowledge interaction and…

Computation and Language · Computer Science 2024-07-16 Biqing Qi , Kaiyan Zhang , Kai Tian , Haoxiang Li , Zhang-Ren Chen , Sihang Zeng , Ermo Hua , Hu Jinfang , Bowen Zhou

Generative AI-Based Effective Malware Detection for Embedded Computing Systems

One of the pivotal security threats for the embedded computing systems is malicious software a.k.a malware. With efficiency and efficacy, Machine Learning (ML) has been widely adopted for malware detection in recent times. Despite being…

Cryptography and Security · Computer Science 2024-04-16 Sreenitha Kasarapu , Sanket Shukla , Rakibul Hassan , Avesta Sasan , Houman Homayoun , Sai Manoj Pudukotai Dinakarrao

Sparse Autoencoders for Hypothesis Generation

We describe HypotheSAEs, a general method to hypothesize interpretable relationships between text data (e.g., headlines) and a target variable (e.g., clicks). HypotheSAEs has three steps: (1) train a sparse autoencoder on text embeddings to…

Computation and Language · Computer Science 2025-06-10 Rajiv Movva , Kenny Peng , Nikhil Garg , Jon Kleinberg , Emma Pierson

Toward Reliable Scientific Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models

Large language models (LLMs) have shown significant potential in scientific disciplines such as biomedicine, particularly in hypothesis generation, where they can analyze vast literature, identify patterns, and suggest research directions.…

Computation and Language · Computer Science 2025-06-10 Guangzhi Xiong , Eric Xie , Corey Williams , Myles Kim , Amir Hassan Shariatmadari , Sikun Guo , Stefan Bekiranov , Aidong Zhang

Incident Response Planning Using a Lightweight Large Language Model with Reduced Hallucination

Timely and effective incident response is key to managing the growing frequency of cyberattacks. However, identifying the right response actions for complex systems is a major technical challenge. A promising approach to mitigate this…

Cryptography and Security · Computer Science 2025-08-08 Kim Hammar , Tansu Alpcan , Emil C. Lupu

Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions

Large Language Models (LLMs) are transforming scientific hypothesis generation and validation by enabling information synthesis, latent relationship discovery, and reasoning augmentation. This survey provides a structured overview of…

Computation and Language · Computer Science 2025-05-09 Adithya Kulkarni , Fatimah Alotaibi , Xinyue Zeng , Longfeng Wu , Tong Zeng , Barry Menglong Yao , Minqian Liu , Shuaicheng Zhang , Lifu Huang , Dawei Zhou

A Bayesian generative neural network framework for epidemic inference problems

The reconstruction of missing information in epidemic spreading on contact networks can be essential in the prevention and containment strategies. The identification and warning of infectious but asymptomatic individuals (i.e., contact…

Social and Information Networks · Computer Science 2022-11-21 Indaco Biazzo , Alfredo Braunstein , Luca Dall'Asta , Fabio Mazza

Literature Meets Data: A Synergistic Approach to Hypothesis Generation

AI holds promise for transforming scientific processes, including hypothesis generation. Prior work on hypothesis generation can be broadly categorized into theory-driven and data-driven approaches. While both have proven effective in…

Artificial Intelligence · Computer Science 2025-01-10 Haokun Liu , Yangqiaoyu Zhou , Mingxuan Li , Chenfei Yuan , Chenhao Tan

Strategic Planning for Network Data Analysis

As network traffic monitoring software for cybersecurity, malware detection, and other critical tasks becomes increasingly automated, the rate of alerts and supporting data gathered, as well as the complexity of the underlying model,…

Artificial Intelligence · Computer Science 2013-05-14 Kartik Talamadupula , Octavian Udrea , Anton Riabov , Anand Ranganathan

Large Language Models for Automated Open-domain Scientific Hypotheses Discovery

Hypothetical induction is recognized as the main reasoning type when scientists make observations about the world and try to propose hypotheses to explain those observations. Past research on hypothetical induction is under a constrained…

Computation and Language · Computer Science 2024-06-13 Zonglin Yang , Xinya Du , Junxian Li , Jie Zheng , Soujanya Poria , Erik Cambria

Learning to Predict with Supporting Evidence: Applications to Clinical Risk Prediction

The impact of machine learning models on healthcare will depend on the degree of trust that healthcare professionals place in the predictions made by these models. In this paper, we present a method to provide people with clinical expertise…

Machine Learning · Computer Science 2021-03-05 Aniruddh Raghu , John Guttag , Katherine Young , Eugene Pomerantsev , Adrian V. Dalca , Collin M. Stultz

Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents

Materials discovery and design are essential for advancing technology across various industries by enabling the development of application-specific materials. Recent research has leveraged Large Language Models (LLMs) to accelerate this…

Computation and Language · Computer Science 2025-02-11 Shrinidhi Kumbhar , Venkatesh Mishra , Kevin Coutinho , Divij Handa , Ashif Iquebal , Chitta Baral

Hypothesis Generation and Inductive Inference in Children and Language Models

Real world decision-making requires constructing mental models under uncertainty over evidence, over the underlying causal rules, and over the state of the world itself. Which computational principles underpin human inference under such…

Artificial Intelligence · Computer Science 2026-05-26 Jeffrey Qin , Wasu Top Piriyakulki , Zhuangfei Gao , Mia Radovanovic , Jessica Sommerville , Kevin Ellis , Marta Kryven