English
Related papers

Related papers: BOOST: Bootstrapping Strategy-Driven Reasoning Pro…

200 papers

Language models for program synthesis are usually trained and evaluated on programming competition datasets (MBPP, APPS). However, these datasets are limited in size and quality, while these language models are extremely data hungry.…

Software Engineering · Computer Science 2025-07-23 Noah van der Vleuten

Large language models (LLMs) can achieve highly effective performance on various reasoning tasks by incorporating step-by-step chain-of-thought (CoT) prompting as demonstrations. However, the reasoning chains of demonstrations generated by…

Computation and Language · Computer Science 2024-03-18 Jiashuo Sun , Yi Luo , Yeyun Gong , Chen Lin , Yelong Shen , Jian Guo , Nan Duan

Prompting a language model (LM) is an increasingly important research topic for better utilization of large language models (LLMs). While simple prompting is effective for single-step questions, it fails to activate the correct knowledge…

Artificial Intelligence · Computer Science 2025-10-09 Iok Tong Lei , Ziyu Zhu , Han Yu , Yige Yao , Zhidong Deng

Few-shot prompting elicits the remarkable abilities of large language models by equipping them with a few demonstration examples in the input. However, the traditional method of providing large language models with all demonstration…

Computation and Language · Computer Science 2023-12-27 Yu Ji , Wen Wu , Yi Hu , Hong Zheng , Liang He

Generating step-by-step "chain-of-thought" rationales improves language model performance on complex reasoning tasks like mathematics or commonsense question-answering. However, inducing language model rationale generation currently…

Machine Learning · Computer Science 2022-05-23 Eric Zelikman , Yuhuai Wu , Jesse Mu , Noah D. Goodman

Multi-hop Question Generation is the task of generating questions which require the reader to reason over and combine information spread across multiple passages using several reasoning steps. Chain-of-thought rationale generation has been…

Computation and Language · Computer Science 2022-11-17 Saurabh Kulshreshtha , Anna Rumshisky

Large language models (LLMs) have recently been shown to deliver impressive performance in various NLP tasks. To tackle multi-step reasoning tasks, few-shot chain-of-thought (CoT) prompting includes a few manually crafted step-by-step…

Computation and Language · Computer Science 2023-05-29 Lei Wang , Wanyu Xu , Yihuai Lan , Zhiqiang Hu , Yunshi Lan , Roy Ka-Wei Lee , Ee-Peng Lim

Chain-of-thought (CoT) prompting with large language models has proven effective in numerous natural language processing tasks, but designing prompts that generalize well to diverse problem types can be challenging, especially in the…

Computation and Language · Computer Science 2023-06-12 Zhanming Jie , Wei Lu

The reasoning performance of Large Language Models (LLMs) on a wide range of problems critically relies on chain-of-thought prompting, which involves providing a few chain of thought demonstrations as exemplars in prompts. Recent work,…

Computation and Language · Computer Science 2025-01-08 Sijia Chen , Baochun Li , Di Niu

Methods such as chain-of-thought prompting and self-consistency have pushed the frontier of language model reasoning performance with no additional training. To further improve performance, we propose a prompt ensembling method for large…

Computation and Language · Computer Science 2023-04-13 Silviu Pitis , Michael R. Zhang , Andrew Wang , Jimmy Ba

Few-shot question answering (QA) aims at precisely discovering answers to a set of questions from context passages while only a few training samples are available. Although existing studies have made some progress and can usually achieve…

Computation and Language · Computer Science 2023-06-08 Xiusi Chen , Yu Zhang , Jinliang Deng , Jyun-Yu Jiang , Wei Wang

Diffusion models have demonstrated excellent potential for generating diverse images. However, their performance often suffers from slow generation due to iterative denoising. Knowledge distillation has been recently proposed as a remedy…

Computer Vision and Pattern Recognition · Computer Science 2023-06-12 Jiatao Gu , Shuangfei Zhai , Yizhe Zhang , Lingjie Liu , Josh Susskind

Large Language Models (LLMs) have demonstrated remarkable capabilities in complex reasoning tasks, yet generating reliable reasoning processes remains a significant challenge. We present a unified probabilistic framework that formalizes LLM…

Self-alignment is an effective way to reduce the cost of human annotation while ensuring promising model capability. However, most current methods complete the data collection and training steps in a single round, which may overlook the…

Computation and Language · Computer Science 2024-06-28 Haoyu Wang , Guozheng Ma , Ziqiao Meng , Zeyu Qin , Li Shen , Zhong Zhang , Bingzhe Wu , Liu Liu , Yatao Bian , Tingyang Xu , Xueqian Wang , Peilin Zhao

Machine learning models achieve state-of-the-art performance on many supervised learning tasks. However, prior evidence suggests that these models may learn to rely on shortcut biases or spurious correlations (intuitively, correlations that…

Machine Learning · Computer Science 2021-08-31 Sindhu C. M. Gowda , Shalmali Joshi , Haoran Zhang , Marzyeh Ghassemi

Reinforcement learning systems have the potential to enable continuous improvement in unstructured environments, leveraging data collected autonomously. However, in practice these systems require significant amounts of instrumentation or…

Robotics · Computer Science 2022-03-30 Abhishek Gupta , Corey Lynch , Brandon Kinman , Garrett Peake , Sergey Levine , Karol Hausman

Prevailing methods for mapping large generative language models to supervised tasks may fail to sufficiently probe models' novel capabilities. Using GPT-3 as a case study, we show that 0-shot prompts can significantly outperform few-shot…

Computation and Language · Computer Science 2021-02-16 Laria Reynolds , Kyle McDonell

Fact-checking real-world claims often requires collecting multiple pieces of evidence and applying complex multi-step reasoning. In this paper, we present Program-Guided Fact-Checking (ProgramFC), a novel fact-checking model that decomposes…

Computation and Language · Computer Science 2023-05-23 Liangming Pan , Xiaobao Wu , Xinyuan Lu , Anh Tuan Luu , William Yang Wang , Min-Yen Kan , Preslav Nakov

Abc-boost is a new line of boosting algorithms for multi-class classification, by utilizing the commonly used sum-to-zero constraint. To implement abc-boost, a base class must be identified at each boosting step. Prior studies used a very…

Machine Learning · Computer Science 2010-06-28 Ping Li

Boosting is a general method of generating many simple classification rules and combining them into a single, highly accurate rule. In this talk, I will review the AdaBoost boosting algorithm and some of its underlying theory, and then look…

Machine Learning · Computer Science 2013-01-07 Robert E. Schapire
‹ Prev 1 2 3 10 Next ›