Computation and Language · Computer Science
Lost in the Pipeline: How Well Do Large Language Models Handle Data Preparation?
Matteo Spreafico, Ludovica Tassini, Camilla Sancricca, Cinzia Cappiello
2025-12-01
Software Engineering · Computer Science
Quality Assessment of Tabular Data using Large Language Models and Code Generation
Ashlesha Akella, Akshar Kaul, Krishnasuri Narayanam, Sameep Mehta
2025-09-23
Databases · Computer Science
Data Context Informed Data Wrangling
Martin Koehler, Alex Bogatu, Cristina Civili, Nikolaos Konstantinou +5
2018-11-26
Computation and Language · Computer Science
Evolution without Large Models: Training Language Model with Task Principles
Minghang Zhu, Shen Gao, Zhengliang Shi, Jiabao Fang +4
2025-07-09
Machine Learning · Computer Science
Iterative Data Programming for Expanding Text Classification Corpora
Neil Mallinar, Abhishek Shah, Tin Kam Ho, Rajendra Ugrani +1
2020-02-05
Software Engineering · Computer Science
Semantically Aligned Question and Code Generation for Automated Insight Generation
Ananya Singha, Bhavya Chopra, Anirudh Khatry, Sumit Gulwani +5
2024-05-06
Computation and Language · Computer Science
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea, Qiaozhu Mei
2020-08-03
Machine Learning · Computer Science
Unlock the Potential of Large Language Models for Predictive Tabular Tasks in Data Science with Table-Specific Pretraining
Yazheng Yang, Yuqi Wang, Yaxuan Li, Sankalok Sen +3
2026-04-23
Computation and Language · Computer Science
A Survey on Data Selection for Language Models
Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre +10
2024-08-05
Computation and Language · Computer Science
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey
Xi Fang, Weijie Xu, Fiona Anting Tan, Jiani Zhang +6
2024-06-25
Artificial Intelligence · Computer Science
Understanding the Capabilities of Large Language Models for Automated Planning
Vishal Pallagani, Bharath Muppasani, Keerthiram Murugesan, Francesca Rossi +4
2023-05-26
Computation and Language · Computer Science
Deep Sequence Models for Text Classification Tasks
Saheed Salahudeen Abdullahi, Sun Yiming, Shamsuddeen Hassan Muhammad, Abdulrasheed Mustapha +4
2022-07-20
Computation and Language · Computer Science
Skill Learning Using Process Mining for Large Language Model Plan Generation
Andrei Cosmin Redis, Mohammadreza Fani Sani, Bahram Zarrin, Andrea Burattin
2024-10-18
Computation and Language · Computer Science
Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems
Stéphane d'Ascoli, Alice Coucke, Francesco Caltagirone, Alexandre Caulier +1
2020-11-05
Computation and Language · Computer Science
Empowering Large Language Models for Textual Data Augmentation
Yichuan Li, Kaize Ding, Jianling Wang, Kyumin Lee
2024-04-30