English

Using ChatGPT for Data Science Analyses

Machine Learning 2026-03-03 v2 Computation and Language Computation

Abstract

As a result of recent advancements in generative AI, the field of data science is prone to various changes. The way practitioners construct their data science workflows is now irreversibly shaped by recent advancements, particularly by tools like OpenAI's Data Analysis plugin. While it offers powerful support as a quantitative co-pilot, its limitations demand careful consideration in empirical analysis. This paper assesses the potential of ChatGPT for data science analyses, illustrating its capabilities for data exploration and visualization, as well as for commonly used supervised and unsupervised modeling tasks. While we focus here on how the Data Analysis plugin can serve as co-pilot for Data Science workflows, its broader potential for automation is implicit throughout.

Keywords

Cite

@article{arxiv.2404.08480,
  title  = {Using ChatGPT for Data Science Analyses},
  author = {Ozan Evkaya and Miguel de Carvalho},
  journal= {arXiv preprint arXiv:2404.08480},
  year   = {2026}
}

Comments

19 pages with figures and appendix

R2 v1 2026-06-28T15:52:31.419Z