English

Minimalist Data Wrangling with Python

Machine Learning 2022-11-10 v1

Abstract

Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results. This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/.

Keywords

Cite

@article{arxiv.2211.04630,
  title  = {Minimalist Data Wrangling with Python},
  author = {Marek Gagolewski},
  journal= {arXiv preprint arXiv:2211.04630},
  year   = {2022}
}

Comments

Release: v1.0.2.9001 (2022-11-09T12:17:50+1100)