SOTAVerified

Minimalist Data Wrangling with Python

2022-11-09Code Available1· sign in to hype

Marek Gagolewski

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results. This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/.

Tasks

Reproductions