Minimalist Data Wrangling with Python
2022-11-09Code Available1· sign in to hype
Marek Gagolewski
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/gagolews/datawranglingpyIn papernone★ 87
- github.com/gagolews/teaching-dataIn papernone★ 29
Abstract
Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results. This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/.