Introduction to Pandas for data science (LUNARC)

13-14 May 2024

Pandas is a powerful, popular Python package for cleaning, manipulating, and statistically analyzing large tabular data sets. It is particularly useful in preparation for AI/ML applications and publication-ready visualization. Originally developed for financial panel data, it is now used by data scientists in a huge variety of fields, from marketing to medicine to astronomy. Pandas is capable of handling data sets of several Gigabytes.

This course will introduce the core Pandas data types, basic input/output routines, data selection and filtering, data inspection and cleaning methods, built-in and user-defined functions for data manipulation, hierarchical data structures, and some built-in visualization methods. There will be a mix of static examples and live demonstrations via Jupyter notebook, and exercises will be provided to complement the lecture materials.

For more information and access to registration, please visit
https://www.lunarc.lu.se/learning-more/training-courses/introduction-to-pandas-for-data-science-13-14-may-2024/.