This is the repo for Intro to Python: Pandas for Metadata Transformation and Cleanup workshop hosted by Northeast Institutional Repository Day (NIRD) in March 2022.
Website: https://mjanowiecki.github.io/intro-pandas-metadata/intro.html
Workbook: https://github.com/mjanowiecki/pandas-workbook
This workshop covers some basics of the pandas library. The pandas library is an open-source Python library that is very popular for data manipulation and analytics acrosss a wide array of displicines. Here are some tasks that pandas is great at:
- Reading and writing data between different formats (CSV, JSON, XML, Excel, SQL, and more)
- Merging and joining data
- Reshaping and pivoting data
- Handling missing data
- Getting quick overviews of data values
- Literally anything data analysis and visualization!