
21 Jan 2022
Data Wrangling with Computational Notebooks
ABOUT EVENT
Registration: http://go.hawaii.edu/VQ5
Workshop Description
This workshop will be an introduction to understanding how to utilize Jupyter Notebooks to create reproducible computational workflows. Attendees will learn how to build notebooks that contain a combination of explanatory markdown formatted text and python code. Lessons will address how to load data from formats such as .csv or .tsv files, how to clean up data that has been loaded, and how to analyze this data tracking provenance and allowing reproducibility of analysis. The workshop will conclude with attendees utilizing the skills they are taught to wrangle a real-life dataset and visualize its contents. Python libraries such as Pandas and Matplotlib will be utilized.
Prerequisites:
- Basic understanding of Python
- Basic understanding of Markdown language
- Basic understanding of Unix-like file systems
- Computer with internet connections
- Have an account on Mana (incase of problems with Binder)
- MFA/DUO enabled on your UH Account
Learning Outcomes:
By the end of this workshop attendees will be able to:
- Understand why Jupyter notebooks are useful and how to document workflows with them
- Apply the Python library ‘Pandas’ to load and clean data from different file formats
- Use built in tools in ‘Pandas’ to analyze data that has been loaded
- Visualize results in ‘Pandas’ with the ‘matplotlib’ plotting library
Tools used in this workshop:
- Python
- Pandas
- Matplotlib
- Jupyter Notebook