03 Feb 2023

Data Wrangling with Computational Notebooks

ABOUT EVENT

Workshop Description

This workshop will be an introduction to understanding how to utilize Jupyter Notebooks to create reproducible computational workflows. Attendees will learn how to build notebooks that contain a combination of explanatory markdown formatted text and python code. Lessons will address how to load data from formats such as .csv or .tsv files, how to clean up data that has been loaded, and how to analyze this data tracking provenance and allowing reproducibility of analysis. The workshop will conclude with attendees utilizing the skills they are taught to wrangle a real-life dataset and visualize its contents. Python libraries such as Pandas and Matplotlib will be utilized.

Prerequisites:

  • Basic understanding of Python
  • Basic understanding of Markdown language
  • Basic understanding of Unix-like file systems
  • Computer with internet connections
  • Have an account on Mana (incase of problems with Binder)
  • MFA/DUO enabled on your UH Account

Learning Objectives:

By the end of this workshop attendees will be able to:

  • Understand why Jupyter notebooks are useful and how to document workflows with them
  • Apply the Python library ‘Pandas’ to load and clean data from different file formats
  • Use built in tools in ‘Pandas’ to analyze data that has been loaded
  • Visualize results in ‘Pandas’ with the ‘matplotlib’ plotting library

Tools used in this workshop:

  • Python
  • Pandas
  • Matplotlib
  • Jupyter Notebook

Registration for : Data Wrangling with Computational Notebooks

    Register Now!

    Share This Event