Activity 3: Introduction to Python pandas
Due: On git by Tuesday, Jan. 31, 2017 at 9:30am
Team: This is a solo assignment.
Grading: This assignment is worth 20 points.

Overview

In lecture, you started using pandas -- an extremely powerful tool for data science. In this activity, you will use pandas to find out GPAs at The University of Illinois.


Initial Files

A new directory has been created on the release git repository, which you should complete this assignment in. To merge this into your repository, navigate to your workbook directory using a command line and run the following commands:

git fetch release
git merge release/activity3 master -m "merge"


Assignment

You will complete this assignment in a jupyter notebook inside of the activity3 directory. To open up the jupyter notebook, run the following command:

jupyter notebook

The dataset you will use is a favorite, containing GPAs from every course in Spring 2016. The CSV file can be downloaded here:

Download Spr2016GPA.csv

Using pandas, create a jupyter notebook that analyzes GPAs. Specifically, you should find the following things:

  1. What was the average GPA across all sections of your favorite class in Spring 2016?
  2. What was the average GPA across all sections of CS 225 in Spring 2016?
  3. What was the average GPA across all sections of MATH in Spring 2016?
  4. What was the average GPA across all sections at UIUC in Spring 2016?

Make sure to save your notebook before closing your browser or command line!


Submission

This activity is submitted digitally via git. View detailed submission instructions here.