GPA Dataset (Spring 2010 through Spring 2020)


The "GPA Dataset" contains the GPAs for courses at The University of Illinois over a ten year period from Spring 2010 through Spring 2020.

  • Dataset Format: Well-formatted CSV with column headers as the first row
  • Dataset Size: 53,933 rows × 21 columns
  • CSV File Location: https://waf.cs.illinois.edu/discovery/gpa.csv
  • Dataset Variables:
    • Year : number ➜ The year the course section was offered
    • Term : string ➜ The term the course section was offered (Fall, Spring, Summer, or Winter)
    • YearTerm : string ➜ A formatted combination of the year and term
    • Subject : string ➜ 2-4 letter subject code for the course; ex: HIST for History, CS for Computer Science, etc.
    • Number : number ➜ 1-3 digit course number
    • Course Title : string ➜ A short title describing the course
    • A+ : number ➜ The number of A+ grades given in the course section
    • A : number ➜ The number of A grades given in the course section
    • A- : number ➜ The number of A- grades given in the course section
    • B+, B, B-, C+, C, C-, D+, D, D-: number ➜ The number of grades given in the course section
    • F : number ➜ The number of F grades given in the course section
    • Primary Instructor : string ➜ The primary instructor of record in the course section
    • Students: The number of students per course section

Using the GPA Dataset in Python

The dataset can be loaded using the pandas library in Python:

import pandas as pd
df = pd.read_csv("https://waf.cs.illinois.edu/discovery/gpa.csv")
df
YearTermYearTermSubjectNumberCourse TitleA+AA-B+BB-C+CC-D+DD-FWPrimary Instructor
02020Spring2020-spAAS100Intro Asian American Studies1212102101000010Boonsripaisal, Simon
12020Spring2020-spAAS100Intro Asian American Studies512414120000010Boonsripaisal, Simon
22020Spring2020-spAAS100Intro Asian American Studies08736320200000Uhall, Michael B
32020Spring2020-spAAS100Intro Asian American Studies86204112100000Uhall, Michael B
42020Spring2020-spAAS100Intro Asian American Studies54314112110110Kang, Yoonjung
..................................................................
539282010Summer2010-suSTAT410Statistics and Probability II510225101300021Stepanov, Alexei G
539292010Summer2010-suSTAT440Statistical Data Management412813000000000Unger, David
539302010Summer2010-suTAM212Introductory Dynamics01325751102010Morgan, William T
539312010Summer2010-suTAM251Introductory Solid Mechanics12215033200110Ott-Monsivais, Stephanie
539322010Summer2010-suTAM335Introductory Fluid Mechanics58945104001020Keane, Richard D

The full GPA dataset stored in a DataFrame (53,933 rows).

Pages Using the GPA Dataset

  1. Learn Page: Python for Data Science: Introduction to DataFrames
  2. Video Walk-Through & Worksheet: Learn Page: GPA Dataset and Operations on Groups
  3. Video Walk-Through & Worksheet: Learn Page: Aggregation Functions for groupby Groups
  4. Video Walk-Through & Worksheet: Learn Page: Examples Aggregating groupby Groups
  5. Video Walk-Through & Worksheet: Learn Page: Reading Histograms (Part 1)
  6. Video Walk-Through & Worksheet: Learn Page: Histograms, Bar Charts, and Box Plots