GPA Dataset (Spring 2010 through Spring 2020)
The "GPA Dataset" contains the GPAs for courses at The University of Illinois over a ten year period from Spring 2010 through Spring 2020.
- Dataset Format: Well-formatted CSV with column headers as the first row
- Dataset Size: 53,933 rows × 21 columns
- CSV File Location: https://waf.cs.illinois.edu/discovery/gpa.csv
- Dataset Variables:
Year: number ➜ The year the course section was offeredTerm: string ➜ The term the course section was offered (Fall,Spring,Summer, orWinter)YearTerm: string ➜ A formatted combination of the year and termSubject: string ➜ 2-4 letter subject code for the course; ex:HISTfor History,CSfor Computer Science, etc.Number: number ➜ 1-3 digit course numberCourse Title: string ➜ A short title describing the courseA+: number ➜ The number of A+ grades given in the course sectionA: number ➜ The number of A grades given in the course sectionA-: number ➜ The number of A- grades given in the course sectionB+,B,B-,C+,C,C-,D+,D,D-: number ➜ The number of grades given in the course sectionF: number ➜ The number of F grades given in the course sectionPrimary Instructor: string ➜ The primary instructor of record in the course sectionStudents: The number of students per course section
Using the GPA Dataset in Python
The dataset can be loaded using the pandas library in Python:
import pandas as pd
df = pd.read_csv("https://waf.cs.illinois.edu/discovery/gpa.csv")
df| Year | Term | YearTerm | Subject | Number | Course Title | A+ | A | A- | B+ | B | B- | C+ | C | C- | D+ | D | D- | F | W | Primary Instructor | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2020 | Spring | 2020-sp | AAS | 100 | Intro Asian American Studies | 12 | 12 | 1 | 0 | 2 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | Boonsripaisal, Simon |
| 1 | 2020 | Spring | 2020-sp | AAS | 100 | Intro Asian American Studies | 5 | 12 | 4 | 1 | 4 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | Boonsripaisal, Simon |
| 2 | 2020 | Spring | 2020-sp | AAS | 100 | Intro Asian American Studies | 0 | 8 | 7 | 3 | 6 | 3 | 2 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | Uhall, Michael B |
| 3 | 2020 | Spring | 2020-sp | AAS | 100 | Intro Asian American Studies | 8 | 6 | 2 | 0 | 4 | 1 | 1 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | Uhall, Michael B |
| 4 | 2020 | Spring | 2020-sp | AAS | 100 | Intro Asian American Studies | 5 | 4 | 3 | 1 | 4 | 1 | 1 | 2 | 1 | 1 | 0 | 1 | 1 | 0 | Kang, Yoonjung |
| ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
| 53928 | 2010 | Summer | 2010-su | STAT | 410 | Statistics and Probability II | 5 | 10 | 2 | 2 | 5 | 1 | 0 | 1 | 3 | 0 | 0 | 0 | 2 | 1 | Stepanov, Alexei G |
| 53929 | 2010 | Summer | 2010-su | STAT | 440 | Statistical Data Management | 4 | 12 | 8 | 1 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Unger, David |
| 53930 | 2010 | Summer | 2010-su | TAM | 212 | Introductory Dynamics | 0 | 1 | 3 | 2 | 5 | 7 | 5 | 1 | 1 | 0 | 2 | 0 | 1 | 0 | Morgan, William T |
| 53931 | 2010 | Summer | 2010-su | TAM | 251 | Introductory Solid Mechanics | 1 | 2 | 2 | 1 | 5 | 0 | 3 | 3 | 2 | 0 | 0 | 1 | 1 | 0 | Ott-Monsivais, Stephanie |
| 53932 | 2010 | Summer | 2010-su | TAM | 335 | Introductory Fluid Mechanics | 5 | 8 | 9 | 4 | 5 | 1 | 0 | 4 | 0 | 0 | 1 | 0 | 2 | 0 | Keane, Richard D |
The full GPA dataset stored in a DataFrame (53,933 rows).
Pages Using the GPA Dataset
- Learn Page: Python for Data Science: Introduction to DataFrames
- Video Walk-Through & Worksheet: Learn Page: GPA Dataset and Operations on Groups
- Video Walk-Through & Worksheet: Learn Page: Aggregation Functions for groupby Groups
- Video Walk-Through & Worksheet: Learn Page: Examples Aggregating groupby Groups
- Video Walk-Through & Worksheet: Learn Page: Reading Histograms (Part 1)
- Video Walk-Through & Worksheet: Learn Page: Histograms, Bar Charts, and Box Plots