From bowl games to GPAs: Can algorithms improve grading?

Gordon Scharf ’09’s search for a better grading system started at the dinner table of the Brown Hall Co-op. Engineering students were complaining that high grades were harder to earn in their classes, “but there was no good way of quantifying how much harder it was,” Scharf said.

The initial premise was not necessarily accurate — departments in the natural sciences award the fewest A’s — but the conversation pushed Scharf toward a new research topic. For his senior thesis, the operations research and financial engineering major decided to develop a solution.

He found an interested adviser, Robert Vanderbei, who happens to be a member of the University’s grading committee; and a constructive critic, physics professor Daniel Marlow, who helped to test Scharf’s model with several different distributions of grades. Vanderbei, Scharf, and Marlow have collaborated on a working paper titled “Assessing Inequity in Grading,” which presents a statistical model to assess both grade inflation and student achievement.

Grading data, Scharf said, form a “landscape of pairwise comparisons,” with thousands of students choosing from hundreds of course options. It’s comparable to major college football, where more than 100 teams compete and each plays just 12 games in a season. A team’s record of wins and losses can mean different things, depending on the strength of its competition, so the Bowl Championship Series has developed algorithms to compare teams and select the top candidates for postseason bowl games.

Using a similar approach, Vanderbei, Scharf, and Marlow’s model compares grades to calculate an “inflatedness” measure for each course and an “aptitude” measure (or adjusted grade-point average) for each student. When it has been run using actual Princeton grading data — masked for confidentiality — the model has been able to accurately predict individual grades removed from the data set.

Vanderbei said that the grading model could have several applications, from providing new data on which departments are the toughest and easiest graders to giving contextual data to the committees that select undergraduates for fellowships and honors.

The model’s “inflatedness” measures for each class also could eliminate the need for grading guidelines like the ones employed at Princeton. Professors could grade however they chose, and the model would correct for those who give inflated marks (as well as those who grade too severely). The faculty probably would not go down that path, Vanderbei said, because “it gets to be a little Big Brother-ish.”

On the Campus From bowl games to GPAs: Can algorithms improve grading?

Featured Content

What’s Next for Princeton Men’s Basketball After Ivy Madness Loss?

Mitchell ’24 Sets Rebounding Record, Women’s Basketball Back in Ivy Final

Former DOJ Lawyer Kathleen Bradish ’94 Continues to Fight Monopolies

PAWcast: Student Mental Health With Calvin Chin and Jess Deutsch ’91

PAWcast: Students Discuss Mental Health at Princeton

The Whole Student: Alumni Share Through Caring Tigers

The Pies That Bind

Q&A: Anthropology Professor Hanna Garth on Food Insecurity

Sister Inspires Maddie Offstein ’19 As Olympic Marathon Trials Approach

Matt McDonald ’15 Balances Postdoc Life with Elite Running Career

Caroline Nelson ’14 Is a Rancher With an Unconventional Model

PAW Podcasts

WMAP completes nine-year probe

A poet and nothing but a poet

Two Students Arrested at Pro-Palestine Demonstration at Princeton

April 23: Laura Coates ’01 Reports a Crisis Live at Trump Trial

PAWcast: How to Make People Care About Climate Change

Norwegian Sculler Jonas Juel ’22 Trains for Big Races Ahead

Photographer Carla Williams ’86’s Thesis Is Back in the Spotlight

Could New Jersey Be the Place Where AI Blossoms?

Featured Content

What’s Next for Princeton Men’s Basketball After Ivy Madness Loss?

Mitchell ’24 Sets Rebounding Record, Women’s Basketball Back in Ivy Final

Former DOJ Lawyer Kathleen Bradish ’94 Continues to Fight Monopolies

PAWcast: Student Mental Health With Calvin Chin and Jess Deutsch ’91

PAWcast: Students Discuss Mental Health at Princeton

The Whole Student: Alumni Share Through Caring Tigers

The Pies That Bind

Q&A: Anthropology Professor Hanna Garth on Food Insecurity

Sister Inspires Maddie Offstein ’19 As Olympic Marathon Trials Approach

Matt McDonald ’15 Balances Postdoc Life with Elite Running Career

Caroline Nelson ’14 Is a Rancher With an Unconventional Model

PAW Podcasts

PAW Podcasts

An editorially independent magazine by alumni for alumni since 1900

Search form

On the Campus From bowl games to GPAs: Can algorithms improve grading?