Please ensure Javascript is enabled for purposes of website accessibility
Read This to Pick the Upsets and a Perfect NCAA Bracket
The-Conversation
By The Conversation
Published 7 years ago on
March 13, 2018

Share

“Beware the Ides of March.” Yes, it’s finally that time of year again: When the emperors of college basketball must watch their backs, lest the lowly bottom seeds of the tournament strike.

Opinion

Matthew Osborne

Kevin Nowland
Before March 15, millions around the world will fill out their March Madness brackets. In 2017, ESPN received a record 18.8 million brackets.
The first step to a perfect bracket is correctly choosing the first round. Unfortunately, most of us can’t predict the future. Last year, only 164 of the submitted brackets were perfect through the first round – less than 0.001 percent.


Many brackets are busted when a lower-seeded team upsets the favored higher seed. Since the field expanded to 64 teams in 1985, at least eight upsets occur on average each year. If you want to win your bracket pool, you better pick at least a few upsets.
We’re two math Ph.D. candidates at the Ohio State University who have a passion for data science and basketball. This year, we decided it would be fun to build a computer program that uses a mathematical approach to predict first-round upsets. If we’re right, a bracket picked using our program should perform better through the first round than the average bracket.

Fallible Humans

It’s not easy to identify which of the first-round games will result in an upset.
Say you have to decide between the No. 10 seed and the No. 7 seed. The No. 10 seed has pulled off upsets in its past three tournament appearances, once even making the Final Four. The No. 7 seed is a team that’s received little to no national coverage; the casual fan has probably never heard of them. Which would you choose?
If you chose the No. 10 seed in 2017, you would have gone with Virginia Commonwealth University over Saint Mary’s of California – and you would have been wrong. Thanks to a decision-making fallacy called recency bias, humans can be tricked into to using their most recent observations to make a decision.
Recency bias is just one type of bias that can infiltrate someone’s picking process, but there are many others. Maybe you’re biased toward your home team, or maybe you identify with a player and desperately want him or her to succeed. All of this influences your bracket in a potentially negative way. Even seasoned professionals fall into these traps.

Modeling Upsets

Machine learning can defend against these pitfalls.
In machine learning, statisticians, mathematicians and computer scientists train a machine to make predictions by letting it “learn” from past data. This approach has been used in many diverse fields, including marketing, medicine and sports.
Machine learning techniques can be likened to a black box. First, you feed the algorithm past data, essentially setting the dials on the black box. Once the settings are calibrated, the algorithm can read in new data, compare it to past data and then spit out its predictions.

A black box view of machine learning algorithms.
Matthew Osborne, CC BY-SA

In machine learning, there are a variety of black boxes available. For our March Madness project, the ones we wanted are known as classification algorithms. These help us determine whether or not a game should be classified as an upset, either by providing the probability of an upset or by explicitly classifying a game as one.
Our program uses a number of popular classification algorithms, including logistic regression, random forest models and k-nearest neighbors. Each method is like a different “brand” of the same machine; they work as differently under the hood as Fords and Toyotas, but perform the same classification job. Each algorithm, or box, has its own predictions about the probability of an upset.
We used the statistics of all 2001 to 2017 first-round teams to set the dials on our black boxes. When we tested one of our algorithms with the 2017 first-round data, it had about a 75 percent success rate. This gives us confidence that analyzing past data, rather than just trusting our gut, can lead to more accurate predictions of upsets, and thus better overall brackets.

What advantages do these boxes have over human intuition? For one, the machines can identify patterns in all of the 2001-2017 data in a matter of seconds. What’s more, since the machines rely only on data, they may be less likely to fall for human psychological biases.
That’s not to say that machine learning will give us perfect brackets. Even though the box bypasses human bias, it’s not immune to error. Results depend on past data. For example, if a No. 1 seed were to lose in the first round, our model would not likely predict it, because that has never happened before.
Additionally, machine learning algorithms work best with thousands or even millions of examples. Only 544 first-round March Madness games have been played since 2001, so our algorithms will not correctly call every upset. Echoing basketball expert Jalen Rose, our output should be used as a tool in conjunction with your expert knowledge – and luck! – to choose the correct games.

Machine Learning Madness?

We’re not the first people to apply machine learning to March Madness and we won’t be the last. In fact, machine learning techniques may soon be necessary to make your bracket competitive.
The ConversationYou don’t need a degree in mathematics to use machine learning – although it helps us. Soon, machine learning may be more accessible than ever. Those interested can take a look at our models online. Feel free to explore our algorithms and even come up with a better approach yourself.
Matthew Osborne, Ph.D Candidate in Mathematics, The Ohio State University and Kevin Nowland, Ph.D Candidate in Mathematics, The Ohio State University
This article was originally published on The Conversation. Read the original article.

DON'T MISS

Kiké Hernández and World Series Champion Dodgers Finalize a $6.5 Million, 1-Year Contract

DON'T MISS

California Man Calls 375 Hoax Threats in Swatting Spree Will Face Prison Time

DON'T MISS

White House Fires USAID Inspector General After Warning About Funding Oversight

DON'T MISS

Red, White, and Blueland? GOP Congressman Proposes Bill to Rename Greenland

DON'T MISS

White House Bars AP Reporter From Oval Office Because of AP Style Policy on ‘Gulf of America’

DON'T MISS

Trump Steel, Aluminum Tariffs Likely to Drive up Car Costs, Industry Leaders Say

DON'T MISS

After Criminal Case Tossed, Madec Fights to Get Fresno CC Classroom Job Back

DON'T MISS

Atmospheric River Stays on Track to Soak Fresno With Rain

DON'T MISS

Blood Center Picks Long-Time Industry Leader as New CEO

DON'T MISS

Californians Picked Up in Recent ICE Raids Include Kids and Volunteers

UP NEXT

California Man Calls 375 Hoax Threats in Swatting Spree Will Face Prison Time

UP NEXT

White House Fires USAID Inspector General After Warning About Funding Oversight

UP NEXT

Red, White, and Blueland? GOP Congressman Proposes Bill to Rename Greenland

UP NEXT

White House Bars AP Reporter From Oval Office Because of AP Style Policy on ‘Gulf of America’

UP NEXT

Trump Steel, Aluminum Tariffs Likely to Drive up Car Costs, Industry Leaders Say

UP NEXT

After Criminal Case Tossed, Madec Fights to Get Fresno CC Classroom Job Back

UP NEXT

Atmospheric River Stays on Track to Soak Fresno With Rain

UP NEXT

Blood Center Picks Long-Time Industry Leader as New CEO

UP NEXT

Californians Picked Up in Recent ICE Raids Include Kids and Volunteers

UP NEXT

Denmark Wants to Buy California? Online Petition Hits 200,000 Signatures

Red, White, and Blueland? GOP Congressman Proposes Bill to Rename Greenland

13 hours ago

White House Bars AP Reporter From Oval Office Because of AP Style Policy on ‘Gulf of America’

14 hours ago

Trump Steel, Aluminum Tariffs Likely to Drive up Car Costs, Industry Leaders Say

14 hours ago

After Criminal Case Tossed, Madec Fights to Get Fresno CC Classroom Job Back

14 hours ago

Atmospheric River Stays on Track to Soak Fresno With Rain

15 hours ago

Blood Center Picks Long-Time Industry Leader as New CEO

16 hours ago

Californians Picked Up in Recent ICE Raids Include Kids and Volunteers

16 hours ago

Denmark Wants to Buy California? Online Petition Hits 200,000 Signatures

17 hours ago

Fresno Crash Kills Pedestrian. Driver Cooperates With Police.

17 hours ago

Can a Joint Fresno Committee Allay Immigration Fears? These Leaders Want to Try

17 hours ago

Kiké Hernández and World Series Champion Dodgers Finalize a $6.5 Million, 1-Year Contract

Kiké Hernández and the Los Angeles Dodgers finalized a $6.5 million, one-year contract on Tuesday that keeps the versatile infielder/outfiel...

12 hours ago

12 hours ago

Kiké Hernández and World Series Champion Dodgers Finalize a $6.5 Million, 1-Year Contract

Photo of an armored vehicle
12 hours ago

California Man Calls 375 Hoax Threats in Swatting Spree Will Face Prison Time

Flowers and a sign are placed outside the headquarters of the U.S. Agency for International Development, or USAID, Friday, Feb. 7, 2025, in Washington. (AP/Jose Luis Magana)
13 hours ago

White House Fires USAID Inspector General After Warning About Funding Oversight

13 hours ago

Red, White, and Blueland? GOP Congressman Proposes Bill to Rename Greenland

Elon Musk listens as President Donald Trump speaks with reporters in the Oval Office at the White House, Tuesday, Feb. 11, 2025, in Washington. (AP/Alex Brandon)
14 hours ago

White House Bars AP Reporter From Oval Office Because of AP Style Policy on ‘Gulf of America’

An employee works on the production line at the Martinrea auto parts manufacturing plant that supplies auto parts to Canada and U.S. plants, in Woodbridge, Ontario, Monday, Feb. 3, 2025. (AP File)
14 hours ago

Trump Steel, Aluminum Tariffs Likely to Drive up Car Costs, Industry Leaders Say

14 hours ago

After Criminal Case Tossed, Madec Fights to Get Fresno CC Classroom Job Back

photo of a rainbow colored umbrella
15 hours ago

Atmospheric River Stays on Track to Soak Fresno With Rain

Help continue the work that gets you the news that matters most.

Search

Send this to a friend