My Senior Thesis (completed in May 2024 at Syracuse University) analyzed which factors influence UFC judging and how individual judges value them differently. Since then, I’ve expanded the work into a full Live UFC Analytics Platform, which powers real-time judging insights and analysis for fans.

Senior Thesis Highlights:

Data & Methodology:

I wrote R scripts to scrape scorecard data from MMAdecisions.com and fight statistics from UFCStats.com.

The fight statistics included significant strikes landed with two different breakdowns:  

Target Breakdown:
  • Significant head strikes
  • Significant body strikes
  • Significant leg strikes
Positional Breakdown:
  • Significant strikes at distance
  • Significant clinch strikes
  • Significant ground strikes

Aside from significant strikes landed, the fight data also included:

  • knockdowns landed
  • non-significant strikes landed and attempted
  • takedowns landed and attempted
  • control time
  • reversals
  • submission attempts  

To remove red vs. blue corner bias, I randomly assigned each fighter’s data to one of two sets of columns. I then calculated the difference between each of the statistics mentioned above for the two fighters.

Binomial GLM Models:

Two models were built for each striking breakdown, using the majority scorecard winner as the response variable.

Target Model:
Target Model
Position Model:
Position Model

Some key takeaways from the models:

  • The target model has identified the head as the most important target to the judges
  • Both models show knockdowns are extremely impactful to winning round
  • Takedowns and reversals are valued very similarly in both models
  • The value of a takedown/reversal is around the same as 50 seconds of control time
  • Submission attempts are valued around twice as much as a takedown/reversal


Individual Judge Biases:

After building the GLM models, I used them to identify individual judges’ scoring tendencies by subsetting the data for each judge. I repeated this process for each of the judge data subsets:

  1. Rounds where the two judges not being examined had different scores were dropped
  2. Two new winner variables were created for each round:
    • Winnerj indicates who the selected judge had winning the round, and
    • Winnernj is the winner selected by the other two judges
  3. These new variables were used to create two models for each judge:
    • The judge model was fitted using the Winnerj variable
    • The non-judge model fitted using the Winnernj variable
  4. A Wald test was used to assess whether the coefficient differences between the two models were significant
  5. A graph was created to compare the coeficcients.  

This Process was also repeated using both the target & position model to examine all variables

Here is an example of the target output graphs for one of the judges (Derek Cleary): Image  
This output shows us the following about Derek Cleary:

  • He values significant head strikes lower than other judges
  • He also values significant body strikes lower than other judges
  • He also seems to value submission attempts a bit high

We can also examine the alternative graph for the position models: Image  
In this graph, the only statistically significant difference that can be seen is in significant distance strikes.

Decision Tree & Random Forest Models:

In addition to the binomial GLM model created, a decision tree and random forest were created to judge rounds. Most of the variables are the same here, but instead of the significant strikes being broken down by target (head, body legs), they are broken down by where the striking occured (distance, ground or clinch). This data will allow the decision trees to identify different types of fights, such as a round where one fighter dominated on the ground but lost on the feet. The main decision tree is shown below: Image You can see that this tree has multiple splits based on who won the distance striking as well as the grappling (through ground striking and control time). While this overfits the data, you can see how this approach makes sense for scoring. A random forest model using the same predictors was also created, and the following output shows the results of this: Image Just like in the decision tree significant distance strikes, control time, and significant ground strikes are the most important predictors. Significant clinch strikes do not have much of an impact, and the mean decrease gini is actually higher for non-significant strikes. Takedowns are not overly important due to most grappling splits occuring on control time and ground striking. Submission attempts, knockdowns, and reversals all occur infrequently so the random forest was unable to capture the importance of these.

 

Read my thesis: Analysis of UFC Judging Criteria (Spring 2024)