I used a sample of tracking data from Metrica sports and resources from Laurie Shaw (@EightyFivePoint) to build a model to identify and estimate the value added of off the ball runs (tracking data records a player’s position on the pitch at frequent intervals). Why is this useful? It’s main use would be to identify… Continue reading Off The Ball Runs
Tag: data
Predicting Titanic Survivors
Kaggle is a website which hosts machine learning competitions. I recently completed the Titanic competition on Kaggle. The task was to predict who died and who survived on the Titanic, given data on approximately 900 passengers. I did this by using a random forest model to classify the passengers. I managed to correctly classify 74%… Continue reading Predicting Titanic Survivors
Making Data Visualizations
I entered the data visualization competition run by Chance Analytics. We were given a full season’s data from the Chinese Super League and told to visualize it as best as possible. I wanted to write about my visualization and creating data visualizations in general. For my entry, I plotted the locations of low crosses which… Continue reading Making Data Visualizations