Please use this identifier to cite or link to this item:
https://dipositint.ub.edu/dspace/handle/2445/182252
Title: | Tracking de jugadors en imatges amb transformers |
Author: | Calvo Ventura, Enric |
Director/Tutor: | Radeva, Petia Tatjer i Montaña, Joan Carles |
Keywords: | Xarxes neuronals convolucionals Aprenentatge automàtic Programari Treballs de fi de grau Visió per ordinador Anàlisi numèrica Convolutional neural networks Machine learning Computer software Computer vision Numerical analysis Bachelor's theses |
Issue Date: | 20-Jun-2021 |
Abstract: | [en] Data is becoming increasingly important across the board, and so is within sports disciplines. Particularly, in football. For instance, the position of the different players on the pitch is in itself data which has been proved useful for applications such as injury prevention, player scouting and so on. This type of data, designated “tracking data”, is obtained at the time of this study using GPS technology on a professional level. Since local leagues and football institutions are responsible for the management of this data, it can prove to be difficult for outside actors to obtain access to it. In reality, this means this data ends up being accessible to high income and top league clubs and institutions only. For this very reason, the need to find alternative ways to generate and obtain such data arises. This project focuses its scope on using computer vision as an alternative to the previously stated. The aim of this work therefore, is to beforehand acquire a theoretical view and understanding in the branch of machine learning and convolutional neural networks and their application to detect people in football videos. In particular we will explore the use of Transformers, an architecture of convolutional neural networks that appeared very recently and involved a paradigm shift in state of the art models to process the source material and generate the raw data. With a specialized dataset of exclusively football images, we have been able to train a DETR model and compare its results with other existent models as a reference. With the results in hand, we have explored ways of improve such models. We have obtained a trained model that successfully manages to detect the players on a football match with the caveat of it being dependent on the source material’s quality. While it does succeed in most regular game play, it struggles in situations where the source material presents occlusions and accumulations of players in the image (for example during a corner kick, where many players can accumulate near the goal). We have been able to even slightly improve the results of the trained model by managing double detections, but the size of our dataset has proved to be a constraint in this direction. Finally, we have discussed some possible future lines of improvement to achieve better results, such as increasing our dataset or using a wider range of frames to reduce the margin of error when players are occluded. |
Note: | Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2021, Director: Petia Radeva i Joan Carles Tatjer i Montaña |
URI: | https://hdl.handle.net/2445/182252 |
Appears in Collections: | Programari - Treballs de l'alumnat Treballs Finals de Grau (TFG) - Matemàtiques Treballs Finals de Grau (TFG) - Enginyeria Informàtica |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
main.py | Codi font | 72.31 kB | Unknown | View/Open |
tfg_enric_calvo_ventura.pdf | Memòria | 16.92 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License