Please use this identifier to cite or link to this item:
https://dipositint.ub.edu/dspace/handle/2445/202048
Title: | Clasificación de comentarios hacia futbolistas en Instagram |
Author: | Vinagre Triguero, Jorge |
Director/Tutor: | Seguí Mesquida, Santi |
Keywords: | Sistemes classificadors (Intel·ligència artificial) Aprenentatge automàtic Programari Treballs de fi de grau Tractament del llenguatge natural (Informàtica) Xarxes socials en línia Learning classifier systems Machine learning Computer software Natural language processing (Computer science) Online social networks Bachelor's theses |
Issue Date: | 13-Jun-2023 |
Abstract: | [en] Currently, multiple machine learning models have been implemented for sentiment analysis that have the ability to classify text according to whether it is positive or negative, both individual words and complex sentences. However, the models with the highest hit rates have required high computational power to classify the text in question and also to be constantly updated with more examples. In this case, the aim is to classify the polarity of offensive comments on social networks, specifically on Instagram and directed towards professional footballers. Therefore, the objectives of this study have been defined firstly as the autonomous collection of data and the creation of a dataset to then train models. Following this thread, the next objectives are to investigate the different methodologies, technologies and models of the Python machine learning library, Scikit-learn. Finally, after making a comparison between the 5 selected models, one of these models will be chosen to determine the polarity of the comments previously extracted by sentiment classification (“sentiment analysis”). Despite the low level of personal knowledge available in the field of Natural Language Processing at the beginning, and the lack of computational capacity, the results of the model can be considered satisfactory, since a coherent classification based on a well-founded justification is being obtained. However, if the initial planning had been more accurate, the results could have been improved and if these data are intended to be used in another project, the model should be trained on a machine with higher computational capacity by which the model can be trained for a longer time with more advanced methods, such as some of those that are nowadays considered as part of the state of the art in this field. |
Note: | Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2023, Director: Santi Seguí Mesquida |
URI: | https://hdl.handle.net/2445/202048 |
Appears in Collections: | Treballs Finals de Grau (TFG) - Enginyeria Informàtica |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
tfg_vinagre_triguero_jorge.pdf | Memòria | 3.21 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License