Towards cyberbullying detection : building, benchmarking and longitudinal analysis of aggressiveness and conflicts/attacks datasets from Twitter

Ferreira, Paula; Pereira, Nadia; Rosa, Hugo; Oliveira, Sofia; Coheur, Luisa; Francisco, Sofia; Souza, Sidclay; Ribeiro, Ricardo; Carvalho, Joao P.; Paulino, Paula; Trancoso, Isabel; Veiga-Simao, Ana Margarida

Towards cyberbullying detection : building, benchmarking and longitudinal analysis of aggressiveness and conflicts/attacks datasets from Twitter

dc.contributor.author	Ferreira, Paula
dc.contributor.author	Pereira, Nadia
dc.contributor.author	Rosa, Hugo
dc.contributor.author	Oliveira, Sofia
dc.contributor.author	Coheur, Luisa
dc.contributor.author	Francisco, Sofia
dc.contributor.author	Souza, Sidclay
dc.contributor.author	Ribeiro, Ricardo
dc.contributor.author	Carvalho, Joao P.
dc.contributor.author	Paulino, Paula
dc.contributor.author	Trancoso, Isabel
dc.contributor.author	Veiga-Simao, Ana Margarida
dc.contributor.institution	HEI-LAB - Human Environment Interaction Lab
dc.date.accessioned	2025-06-18T15:50:01Z
dc.date.available	2025-06-18T15:50:01Z
dc.date.issued	2024-12-16
dc.description	Publisher Copyright: © 2010-2012 IEEE.
dc.description.abstract	Offense and hate speech are a source of online conflicts which have become common in social media and, as such, their study is a growing topic of research in machine learning and natural language processing. This article presents two Portuguese language offense-related datasets that deepen the study of the subject: an Aggressiveness dataset and a Conflicts/Attacks dataset. While the former is similar to other offense detection related datasets, the latter constitutes a novelty due to the use of the history of the interaction between users. Several studies were carried out to construct and analyze the data in the datasets. The first study included gathering expressions of verbal aggression witnessed by adolescents to guide data extraction for the datasets. The second study included extracting data from Twitter (in Portuguese) that matched the most frequent expressions/words/sentences that were identified in the previous study. The third study consisted in the development of the Aggressiveness dataset, the Conflicts/Attacks dataset, and classification models. In our fourth study, we proposed to examine whether online aggression and conflicts/attacks revealed any trend changes over time with a sample of 86 adolescents. With this study, we also proposed to investigate whether the amount of tweets sent over a period of 273 days was related to online aggression and conflicts/attacks. Lastly, we analyzed the percentage of participants who participated in the aggressions and/or attacks/conflicts.	en
dc.description.sponsorship	This work received national funding from FCT – Fundação para a Ciência e a Tecnologia, I.P., through the Research Center for Psychological Science of the Faculty of Psychology, University of Lisbon (PTDC/MHC/PED/3297/2014; PTDC/PSI-GER/1918/2020; UIDB/04527/2020; UIDP/04527/2020), and in collaboration with INESC-ID via project reference UIDB/50021/2020.
dc.identifier.citation	Ferreira, P, Pereira, N, Rosa, H, Oliveira, S, Coheur, L, Francisco, S, Souza, S, Ribeiro, R, Carvalho, J P, Paulino, P, Trancoso, I & Veiga-Simao, A M 2024, 'Towards cyberbullying detection : building, benchmarking and longitudinal analysis of aggressiveness and conflicts/attacks datasets from Twitter', IEEE Transactions on Affective Computing, vol. 16, no. 3, pp. 1-15. https://doi.org/10.1109/TAFFC.2024.3518587
dc.identifier.doi	https://doi.org/10.1109/TAFFC.2024.3518587
dc.identifier.issn	1949-3045
dc.identifier.uri	http://hdl.handle.net/10437/15394
dc.identifier.url	https://www.scopus.com/pages/publications/85212842471
dc.language.iso	eng
dc.peerreviewed	yes
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof	IEEE Transactions on Affective Computing
dc.rights	openAccess
dc.subject	COMPUTER SCIENCE
dc.subject	CYBERBULLYING
dc.subject	DATASETS
dc.subject	HATE SPEECH
dc.subject	NATURAL LANGUAGE PROCESSING
dc.subject	SOCIAL NETWORKS
dc.subject	INFORMÁTICA
dc.subject	CYBERBULLYING
dc.subject	DADOS
dc.subject	DISCURSOS DE ÓDIO
dc.subject	LINGUAGEM NATURAL
dc.subject	REDES SOCIAIS
dc.subject	ULHT/HEI-Lab - Artigos de Revistas Internacionais com Arbitragem Científica
dc.title	Towards cyberbullying detection : building, benchmarking and longitudinal analysis of aggressiveness and conflicts/attacks datasets from Twitter	en
dc.type	article

Ficheiros

Principais

A mostrar 1 - 1 de 1

Nome:: Towards_Cyberbullying_Detection_Building_Benchmarking_and_Longitudinal_Analysis_of_Aggressiveness_and_Conflicts_Attacks_Datasets_from_Twitter.pdf
Tamanho:: 930.25 KB
Formato:: Adobe Portable Document Format

Ver/Abrir

Coleções

pure-collection
ULHT/HEI-Lab - Artigos de Revistas Internacionais com Arbitragem Científica