MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification

dc.contributor.authorPetukhova, Alina
dc.contributor.authorFachada, Nuno
dc.contributor.institutionFaculdade de Engenharia
dc.date.issued2023-04-23
dc.descriptionData
dc.description.abstractThis article presents a dataset of 10,917 news articles with hierarchical news categories collected between 1 January 2019 and 31 December 2019. We manually labeled the articles based on a hierarchical taxonomy with 17 first-level and 109 second-level categories. This dataset can be used to train machine learning models for automatically classifying news articles by topic. This dataset can be helpful for researchers working on news structuring, classification, and predicting future events based on released news. Keywords: news dataset; text classification; NLP; media topic taxonomypt
dc.description.statusNon peer reviewed
dc.formatapplication/pdf
dc.identifier.citationPetukhova , A & Fachada , N 2023 , ' MN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classification ' , Data .
dc.identifier.issn2306-5729
dc.language.isoeng
dc.publisherMDPI
dc.relation.ispartofData
dc.rightsopenAccess
dc.subjectRECOLHA DE DADOS
dc.subjectNOTÍCIAS
dc.subjectPROCESSAMENTO DA LINGUAGEM NATURAL
dc.subjectCOMUNICAÇÃO SOCIAL
dc.subjectPROCESSAMENTO DE DADOS
dc.subjectTAXONOMIA
dc.subjectINFORMÁTICA
dc.subjectDATA COLLECTION
dc.subjectNEWS
dc.subjectNATURAL LANGUAGE PROCESSING
dc.subjectMEDIA
dc.subjectDATA PROCESSING
dc.subjectTAXONOMY
dc.subjectCOMPUTER SCIENCE
dc.titleMN-DS: A Multilabeled News Dataset for News Articles Hierarchical Classificationen

Ficheiros

Principais
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
2023_data_mnds.pdf
Tamanho:
1.17 MB
Formato:
Adobe Portable Document Format
Descrição:
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descrição: