Overview of HOPE at IberLEF 2023Multilingual Hope Speech Detection

  1. Ureña López, Luis Alfonso
  2. Valencia García, Rafael
  3. Jiménez Zafra, Salud M.
  4. García Cumbreras, Miguel Ángel
  5. García-Baena, Daniel
  6. García-Díaz, José Antonio
  7. Chakravarthi, Bharathi Raja
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Año de publicación: 2023

Número: 71

Páginas: 371-381

Tipo: Artículo

Otras publicaciones en: Procesamiento del lenguaje natural

Resumen

Hope speech is the speech that is able to relax hostile environments and that helps, inspires and encourages people in times of illness, stress, loneliness or depression. Its automatic recognition can have a very significant effect fighting against sexual and racial discrimination or fostering less belligerent environments. In contrast to identifying and censoring negative or hate speech, hope speech detection is focused on recognizing and promoting positive speech online. In this paper we present an overview of the IberLEF 2023 shared task, HOPE: Multilingual Hope Speech Detection, consisting of identifying whether texts written in English or Spanish contain hope speech or not. The competition was organized through CodaLab and attracted 50 teams that registered. Finally, 12 submitted results and 8 presented working notes describing their systems.

Referencias bibliográficas

  • Ahani, Z., G. Sidorov, O. Kolesnikova, and A. Gelbukh. 2023. Zavira at HOPE2023IberLEF: Hope Speech Detection from Text using TF-IDF Features and Machine Learning Algorithms. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Balaji, V., A. Kannan, A. Balaji, and B. Bharathi. 2023. NLP SSN CSE at HOPE2023IberLEF: Multilingual Hope Speech Detection using Machine Learning Algorithms. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEURWS. org.
  • Burnap, P., G. Colombo, R. Amery, A. Hodorog, and J. Scourfield. 2017. Multi-class machine classification of suicide-related communication on twitter. Online social networks and media, 2:32–44.
  • Cañete, J., G. Chaperon, R. Fuentes, J.-H. Ho, H. Kang, and J. Pérez. 2020. Spanish pre-trained bert model and evaluation data. Pml4dc at iclr, (2020):1–10.
  • Cañete, J., S. Donoso, F. Bravo-Marquez, A. Carvallo, and V. Araujo. 2022. Albeto and distilbeto: Lightweight spanish language models. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 4291–4298.
  • Chakravarthi, B. R. 2020. Hopeedi: A multilingual hope speech detection dataset for equality, diversity, and inclusion. In Proceedings of the Third Workshop on Computational Modeling of People’s Opinions, Personality, and Emotion’s in Social Media, pages 41–53, Barcelona, Spain (Online), dec. Association for Computational Linguistics.
  • Chakravarthi, B. R., V. Muralidaran, R. Priyadharshini, S. Chinnaudayar Navaneethakrishnan, J. P. McCrae, M. A. García-Cumbreras, S. M. Jiménez-Zafra, R. Valencia-García, P. Kumar Kumaresan, R. Ponnusamy, D. García-Baena, and J. A. García-Díaz. 2022. Overview of the shared task on hope speech detection for equality, diversity, and inclusion. Association for Computational Linguistics, pages 378–388, may.
  • Conneau, A., K. Khandelwal, N. Goyal, V. Chaudhary, G. Wenzek, F. Guzmán, é. Grave, M. Ott, L. Zettlemoyer, and V. Stoyanov. 2020. Unsupervised crosslingual representation learning at scale. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 8440–8451.
  • Devlin, J., M.-W. Chang, K. Lee, and K. Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota, June. Association for Computational Linguistics.
  • Domínguez Olmedo, J. L., J. Mata Vázquez, and V. Pachón álvarez. 2023. I2C-Huelva at HOPE2023IberLEF: Simple Use of Transformers for Automatic Hope Speech Detection. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEURWS. org.
  • García-Baena, D., M. García-Cumbreras, S. M. Zafra, J. García-Díaz, and R. Valencia-García. 2023. Hope speech detection in spanish. Language Resources and Evaluation, pages 1–28, 03.
  • García-Díaz, J. A., á. Almela, G. Alcaraz- Mármol, and R. Valencia-García. 2020. Umucorpusclassifier: Compilation and evaluation of linguistic corpus for natural language processing tasks. Procesamiento del Lenguaje Natural, 65:139–142.
  • Gemeda Yigezu, M., G. Yohannis Bade, O. Kolensikova, G. Sidorov, and A. Gelbukh. 2023. Multilingual Hope Speech Detection using Machine Learning. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • He, P., X. Liu, J. Gao, and W. Chen. 2020. Deberta: Decoding-enhanced bert with disentangled attention. arXiv preprint arXiv:2006.03654.
  • Huertas-Tato, J., A. Martin, and D. Camacho. 2022. Bertuit: Understanding spanish language in twitter through a native transformer. arXiv preprint arXiv:2204.03465.
  • Jiménez-Zafra, S. M., F. Rangel, and M. Montes-y Gómez. 2023. Overview of IberLEF 2023: Natural Language Processing Challenges for Spanish and other Iberian Languages. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Kitzie, V. 2018. I pretended to be a boy on the internet: Navigating affordances and constraints of social networking sites and search engines for lgbtq+ identity work. First Monday.
  • Lan, Z., M. Chen, S. Goodman, K. Gimpel, P. Sharma, and R. Soricut. 2019. Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942.
  • Liu, Y., M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, and V. Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
  • Milne, D. N., G. Pink, B. Hachey, and R. A. Calvo. 2016. Clpsych 2016 shared task: Triaging content in online peer-support forums. In Proceedings of the third workshop on computational linguistics and clinical psychology, pages 118–127.
  • Ngo, A. and H. T. H. Tran. 2023. Zootopi at HOPE2023IberLEF: Is Zero-Shot Chat- GPT the Future of Hope Speech Detection? In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEURWS. org.
  • no, A. G. F., J. A. Estapé, M. P`amies, J. L. Palao, J. S. Ocampo, C. P. Carrino, C. A. Oller, C. R. Penagos, A. G. Agirre, and M. Villegas. 2022. Maria: Spanish language models. Procesamiento del Lenguaje Natural, 68.
  • Palakodety, S., A. R. KhudaBukhsh, and J. G. Carbonell. 2019. Hope speech detection: A computational analysis of the voice of peace. arXiv preprint arXiv:1909.12940.
  • Pan, R., G. Alcaraz-Mármol, and F. García- Sánchez. 2023. UMUTeam at HOPE2023IberLEF: Evaluation of Transformer Model with Data Augmentation for Multilingual Hope Speech Detection. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Rodríguez-García, M. A., A. Riaño Martínez, and S. Montalvo Herranz. 2023. URJCTeam at HOPE2023IberLEF: Multilingual Hope Speech Detection Using Transformers Architecture. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.
  • Sanh, V., L. Debut, J. Chaumond, and T. Wolf. 2019. Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108.
  • Shahiki-Tash, M., J. Armenta-Segura, O. Kolesnikova, G. Sidorov, and A. Gelbukh. 2023. LIDOMA at HOPE2023IberLEF: Hope Speech Detection Using Lexical Features and Convolutional Neural Networks. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023), co-located with the 39th Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), CEUR-WS.org.