Overview of PoliticEs 2022:Spanish Author Profiling for Political Ideology

  1. García-Díaz, José Antonio
  2. Jiménez Zafra, Salud M.
  3. Martín Valdivia, María Teresa
  4. García-Sánchez, Francisco
  5. Ureña López, Luis Alfonso
  6. Valencia García, Rafael
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Año de publicación: 2022

Número: 69

Páginas: 265-272

Tipo: Artículo

Otras publicaciones en: Procesamiento del lenguaje natural

Resumen

Este artículo presenta la tarea PoliticEs 2022, organizada en el taller IberLEF 2022, en el marco de la 38 edición del Congreso Internacional de la Sociedad Española para el Procesamiento del Lenguaje Natural. Esta tarea tiene como objetivo extraer la ideología política de un usuario a partir de un conjunto de tuits publicados por él. En concreto, se centró en la identificación del género y la profesión, como rasgos demográficos, y la ideología política desde una perspectiva binaria y multiclase, como rasgo psicográfico. La tarea PoliticEs atrajo a 63 equipos que se inscribieron a través de CodaLab. Finalmente, 20 enviaron resultados y 14 presentaron artículos describiendo sus sistemas. La mayoría de los equipos propusieron enfoques basados en transformers, aunque algunos de ellos también utilizaron algoritmos tradicionales de aprendizaje automático o incluso una combinación de ambos enfoques.

Referencias bibliográficas

  • Baumgaertner, B., J. E. Carlisle, and F. Justwan. 2018. The influence of political ideology and trust on willingness to vaccinate. PloS one, 13(1):e0191728.
  • Bevendorff, J., B. Chulvi, G. L. D. L. Peña Sarracen, M. Kestemont, E. Manjavacas, I. Markov, M. Mayerl, M. Potthast, F. Rangel, P. Rosso, et al. 2021. Overview of PAN 2021: authorship verification, profiling hate speech spreaders on twitter, and style change detection. In International Conference of the CrossLanguage Evaluation Forum for European Languages, pages 419–431. Springer.
  • Cabrera, H., E. S. Tellez, and S. Miranda. 2022. INFOTEC-LaBD at PoliticES 2022: Low-dimensional Stacking Model for Political Ideology Profiling. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Cañete, J., G. Chaperon, R. Fuentes, J.-H. Ho, H. Kang, and J. Perez. 2020. Spanish pre-trained bert model and evaluation data. Pml4dc at iclr, 2020:1–10.
  • Cañete, J., S. Donoso, F. Bravo-Marquez, A. Carvallo, and V. Araujo. 2022. Albeto and distilbeto: Lightweight spanish language models. arXiv preprint arXiv:2204.09145.
  • Carrasco, S. S. and R. C. Rosillo. 2022. LosCalis at PoliticEs 2022: Political Author Profiling using BETO and MarIA. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • De la Rosa, J., E. G. Ponferrada, M. Romero, P. Villegas, P. G. de Prado Salas, and M. Grandury. 2022. Bertin: Efficient pretraining of a spanish language model using perplexity sampling. Procesamiento del Lenguaje Natural, 68:13–23.
  • Espin-Riofrio, C., J. Ortiz-Zambrano, and A. Montejo-Raez. 2022. SINAI at PoliticEs 2022: Exploring Relative Frequency of Words in Stylometrics for Profile Discovery. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Fatke, M. 2017. Personality traits and political ideology: A first global assessment. Political Psychology, 38(5):881–899.
  • Garcıa-Dıaz, J. A., A. Almela, G. Alcaraz- Marmol, and R. Valencia-Garcıa. 2020. UMUCorpusClassifier: Compilation and evaluation of linguistic corpus for Natural Language Processing tasks. Procesamiento del Lenguaje Natural, 65(0):139– 142.
  • Garcıa-Dıaz, J. A., R. Colomo-Palacios, and R. Valencia-Garcıa. 2022. Psychographic traits identification based on political ideology: An author analysis study on spanish politicians’ tweets posted in 2020. Future Generation Computer Systems, 130:59–74.
  • Garcıa-Ochoa Martın-Forero, A., A. Mas- sotti Lopez, and I. Segura-Bedmar. 2022. UC3MDeep at PoliticEs 2022: Exploring Traditional Machine Learning Algorithms for Political Ideology Detection. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Gutierrez Fandiño, A., J. Armengol Estape, M. P`amies, J. Llop Palao, J. Silveira Ocampo, C. Pio Carrino, C. Armentano Oller, C. Rodriguez Penagos, A. Gonzalez Agirre, and M. Villegas. 2022. MarIA: Spanish language models. Procesamiento del Lenguaje Natural, 68.
  • Holgado, C. G. and A. Sinha. 2022. HalBERT at PoliticEs 2022: Are Machine Learning Algorithms better for Author Profiling? In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Kenton, J. D. M.-W. C. and L. K. Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACLHLT, pages 4171–4186.
  • Liu, Y., M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, and V. Stoyanov. 2019. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
  • Manea, A.-A. and L. P. Dinu. 2022. UniRetro at PoliticEs@IberLef 2022: Political Ideology Profiling using Language Models. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Montes-y Gomez, M., J. Gonzalo, F. Rangel, M. Casavantes, M. A. Alvarez-Carmona, G. Bel-Enguix, H. Jair Escalante, L. Freitas, A. Miranda-Escalada, F. RodrıguezSanchez, A. Rosa, M. A. SobrevillaCabezudo, M. Taule, and R. ValenciaGarcıa, editors. 2022. Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022).
  • Mosquera, A. 2022. Alejandro Mosquera at PoliticEs 2022: Towards Robust Spanish Author Profiling and Lessons Learned from Adversarial Attacks. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Ochoa-Hernandez, J. L. and Y. Aleman. 2022. TeamMX at PoliticEs 2022: Analysis of Feature Sets in Spanish Author Profiling for Political Ideology. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Ramos, P. C., J. M. Vazquez, V. P. Alvarez, and J. L. D. Olmedo. 2022. I2C at PoliticEs 2022: Using Transformers to Identify Political Ideology in Spanish Tweets. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Rodrigo, A., H. Fabregat, and R. Centeno. 2022. UNED at PoliticEs 2022: Testing Approximate Nearest Neighbors and Spanish Language Models for Author Profiling in Political Ideology. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Rodrıguez-Garcıa, M. A., S. Montalvo Her- ranz, and R. Martınez Unanue. 2022. URJC-Team at PoliticEs 2022: Political Ideology Prediction using Linear Classifiers. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Santibañez-Cortes, E., A. CarrilloCabrera, Y. A. Castillo-Castillo, D. Moctezuma, and V. Muñiz-Sanchez. 2022. CIMAT 2021 at PoliticEs 2022: Ensemble Based Classification Algorithms for Author Profiling in Spanish Language. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Ta, H. T., A. B. S. Rahman, L. Najjar, and A. Gelbukh. 2022. THANGCIC at PoliticEs 2022: Term-based BERT for Extracting Political Ideology from Spanish Author Profiling. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.
  • Vaswani, A., N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L . Kaiser, and I. Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems, 30.
  • Verhulst, B., L. J. Eaves, and P. K. Hatemi. 2012. Correlation not causation: The relationship between personality traits and political ideologies. American journal of political science, 56(1):34–51.
  • Villa-Cueva, E., I. Gonzalez-Franco, F. Sanchez-Vega, and A. P. LopezMonroy. 2022. NLP-CIMAT at PoliticEs 2022: PolitiBETO, a Domain-Adapted Transformer for Multi-class Political Author Profiling. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022). CEUR Workshop Proceedings, CEUR-WS, A Coruña, Spain.