La negación en español: análisis y tipología de patrones de negación

  1. Martín Valdivia, María Teresa
  2. Jiménez Zafra, Salud M.
  3. Martí Antonín, Maria Antònia
  4. Taulé Delor, Mariona
  5. Nofre, Montserrat
  6. Marsó, Laia
Journal:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Year of publication: 2016

Issue: 57

Pages: 41-48

Type: Article

More publications in: Procesamiento del lenguaje natural

Abstract

In this paper we present the criteria applied for the annotation of the SFU ReviewSP-NEGcorpus and the corresponding linguistic typology. This typology has the advantage that it is easy to express in terms of a tagset for corpus annotation: the types are clearly defined, which avoid the ambiguity in the annotation process, and they present a wide coverage (i.e. they covered/solved all the cases occurring in the corpus). The corpus consists of 400 reviews and 198,551 words. Currently, we have annotated 75% and from a total of 6,331 annotated sentences 2,953 contain at least one negation.

Bibliographic References

  • Blanco E. y D. Moldovan. 2013. Retrieving implicit positive meaning from negated statements. Natural Language Engineering, 20 (4): 501-535. Cambridge University Press. 10
  • Bosque I. y V. Demonte. 1999. Gramática Descriptiva de la Lengua Española, Vol. 2. Espasa Calpe, España.
  • Councill, I. G., R. McDonald, y L. Velikovich, L. 2010. What's great and what's not: learning to classify the scope of negation for improved sentiment analysis. Proceedings of the workshop on negation and speculation in natural language processing, páginas 51-59, Uppsala, ACL.
  • Erk K., y S. Padó. 2004. A powerful and versatileXML format forrepresenting rolesemantic annotation. Proceedings of 4th International Conference on Language Resources and Evaluation (LREC’04), Lisboa, Portugal.
  • Huddleston, R.D. y G. K. Pullum. 2002. The Cambridge Grammar of the English Language. Cambridge University Press, Cambridge, UK.
  • Kim J.D., T. Ohta y J. Tsujii. 2008. Corpus annotation for mining biomedical events from literature. BMC Bioinformatics, 9:10.
  • Konstantinova, N., S. C de Sousa, N. P. Díaz, N. P. Cruz, M. J. Maña, M. Taboada y R. Mitkov. 2012. A review corpus annotated for negation, speculation and their scope. Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’12), páginas 3190-3195, Turkey.
  • Konstantinova, N. y S. C de Sousa. 2011. Annotating Negation and Speculation: the Case of the Review Domain. Proceedings of the Student Research Workshop associated with RANLP 2011, páginas 139-144, Bulgaria.
  • Morante, R. y W. Daelemans. 2012. ConanDoyle-neg: Annotation of negation in Conan Doyle stories. Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’12), páginas 1563-1568, Turkey.
  • Morante, R. y C. Sporleder. 2012. Modality and negation: An introduction to the special issue. Computational linguistics, 38(2), 223260.
  • Palmer, M., P. Kingsbury y D. Gildea. 2005. The Proposition Bank: An Annotated Corpus of Semantic Roles, Computational Linguistics, 21 (1).
  • Pang, B., L. Lee, y S. Vaithyanathan, S. 2002.Thumbs up?: sentiment classification using machine learning techniques. Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10: 79-86. ACL.
  • Polanyi L., Zaenen, A. 2006. Contextual Valence Shifters. Computing affect and attitude in text: Theory and applications, 20: 1-10. The Information Retrieval Series.
  • Pyysala S., F. Ginter, J. Heimonen, J. Björne, J. Boberg, J. Járvinen y T. Salakosk. 2007. BioInfer: a corpus for information extraction in the biomedical domain. BMC Bioinformatics, 8: 50.
  • RAE. 2009. Nueva Gramática de la Lengua Española. Vol. 2. Espasa Libros, España.
  • Ruppenhofer J., C. Sporleder, R. Morante, C. Baker y M. Palmer. 2010. Semeval-2010 task 10: Linking events and their participants in discourse. Proceedings of the 5th Workshop on Semantic Evaluations (ACL 2010), páginas 45-50, Suecia.
  • Taboada, M., C. Anthony y K. Voll. 2006. Methods for creating semantic orientation dictionaries. Proceedings of the 5th Conference on Language Resources and Evaluation (LREC’06), páginas 427-432.
  • Vincze, V., Szarvas G., Farkas R., Móra G. y Csirik J. 2008. The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes. BMC Bioinformatics, 9:1-9.
  • Vincze, V. 2010. Speculation and negation annotation in natural language texts: what the case of bioscope might (not) reveal. Proceedings of the workshop on negation and speculation in natural language processing, páginas 51-59, Uppsala, ACL.
  • Wiegand, M., A. Balahur, B. Roth, D. Klakow, y A. Montoyo. 2010. A survey on the role of negation in sentiment analysis. In Proceedings of the workshop on negation and speculation in natural language processing, páginas 60-68, ACL.