SINAI at Twitter-Normalization 2013
- Arturo Montejo Ráez 1
- M. Carlos Diaz Galiano 1
- Eugenio Martínez Cámara 1
- M. Teresa Martín Valdivia 1
- Miguel A. García Cumbreras 1
- L. Alfonso Ureña López 1
-
1
Universidad de Jaén
info
- Alberto Díaz Esteban (coord.)
- Iñaki Alegria Loinaz (coord.)
- Julio Villena Román (coord.)
Publisher: Sociedad Española para el Procesamiento del Lenguaje Natural
ISBN: 978-84-695-8349-4
Year of publication: 2013
Pages: 72-75
Congress: Sociedad Española para el Procesamiento del Lenguaje Natural. Congreso (29. 2013. Madrid)
Type: Conference paper
Abstract
In this paper, we present the Twitter-normalization system developed by the SINAI group. Our system performs a series of conversions on the text by the use of translation lexicons and a spell checker. We obtain a poor result, only 37.6% of accuracy, and after the analysis of these results our system should be improved in areas such as the treatment of diminutives and superlatives, entities or abbreviations.