SINAI at Twitter-Normalization 2013

  1. Arturo Montejo Ráez 1
  2. M. Carlos Diaz Galiano 1
  3. Eugenio Martínez Cámara 1
  4. M. Teresa Martín Valdivia 1
  5. Miguel A. García Cumbreras 1
  6. L. Alfonso Ureña López 1
  1. 1 Universidad de Jaén
    info

    Universidad de Jaén

    Jaén, España

    ROR https://ror.org/0122p5f64

Book:
XXIX Congreso de la Sociedad Española de Procesamiento de Lenguaje Natural: SEPLN 2013
  1. Alberto Díaz Esteban (coord.)
  2. Iñaki Alegria Loinaz (coord.)
  3. Julio Villena Román (coord.)

Publisher: Sociedad Española para el Procesamiento del Lenguaje Natural

ISBN: 978-84-695-8349-4

Year of publication: 2013

Pages: 72-75

Congress: Sociedad Española para el Procesamiento del Lenguaje Natural. Congreso (29. 2013. Madrid)

Type: Conference paper

Abstract

In this paper, we present the Twitter-normalization system developed by the SINAI group. Our system performs a series of conversions on the text by the use of translation lexicons and a spell checker. We obtain a poor result, only 37.6% of accuracy, and after the analysis of these results our system should be improved in areas such as the treatment of diminutives and superlatives, entities or abbreviations.