Sentiment analysis in Arabicopinion polarity detection

  1. Rushdi Saleh, Mohammed
Dirigida por:
  1. María Teresa Martín Valdivia Directora
  2. Luis Alfonso Ureña López Director

Universidad de defensa: Universidad de Jaén

Fecha de defensa: 07 de octubre de 2013

Tribunal:
  1. Ruslan Mitkov Presidente/a
  2. José Manuel Perea Ortega Secretario/a
  3. José Antonio Troyano Jiménez Vocal
Departamento:
  1. INFORMÁTICA

Tipo: Tesis

Teseo: 363945 DIALNET lock_openRUJA editor

Resumen

Sentiment analysis is becoming increasingly important due the growing popularity of Web 2.0. This study focuses mainly on how to analyze opinions in Arabic language and predict their polarity. To achieve that, two corpora have been generated (OCA and EVOCA), OCA is an opinion corpus for Arabic movie reviews, while EVOCA is the translated version of OCA to English. Another corpus was created (SINAI-SA corpus) used with other corpora in order to predict sentiments in different domains. SINAI corpus was also used to study how to sort comments behave as textual information for the prediction of customer rates. Another question that was solved in this study is “How to treat with the neutral reviews”. Two main approaches have been investigated in this research, one based on semantic orientation and the other one based on machine learning algorithms like SVM or NB