Linguistic laws in speech: the case of Catalan and Spanish
- González Torre, Ivan 1
- Hernández-Fernández, Antoni 2
- Garrido, Juan-María 3
- Lacasa, Lucas 4
-
1
Universidad Politécnica de Madrid
info
-
2
Universitat Politècnica de Catalunya
info
-
3
Universidad Nacional de Educación a Distancia
info
-
4
Queen Mary University of London
info
Editor: Dryad
Año de publicación: 2019
Tipo: Dataset
Resumen
In this work we explore in an oral corpus of Catalan and Spanish (Glissando Corpus) four classical linguistic laws (Zipf's law, Herdan's law, Brevity law, and Menzerath-Altmann's law) in oral communication, both in physical units and in symbolic units measured in speech transcriptions, and we also reviewed two more laws recently reformulated: lognormality law and size-rank law. Our results reinforce with empirical evidence in two more languages the 'physical hypothesis' according to which linguistic laws could be explained by physical laws and the principles of information theory. In this sense, linguistic laws would have an oral origin and the evidences recovered in written texts would be a byproduct of the complexity that takes place in speech.