Polysemy and brevity versus frequency in language

Other authors

Universitat Politècnica de Catalunya. Departament de Ciències de la Computació

Universitat Politècnica de Catalunya. Institut de Ciències de l'Educació

Universitat Politècnica de Catalunya. LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge

Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural

Publication date

2019-11-01

Abstract

The pioneering research of G. K. Zipf on the relationship between word frequency and other word features led to the formulation of various linguistic laws. The most popular is Zipf’s law for word frequencies. Here we focus on two laws that have been studied less intensively: the meaning-frequency law, i.e. the tendency of more frequent words to be more polysemous, and the law of abbreviation, i.e. the tendency of more frequent words to be shorter. In a previous work, we tested the robustness of these Zipfian laws for English, roughly measuring word length in number of characters and distinguishing adult from child speech. In the present article, we extend our study to other languages (Dutch and Spanish) and introduce two additional measures of length: syllabic length and phonemic length. Our correlation analysis indicates that both the meaning-frequency law and the law of abbreviation hold overall in all the analyzed languages.


Peer Reviewed


Postprint (author's final draft)

Document Type

Article

Language

English

Publisher

Elsevier

Related items

https://www.sciencedirect.com/science/article/pii/S0885230817300414

info:eu-repo/grantAgreement/MINECO/TIN2016-77820-C3-3-R

info:eu-repo/grantAgreement/MINECO//TIN2014-57226-P/ES/APRENDIZAJE COMPUTACIONAL Y COMUNICACION/

info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2017-89244-R/ES/GESTION Y ANALISIS DE DATOS COMPLEJOS/

Recommended citation

This citation was generated automatically.

Rights

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

Open Access

Attribution-NonCommercial-NoDerivs 3.0 Spain

This item appears in the following Collection(s)

E-prints [72986]