arXiv Analytics

Sign in

arXiv:1811.03273 [cs.CL]AbstractReferencesReviewsResources

Information Flow in Pregroup Models of Natural Language

Peter M. Hines

Published 2018-11-08Version 1

This paper is about pregroup models of natural languages, and how they relate to the explicitly categorical use of pregroups in Compositional Distributional Semantics and Natural Language Processing. These categorical interpretations make certain assumptions about the nature of natural languages that, when stated formally, may be seen to impose strong restrictions on pregroup grammars for natural languages. We formalize this as a hypothesis about the form that pregroup models of natural languages must take, and demonstrate by an artificial language example that these restrictions are not imposed by the pregroup axioms themselves. We compare and contrast the artificial language examples with natural languages (using Welsh, a language where the 'noun' type cannot be taken as primitive, as an illustrative example). The hypothesis is simply that there must exist a causal connection, or information flow, between the words of a sentence in a language whose purpose is to communicate information. This is not necessarily the case with formal languages that are simply generated by a series of 'meaning-free' rules. This imposes restrictions on the types of pregroup grammars that we expect to find in natural languages; we formalize this in algebraic, categorical, and graphical terms. We take some preliminary steps in providing conditions that ensure pregroup models satisfy these conjectured properties, and discuss the more general forms this hypothesis may take.

Comments: In Proceedings CAPNS 2018, arXiv:1811.02701
Journal: EPTCS 283, 2018, pp. 13-27
Categories: cs.CL, cs.FL
Related articles: Most relevant | Search more
arXiv:1709.01634 [cs.CL] (Published 2017-09-06)
The Voynich Manuscript is Written in Natural Language: The Pahlavi Hypothesis
arXiv:1812.10549 [cs.CL] (Published 2018-12-18)
Automatic Summarization of Natural Language
arXiv:1405.2874 [cs.CL] (Published 2014-05-12, updated 2014-12-30)
A Study of Entanglement in a Categorical Framework of Natural Language