Skip to yearly menu bar Skip to main content


In-Person Poster presentation / poster accept

On the Word Boundaries of Emergent Languages Based on Harris's Articulation Scheme

Ryo Ueda · Taiga Ishii · Yusuke Miyao

MH1-2-3-4 #88

Keywords: [ Machine Learning for Sciences ] [ emergent communication ] [ Harris's Articulation Scheme ] [ Unsupervised Word Segmentation ] [ compositionality ] [ emergent language ]


Abstract:

This paper shows that emergent languages in signaling games lack meaningful word boundaries in terms of Harris's Articulation Scheme (HAS), a universal property of natural language. Emergent Languages are artificial communication protocols arising among agents. However, it is not obvious whether such a simulated language would have the same properties as natural language. In this paper, we test if they satisfy HAS. HAS states that word boundaries can be obtained solely from phonemes in natural language. We adopt HAS-based word segmentation and verify whether emergent languages have meaningful word segments. The experiment suggested they do not have, although they meet some preconditions for HAS. We discovered a gap between emergent and natural languages to be bridged, indicating that the standard signaling game satisfies prerequisites but is still missing some necessary ingredients.

Chat is not available.