Comparative analyses of the information content of letters, characters, and inter-word spaces across writing systems

Linjieqiong Huang, Erik D. Reichle, Xingshan Li*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

One difference among writing systems is how orthographic cues are used to demarcate words; although most alphabetic scripts use inter-word spaces, some Asian scripts do not explicitly mark word boundaries (e.g., Chinese). It is unclear whether these differences are arbitrary or whether they are designed to maximize reading efficiency. Here, we show that spaces inserted between words in non-demarcated scripts provide less information about word boundaries than spaces in demarcated scripts. Furthermore, despite the fact that less information is contained by inter-word spaces than characters/letters of the same size, the information content of inter-word spaces in demarcated scripts is closer to that of characters/letters compared to the information content of inter-word spaces that are inserted in non-demarcated scripts. These results suggest that the conventions used to demarcate word boundaries are sufficient to support efficient reading. Our findings provide new insights into the universals and variation across writing systems and shed light on the mental processes that support skilled reading.

Original languageEnglish
Pages (from-to)129-139
Number of pages11
JournalAnnals of the New York Academy of Sciences
Volume1537
Issue number1
DOIs
Publication statusPublished - Jul 2024

Keywords

  • information theory
  • reading
  • scripts
  • writing systems

Cite this