Toward a Quantitative Semiotics? (article)


This paper will demonstrate that the lack of quantitative, data-based research about nonlinguistic symbol sets that still have internal structure is already adversely affecting linguistics itself. We will survey recent attempts to distinguish between written languages and nonlinguistic inscriptions using information-theoretic, physical and usage-based metrics. We will also provide a list of nonlinguistic corpuses used in current research alongside their shortcomings. Finally, we will propose a possible new framework and a continuum-based concept of language-ness.

