Inferring Simplest Laws/Patterns: MINTYP and the Problem of Describing a Typology
Machine-Aided Linguistic Discovery - An Introduction and Some Examples - Vladimir Pericliev
Vladimir Pericliev [+ ]
Bulgarian Academy of Sciences
View Website
Vladimir Pericliev is Senior Researcher at the Institute of Mathematics & Informatics, Bulgarian Academy of Sciences, with over 60 publications in general and computational linguistics, Artificial Intelligence and philosophy of science.
Description
Given a dataset, a common problem in scientific knowledge discovery is to summarize this set by a collection of rules (laws, patterns, etc.) such that the resultant description is the simplest, or most economic. In linguistics, the problem occurs e.g. in attempts to describe a linguistic typology in terms of the smallest set of implicational universals that allow all actually attested, and none of the unattested, language types. In this chapter, I introduce the MINTYP (Minimum TYPological description) program, which handles this problem, illustrating it on the typologies in Greenberg’s Appendix II (Greenberg 1966a) and Hawkins’ Expanded Sample (Hawkins 1983).