Algorithmic typology and going from known to similar unknown categories within and across languages
2014 (English)In: Algorithmic typology and going from known to similar unknowncategories within and across languages: Linguistic Variation in Text and Speech / [ed] Benedikt Smrecsanyi & Bernhard Wälchli, Berlin: Walter de Gruyter, 2014, 355-393 p.Chapter in book (Refereed)
This paper introduces three algorithms for the extraction of lexical and grammatical markers in parallel texts. The starting point for all of them is that trigger distributions are used as semantic cues. Automatic processing chains apply the same procedures (so-called “procedural universals”) to directly comparable texts of all languages. The domain-internal distribution of markers is usually highly diverse cross-linguistically due to polymorphy (there are many markers instantiating the same domain, but which also expressother meanings at the same time). Polymorphy structures a domain into subdomains in cross-linguistically different ways, and this structure canbe used for the aggregation of markers into cross-linguistically recurrent marker types and for assessing the domain-specific similarity relationships between languages.
Place, publisher, year, edition, pages
Berlin: Walter de Gruyter, 2014. 355-393 p.
, Linguae & Litterae, ISSN 1869-7054 ; 28
General Language Studies and Linguistics
IdentifiersURN: urn:nbn:se:su:diva-100261DOI: 10.1515/9783110317558.355ISI: 000337522000013ISBN: 978-3-11-031739-8ISBN: 978-3-11-031755-8OAI: oai:DiVA.org:su-100261DiVA: diva2:692357