Author: Tobias Scheer.
Exhaustive list of words with initial Sonorant-Obstruent clusters in 13 Slavic languages.
The introductory part of the data base provides detailed explanation regarding the goal and scope of the corpus, how it was constructed, how it is organised, what you can do in order to improve it and where it has been used.
Goal and scope
Since 2000, I have attempted to establish a data base which provides the exhaustive record of words that begin with a sonorant-obstruent cluster in 13 Slavic languages : Czech, Slovak, Polish, Upper Sorbian, Lower Sorbian, Kashubian (West), Bulgarian, Macedonian, Bosno-Serbo-Croatian, Slovenian (South), Russian, Ukrainian, Bielorussian (Est).
The purpose is to dispose of a solid empirical basis that documents a typical typological feature for which Slavic languages are famous : the violation of sonority sequencing at the beginning of the word. While it is well known that Slavic languages have non-orthodox initial clusters, there does not seem to be a comparative record that would tell us precisely which languages have these clusters, how many of them are encountered, exactly which clusters occur and which do not. For the time being, the corpus does not include #RR and #TT clusters (R is shorthand for sonorants, T for obstruents). In a first step, I have privileged the clusters that are most offending for sonority sequencing, and this has turned out to be enough work. It is to be hoped that at some point the list of Slavic #RR- and #TT-words will also be available.