Textual links of words in mid-19th century Russian prose
https://doi.org/10.28995/2686-7249-2018-12-10-31
Abstract
A corpus of Russian prose (14 million running words) is divided into fragments of equal size (40 words each). When actual co-occurrence of two words exceeds signifcantly the expectation, calculated on the basis of the null hypothesis, the two words are said to be linked together. A huge network of textual links of words is constituted as a result of the analysis.
About the Author
A. Ya. ShaikevichRussian Federation
Anatolii Ya. Shaikevich
bld. 18/2, Volkhonka st., Moscow, 119019
References
1. Shaikevich AYa. Andrjuscenko VM., Rebetskaya NA. Distributional statistical analysis of the language of Russian prose 1850–1870. In 2 vols. Moscow: YaSK Publ.; 2013. (In Russ.)
2. Shaikevich AYa. Network of semantic textual links in the poetry of Pushkin and Mickiewicz // Slavic Studies. XIII Congress of Slavic Studies. Reports of the Russian delegation (Ljubljana, 2003). Moscow: Indrik Publ.; 2003. p. 576-88. (In Russ.)
Review
For citations:
Shaikevich A.Ya. Textual links of words in mid-19th century Russian prose. RSUH/RGGU Bulletin: “Literary Teory. Linguistics. Cultural Studies”, Series. 2018;(12):10-31. (In Russ.) https://doi.org/10.28995/2686-7249-2018-12-10-31