The Electronic Corpus of 17th- and 18th-century Polish Texts Article Swipe
Related Concepts
Slavic languages
Linguistics
Computer science
History
Resource (disambiguation)
Natural language processing
Classics
Philosophy
Computer network
Włodzimierz Gruszczyński
,
Dorota Adamiec
,
Renata Bronikowska
,
Witold Kieraś
,
Emanuel Modrzejewski
,
Aleksandra Wieczorek
,
Marcin Woliński
·
YOU?
·
· 2021
· Open Access
·
· DOI: https://doi.org/10.1007/s10579-021-09549-1
· OA: W3200575729
YOU?
·
· 2021
· Open Access
·
· DOI: https://doi.org/10.1007/s10579-021-09549-1
· OA: W3200575729
The paper describes the process of building the electronic corpus of 17th- and 18th-century Polish texts, a relatively large, balanced, structurally and morphologically annotated resource of the Middle Polish language, available for searching at https://www.korba.edu.pl . The corpus consists of samples extracted from over seven hundred texts written and published between 1601 and 1772, summing up to a total size of 13.5 million tokens which makes it one of the largest historical corpora for a Slavic language.
Related Topics
Finding more related topics…