Polish-Ukrainian Corpus

Name

Polish-Ukrainian Parallel Corpus “2” – bilingual (Polish-Ukrainian) parallel corpus of contemporary texts.

Description

The Polish-Ukrainian Corpus is a constantly developing bilingual resource. It contains manually aligned contemporary Polish and Ukrainian texts with a total volume of more than 1.2 million words. The Corpus features a predominance of works representing fiction, colloquial style and specialized language. The colloquial speech is visible in film dialogues. Translations from third languages prevail in the corpus. Currently, the work on the full rigging of the resources is being completed, including lemmatization and morphological-syntactic annotation of all the word-forms. Soon, a large translation resource representing press language and fiction, among others, will be added. The selection of corpus resources was guided by the principle of ensuring high representation of various lexems and terms.

Bibliographic address of the main publication (in case of using the Polish-Ukrainian Corpus, please cite this publication)

Access

Corpus page with a downloadable version: https://clarin-pl.eu/dspace/handle/11321/535
Wyszukiwarka korpusowa: https://kontext.clarin-pl.eu/run.cgi/first_form#

Examples of application

Sosnowski, Wojciech, Blagoeva, Diana oraz Tymoshuk, Roman (2018). New Bulgarian, Polish, Ukrainian Phraseology and language corpora. Cognitive Studies | Etudes Cognitives, 18, 1-13. DOI: https://doi.org/10.11649/cs.1768

Sosnowski, W., Blagoeva, D., & Tymoshuk, R. (2018). New Bulgarian, Polish, and Ukrainian phraseology and language corpora. Cognitive Studies/Études cognitives, 2018(18), 1–13. DOI: https://doi.org/10.11649/cs.1768

Sosnowski, W., & Tymoshuk, R. (2017). Konfrontacja językowa polskich i ukraińskich jednostek frazeologicznych na przykładzie materiału z leksykonu aktywnej frazeologii polskiej i ukraińskiej. In D. Blagoeva, L. Andreĭchin (Eds.), Bilgarsko-polski studii (91–108). Bŭlgarska akademiya na naukite institut za bŭlgarski ezik.

Sosnowski, W., & Tymoshuk, R. (2017). Novi pidkhody do stvorennya suchasnykh frazeolohichnykh slovnykiv (na materiali «Leksykona pol’s’koyi ta ukrayins’koyi aktyvnoyi frazeolohiyi»). Movoznavstvo, 2, 69–77.

Sosnowski, W., & Tymoshuk, R. (2017). On „The dictionary of active Polish and Ukrainian phraseology”. Contrastive linguistics and culture. Cognitive Studies/ Études cognitives, 2017(17). DOI: https://doi.org/10.11649/cs.1317

Sosnowski, W., & Tymoshuk, R. (2017), O rabote nad „Leksikonom pol’skoj i ukrainskoj aktivnoj frazeologii”. In L. Janovec, R. K. Brabcová, V. Skibina, Z. Wildová (Eds.), Svět v obrazech a ve frazeologii / World in Pictures and in Phraseology (pp. 269–276). Univerzita Karlova, Pedagogická fakulta. ISBN 978-80-7290-964-3.

Jaskot, M., & Sosnowski, W. (2017). O fałszywych przyjaciołach tłumacza na przykładzie Leksykonu aktywnej frazeologii polskiej i ukraińskiej. In Barbara Borkowska-Kępska, Grzegorz Gwóźdź (Eds.), LSP Perspectives 2. Języki specjalistyczne – nowe perspektywy 2 (pp. 55–62). Dąbrowa Górnicza: Wyższa Szkoła Biznesu w Dąbrowie Górniczej. ISBN 978-83-65621-30–6.

Jaskot, M., Ganoszenko, Ju., Sosnowski, W., & Tymoshuk, R. (2017), Leksykon aktywnej frazeologii polskiej i ukraińskiej. Warszawa: KJV Digital. ISBN 978-83-946640-2-2.

Sosnowski, Wojciech, Blagoeva, Diana oraz Tymoshuk, Roman (2018). New Bulgarian, Polish, Ukrainian Phraseology and language corpora. Cognitive Studies | Etudes Cognitives, 18, 1-13. DOI: https://doi.org/10.11649/cs.1768