Corpus of 1000 short stories

Name

Corpus of 1000 short stories

Description

A corpus of 1000 short stories is available under an open licence, created in order to establish a comparative collection for tools for automatic classification of texts. The corpus contains texts in Polish written by Polish authors. Each text is placed in a separate txt file.

Bibliographic address of the main publication (in case of using Chronocorpus, please cite this publication):

If used in a publication, please cite the following source: Eder, Maciej; Rybicki, Jan; Młynarczyk, Ksenia; et al., 2016, 1000 Novels Corpus, CLARIN-PL digital repository, http://hdl.handle.net/11321/312.

Link to the manual

Examples of applications

http://ws.clarin-pl.eu/websty.shtml – a sample collection of texts from the Corpus of 1000 short stories