What's New

corpus
corpus
Author(s):
Description:
Cleaned Polish Oscar corpus (part: 128M above lines, 1.93 GB). Data was prepared with a few cleaning heuristics: - remove sentences shorter than - remove non-polish sentences - remove ungrammatical sentences ...
 This item contains no files.
corpus
corpus
Author(s):
Description:
Cleaned Polish Oscar corpus (part: 128M lines, 3.53 GB). Data was prepared with a few cleaning heuristics: - remove sentences shorter than - remove non-polish sentences - remove ungrammatical sentences ...
 This item contains no files.
corpus
corpus
Author(s):
Description:
Cleaned Polish Oscar corpus (part: 96M lines, 3.49 GB). Data was prepared with a few cleaning heuristics: - remove sentences shorter than - remove non-polish sentences - remove ungrammatical sentences ...
 This item contains no files.