dc.contributor.author |
Wołk, Krzysztof |
dc.date.accessioned |
2020-09-28T18:43:41Z |
dc.date.available |
2020-09-28T18:43:41Z |
dc.date.issued |
2020-09-28 |
dc.identifier.uri |
http://hdl.handle.net/11321/777 |
dc.description |
Big data language model based on subword units, based on byte pair encoding in RAW format |
dc.language.iso |
pol |
dc.publisher |
Polish-Japanese Academy of Information Technology |
dc.source.uri |
https://drive.google.com/file/d/1Sj6v9pZMQhpbz22Gww4mcXWeNJ6M3qWg/view?usp=sharing |
dc.subject |
language model |
dc.subject |
Polish |
dc.subject |
monolingual |
dc.title |
Big Data language model - subword - BPE - RAW |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
has.files |
no |
branding |
CLARIN-PL |
contact.person |
Krzysztof Wołk kwolk@pja.edu.pl Polish-Japanese Academy of Information Technology |
size.info |
39 gb |
files.size |
0 |
files.count |
0 |