| dc.contributor.author | Wołk, Krzysztof |
| dc.date.accessioned | 2020-09-28T18:43:41Z |
| dc.date.available | 2020-09-28T18:43:41Z |
| dc.date.issued | 2020-09-28 |
| dc.identifier.uri | http://hdl.handle.net/11321/777 |
| dc.description | Big data language model based on subword units, based on byte pair encoding in RAW format |
| dc.language.iso | pol |
| dc.publisher | Polish-Japanese Academy of Information Technology |
| dc.source.uri | https://drive.google.com/file/d/1Sj6v9pZMQhpbz22Gww4mcXWeNJ6M3qWg/view?usp=sharing |
| dc.subject | language model |
| dc.subject | Polish |
| dc.subject | monolingual |
| dc.title | Big Data language model - subword - BPE - RAW |
| dc.type | corpus |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| has.files | no |
| branding | CLARIN-PL |
| contact.person | Krzysztof Wołk kwolk@pja.edu.pl Polish-Japanese Academy of Information Technology |
| size.info | 39 gb |
| files.size | 0 |
| files.count | 0 |