dc.contributor.author | Kocoń, Jan |
dc.date.accessioned | 2018-09-28T12:37:54Z |
dc.date.available | 2018-09-28T12:37:54Z |
dc.date.issued | 2018-09-28 |
dc.identifier.uri | http://hdl.handle.net/11321/606 |
dc.description | Distributional language model (both textual and binary) for Polish (word embeddings) trained on KGR10 corpus (over 4 billion of words) using Fasttext with the following variants (all possible combinations): - dimension: 100, 300 - method: skipgram, cbow - tool: FastText, Magnitude - source text: plain, plain.lower, plain.lemma, plain.lemma.lower The link below leads to the NextCloud directory with all variants of embeddings. If you use it, please cite the following article: @article{kocon2018embeddings, author = {Koco\'{n}, Jan and Gawor, Micha{\l}}, title = {Evaluating {KGR10} {P}olish word embeddings in the recognition of temporal expressions using {BiLSTM-CRF}}, journal = {Schedae Informaticae}, volume = {27}, year = {2018}, url = {http://www.ejournals.eu/Schedae-Informaticae/2018/Volume-27/art/13931/}, doi = {10.4467/20838476SI.18.008.10413} } |
dc.language.iso | pol |
dc.publisher | Wroclaw University of Science and Technology |
dc.rights | GNU GPL3 |
dc.rights.uri | http://www.gnu.org/licenses/gpl-3.0.en.html |
dc.rights.label | PUB |
dc.subject | Polish |
dc.subject | embeddings |
dc.subject | word embeddings |
dc.subject | KGR10 |
dc.subject | Fasttext |
dc.subject | skipgram |
dc.subject | cbow |
dc.title | KGR10 FastText Polish word embeddings |
dc.type | languageDescription |
metashare.ResourceInfo#ContentInfo.detailedType | other |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | CLARIN-PL |
contact.person | Jan Kocoń jan.kocon@pwr.edu.pl Wroclaw University of Science and Technology |
sponsor | Ministry of Science and Higher Education (Poland) N/A CLARIN-PL nationalFunds |
files.size | 196 |
files.count | 1 |