Show simple item record

 
dc.contributor.author Kocoń, Jan
dc.date.accessioned 2018-09-28T12:37:54Z
dc.date.available 2018-09-28T12:37:54Z
dc.date.issued 2018-09-28
dc.identifier.uri http://hdl.handle.net/11321/606
dc.description Distributional language model (both textual and binary) for Polish (word embeddings) trained on KGR10 corpus (over 4 billion of words) using Fasttext with the following variants (all possible combinations): - dimension: 100, 300 - method: skipgram, cbow - tool: FastText, Magnitude - source text: plain, plain.lower, plain.lemma, plain.lemma.lower The link below leads to the NextCloud directory with all variants of embeddings. If you use it, please cite the following article: @article{kocon2018embeddings, author = {Koco\'{n}, Jan and Gawor, Micha{\l}}, title = {Evaluating {KGR10} {P}olish word embeddings in the recognition of temporal expressions using {BiLSTM-CRF}}, journal = {Schedae Informaticae}, volume = {27}, year = {2018}, url = {http://www.ejournals.eu/Schedae-Informaticae/2018/Volume-27/art/13931/}, doi = {10.4467/20838476SI.18.008.10413} }
dc.language.iso pol
dc.publisher Wroclaw University of Science and Technology
dc.rights GNU GPL3
dc.rights.uri http://www.gnu.org/licenses/gpl-3.0.en.html
dc.rights.label PUB
dc.subject Polish
dc.subject embeddings
dc.subject word embeddings
dc.subject KGR10
dc.subject Fasttext
dc.subject skipgram
dc.subject cbow
dc.title KGR10 FastText Polish word embeddings
dc.type languageDescription
metashare.ResourceInfo#ContentInfo.detailedType other
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding CLARIN-PL
contact.person Jan Kocoń jan.kocon@pwr.edu.pl Wroclaw University of Science and Technology
sponsor Ministry of Science and Higher Education (Poland) N/A CLARIN-PL nationalFunds
files.size 196
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
GNU GPL3
Icon
Name
fast_text_kgr_em.zip
Size
196 bytes
Format
application/zip
 Download file

Show simple item record