CLARIN-PL repository About and Policies



CLARIN Mission Statement


The ultimate objective of CLARIN ERIC is to advance research in humanities and social sciences by giving researchers unified single sign-on access to a platform which integrates language-based resources and advanced tools at a European level. This shall be implemented by the construction and operation of a shared distributed infrastructure that aims at making language resources, technology and expertise available to the humanities and social sciences (henceforth abbreviated HSS) research communities at large.

To know more about CLARIN ERIC visit CLARIN-ShortGuide.pdf


CLARIN-PL Mission Statement


The Repository “CLARIN-PL Centre” gives access to language resources and tools from the Wroclaw University of Technology, Department of Computational Intelligence and other CLARIN-PL Members.
The CLARIN-PL Centre stores the resources and tools according to the mission:
  • To promote the knowledge and the use of the Polish language, by conducting scientific applied research.
  • To stimulate and coordinate the scientific description of the Polish vocabulary and grammar in all its aspects through the ages.
  • To produce, link and give access to source material for Polish in the form of historical and current corpora, dictionaries, lexical digital databases, grammars and the accompanying tools.
Please check the provided link to see what we are currently working on.

Terms of Service

To achieve our mission statement,we set out some ground rules through the Terms of Service. By accessing or using any kind of data or services provided by the Repository, you agree to abide by the Terms contained in the above mentioned document.

Data in CLARIN-PL repository are made available under the licence attached to the resources. In case there is no licence, data is made freely available for access, printing and download for the purposes of non-commercial research or private study. Users must acknowledge in any publication, the Deposited Work using a persistent identifier (see Citing Data), its original author(s)/creator(s), and any publisher where applicable. Full items must not be harvested by robots except transiently for full-text indexing or citation analysis. Full items must not be sold commercially unless explicitaly granted by the attached licence without formal permission of the copyright holders.


About Repository

It is like a library for linguistic data and tools.

  • Search for data and tools and easily download them.
  • Deposit the data and be sure it is safely stored, everyone can find it, use it, and correctly cite it (giving you credit)

License Agreement and Contracts

At the moment, CLARIN-PL distinguishes three types of contracts.

  • For every deposit, we enter into a standard contract with the submitter, the so-called "Deposition License Agreement", in which we describe our rights and duties and the submitter acknowledges that they have the right to submit the data and gives us (the repository centre) right to distribute the data on their behalf.
  • Everyone who downloads data is bound by the licence assigned to the item - in order to download protected data, one has to be authenticated and needs to electronically sign the licence. A list of available licenses in our repository can be found here.
  • For submitters, there is a possibility for setting custom licences to items during the submission workflow.

Intellectual Property Rights

As mentioned in the section License Agreement and Contracts, we require the depositor of data or tools to sign a Distribution License Agreement, which specifies that they have the right to submit the data and gives us (the repository centre) right to distribute the data on their behalf. This means that depositors are solely responsible for taking care of IPR issues before publishing data or tools by submitting them to us.
Should anyone have a suspicion that any of the datasets or tools in our repository violate Intellectual Property Rights, they should contact us immediately at our help desk.


Privacy Policy

Read our Privacy Policy in order to learn how we manage personal data collected by the CLARIN-PL repository and services.


Metadata Policy

Deposited content must be accompanied by sufficient metadata describing its content, provenance and formats in order to support its preservation and dissemination. Metadata are freely accessible and are distributed in the public domain (under CC0). However, we reserve the right to be informed about commercial usage of metadata from CLARIN-PL repository including a description of your use case at Help Desk.


Preservation Policy

CLARIN-PL is committed to the long-term care of items deposited in our repository and strives to adopt the current best practice in digital preservation.

To ensure the integrity of the data sets, for every deposited file a checksum (md5 type) is made which allows us to check for defects of the data over the years. Once deposited, files in data sets are never changed and only minor changes to the metadata are allowed. For example: correction of spelling, minor changes in documentation, additional documentation added. Changes to the data themselves will be issued as a new version of the dataset, which will obtain a new persistent identifier. These changes are only made in close collaboration with the producer of the dataset.


Authenticity Policy

Data producers hand over the materials to us. We do not change the data, except by adding metadata if required. If applicable, we create collection-level objects which provide a context for the embedded data sets. The repository maintains links to other relevant materials (e.g. articles, theses, documentation, related data) and to software and tools that have been used in production of the data, if applicable. The identity of a depositor is ensured by the required login using CLARIN SpF for identification.


Citing Data Policy

Data Users must acknowledge and cite data sources properly in all publications and outputs.