Persistent identifiers

Persistent identifiers

To make your data easy to find and accessible, you must provide your data and metadata with a persistent identifier. A persistent identifier is a long-lasting reference to a digital resource and provides the information required to reliably identify, verify and locate your research data. 

What is a persistent identifier?

A persistent identifier (PID) is a long-lasting reference to a digital resource and provides the information required to reliably identify, verify and locate your research data eliminating many misunderstandings. A PID may also be connected to a set of metadata which describes a digital resource. 

Notable persistent identifiers are the Digital Object Identifier (DOI) and the Handle System which can both be assigned to data to identify them uniquely. The DOI system uses the Handle System, which is the best infrastructure component available today for managing digital objects.  While DOIs are mainly assigned to resources ready for public dissemination, Handles are in general used to persistently identify other categories of digital resources (e.g. those created in the labs) to make them referable by software, workflows etc.

Example 1

This is what a DOI and a Handle look like at the EUDAT Collaborative Data Infrastructure, one of the largest infrastructures of integrated data services and resources supporting research in Europe.

Example 2

See how the Language Technology Group ensures their data receive a persistent identifier.

Which data need a PID?

image description

Ensure reproducibility

To ensure reproducibility of your research data, assign persistent identifiers to your processed data

image description

Ensure reuse

To ensure your research data can be reused you can assign persistent identifiers to the raw data, and not as the last task before they get archived but immediately after data collection!