To make your data easy to find and accessible, you must provide your data and metadata with a persistent identifier. A persistent identifier is a long-lasting reference to a digital resource and provides the information required to reliably identify, verify and locate your research data.
A persistent identifier (PID) is a long-lasting reference to a digital resource and provides the information required to reliably identify, verify and locate your research data eliminating many misunderstandings. A PID may also be connected to a set of metadata which describes a digital resource.
Notable persistent identifiers are the Digital Object Identifier (DOI) and the Handle System which can both be assigned to data to identify them uniquely. The DOI system uses the Handle System, which is the best infrastructure component available today for managing digital objects. While DOIs are mainly assigned to resources ready for public dissemination, Handles are in general used to persistently identify other categories of digital resources (e.g. those created in the labs) to make them referable by software, workflows etc.
This is what a DOI and a Handle look like at the EUDAT Collaborative Data Infrastructure, one of the largest infrastructures of integrated data services and resources supporting research in Europe.
To make your data accessible and easy to find, you must provide your data and/or your metadata with a PID. This can be done by making a record in a trusted or recommended repository.
Find a repository that will provide a PID for your research data by:
Before assigning a PID, it is important to consider who has the responsibility and the rights for registering the PID and, later on, making updates if needed.
If your data and/or metadata are only stored internally at your institution or at another repository that does not issue a PID, they will not be FAIR.
To ensure reproducibility of your research data, assign persistent identifiers to your processed data.
To ensure your research data can be reused you can assign persistent identifiers to the raw data, and not as the last task before they get archived but immediately after data collection!