What is FAIR?

The FAIR universe is populated by concepts and acronyms that you may not be familiar with. You will learn about all of them, as you make your way through this website. On this page, you can read about three fundamental concepts: the FAIR principles, FAIR data, and FAIRification practices.  

We presuppose that you have some knowledge of research data management practices. If you are completely new to the field, we recommend that you start by taking our e-Learning course on research data management. 

image description

The FAIR principles

image description

1) Both humans and machines are intended as digesters of data.

This will lead to the creation of an ecosystem that is fast to respond to change and automatically adapts to new findings or changes: the Internet of FAIR Data and Services. This is the reason for focusing on standards for data, identification mechanisms, data availability, etc.

 

image description

2) The FAIR principles apply to both data and metadata.

Where metadata are descriptions of or records about data. This is why the term “(meta)data” is stated in the principles.

 

image description

3) The principles are not necessarily about open data.

You can work in a FAIR manner with data that is not intended for public availability.

 

image description

4) The FAIR principles are not rules or standards.

The FAIR principles must not be mistaken for rules or standards that you can use to evaluate tools, data, policies, etc. This would soon make the principles out-of-date and inapplicable across research disciplines. Adopting the FAIR principles will often be a gradual adaptation of work routines – but it could also be a huge leap, where you replace one type of infrastructure with another. It will be up to the different research areas and research communities to make the FAIR principles work in their respective contexts.

This text is adapted from the introduction of A FAIRy tale.

FAIR data

image description

Findable

means that the data can be discovered by both humans and machines, for instance by exposing meaningful machine-actionable metadata and keywords to search engines and research data catalogues. The data are referenced with unique and persistent identifiers (e.g. DOIs or Handles) and the metadata include the identifier of the data they describe.

image description

Accessible

means that the data are archived in long-term storage and can be made available using standard technical procedures. This does not mean that the data have to be openly available for everyone, but information on how the data could be retrieved (or not) has to be available. For example, data can be marked “Access only with explicit permission from the author” and include the author’s contact details. Ideally, though, the information about data accessibility can also be read by machines, e.g. by way of machine-readable standard licences.

image description

Interoperable

means that the data can be exchanged and used across different applications and systems — also in the future, for example, by using open file formats. It also means that the data can be integrated with other data from the same research field or data from other research fields. This is made possible by using metadata standards, standard ontologies, and controlled vocabularies as well as meaningful links between the data and related digital research objects.

image description

Reusable

means that the data are well documented and curated and provide rich information about the context of data creation. The data should conform to community standards and include clear terms and conditions on how the data may be accessed and reused, preferably by applying machine-readable standard licences. This allows others either to assess and validate the results of the original study, thus ensuring data reproducibility, or to design new projects based on the original results, in other words data reuse in the stricter sense. Reusable data encourage collaboration and avoid duplication of effort. 

 

Want to read more?

Check out the explanation of the FAIR principles by the GO FAIR initiative.

FAIRification practices

How you apply the FAIR principles, depends on your specific discipline and your way of doing research. But there are different activities you must consider within your research workflows, if you want to make your data FAIR. For instance: documenting your data, choosing appropriate file formats, adding metadata, giving access to the data, licensing the data or adding a persistent identifier. On this website, we call these activities FAIRification practices.

To learn more about the six FAIRification practices, have a look at our e-Learning module 2 on FAIR data

image description

Documentation

Documentation adds rich context to your data and makes the data easier to understand and reuse in the future. 

image description

File formats

File formats determine how data can be used. It is important to decide what file formats to use for data collection, data processing, data archiving, and long-term preservation. File formats are important to consider, when you want to combine datasets or make data readable by machines.

image description

Metadata

Metadata are data about data. Research data need metadata to become findable, accessible, interoperable and reusable - by humans and machines.  

image description

Access to data

Access to data means that you determine who you make your data available for, how you provide access, and under which conditions.

image description

Persistent identifiers

To make your data accessible and easy to find, you must provide your data and metadata with a persistent identifier (PID). A PID is a long-lasting reference to a digital resource and provides the information required to reliably identify, verify and locate your research data.

image description

Data licences

A data licence is a legal arrangement between the data creator and the data user that specifies what users can do with the data. It is one of the most effective ways to communicate permissions to potential data users.

Research Data Management e-Learning course

Within the framework of the Danish National Forum for Data Management, the Danish Universities have developed the following video e-Learning course on the importance of good Research Data Management.

The course below is divided into three independent modules. Each module consists of a 20 min video recording (including animations and interviews) that can be taken at a students' own pace and can be complemented with face-to-face training. The target group for the course is PhD students, researchers and research support staff who would like an introduction to the general concepts and terms used in research data management. However, others are also welcome to take the course.
It is open to all.

Holmstrand, K.F., den Boer, S.P.A., Vlachos, E., Martínez-Lavanchy, P.M., Hansen, K.K. (Eds.) (2019). Research Data Management (e-Learning course). doi:10.11581/dtu:00000047

Module 1: Introduction

Learning Objective: At the end of this module, the student can…

  1. understand the importance of managing research data
  2. understand how the most important players in research data management (RDM) can influence research
  3. identify different types of research data within different disciplines
  4. understand what RDM entails when looking at Research Data Lifecycle

Module 2: FAIR principles

Learning Objective: At the end of this module, the student can…

  1. identify key elements that help make research data discoverable, accessible, interoperable and reusable
  2. understand how these key elements are used in different research disciplines and different research workflows
  3. distinguish between FAIR data and open data

Module 3: Data Management Plans

Learning Objective: At the end of this module, the student can…

  1. understand the added value of making data management plans in research projects
  2. identify challenges in projects related to research data management
  3. identify stakeholders who require a DMP and know how to start making a DMP, including what topics to cover