The FAIR Principles#

The FAIR guiding principles for scientific data management and stewardship [WDA+16] were developed as guidelines to improve the Findability, Accessibility, Interoperability and Reusability of digital assets; all of which support research reproducibility. The FAIR principles play an important role in making your data available to others for reuse.

It is much easier to make data FAIR if you plan to do this from the beginning of your research project. You can plan for this in your Data Management Plan (DMP) (see points 4 and 5 of the Data Management Plan chapter).

Even though the FAIR principles have been defined to allow machines to find and use digital objects automatically, they improve the reusability of data by humans as well. The capacity of computational systems to find, access, interoperate, and reuse data, with none or minimal human intervention, is essential in today’s data-driven era, where humans increasingly rely on computational support to deal with data as a result of the increase in volume, velocity and variety.

This chapter provides an abstract and broad view of what the FAIR principles are. How to put the FAIR principles into practise is discussed in other sub chapters ( Data Organisation in Spreadsheets, Documentation and Metadata and Sharing and Archiving Data). You can also use the Wellcome Getting Started Guide or the How To FAIR website to find out more about the FAIR principles and how to get started.

Illustration of the FAIR principles to show the definition of being Findable, Accessible, Interoperable and Reusable.

Fig. 43 The Turing Way project illustration by Scriberia. Used under a CC-BY 4.0 licence. DOI: 10.5281/zenodo.3332807.#

Theory#

In brief, FAIR data should be:

Findable: The first step in (re)using data is to find it! Descriptive metadata (information about the data such as keywords) is essential.

Accessible: Once the user finds the data and software they need to know how to access it. Data could be openly available but it is also possible that authentication and authorisation procedures are necessary.

Interoperable: Data needs to be integrated with other data and interoperate with applications or workflows.

Reusable: Data should be well-described so that they can be used, combined, and extended in different settings.

You can find a more detailed overview of the FAIR principles by GO FAIR of what the FAIR principles recommend. You can also read A FAIRy tale for an understandable explanation of each principle.

Making data ‘FAIR’ is not the same as making it ‘open’. Accessible means that there is a procedure in place to access the data. Data should be as open as possible, and as closed as necessary.

It is also important to say that the FAIR principles are aspirational: they do not strictly define how to achieve a state of FAIRness, but rather describe a continuum of features, attributes, and behaviours that will move a digital resource closer to that goal.

The FAIR principles are also applied to software (see [LGK+20]and [HCH+20]). Watch a ten minute video on FAIR software for a short explanation.

FAIR principles and environmental sustainability#

“FAIR practices can result in highly efficient code implementations, reduce the need to retrain models, and reduce unnecessary data generation/storage, thus reducing the overall carbon footprint. As a result, green computing and FAIR practices may boht stimulate innovation and reduce financial costs.” - [LAB+23]

FAIR principles and accessibility#

The Accessible in FAIR is not equal to ensuring that your research objects are accessibles to all users. For this, the term “actually accessible” has been coined by [CGK23] to refer to data that is “easy to locate, obtain, interpret, use, share, and analyze for everybody, including disabled people.”

Community involvement#

Various online resources are provided for people who are working in the life sciences, to guide them in ensuring FAIRness in their data, providing them with tools and advice for good data management at various stages of their work. Two prominent ones include:

  • Under the FAIR Cookbook, several resources are offering guidance and assistance in FAIR data management. The FAIR Cookbook is designed to serve a variety of audience types and involved in different stages of data management life cycle. The FAIR Cookbook is developed and maintained by life sciences professionals, both in the academia and industry sectors, including members of the ELIXIR community.

  • Under ELIXIR Research Data Management Kit (RDMkit), resources are provided for life scientists to guide them in better management of their research data in adhering to the FAIR Principles. It is an attempt to help researchers work at different capacities, both in individual and collaborative workspaces. The RDMkit is open for suggestions from anyone, as long as they abide by the contributor responsibilities.

Many groups and organisations are working to define guidance and tools to help researchers and other stakeholders (like librarians, funders, publishers, and trainers) make data more FAIR. There are two global initiatives that act as umbrella organisations and reference points for many discipline-specific efforts, including the ones listed above: GOFAIR and the Research Data Alliance (RDA).

  • Under GOFAIR, there are many Implementation Networks (INs) committed to implementing the FAIR principles.

  • Under the RDA, there are several groups tackling different aspects relevant to the RDM life cycle. Among these, one group, the FAIR Data Maturity Model Working Group is reviewing existing efforts, building on them to define a standard set of common assessment criteria for the evaluation of FAIRness.

More information#