This document outlines the Energy Data Centre's approach to applying the FAIR data principles to the content held in the EDC.
F1 (meta)data are assigned a globally unique and persistent identifier.
For some of the research data held within the EDC, there is a DOI minted. For metadata only records, a UUID is assigned internally.
Recording ORCIDs and ROR is on the development roadmap.
F2 data are described with rich metadata (defined by R1 below).
Alongside the standard descriptive metadata, records are assigned terms from two additional domain-based schemes.
F3 metadata clearly and explicitly include the identifier of the data it describes.
The metadata records include the identifier of the data for internally held records. We seek to include the identifier as well as the link to the data for data held elsewhere.
F4 (meta)data are registered or indexed in a searchable resource.
The content within the EDC is searchable through the EDC and is indexed by search engines.
A1 (meta)data are retrievable by their identifier using a standardized communications protocol.
A1.1 the protocol is open, free, and universally implementable.
A1.2 the protocol allows for an authentication and authorization procedure, where necessary.
A2 metadata are accessible, even when the data are no longer available.
All of these are implemented through our standard web interface for humans.
I1 (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation.
EDC metadata is not currently machine accessible. This principle is not met.
Implementing this is on the development roadmap
I2 (meta)data use vocabularies that follow FAIR principles.
While the EDC does use controlled vocabularies that are publicly accessible through the EDC website, these are not machine actionable. This principle is partially met.
I3 (meta)data include qualified references to other (meta)data.
There a links between content within the EDC, currently these are not qualified references.
Extending this is on the development roadmap
R1 meta(data) are richly described with a plurality of accurate and relevant attributes.
As well as the descriptive data for the content, the EDC staff add additional domain based information to aid users to identify whether it is of use. All data is required to have a README file associated with the files to enable potential re-users to establish whether it is useful.
R1.1 (meta)data are released with a clear and accessible data usage license.
All metadata records have a clear license. For physical objects held in the EDC license information is recorded. For metadata only records pointing to other resources, license information will be record if it is found on the remote site.
R1.2 (meta)data are associated with detailed provenance.
Formal Internal audit trail has been implemented since 2023.
R1.3 (meta)data meet domain-relevant community standards.
Descriptive metadata follows general repository standards. Domain based standards in the Energy sector are still emerging.
Wilkinson, M., Dumontier, M., Aalbersberg, I. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016). https://doi.org/10.1038/sdata.2016.18