3 Workflow
The general workflow of making your data more FAIR and structured is shown in Figure 3.1 and shortly described in the following section.
To make your data more FAIR and structured, we recommend you to:
First, describe your data and thereby gather metadata.
Afterwards, you should decide whether your raw data should be mobilised to an online repository.
- If yes, you have to choose a suitable repository and upload your metadata and data there.
Next, you can start to standardise both the data and the metadata. With which of the two you start does not matter, as they can both be structured and standardised separately.
For the metadata:
First, choose a metadata standard.
Next, map the metadata to the selected standard.
For the data:
First, choose a suitable data standard.
Then, map the data to that standard.
For further standardisation of your data, use ontologies and biological taxonomies where possible to describe the contents of the data.
After standardising the data, you should choose a standardised structure for the data.
Restructure your data according to the chosen structure.
Once standards and structure are applied, structural metadata can be created that describes how the data is organised in its new form.
As a last step, your standardised data and metadata should be persistently stored in an online storage. If you have not chosen a repository before (or if you want to store the standardised data elsewhere), you should:
Choose a suitable repository that accepts your new data structure/format.
Mobilise the data and the metadata to the chosen repository.
Even though this guide is tailored to ecological data, the workflow presented here should be generally applicable to any type of data, only the implementations of each step will differ. In the following chapters, we will go through each of the steps and provide some general background, introduce some options for implementation for ecological data and considerations that can help to choose the best suited option for your data, as well as a short insight into our own choices for the example datasets.