Image Removed
...
...
...
...
...
...
...
...
Where do I publish “my” MOSAiC data?
- The default repository for MOSAiC is PANGAEA.
- My national funding agency requires depositing data in a special national repository. What should I do? These cases are handled as exceptions (see Data Policy) and a legitimate reason for not publishing the data in PANGAEA. At the moment, written agreements have been signed with several repositories: Arctic Data Center (ADC), Atmospheric Radiation Measurement (ARM) data center, British Oceanographic Data Centre (BODC), UK Polar Data Centre and Centre for Environmental Data Analysis (CEDA). When archiving data in these repositories, it is always important to acknowledge the MOSAiC project (see Data Policy). The agreements assure FAIR data publication and future findability of MOSAiC data from a single access point (portal to project and data).
- Other exceptions are possible for special data types (e.g., genomics, source code, high volume model data), for which PANGAEA is not a suitable repository and a dedicated community repository exists which fullfills the FAIR criteria. When archiving data in these repositories, it is always important to acknowledge the MOSAiC project (see Data Policy). Otherwise, findability of MOSAiC data from a single access point (portal) cannot be assured in the future. If you are unsure if this applies, contact the PANGAEA team.
- PANGAEA does not assign or link own DOIs to data sets published somewhere else.
When do I submit and publish “my” MOSAiC data?
- Submit your quality controlled data sets as early as possible and before they are used for a paper. In PANGAEA, the data can be password protected, which means only metadata are accessible, but data itself cannot be viewed or downloaded. However, PANGAEA editors can already provide a temporary access key for reviewers. While data set status is “in review”, the content can still be changed, and the DOI is provided already for use in your manuscript.
- Cite your data sets in any paper which is using them. Remember, data sets have full citations which can be used just like any other references.
- Like paper publication, data publication involve editorial work, which requires time (sometime up to several weeks). Do not wait with the submission of data for publication for the last minute. No data citation will be possible before the data are actually archived in the repository.
Data publishing workflow
The workflow drafted below includes publishing data set in the repository before the paper submission. This enables correct crossreferencing of both.
Image Removed
Expand |
---|
|
group, peoples icon made by icon king from www.freeicons.io; password icon made by icon king from www.freeicons.io; document, content, article, letter, paper icon made by BECRIS from www.freeicons.io; edit, document, note, writing, review icon made by BECRIS from www.freeicons.io; engagement, customer, user, interaction, branding icon made by BECRIS from www.freeicons.io; essentials, sand, clock, time icon made by byandriy matviychuk from www.freeicons.io; Data Icon #135890 from https://icon-library.com |
...
...
How do I approach data publication?
Row data / Primary data publication workflow
- Raw data publication is going to be semi-automatic in the near future and the responsible PIs will be informed about the process.
- Primary data publication (calibrated data, data ready made for a paper publication) needs to be always initiated by the authors by opening a data submission ticket in PANGAEA or other designated repository, if exceptions apply.
- If the raw data wasn’t published with PANGAEA at the time of primary data publication yet, and is needed, contact the PANGAEA team for initiating the raw data publication.
- During data publication instruct the editors in your data repository to create links to other versions of data (e.g., raw data), especially when they were or are being published in another repository.
- All published data must include a funding acknowledgment of MOSAiC in the following form: "Multidisciplinary drifting Observatory for the Study of the Arctic Climate (MOSAiC)" with the tag "MOSAiC20192020”. Additionally, the Project ID given for specific expedition must be mentioned. For the Polarstern expedition this is "AWI_PS122_00". Additional attributions like specific award/grant numbers might be added.
Image Removed
Data publication in PANGAEA
Datasets in PANGAEA may be archived as stand-alone publications of data (e.g., https://doi.org/10.1594/PANGAEA.753658) or as supplements to an article (e.g., https://doi.org/10.1594/PANGAEA.846130). Data can be submitted to and published in PANGAEA with access restrictions in place for a predefined period (until article publication, or during an embargo period).
Metadata must be submitted together with the data. Minimal requirements are:
- dataset Author(s)
- PI, dataset title
- MOSAiC device operation ID(s) / Event labels associated with individual data / data files
- related institute(s) or publication(s))
...
...
...
...
The MOSAiC Device operation ID(s) / Events list is available after the end of each leg from PANGAEA page https://www.pangaea.de/expeditions/byproject/MOSAiC (can be found for viewing or download under "Event list: " link).
Within the data table, parameters (table header) should be submitted with full names and units. Data submitted in the form of videos, photos, geoTIFF, shape files, netCDF, sgy, etc. will be archived as is (e.g., https://doi.org/10.1594/PANGAEA.865445).
More information on data submission can be found in https://wiki.pangaea.de/wiki/Data_submission.
If a published dataset needs to be updated, PANGAEA will upload a new version of this dataset, with new documentation and complete metadata (clearly providing information on the changes between the versions). Both versions can be linked but will have their own permanent DOI.
Data submission to PANGAEA
Publication of primary data sets in PANGAEA or other recommended repositories is the responsibility of each scientist (MOSAiC data policy).
The data can be submitted via https://www.pangaea.de/submit/. Sign in with your user name. If you are not a PANGAEA user yet you need to register with your name and E-Mail address, or ideally your ORCID ID. An ORCID ID is a unique identifier for academic authors. To find more about ORCiD, visit www.orcid.org.
This login can later also be used to access your password protected dataset during the review of your data publication. After signing up, you will receive a link via email to activate your account and sign in.
Once logged in, click on “Submit data” and a data submission form will open.
In this form, you can enter the information from your checklist, including
- authors,
- title of your dataset: should reflect what has been measured, observed or calculated, when, where and how.
- description of your data (dataset abstract), and
- other information you deem important, for example your project label. For data from MOSAiC project please use the label "MOSAiC".
You can choose the license under which you plan to share your data. We recommend CC-BY, which is suitable for scientific data reuse.
Finally you need to attach your dataset files. If your data files are larger than 100 MB we can provide an upload link for large files up to 10 GB per file. Please indicate this in the Description-field.
Once you think you have entered all necessary information and data files, you can click on “Create”. Do not worry if you are uncertain about some fields or content, you will still be able to make final adjustments during the following steps and with support of your data curator. First the submission passes a brief editorial review to make sure the data submission is complete. If questions remain, we will get back to you. Then the submission will be assigned to a data curator who will lead you through the further process.
After we’ve imported your data to PANGAEA, you’ll be asked to proofread your dataset, which is then “in review”. We lead you through this iterative process with our ticket system until the data submission is complete and approved by you. Once the dataset is approved, the digital object identifier, DOI, is registered and with that the dataset is officially published and citable.
If the related manuscript has not been published yet, a moratorium on access and publication can be put in place. At the same time, PANGAEA can provide a temporary key to enable access to your datasets for example for anonymous reviewers. In general, moratorium on MOSAiC data is possible until 2023-01-01.
...