Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

How do I approach data publication?

Row data / Primary data publication workflow


Image Added

  • Raw data publication is going to be semi-automatic in the near future and the responsible PIs will be informed about the process.
  • Primary data publication (calibrated data, data ready made for a paper publication) needs to be always initiated by the authors by opening a data submission ticket in PANGAEA or other designated repository, if exceptions apply.
  • If the raw data wasn’t published with PANGAEA at the time of primary data publication yet, and is needed, contact the PANGAEA team for initiating the raw data publication.
  • During data publication instruct the editors in your data repository to create links to other versions of data (e.g., raw data), especially when they were or are being published in another repository.
  • All published data must include a funding acknowledgment of MOSAiC in the following form: "Multidisciplinary drifting Observatory for the Study of the Arctic Climate (MOSAiC)" with the tag "MOSAiC20192020”. Additionally, the Project ID given for specific expedition must be mentioned. For the Polarstern expedition this is "AWI_PS122_00". Additional attributions like specific award/grant numbers might be added.


Datasets in PANGAEA may be archived as stand-alone publications of data (e.g., https://doi.org/10.1594/PANGAEA.753658) or as supplements to an article (e.g., https://doi.org/10.1594/PANGAEA.846130). Data can be submitted to and published in PANGAEA with access restrictions in place for a predefined period (until article publication, or during an embargo period).

Metadata must be submitted together with the data. Minimal requirements are:

  • dataset Author(s)
  • PI, dataset title
  • MOSAiC device operation ID(s) / Event labels associated with individual data / data files
  • related institute(s) or publication(s))

Any documentation (e.g., MOSAiC Standard operating procedures, MSOPs) helping to understand the data can and should be linked to the dataset(s). If no persistent link to the documents can be provided, PANGAEA can archive the documents permanently alongside the data.
The granularity of the data is up to the author(s) of the dataset. Lower-granularity datasets can be combined in a (time-)series collection dataset as in https://doi.org/10.1594/PANGAEA.873032. During submission (https://www.pangaea.de/submit/), the connection with MOSAiC has to be clearly stated in the Label Field of the Data Submission ("MOSAiC"). The MOSAiC Project ID (for the Polarstern expedition this is "AWI_PS122_00") is internally associated as a grant number of the MOSAiC project and does not have to be inserted in the submission form additionally.

The MOSAiC Device operation ID(s) / Events list is available after the end of each leg from PANGAEA page https://www.pangaea.de/expeditions/byproject/MOSAiC (can be found for viewing or download under "Event list: " link).

Within the data table, parameters (table header) should be submitted with full names and units. Data submitted in the form of videos, photos, geoTIFF, shape files, netCDF, sgy, etc. will be archived as is (e.g., https://doi.org/10.1594/PANGAEA.865445).

More information on data submission can be found in https://wiki.pangaea.de/wiki/Data_submission.


If a published dataset needs to be updated, PANGAEA will upload a new version of this dataset, with new documentation and complete metadata (clearly providing information on the changes between the versions). Both versions can be linked but will have their own permanent DOI.


Data submission to PANGAEA

Are you planning to share your research data? That is a great idea. If you publish your data with PANGAEA, we will provide a long-term data archiving service using the FAIR principles:

  • Findable,
  • Accessible,
  • Interoperable,
  • Reusable

With PANGAEA your data will be found by others, re-used, and cited appropriately – ensuring that you get the deserved credit for it.

Data submission is easy. All you need is your data, some additional information called metadata, and the willingness to open up your data to the world. By granting access to your data in a citable format you can greatly increase the impact of your scientific work. PANGAEA is a fully curated data repository, so once you take the initial step, we will help you personally through the further submission process.

To make the submission process smoother, you can work through the following checklist to see if you have everything ready:

  • the final dataset files
  • information about geolocation 
  • a title for your dataset
  • list of  the authors of your dataset
  • and in case these data belong to an article, please have the tentative citation ready

With all this prepared, you can start your submission. Just go to www.pangaea.de and sign in. If you are not a user yet you need to register with your name and E-Mail address, or ideally your ORCID ID. An ORCID  ID is a unique identifier for academic authors. To find more about ORCiD, visit www.orcid.org.

This login can later also be used to access your password protected dataset during the review of your data publication. After signing up, you will receive a link via email to activate your account and sign in.

Once logged in, click on “Submit data” and a data submission form will open.

In this form, you can enter the information from your checklist, including

  • authors,
  • title of your dataset,
  • description of your data, and
  • other information you deem important, for example your project label.


You can choose the license under which you plan to share your data. We recommend CC-BY, which is suitable for scientific data reuse.

Finally and most importantly you need to attach your dataset files.  

If your data files are larger than 100 MB we can provide an upload link for large files  up to 10 GB per file. Please indicate this in the Description-field.


Once you think you have entered all necessary information and data files, you can click on “Create”.  Do not worry if you are uncertain about some fields or content, you will still be able to make final adjustments during the following steps and with support of your data curator.  

First the submission passes an editorial review to make sure the data submission fits the scope of PANGAEA and is complete. If questions remain, we will get back to you. Then the submission will be assigned to a data curator who will lead you through the further process.

After we’ve imported your data to PANGAEA, you’ll be asked to proofread your dataset, which is then “in review”. We lead you through this iterative process with our ticket system until the data submission is complete and approved by you.

Once the dataset is approved, the digital object identifier, DOI, is registered and with that the dataset is officially published and citable.


If the related manuscript has not been published yet, a moratorium on access and publication can be put in place. At the same time, PANGAEA can provide a temporary key to enable access to your datasets for example for anonymous reviewers.

To assure that the FAIR principles are met and that we can apply high quality standards, we provide professional and personalized data curation by skilled curators for every complete submission. This requires time. Thus, submit your data as early as possible to PANGAEA and we can keep your data under a moratorium until your publication has been accepted.

By sharing your data in PANGAEA you will increase the impact of your research data and receive proper credit for it.

See, it’s not difficult at all! Get your data ready and let’s get started at www.pangaea.de.