Fundamentals Research Data Management FAIR Data Principles Metadata Ontologies Data Sharing Data Publications Data Management Plan Version Control & Git Public Data Repositories Persistent Identifiers Electronic Lab Notebooks (ELN)

DataPLANT Implementations

Annotated Research Context ARC specification

ARC Commander Swate MetadataQuiz DataHUB DataPLAN Ontology Service Landscape

ARC Commander Manual

Setup Git Installation

ARC Commander Installation Windows MacOS Linux

ARC Commander DataHUB Access

Before we start

Central Functions Initialize Clone Connect Synchronize Configure Branch

ISA Metadata Functions

ISA Metadata Investigation Study Assay

ARCitect Manual Installation - Windows Installation - macOS Installation - Linux QuickStart QuickStart - Videos

ARCmanager Manual What is the ARCmanager? How to use the ARCmanager

Swate Manual QuickStart QuickStart - Videos Annotation tables

Building blocks Building Block Types Adding a Building Block

Filling cells with ontology terms Advanced Term Search File Picker Templates Contribute Templates ISA-JSON

DataHUB Manual Overview

User Settings Generate a Personal Access Token (PAT)

ARC Panel Forks Working with files ARC Settings ARC Wiki

Groups Panel Create a new user group

CQC Pipelines & validation Find and use ARC validation packages

Data publications Passing Continuous Quality Control Submitting ARCs with ARChigator Track publication status Use your DOIs

Guides ARC User Journey

Create your ARC ARCitect QuickStart ARCitect QuickStart - Videos ARC Commander QuickStart ARC Commander QuickStart (Experts)

Annotate Data in your ARC Annotation Principles ISA File Types Best Practices For Data Annotation Swate QuickStart Swate QuickStart - Videos Swate Walk-through

Share your ARC Register at the DataHUB DataPLANT account Invite collaborators to your ARC Sharing ARCs via the DataHUB

Work with your ARC Using ARCs with Galaxy

Computational Workflows CWL Introduction CWL runner installation CWL Examples CWL Metadata

Recommended ARC practices Syncing recommendation Keep files from syncing to the DataHUB Managing ARCs across locations Working with large data files Adding external data to the ARC ARCs in Enabling Platforms Publication to ARC

Troubleshooting Git Troubleshooting & Tips

Contribute Swate Templates Knowledge Base

Teaching Materials

Events 2023 Nov: CEPLAS PhD Module Oct: CSCS CEPLAS Start Your ARC Sept: MibiNet CEPLAS Start Your ARC July: RPTU Summer School on RDM July: Data Steward Circle

May: CEPLAS Start Your ARC Series Start Your ARC Series - Videos

Events 2024 TRR175 Becoming FAIR CEPLAS ARC Trainings – Spring 2024 MibiNet CEPLAS DataPLANT Tool-Workshops TRR175 Tutzing Retreat

Frequently Asked Questions

Adding external data to the ARC

last updated at 2023-07-07

About this guide

In this guide we recommend routines to properly add data from external sources to your ARC.

UserAdvanced ModeTutorial

Before we can start

☑️ You are familiar with the ARC concept and ISA file types

Research projects rarely start out of the blue. Rather every project builds on previous findings and published or unpublished datasets.

Add a study to store and describe the external data

To properly re-use and reference such a dataset, we recommend to add a study to your ARC. Every study by default comes with four parts:

└── <StudyName> ├── README.md ├── isa.study.xlsx ├── protocols └── resources

In the resources directory you can add the data (e.g. supplemental data files)
In the protocols directory you can add notes on how you retrieved the data and from where.
The study is registered in your ARC's isa.investigation.xlsx, which includes a section "STUDY PUBLICATIONS" for every study. In this section, you can add publication details (author, DOI, etc.) about the external data source.
Finally, the README.md is a good option to let other viewers of your ARC know the source and details to the external study. This file is also prominently displayed in the respective folder in the DataHUB.

💡 The easiest way to add a new study is by using the ARC Commander's function:

arc study add --identifier <StudyName>

💡 As with any other routine used by researchers to share scientific results and data, the responsibility to acknowledge scientific integrity, follow guidelines of good scientific practice, institutional guidelines for data handling, and respective laws for licensing, and – if applicable – to properly reference or cite the data source lies with the individual researcher.

💡 You can add datasets to the .gitignore file, if you are unsure about the conditions to reuse data from an external source.

DataPLANT Support

Besides these technical solutions, DataPLANT supports you with community-engaged data stewardship. For further assistance, feel free to reach out via our helpdesk or by contacting us directly .

Contribution Guide 📖

✏️ Edit this page