Research Data Management
FAIR Data Principles
Data Management Plan
Version Control & Git
Public Data Repositories
Implementation within DataPLANT
Annotated Research Context
Swate - a workflow annotation tool for Excel
Training & Tutorials
QuickStart on ARCs
ARC Commander QuickStart
Best practices for data annotation
last updated at 2022-08-05
This document is work in progress. If you experience any inconsistencies, have questions or would like to suggest additions, please feel free to contact us or open an issue in our Helpdesk.
We are very happy that you chose our tools and infrastructure to create and share your own ARCs. In this QuickStart we focus on how to use the "ARC Commander" to store and share your data.
Environment and setup
The command line
- Most of this Quickstart (especially the section ARC initialization) is based on the command line (Windows: powershell; Linux and Mac: terminal).
- The following picture shows exemplarily how to open a powershell on windows by entering powershell into the explorer path:
echo "hello - I am a code block"
- Text formatted as code blocks represents commands to copy/paste into the command line:
git config --global user.name <your_name>
git config --global user.email <your_email>
Note: If this is your first time using git on this computer, you need to set your git user name and email address. These are needed for displaying them on the git commits. You can update the settings with
- Create and navigate to a local folder, which you want to initialize as an ARC.
- Initialize your ARC by executing
- This will create the general ARC folder structure:
The ISA investigation (
arc i create -i QuickStartInvestigation
ISA studies and assays
-i) workbook allows you to record administrative metadata of your project. Add the isa.investigation.xlsx workbook including an identifier to your ARC with
The ISA study (
-s) and ISA assay (
-a) workbooks allow you to annotate your experimental data.
arc s add -s QuickStartStudy
- Add an isa.study.xlsx workbook including an identifier to your ARC with
arc a add -s QuickStartStudy -a QuickStartAssay
- Add an isa.assay.xlsx workbook including an identifier to your ARC with
Note: An assay must be linked to a study. If a study does not exist, it will be created automatically in this step.
- The ARC Commander will add a subdirectories to the studies and assays folder. Your ARC should
look similar to this now:
- These steps can be repeated to add as many studies and assays as needed. Accordingly, more subdirectories will be added. Multiple assays can be grouped in a study when the same StudyIdentifier is used.
Sharing your ARC
DataPLANT registration and access
- Place the data for each assay in the respective dataset folder.
In case you are not a member of DataPLANT yet, please visit sign up. Afterwards, you will be granted access to DataPLANT’s DataHUB. The DataHUB allows you to share your ARCs with registered lab or project partners.
After successful registration, create and set an access token for ARC Commander synchronization using
arc remote accesstoken get -s https://git.nfdi4plants.org
A window within your browser will open, asking for your DataPLANT Log In. In case you are already logged in, the browser will directly display a Success message to you:
- Synchronize your ARCs with the DataHUB using the command
arc sync -r https://git.nfdi4plants.org/martinkuhl/QuickStart
- If you did not connect your local ARC with a remote one so far, you can specify the remote address with the flag
-r followed by an URL, e.g.,
- In case you want to create a new remote repository at this URL, it needs to be assembled as the following example:
arc sync -f
- If no repository exists under the given URL, the ARC Commander will produce an error ensuring that you spelled the URL correctly. To force synchronization, use
- Check if the upload was successful by visiting your ARC at the respective URL in your browser.
Setting a git user
Note: Alternatively, you can first create a new blank repository in the DataHUB by clicking "New project/repository" in the plus drop down menu of the navigation bar on top. Afterwards, you can sync your local ARC to the respective repository by adapting the URL to the newly generated one.
Some users might want to use different signatures for different repositories, e.g. for developing software on GitHub and working on ARCs on DataPLANT's DataHUB. Besides your global git configuration, you can store the information you want to use for editing ARCs within the ARC Commander config:
arc config set -g -n "general.gitname" -v "Name of choice"
arc config set -g -n "general.gitemail" -v "Email of choice"
To transfer the information from the global ARC Commander config to the local git config of the ARC use
arc config setgituser
You can invite lab-colleagues or project partners to join your ARC for collaboration. While inside your ARC on the DataHUB, click on Project information -> Members in the left navigation panel. Search for registered researchers and select a role for each individually. These roles come along with different rights.
- Guests: Have the least rights. This is recommended for people you ask for consultancy.
- Developers: The choice for most people you want to invite to your ARC. Developers have read and write access, but cannot maintain the project on the DataHUB, e.g. inviting others.
- Maintainers: Gives the person the same rights as you have (except of removing you from your own project). This is recommended for inviting PIs or group leaders allowing them to add their group members for data upload or analysis to the project as well.
The Minimalist's ARC-QuickStart
Note: A detailed usage instruction for the ARC Commander can be found here.
Voila! You are ready to follow these few steps to create a minimal ARC sharable via DataPLANT's DataHUB:
# Create and navigate to your ARC folder
# Setup the ARC structure with one study and one assay
arc i create -i <YourInvestigation>
arc a add -s <YourStudy> -a <YourAssay>
arc sync -f -r https://git.nfdi4plants.org/<YourUserName>/<YourARC> -m "initialize ARC structure"
- Visit the DataHUB, create a new repository and copy the URL to your clipboard.
- Replace the
<variables> in the following code block with your information and execute it in your command line.
Besides these technical solutions, DataPLANT supports you with community-engaged data stewardship. For further assistance, feel free to reach out via our
or by contacting us