1 of 24

Squirrel data sharing format

The squirrel data format allows sharing of all information necessary to recreate an experiment and its results, from raw to analyzed data, and experiment parameters to analysis pipelines.

The squirrel format specification is implemented in NiDB. A DICOM-to-squirrel converter, and squirrel validator are available.

Using the squirrel library

Overview of how to use the squirrel C++ library

The squirrel library is built using the Qt framework and gdcm. Both are available as open-source, and make development of the squirrel library much more efficient.

The Qt and gdcm libraries (or DLLs on Windows) will need to be redistributed along with any programs that use the squirrel library.

Including squirrel

The squirrel library can be included at the top of your program. Make sure the path to the squirrel library is in the INCLUDE path for your compiler.

Reading

Create an object and read an existing squirrel package

Subject data

All imaging data is stored in a Subject->Study(session)->Series hierarchy. Subjects are stored in the root of the squirrel object.

Experiments and Pipelines

Access to these objects is similar to accessing subjects

Writing

Create a new squirrel package and add a subject

Add a study to existing subject

Write package

Modalities

Example package

Package contents (file and directory structure)

/
/squirrel.json
/data
/data/S1234ABC
/data/S1234ABC/5
/data/S1234ABC/5/1
/data/S1234ABC/5/1/S1234ABC_5_1_00001.nii.gz
/data/S1234ABC/5/2
/data/S1234ABC/5/2/S1234ABC_5_1_00001.nii.gz
/data/S1234ABC/5/3
/data/S1234ABC/5/3/S1234ABC_5_1_00001.nii.gz
/pipelines
/pipelines/freesurfer
/pipelines/freesurfer/pipeline.json

squirrel.json

{
    "_package": {
        "NiDBversion": "version2022.5.805",
        "datetime": "2022-05-10 14:04:02",
        "description": "more details...",
        "format": "squirrel",
        "name": "pineapples",
        "version": "1.0"
    },
    "pipelines": [
        {
            "createDate": "Mon Apr 6 14:26:18 2020",
            "desc": "freesurfer for all structural T1s",
            "level": 1,
            "name": "freesurferUnified6"
        }
    ],
    "subjects": [
        {
            "ID": "S1234ABC",
            "alternateIDs": [ "ID_001", "ID_009" ],
            "dateOfBirth": "1990-04-29",
            "ethnicity1": "",
            "ethnicity2": "",
            "gender": "U",
            "sex": "U",
            "studies": [
                {
                    "ageAtStudy": 0,
                    "analysis": [
                        {
                            "clusterEndDate": "2018-02-28 02:19:22",
                            "clusterStartDate": "2018-02-26 15:37:32",
                            "diskSize": 312055410,
                            "endDate": "2018-03-09 11:53:36",
                            "hostname": "compute19",
                            "isBad": false,
                            "isComplete": true,
                            "notes": "",
                            "numSeries": 1,
                            "pipelineName": "freesurfer",
                            "pipelineVersion": 14,
                            "startDate": "2018-02-26 15:36:35",
                            "status": "complete",
                            "statusmessage": "Supplement processing complete"
                        }
                    ],
                    "dayNumber": "",
                    "description": "",
                    "modality": "MR",
                    "series": [
                        {
                            "number": 1,
                            "numfiles": 3,
                            "path": "S1234ABC/5/1",
                            "size": 303300
                        },
                        {
                            "number": 2,
                            "numfiles": 3,
                            "path": "S1234ABC/5/2",
                            "size": 302113
                        },
                        {
                            "number": 3,
                            "numfiles": 1,
                            "path": "S1234ABC/5/3",
                            "size": 23720011
                        }
                    ],
                    "studyDateTime": "Tue Mar 28 14:20:18 2017",
                    "studyNumber": 5,
                    "timePoint": "",
                    "visit": ""
                }
            ]
        }
    ]
}

pipeline.json

{
    "clusterType": "sge",
    "clusterUser": "",
    "completeFiles": [
        "{analysisroot}/complete.txt"
    ],
    "createDate": "Mon Apr 6 14:26:18 2020",
    "dataCopyMethod": "nfs",
    "dataSpec": [
        {
            "associatonType": "nearestintime",
            "behDir": "",
            "behFormat": "behnone",
            "dataFormat": "nifti3d",
            "enabled": true,
            "gzip": true,
            "imageType": "",
            "level": "study",
            "location": "data",
            "modality": "MR",
            "numBOLDreps": 0,
            "numImagesCriteria": 0,
            "optional": true,
            "order": 1,
            "preserveSeries": false,
            "primaryProtocol": false,
            "protocol": "T1w",
            "seriesCriteria": "all",
            "usePhaseDir": false,
            "useSeries": false
        }
    ],
    "depDir": "root",
    "depLevel": "study",
    "depLinkType": "hardlink",
    "desc": "freesurfer for structural T1-weighted images",
    "dirStructure": "",
    "directory": "",
    "group": "",
    "groupType": "",
    "level": 1,
    "maxWallTime": 2880,
    "name": "freesurfer",
    "notes": "",
    "numConcurrentAnalysis": 30,
    "primaryScript": [
        {
            "command": "export FREESURFER_HOME=/opt/freesurfer",
            "desc": "The Freesurfer home directory (version) you want to use",
            "enabled": true,
            "logged": true,
            "order": 1,
            "workingdir": ""
        },
        {
            "command": "export FSFAST_HOME=/opt/freesurfer/fsfast",
            "desc": "Not sure if these next two are needed but keep them just in case",
            "enabled": true,
            "logged": true,
            "order": 2,
            "workingdir": ""
        },
        {
            "command": "export MNI_DIR=/opt/freesurfer/mni",
            "desc": "Not sure if these next two are needed but keep them just in case",
            "enabled": true,
            "logged": true,
            "order": 3,
            "workingdir": ""
        },
        {
            "command": "source $FREESURFER_HOME/SetUpFreeSurfer.sh",
            "desc": "MGH's shell script that sets up Freesurfer to run",
            "enabled": true,
            "logged": true,
            "order": 4,
            "workingdir": ""
        },
        {
            "command": "export SUBJECTS_DIR={analysisrootdir}",
            "desc": "Point to the subject directory you plan to use - all FS data will go there",
            "enabled": true,
            "logged": true,
            "order": 5,
            "workingdir": ""
        },
        {
            "command": "recon-all -notal-check -no-isrunning -all -subjid analysis",
            "desc": "Autorecon all {PROFILE}",
            "enabled": true,
            "logged": true,
            "order": 6,
            "workingdir": ""
        }
    ],
    "queue": "slow.*.q",
    "resultScript": "",
    "submitDelay": 0,
    "submitHost": "compute11",
    "tmpDir": "",
    "useProfile": true,
    "useTmpDir": false,
    "version": 1
}

squirrel vs BIDS

Understanding the differences between package formats

BIDS and squirrel are both file formats designed to store neuroimaging data. They are similar, but different in implementation. If you are familiar with BIDS, squirrel will be easy to understand.

squirrel vs BIDS objects

Specification v1.0

Format specification for v1.0

Overview

A squirrel contains a JSON file with meta-data about all of the data in the package, and a directory structure to store files. While many data items are optional, a squirrel package must contain a JSON file and a data directory.

JSON File

JSON is JavaScript object notation, and many tutorials are available for how to read and write JSON files. Within the squirrel format, keys are camel-case; for example dayNumber or dateOfBirth, where each word in the key is capitalized except the first word. The JSON file should be manually editable. JSON resources:

JSON tutorial - https://www.w3schools.com/js/js_json_intro.asp
Wiki - https://en.wikipedia.org/wiki/JSON
JSON specification - https://www.json.org/json-en.html

Data types

The JSON specification includes several data types, but squirrel uses some derivative data types: string, number, date, datetime, char. Date, datetime, and char are stored as the JSON string datatype and should be enclosed in double quotes.

Directory Structure

The JSON file squirrel.json is stored in the root directory. A directory called data contains any data described in the JSON file. Files can be of any type, with file any extension. Because of the broad range of environments in which squirrel files are used, filenames must only contain alphanumeric characters. Filenames cannot contain special characters or spaces and must be less than 255 characters in length.

Squirrel Package

A squirrel package becomes a package once the entire directory structure is combined into a zip file. The compression level does not matter, as long as the file is a .zip archive. Once created, this package can be distributed to other instances of NiDB, squirrel readers, or simply unzipped and manually extracted. Packages can be created manually or exported using NiDB or squirrel converters.

Package Specification

Package root

JSON object

The package root contains all data and files for the package. The JSON root contains all JSON objects for the package.

JSON variables

*required

Directory structure

Files associated with this object are stored in the following directory.

/

package

JSON object

This object contains information about the squirrel package.

JSON variables

*required

Variable options

subjectDirFormat, studyDirFormat, seriesDirFormat

orig - Original subject, study, series directory structure format. Example S1234ABC/1/1
seq - Sequential. Zero-padded sequential numbers. Example 00001/0001/00001

dataFormat

orig - Original, raw data format. If the original format was DICOM, the output format should be DICOM. See DICOM anonymization levels for details.
anon - If original format is DICOM, write anonymized DICOM, removing most PHI, except dates. See DICOM anonymization levels for details.
anonfull - If original format is DICOM, the files will be fully anonymized, by removing dates, times, locations in addition to PHI. See DICOM anonymization levels for details.
nifti3d - Nifti 3D format
- Example file001.nii, file002.nii, file003.nii
nifti3dgz - gzipped Nifti 3D format
- Example file001.nii.gz, file002.nii.gz, file003.nii.gz
nifti4d - Nifti 4D format
- Example file.nii
nifti4dgz - gzipped Nifti 4D format
- Example file.nii.gz

Notes

Notes about the package are stored here. This includes import and export logs, and notes from imported files. This is generally a freeform object, but notes can be divided into sections.

Directory structure

Files associated with this section are stored in the following directory

/

data

JSON object

This data object contains information about the subjects, and potential future data.

JSON variables

*required

Directory structure

Files associated with this section are stored in the following directory, but actual binary data should be stored in the subjects or group-analysis sub directories.

/data

subjects

JSON array

This object is an array of subjects, with information about each subject.

JSON variables

*required

Directory structure

Files associated with this section are stored in the following directory

/data/<SubjectID>

studies

JSON array

An array of imaging studies, with information about each study. An imaging study (or imaging session) is defined as a set of related series collected on a piece of equipment during a time period. An example is a research participant receiving an MRI exam. The participant goes into the scanner, has several MR images collected, and comes out. The time spent in the scanner and all of the data collected from it is considered to be a study.

Valid squirrel modalities are derived from the DICOM standard and from NiDB modalities. Modality can be any string, but some squirrel readers may not correctly interpret the modality or may convert it to “other” or “unknown”. See full list of modalities.

JSON variables

*required

Directory structure

Files associated with this section are stored in the following directory. SubjectID and StudyNum are the actual subject ID and study number, for example /data/S1234ABC/1.

/data/<SubjectID>/<StudyNum>

series

JSON array

An array of series. Basic series information is stored in the main squirrel.json file. Extended information including series parameters such as DICOM tags are stored in a params.json file in the series directory.

JSON variables

* required

Directory structure

Files associated with this section are stored in the following directory. subjectID, studyNum, seriesNum are the actual subject ID, study number, and series number. For example /data/S1234ABC/1/1.

/data/<SubjectID>/<StudyNum>/<SeriesNum>

Behavioral data is stored in

/data/<SubjectID>/<StudyNum>/<SeriesNum>/beh

drugs

JSON array

‘Drugs’ represents any substances administered to a participant; through a clinical trial or the participant’s use of prescription or recreational drugs. Detailed variables are available to record exactly how much and when a drug is administered. This allows searching by dose amount, or other variable.

JSON variables

*required

Recording drug administration

The following examples convert between common language and the squirrel storage format

esomeprazole 20mg capsule by mouth daily

2 puffs atrovent inhaler every 6 hours

group-analysis

JSON array

This object is an array of group analyses. A group analysis is considered an analysis involving more than one subject.

JSON variables

*required

Directory structure

Files associated with this section are stored in the following directory, where <GroupAnalysisName> is the name of the analysis.

/group-analysis/<GroupAnalysisName>/

pipelines

JSON array

Pipelines are the methods used to analyze data after it has been collected. In other words, the experiment provides the methods to collect the data and the pipelines provide the methods to analyze the data once it has been collected.

Basic pipeline information is stored in the main squirrel.json file, and complete pipeline information is stored in the pipeline subdirectory in the pipeline.json file.

JSON Variables

*required

Directory structure

Files associated with this section are stored in the following directory. PipelineName is the unique name of the pipeline.

/pipelines/<PipelineName>

experiments

JSON array

Experiments describe how data was collected from the participant. In other words, the methods used to get the data. This does not describe how to analyze the data once it’s collected.

JSON variables

*required

Directory structure

Files associated with this section are stored in the following directory. Where ExperimentName is the unique name of the experiment.

/experiments/<ExperimentName>

Using the squirrel library

Overview of how to use the squirrel C++ library

The squirrel library is built using the Qt framework and gdcm. Both are available as open-source, and make development of the squirrel library much more efficient.

The Qt and gdcm libraries (or DLLs on Windows) will need to be redistributed along with any programs that use the squirrel library.

Including squirrel

The squirrel library can be included at the top of your program. Make sure the path to the squirrel library is in the INCLUDE path for your compiler.

#include "squirrel.h"

Reading

Create an object and read an existing squirrel package

squirrel *sqrl = new squirrel();

/* read the squirrel package and check for success */
if (sqrl->Read("/home/squirrel.zip"))
    cout << "Successfuly read squirrel package. Log: " << m << endl;
else
    cout << "Error reading squirrel package. Log: " << m << endl;

/* print the entire package */
sqrl->Print();

/* access individual package meta-data */
cout << sqrl->name;

Subject data

All imaging data is stored in a Subject->Study(session)->Series hierarchy. Subjects are stored in the root of the squirrel object.

/* iterate by list to access copies of the subjects (read only) */
foreach (squirrelSubject subj, sqrl->subjectList) {
    cout << subj.ID << endl;
}

/* iterate by index to change the original subject (read/write) */
for (int i=0; i < sqrl->subjectList.size(); i++) {
    sqrl->subjectList[i].ID = i;
}

/* get a list of subjects (copy) */
QList<squirrelSubject> subjects;
if (sqrl->GetSubjectList(subjects))
    cout << "Retrieved " << subjects.size() << " subjects" << endl;

Experiments and Pipelines

Access to these objects is similar to accessing subjects

/* iterate by list to access copies of the objects(read only) */
foreach (squirrelExperiment exp, sqrl->experimentList) {
    cout << exp.experimentName << endl;
}
foreach (squirrelPipeline pipe, sqrl->pipelineList) {
    cout << pipe.pipelineName << endl;
}

/* iterate by index to change the original object (read/write) */
for (int i=0; i < sqrl->experimentList.size(); i++) {
    sqrl->experimentList[i].numFiles = 0;
}
for (int i=0; i < sqrl->pipelineList.size(); i++) {
    sqrl->pipelineList[i].numFiles = 0;
}

Writing

Create a new squirrel package and add a subject

squirrel *sqrl = new squirrel();

/* set the package details */
sqrl->name = "LotsOfData";
sqrl->description = "My First squirrel package;
sqrl->datetime = QDateTime()::currentDateTime();
sqrl->subjectDirFormat = "orig";
sqrl->studyDirFormat = "orig";
sqrl->seriesDirFormat = "orig;
sqrl->dataFormat = "nifti";

/* create a subject */
squirrelSubject sqrlSubject;
sqrlSubject.ID = "123456";
sqrlSubject.alternateIDs = QString("Alt1, 023043").split(",");
sqrlSubject.GUID = "NDAR12345678";
sqrlSubject.dateOfBirth.fromString("2000-01-01", "yyyy-MM-dd");
sqrlSubject.sex = "O";
sqrlSubject.gender = "O";
sqrlSubject.ethnicity1 = subjectInfo->GetValue("ethnicity1");
sqrlSubject.ethnicity2 = subjectInfo->GetValue("ethnicity2");

/* add the subject. This subject has only demographics, there are no studies or  */
sqrl->addSubject(sqrlSubject);

Add a study to existing subject

/* see if we can find a subject by ID */
int subjIndex = sqrl->GetSubjectIndex("123456");
if (subjIndex >= 0) {

    /* build the study object */
    squirrelStudy sqrlStudy;
    sqrlStudy.number = 1;
    sqrlStudy.dateTime.fromString("2023-06-19 15:34:56", "yyyy-MM-dd hh:mm:ss");
    sqrlStudy.ageAtStudy = 34.5;
    sqrlStudy.height = 1.5; // meters
    sqrlStudy.weight = 75.9; // kg
    sqrlStudy.modality = "MR";
    sqrlStudy.description = "MJ and driving";
    sqrlStudy.studyUID = "";
    sqrlStudy.visitType = "FirstVisit";
    sqrlStudy.dayNumber = 1;
    sqrlStudy.timePoint = 1;
    sqrlStudy.equipment = "Siemens 3T Prisma;
    
    sqrl->subjectList[subjIndex].addStudy(sqrlStudy);
}
else {
    cout << "Unable to find subject by ID [123456]" << endl;
}

Write package

QString outdir = "/home/squirrel/thedata" /* output directory of the squirrel package */
QString zippath; /* the full filepath of the written zip file */

sqrl->write(outdir, zippath);++

Pipeline scripts

Details about how pipeline scripts are formatted for squirrel and NiDB

Pipeline scripts are meant to run in bash. They are traditionally formatted to run with a RHEL distribution such as CentOS or Rocky Linux. The scripts are bash compliant, but have some nuances that allow them to run more effectively under an NiDB pipeline setup.

The bash script is interpreted to run on a cluster. Some commands are added to your script to allow it to check in and give status to NiDB as it is running.

The script

There is no need for a shebang line at the beginning (for example #!/bin/sh) because this script is only interested in the commands being run.

Example script...

export FREESURFER_HOME=/opt/freesurfer-6.0     #  The Freesurfer home directory (version) you want to use
export FSFAST_HOME=/opt/freesurfer-6.0/fsfast     #  Not sure if these next two are needed but keep them just in case
export MNI_DIR=/opt/freesurfer-6.0/mni     #  Not sure if these next two are needed but keep them just in case
source $FREESURFER_HOME/SetUpFreeSurfer.sh     #  MGH's shell script that sets up Freesurfer to run
export SUBJECTS_DIR={analysisrootdir}     #  Point to the subject directory you plan to use - all FS data will go there
freesurfer > {analysisrootdir}/version.txt     # {NOLOG} get the freesurfer version
perl /opt/pipeline/ImportFreesurferData.pl {analysisrootdir}/data analysis     #  import data. the perl program allows importing of multiple T1s
recon-all -hippocampal-subfields-T1 -no-isrunning -all -notal-check -subjid analysis     #  Autorecon all {PROFILE}
if tail -n 1 {analysisrootdir}/analysis/scripts/recon-all-status.log | grep 'finished without error' ; then touch {analysisrootdir}/reconallsuccess.txt; fi     # {NOLOG} {NOCHECKIN}
recon-all -subjid analysis -qcache     #  do the qcache step {PROFILE}

Before being submitted to the cluster, the script is passed through the NiDB interpreter, and the actual bash script will look like below. This script is running on subject S2907GCS, study 8, under the freesurferUnified6 pipeline. This script will then be submitted to the cluster.

... script is submitted to the cluster

#!/bin/sh
#$ -N freesurferUnified6
#$ -S /bin/bash
#$ -j y
#$ -o /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/
#$ -V
#$ -u onrc
#$ -l h_rt=72:00:00
LD_LIBRARY_PATH=/opt/pipeline/nidb/; export LD_LIBRARY_PATH;
echo Hostname: `hostname`
echo Username: `whoami`

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s started -m 'Cluster processing started'
cd /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6;

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 1 of 10'
# The Freesurfer home directory (version) you want to use
echo Running export FREESURFER_HOME=/opt/freesurfer-6.0
export FREESURFER_HOME=/opt/freesurfer-6.0 >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step1

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 2 of 10'
# Not sure if these next two are needed but keep them just in case
echo Running export FSFAST_HOME=/opt/freesurfer-6.0/fsfast
export FSFAST_HOME=/opt/freesurfer-6.0/fsfast >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step2

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 3 of 10'
# Not sure if these next two are needed but keep them just in case
echo Running export MNI_DIR=/opt/freesurfer-6.0/mni
export MNI_DIR=/opt/freesurfer-6.0/mni >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step3

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 4 of 10'
# MGH's shell script that sets up Freesurfer to run
echo Running source $FREESURFER_HOME/SetUpFreeSurfer.sh
source $FREESURFER_HOME/SetUpFreeSurfer.sh >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step4

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 5 of 10'
# Point to the subject directory you plan to use - all FS data will go there
echo Running export SUBJECTS_DIR=/home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6
export SUBJECTS_DIR=/home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6 >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step5

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 6 of 10'
# get the freesurfer version
echo Running freesurfer > /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/version.txt
freesurfer > /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/version.txt

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 7 of 10'
# import data. the perl program allows importing of multiple T1s
echo Running perl /opt/pipeline/ImportFreesurferData.pl /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/data analysis
perl /opt/pipeline/ImportFreesurferData.pl /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/data analysis >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step7

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 8 of 10'
# Autorecon all {PROFILE}
echo Running recon-all -hippocampal-subfields-T1 -no-isrunning -all -notal-check -subjid analysis
/usr/bin/time -v recon-all -hippocampal-subfields-T1 -no-isrunning -all -notal-check -subjid analysis >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step8
if tail -n 1 /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/analysis/scripts/recon-all-status.log | grep 'finished without error' ; then touch /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/reconallsuccess.txt; fi

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 10 of 10'
# do the qcache step {PROFILE}
echo Running recon-all -subjid analysis -qcache
/usr/bin/time -v recon-all -subjid analysis -qcache >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step10

/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'Processing result script'
# Running result script
echo Running perl /opt/pipeline/ParseFreesurferResults.pl -r /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6 -p /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/analysis/stats -a 3151385     #  dump results back into ado2 > /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/stepResults.log 2>&1
perl /opt/pipeline/ParseFreesurferResults.pl -r /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6 -p /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/analysis/stats -a 3151385     #  dump results back into ado2 > /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/stepResults.log 2>&1
chmod -Rf 777 /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6
/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'Updating analysis files'
/opt/pipeline/nidb/nidb cluster -u updateanalysis -a 3151385
/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'Checking for completed files'
/opt/pipeline/nidb/nidb cluster -u checkcompleteanalysis -a 3151385
/opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s complete -m 'Cluster processing complete'
chmod -Rf 777 /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6

How to interpret the altered script

Details for the grid engine are added at the beginning

This includes max wall time, output directories, run-as user, etc

#!/bin/sh
#$ -N freesurferUnified6
#$ -S /bin/bash
#$ -j y
#$ -o /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/
#$ -V
#$ -u onrc
#$ -l h_rt=72:00:00

Each command is changed to include logging and check-ins
- /opt/pipeline/nidb/nidb cluster -u pipelinecheckin -a 3151385 -s processing -m 'processing step 1 of 10' # The Freesurfer home directory (version) you want to use echo Running export FREESURFER_HOME=/opt/freesurfer-6.0 export FREESURFER_HOME=/opt/freesurfer-6.0 >> /home/pipeline/onrc/data/pipeline/S2907GCS/8/freesurferUnified6/pipeline/Step1
- nidb cluster -u pipelinecheckin checks in to the database the current step. This is displayed on the Pipelines --> Analysis webpage
- Each command is also echoed to the grid engine log file so you can check the log file for the status
- The output of each command is echoed to a separate log file in the last line using the >>

Pipeline Variables

There are a few pipeline variables that are interpreted by NiDB when running. The variable is replaced with the value before the final script is written out. Each study on which a pipeline runs will have a different script, with different paths, IDs, and other variables listed below.

Building squirrel library and utils

Overview

The following OS configurations have been tested to build squirrel with Qt 6.5

Compatible
- RHEL compatible Linux 8 (not 8.6)
- CentOS 8 (not CentOS 8 Stream)
- CentOS 7
- Windows 10/11

squirrel library and utils cannot be built on CentOS Stream 8 or Rocky Linux 8.6. There are kernel bugs which do not work correctly with Qt's QProcess library. This can lead to inconsistencies when running shell commands, and qmake build errors.

Other OS configurations may work to build squirrel, but have not been tested.

Prepare Build Environment

Install the following as root

Install Qt

Make the installer executable chmod 777 qt-unified-linux-x64-x.x.x-online.run
Run ./qt-unified-linux-x64-x.x.x-online.run
The Qt Maintenance Tool will start. An account is required to download Qt open source.
On the components screen, select the checkbox for Qt 6.5.3 → Desktop gcc 64-bit

Install the following as root

Install Qt

Make the installer executable chmod 777 qt-unified-linux-x64-x.x.x-online.run
Run ./qt-unified-linux-x64-x.x.x-online.run
The Qt Maintenance Tool will start. An account is required to download Qt open source.
On the components screen, select the checkbox for Qt 6.5.3 → Desktop gcc 64-bit

Install the following as root

Install Qt

Make the installer executable chmod 777 qt-unified-linux-x64-x.x.x-online.run
Run ./qt-unified-linux-x64-x.x.x-online.run
The Qt Maintenance Tool will start. An account is required to download Qt open source.
On the components screen, select the checkbox for Qt 6.5.3 → Desktop gcc 64-bit

Install the following as root

Install Qt

Make the installer executable chmod 777 qt-unified-linux-x64-x.x.x-online.run
Run ./qt-unified-linux-x64-x.x.x-online.run
The Qt Maintenance Tool will start. An account is required to download Qt open source.
On the components screen, select the checkbox for Qt 6.5.3 → Desktop gcc 64-bit

Install build environment

Install Qt 6.4.2 for MSVC2019 x64

Install Qt

Run the setup program.
The Qt Maintenance Tool will start. An account is required to download Qt open source.
On the components screen, select the checkbox for Qt 6.5.3 → MSVC 2019 64-bit

Building the squirrel Library

Once the build environment is setup, the build process can be performed by script. The build.sh script will build the squirrel library files and the squirrel utils.

The first time building squirrel on this machine, perform the following

This will build gdcm (squirrel depends on GDCM for reading DICOM headers), squirrel lib, and squirrel-gui.

All subsequent builds on this machine can be done with the following

Using Github Desktop, clone the squirrel repository to C:\squirrel
Build GDCM
- Open CMake
- Set source directory to C:\squirrel\src\gdcm
- Set build directory to C:\squirrel\bin\gdcm
- Click Configure (click Yes to create the build directory)
- Select Visual Studio 16 2019. Click Finish
- After it's done generating, make sure GDCM_BUILD_SHARED_LIBS is checked
- Click Configure again
- Click Generate. This will create the Visual Studio solution and project files
- Open the C:\squirrel\bin\gdcm\GDCM.sln file in Visual Studio
- Change the build to Release
- Right-click ALL_BUILD and click Build
Build squirrel library
- Double-click C:\squirrel\src\squirrel\squirrellib.pro
- Configure the project for Qt 6.4.2 as necessary
- Switch the build to Release and build it
- squirrel.dll and squirrel.lib will now be in C:\squirrel\bin\squirrel
Build squirrel-gui
- Configure the project for Qt 6.4.2 as necessary
- Double-click C:\squirrel\src\squirrel-gui\squirrel-gui.pro
- Switch the build to Release and build it

Contributing to the squirrel Library

Setting up a development environment

Once you've been granted access to the squirrel project on github, you'll need to add your server's SSH key to your github account (github.com --> click your username --> Settings --> SSH and GPG keys). There are directions on the github site for how to do this. Then you can clone the current source code into your server.

Cloning a new repository with SSH

This will create a git repository called squirrel in your home directory.

Committing changes

Updating your repository

To keep your local copy of the repository up to date, you'll need to pull any changes from github.

Troubleshooting

Build freezes

This may happen if the build machine does not have enough RAM or processors. More likely this is happening inside of a VM in which the VM does not have enough RAM or processors allocated.

Build fails with "QMAKE_CXX.COMPILER_MACROS not defined"

This error happens because of a kernel bug in RHEL 8.6. Downgrade to 8.5 or upgrade to 8.7.

Library error

This example is from the nidb example. If you get an error similar to the following, you'll need to install the missing library

You can check which libraries are missing by running ldd on the nidb executable

Copy the missing library file(s) to /lib as root. Then run ldconfig to register any new libraries.

Virtual Machine Has No Network

If you are using a virtual machine to build, there are a couple of weird bugs in VMWare Workstation Player (possibly other VMWare products as well) where the network adapters on a Linux guest simply stop working. You can't activate them, you can't do anything with them, they just are offline and can't be activated. Or it's connected and network connection is present, but your VM is inaccessible from the outside.

Try these fixes to get the network back:

While the VM is running, suspend the guest OS. Wait for it to suspend and close itself. Then resume the guest OS. No idea why, but this should fix the lack of network adapter in Linux.
Open the VM settings. Go to network, and click the button to edit the bridged adapters. Uncheck the VM adapter. This is if you are using bridged networking only.
Switch to NAT networking. This may be better if you are connected to a public wifi.

Using the squirrel Library

Copy the squirrel library files to the lib directory. The libs will then be available for the whole system.