Note
Go to the end to download the full example code
8.2. Generic BIDS DataGrabber for datalad.#
This example uses a generic BIDS DataGraber to get the data from a BIDS dataset store in a datalad remote sibling.
Authors: Federico Raimondo
License: BSD 3 clause
from junifer.datagrabber import PatternDataladDataGrabber
from junifer.utils import configure_logging
Set the logging level to info to see extra information
configure_logging(level="INFO")
2024-04-05 15:42:12,487 - JUNIFER - INFO - ===== Lib Versions =====
2024-04-05 15:42:12,488 - JUNIFER - INFO - numpy: 1.26.4
2024-04-05 15:42:12,488 - JUNIFER - INFO - scipy: 1.11.4
2024-04-05 15:42:12,488 - JUNIFER - INFO - pandas: 2.1.4
2024-04-05 15:42:12,488 - JUNIFER - INFO - nilearn: 0.10.2
2024-04-05 15:42:12,488 - JUNIFER - INFO - nibabel: 5.2.1
2024-04-05 15:42:12,488 - JUNIFER - INFO - junifer: 0.0.4
2024-04-05 15:42:12,488 - JUNIFER - INFO - ========================
The BIDS DataGrabber requires three parameters: the types of data we want, the specific pattern that matches each type, and the variables that will be replaced in the patterns.
types = ["T1w", "BOLD"]
patterns = {
"T1w": {
"pattern": "{subject}/anat/{subject}_T1w.nii.gz",
"space": "native",
},
"BOLD": {
"pattern": "{subject}/func/{subject}_task-rest_bold.nii.gz",
"space": "MNI152NLin6Asym",
},
}
replacements = ["subject"]
Additionally, a datalad-based DataGrabber requires the URI of the remote sibling and the location of the dataset within the remote sibling.
Now we can use the DataGrabber within a with context. One thing we can do with any DataGrabber is iterate over the elements. In this case, each element of the DataGrabber is one session.
with PatternDataladDataGrabber(
rootdir=rootdir,
types=types,
patterns=patterns,
uri=repo_uri,
replacements=replacements,
) as dg:
for elem in dg:
print(elem)
2024-04-05 15:42:12,488 - JUNIFER - INFO - `datadir` is None, creating a temporary directory
2024-04-05 15:42:12,489 - JUNIFER - INFO - `datadir` set to /tmp/tmpbm6ilvwj/datadir
sub-03
sub-08
sub-06
sub-05
sub-04
sub-09
sub-07
sub-02
sub-01
Another feature of the DataGrabber is the ability to get a specific element by its name. In this case, we index sub-01 and we get the file paths for the two types of data we want (T1w and BOLD).
with PatternDataladDataGrabber(
rootdir=rootdir,
types=types,
patterns=patterns,
uri=repo_uri,
replacements=replacements,
) as dg:
sub01 = dg["sub-01"]
print(sub01)
2024-04-05 15:42:15,818 - JUNIFER - INFO - `datadir` is None, creating a temporary directory
2024-04-05 15:42:15,818 - JUNIFER - INFO - `datadir` set to /tmp/tmpuitn0wo0/datadir
2024-04-05 15:42:17,996 - JUNIFER - INFO - Getting element sub-01
{'T1w': {'space': 'native', 'path': PosixPath('/tmp/tmpuitn0wo0/datadir/example_bids/sub-01/anat/sub-01_T1w.nii.gz'), 'meta': {'datagrabber': {'class': 'PatternDataladDataGrabber', 'patterns': {'T1w': {'pattern': '{subject}/anat/{subject}_T1w.nii.gz', 'space': 'native'}, 'BOLD': {'pattern': '{subject}/func/{subject}_task-rest_bold.nii.gz', 'space': 'MNI152NLin6Asym'}}, 'replacements': ['subject'], 'confounds_format': None, 'types': ['T1w', 'BOLD'], 'uri': 'https://gin.g-node.org/juaml/datalad-example-bids', 'datalad_dirty': False, 'datalad_commit_id': '522dfb203afcd2cd55799bf347f9b211919a7338', 'datalad_id': 'fec92475-d9c0-4409-92ba-f041b6a12c40'}, 'dependencies': set(), 'element': {'subject': 'sub-01'}}}, 'BOLD': {'space': 'MNI152NLin6Asym', 'path': PosixPath('/tmp/tmpuitn0wo0/datadir/example_bids/sub-01/func/sub-01_task-rest_bold.nii.gz'), 'meta': {'datagrabber': {'class': 'PatternDataladDataGrabber', 'patterns': {'T1w': {'pattern': '{subject}/anat/{subject}_T1w.nii.gz', 'space': 'native'}, 'BOLD': {'pattern': '{subject}/func/{subject}_task-rest_bold.nii.gz', 'space': 'MNI152NLin6Asym'}}, 'replacements': ['subject'], 'confounds_format': None, 'types': ['T1w', 'BOLD'], 'uri': 'https://gin.g-node.org/juaml/datalad-example-bids', 'datalad_dirty': False, 'datalad_commit_id': '522dfb203afcd2cd55799bf347f9b211919a7338', 'datalad_id': 'fec92475-d9c0-4409-92ba-f041b6a12c40'}, 'dependencies': set(), 'element': {'subject': 'sub-01'}}}}
Total running time of the script: (0 minutes 6.598 seconds)