8.2. Generic BIDS DataGrabber for datalad.

This example uses a generic BIDS DataGraber to get the data from a BIDS dataset store in a datalad remote sibling.

Authors: Federico Raimondo

License: BSD 3 clause

from junifer.datagrabber import PatternDataladDataGrabber
from junifer.utils import configure_logging

Set the logging level to info to see extra information

configure_logging(level="INFO")
2025-11-26 08:16:40,854 - JUNIFER - INFO - ===== Lib Versions =====
2025-11-26 08:16:40,854 - JUNIFER - INFO - numpy: 1.26.4
2025-11-26 08:16:40,854 - JUNIFER - INFO - scipy: 1.15.0
2025-11-26 08:16:40,854 - JUNIFER - INFO - pandas: 2.1.4
2025-11-26 08:16:40,854 - JUNIFER - INFO - nilearn: 0.10.4
2025-11-26 08:16:40,854 - JUNIFER - INFO - nibabel: 5.3.2
2025-11-26 08:16:40,854 - JUNIFER - INFO - junifer: 0.0.6
2025-11-26 08:16:40,854 - JUNIFER - INFO - ========================

The BIDS DataGrabber requires three parameters: the types of data we want, the specific pattern that matches each type, and the variables that will be replaced in the patterns.

types = ["T1w", "BOLD"]
patterns = {
    "T1w": {
        "pattern": "{subject}/anat/{subject}_T1w.nii.gz",
        "space": "native",
    },
    "BOLD": {
        "pattern": "{subject}/func/{subject}_task-rest_bold.nii.gz",
        "space": "MNI152NLin6Asym",
    },
}
replacements = ["subject"]

Additionally, a datalad-based DataGrabber requires the URI of the remote sibling and the location of the dataset within the remote sibling.

repo_uri = "https://gin.g-node.org/juaml/datalad-example-bids"
rootdir = "example_bids"

Now we can use the DataGrabber within a with context. One thing we can do with any DataGrabber is iterate over the elements. In this case, each element of the DataGrabber is one session.

2025-11-26 08:16:40,854 - JUNIFER - INFO - `datadir` is None, creating a temporary directory
2025-11-26 08:16:40,855 - JUNIFER - INFO - `datadir` set to /tmp/junifer/tmppaitq96b/datalad4ftm40k9/datadir
sub-09
sub-04
sub-02
sub-05
sub-08
sub-07
sub-06
sub-01
sub-03

Another feature of the DataGrabber is the ability to get a specific element by its name. In this case, we index sub-01 and we get the file paths for the two types of data we want (T1w and BOLD).

2025-11-26 08:16:46,163 - JUNIFER - INFO - `datadir` is None, creating a temporary directory
2025-11-26 08:16:46,163 - JUNIFER - INFO - `datadir` set to /tmp/junifer/tmppaitq96b/datalad2g2me6dl/datadir
2025-11-26 08:16:46,769 - JUNIFER - INFO - Getting element sub-01
2025-11-26 08:16:46,769 - JUNIFER - INFO - Resolving path from pattern for T1w
2025-11-26 08:16:46,769 - JUNIFER - INFO - Resolving path from pattern for BOLD
{'T1w': {'space': 'native', 'path': PosixPath('/tmp/junifer/tmppaitq96b/datalad2g2me6dl/datadir/example_bids/sub-01/anat/sub-01_T1w.nii.gz'), 'meta': {'datagrabber': {'class': 'PatternDataladDataGrabber', 'replacements': ['subject'], 'patterns': {'T1w': {'pattern': '{subject}/anat/{subject}_T1w.nii.gz', 'space': 'native'}, 'BOLD': {'pattern': '{subject}/func/{subject}_task-rest_bold.nii.gz', 'space': 'MNI152NLin6Asym'}}, 'partial_pattern_ok': False, 'confounds_format': None, 'types': ['T1w', 'BOLD'], 'uri': 'https://gin.g-node.org/juaml/datalad-example-bids', 'datalad_dirty': False, 'datalad_commit_id': '3f288c8725207ae0c9b3616e093e78cda192b570', 'datalad_id': '582b9696-f13f-42e4-9587-b4e62aa2a8e7'}, 'dependencies': set(), 'element': {'subject': 'sub-01'}}}, 'BOLD': {'space': 'MNI152NLin6Asym', 'path': PosixPath('/tmp/junifer/tmppaitq96b/datalad2g2me6dl/datadir/example_bids/sub-01/func/sub-01_task-rest_bold.nii.gz'), 'meta': {'datagrabber': {'class': 'PatternDataladDataGrabber', 'replacements': ['subject'], 'patterns': {'T1w': {'pattern': '{subject}/anat/{subject}_T1w.nii.gz', 'space': 'native'}, 'BOLD': {'pattern': '{subject}/func/{subject}_task-rest_bold.nii.gz', 'space': 'MNI152NLin6Asym'}}, 'partial_pattern_ok': False, 'confounds_format': None, 'types': ['T1w', 'BOLD'], 'uri': 'https://gin.g-node.org/juaml/datalad-example-bids', 'datalad_dirty': False, 'datalad_commit_id': '3f288c8725207ae0c9b3616e093e78cda192b570', 'datalad_id': '582b9696-f13f-42e4-9587-b4e62aa2a8e7'}, 'dependencies': set(), 'element': {'subject': 'sub-01'}}}}

Total running time of the script: (0 minutes 6.351 seconds)

Gallery generated by Sphinx-Gallery