An fMRI dataset in response to “The Grand Budapest Hotel”, a socially-rich, naturalistic movie

Visconti di Oleggio Castello, Matteo; Chauhan, Vassiki; Jiahui, Guo; Gobbini, M. Ida

doi:10.1038/s41597-020-00735-4

Download PDF

Data Descriptor
Open access
Published: 11 November 2020

An fMRI dataset in response to “The Grand Budapest Hotel”, a socially-rich, naturalistic movie

Scientific Data volume 7, Article number: 383 (2020) Cite this article

7649 Accesses
16 Citations
107 Altmetric
Metrics details

Subjects

Abstract

Naturalistic stimuli evoke strong, consistent, and information-rich patterns of brain activity, and engage large extents of the human brain. They allow researchers to compare highly similar brain responses across subjects, and to study how complex representations are encoded in brain activity. Here, we describe and share a dataset where 25 subjects watched part of the feature film “The Grand Budapest Hotel” by Wes Anderson. The movie has a large cast with many famous actors. Throughout the story, the camera shots highlight faces and expressions, which are fundamental to understand the complex narrative of the movie. This movie was chosen to sample brain activity specifically related to social interactions and face processing. This dataset provides researchers with fMRI data that can be used to explore social cognitive processes and face processing, adding to the existing neuroimaging datasets that sample brain activity with naturalistic movies.

Measurement(s)	functional brain measurement
Technology Type(s)	functional magnetic resonance imaging
Sample Characteristic - Organism	Homo sapiens

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.12980924

The Amsterdam Open MRI Collection, a set of multimodal MRI datasets for individual difference analyses

Article Open access 19 March 2021

A studyforrest extension, MEG recordings while watching the audio-visual movie “Forrest Gump”

Article Open access 13 May 2022

A uniform human multimodal dataset for emotion perception and judgment

Article Open access 07 November 2023

Background & Summary

In cognitive neuroscience the use of naturalistic stimuli such as commercial movies has advanced our understanding of the human brain. While simple, controlled experiments necessarily target a limited space of brain responses, naturalistic stimuli sample a much broader space^1,2. Naturalistic stimuli are better suited to engage participants and hold their attention³. They evoke more reliable brain activity in comparison to controlled experiments where the same stimuli in the same conditions are repeated multiple times^4,5,6. Naturalistic stimuli allow researchers to compare highly similar brain responses across subjects^1,4,5,7, and to study how complex representations are encoded in brain activity^2,8,9.

Experiments with naturalistic stimuli are flexible because they can be analyzed with a variety of methods. Inter-Subject Correlation (ISC)¹⁰ can be used to study the similarity of brain activity across subjects. Multivariate Pattern Analysis (MVPA)^11,12, including Representational Similarity Analysis (RSA)^13,14, can be used to investigate information in population responses embedded in patterns of brain activity. Voxelwise encoding models can be used with naturalistic stimuli to create predictive models of brain activity and quantify complex, multidimensional voxel tuning^15,16,17. Across subjects, brain activity is highly similar in response to naturalistic stimuli⁵, and such brain responses can be used as a basis for functional alignment (e.g., Hyperalignment^1,7,18). Hyperalignment outperforms anatomical alignment for statistical analysis and, most importantly, preserves the information encoded in fine-grained topographies of brain activity, which facilitates the study of individual differences^19,20,21.

As an additional advantage, a single naturalistic fMRI dataset can be reused multiple times to answer different experimental questions with a variety of analytical methods. When datasets with responses to naturalistic stimuli are publicly shared, they can be used by many different laboratories and researchers to address their specific questions of interest (for example, see http://studyforrest.org/²²). It is important, however, to use a variety of naturalistic stimuli to sample brain activity more broadly. For example, multiple naturalistic movies can be used to test whether the experimental results of interest generalize beyond a specific stimulus set.

For this reason, in this paper we describe and share a dataset where 25 subjects watched part of the feature film “The Grand Budapest Hotel” by Wes Anderson. Their brain activity was measured with a state-of-the-art 3 T scanner (Siemens Prisma) at the Dartmouth Brain Imaging Center. This movie was chosen to sample brain activity specifically related to social interactions and face processing. The movie has a large cast with many famous actors. Throughout the story, the camera highlights many different faces and expressions, which are fundamental to understand the complex narrative of the movie. In a previous publication²³, part of this dataset has been used as a basis for hyperalignment to show that face-specific functional ROIs can be recovered in new subjects using existing data, paving the way for a novel method to recover functional ROIs in detail without using time-consuming localizer tasks. This dataset adds to the existing neuroimaging datasets that sampled brain activity with naturalistic movies^22,24,25,26 to provide researchers with fMRI data that can be used by those especially interested in social interactions and face perception.

Methods

Participants

Twenty-five participants (including three of the authors, 13 females, mean age 27.52 years ± 2.26 SD) took part in the experiment. All had normal or corrected-to-normal vision. All participants provided written informed consent to the study and to the release of their data. Twenty-one participants used a custom-fitted CaseForge headcase (https://caseforge.co) to minimize head motion in the scanner (see Table 1). The study was approved by the Dartmouth Committee for the Protection of Human Subjects.

Table 1 Demographic information about subjects.

Full size table

Stimuli

The full-length feature movie “The Grand Budapest Hotel” by Wes Anderson (DVD UPC 024543897385) was divided into six parts of different durations. The movie was split at scene cuts to maintain the narrative of the movie as intact as possible. The audio of the movie was post-processed using FFMPEG (https://www.ffmpeg.org) with an audio compressor filter to reduce the dynamic range and make dialogues clearer in the scanner. The code used to split and post-process the movie is available in the code repository.

Procedure

Subjects took part in two experimental sessions, one behavioral and one in the fMRI scanner. In the behavioral session, participants watched the first part of the movie (approximately 46 minutes). Immediately after this session, participants went into the scanner and watched the remaining movie, divided into five parts. They were instructed to watch the movie without any additional task.

Imaging session

The imaging session comprised one anatomical (T1w) scan, one gradient echo (GRE) fieldmap estimation scan, and five functional runs. During the anatomical scan participants watched the last five minutes of the first part of the movie—which they watched in the behavioral session—to calibrate the sound volume for the scanner. They were asked to use a button box to increase or decrease the volume so that they could easily hear the dialogue. The volume chosen by the subject was used throughout the session without further modifications. The functional runs had a different duration depending on the part of the movie and ranged from approximately 9 to 13 minutes. Each run was padded with a 10 s fixation period both at the beginning and the end of the run. In all but the first run, the movie started with at least 10 s that overlapped with the previous run. The movie was presented to the subjects on a back-projected screen, and subtended approximately 16.27 × 9.17 (W × H) degrees of visual angle. The audio was delivered to the subject through MR-compatible in-ear headphones (Sensimetrics model S14).

Imaging parameters

All functional and structural volumes were acquired using a 3 T Siemens Magnetom Prisma MRI scanner (Siemens, Erlangen, Germany) with a 32-channel phased-array head coil at the Dartmouth Brain Imaging Center. Functional, blood oxygenation level-dependent (BOLD) images were acquired in an interleaved fashion using gradient-echo echo-planar imaging with pre-scan normalization, fat suppression, a multiband (i.e., simultaneous multi-slice; SMS) acceleration factor of 4 (using blipped CAIPIRINHA), and no in-plane acceleration (i.e., GRAPPA acceleration factor of 1): TR/TE = 1000/33 ms, flip angle = 59◦, resolution = 2.5 mm³ isotropic voxels, matrix size = 96 × 96, FoV = 240 × 240 mm, 52 axial slices with full brain coverage and no gap, anterior–posterior phase encoding. At the beginning of each run, three dummy scans were acquired to allow for signal stabilization. At the beginning of the imaging session, a single dual-echo GRE (gradient echo) scan was acquired. This scan was used to obtain a fieldmap estimate for spatial distortion correction.

A T1-weighted structural scan was acquired using a high-resolution single-shot MPRAGE sequence with an in-plane acceleration factor of 2 using GRAPPA: TR/TE/TI = 2300/2.32/933 ms, flip angle = 8°, resolution = 0.9375 × 0.9375 × 0.9 mm voxels, matrix size = 256 × 256, FoV = 240 × 240 × 172.8 mm, 192 sagittal slices, ascending acquisition, anterior–posterior phase encoding, no fat suppression, 5 min 21 s total acquisition time.

Preprocessing

The description of the anatomical and functional preprocessing (sections Anatomical data preprocessing and Functional data preprocessing) was automatically generated by fMRIprep²⁷, and it is copied here with minimal changes for style (see https://fmriprep.org/en/stable/citing.html#note-for-reviewers-and-editors for more information).

Results included in this manuscript come from preprocessing performed using fMRIPrep 20.1.1²⁷ (RRID:SCR_016216), which is based on Nipype 1.5.0^28,29 (RRID:SCR_002502).

Anatomical data preprocessing

The T1-weighted (T1w) image was corrected for intensity non-uniformity (INU) with N4BiasFieldCorrection³⁰, distributed with ANTs 2.2.0³¹ (RRID:SCR_004757), and used as T1w-reference throughout the workflow. The T1w-reference was then skull-stripped with a Nipype implementation of the antsBrainExtraction.sh workflow (from ANTs), using OASIS30ANTs as target template. Brain tissue segmentation of cerebrospinal fluid (CSF), white-matter (WM) and gray-matter (GM) was performed on the brain-extracted T1w using fast³² (FSL 5.0.9, RRID:SCR_002823).

Brain surfaces were reconstructed using recon-all³³ (FreeSurfer 6.0.1, RRID:SCR_001847) and the brain mask estimated previously was refined with a custom variation of the method to reconcile ANTs-derived and FreeSurfer-derived segmentations of the cortical gray-matter of Mindboggle³⁴ (RRID:SCR_002438). Volume-based spatial normalization to one standard space (MNI152NLin2009cAsym) was performed through nonlinear registration with antsRegistration (ANTs 2.2.0), using brain-extracted versions of both T1w reference and the T1w template. The following template was selected for spatial normalization: ICBM 152 Nonlinear Asymmetrical template version 2009c³⁵ (RRID:SCR_008796; TemplateFlow ID: MNI152NLin2009cAsym).

Functional data preprocessing

For each of the five BOLD runs per subject, the following preprocessing was performed. First, a reference volume and its skull-stripped version were generated using a custom methodology of fMRIPrep. A B0-nonuniformity map (or fieldmap) was estimated based on a phase-difference map calculated with a dual-echo GRE (gradient-recall echo) sequence, processed with a custom workflow of SDCFlows inspired by the epidewarp.fsl script (http://www.nmr.mgh.harvard.edu/~greve/fbirn/b0/epidewarp.fsl) and further improvements in HCP Pipelines³⁶. The fieldmap was then co-registered to the target EPI (echo-planar imaging) reference run and converted to a displacements field map (amenable to registration tools such as ANTs) with FSL’s fugue and other SDCflows tools. Based on the estimated susceptibility distortion, a corrected EPI (echo-planar imaging) reference was calculated for a more accurate co-registration with the anatomical reference. The BOLD reference was then co-registered to the T1w reference using bbregister (FreeSurfer) which implements boundary-based registration³⁷. Co-registration was configured with six degrees of freedom. Head-motion parameters with respect to the BOLD reference (transformation matrices, and six corresponding rotation and translation parameters) are estimated before any spatiotemporal filtering using mcflirt³⁸ (FSL 5.0.9). BOLD runs were slice-time corrected using 3dTshift from AFNI 20160207³⁹ (RRID:SCR_005927). The BOLD time-series were resampled onto the fsaverage surface (FreeSurfer reconstruction nomenclature).

The BOLD time-series (including slice-timing correction when applied) were resampled onto their original, native space by applying a single, composite transform to correct for head-motion and susceptibility distortions. These resampled BOLD time-series will be referred to as preprocessed BOLD in original space, or just preprocessed BOLD.

Several confounding time-series were calculated based on the preprocessed BOLD: framewise displacement (FD), DVARS and three region-wise global signals. FD and DVARS are calculated for each functional run, both using their implementations in Nipype (following the definitions by Power et al.⁴⁰). The three global signals are extracted within the CSF, the WM, and the whole-brain masks. Additionally, a set of physiological regressors were extracted to allow for component-based noise correction (CompCor⁴¹). Principal components are estimated after high-pass filtering the preprocessed BOLD time-series (using a discrete cosine filter with 128 s cut-off) for the two CompCor variants: temporal (tCompCor) and anatomical (aCompCor). tCompCor components are then calculated from the top 5% variable voxels within a mask covering the subcortical regions. This subcortical mask is obtained by heavily eroding the brain mask, which ensures it does not include cortical GM regions. For aCompCor, components are calculated within the intersection of the aforementioned mask and the union of CSF and WM masks calculated in T1w space, after their projection to the native space of each functional run (using the inverse BOLD-to-T1w transformation). Components are also calculated separately within the WM and CSF masks. For each CompCor decomposition, the k components with the largest singular values are retained, such that the retained components’ time series are sufficient to explain 50 percent of variance across the nuisance mask (CSF, WM, combined, or temporal). The remaining components are dropped from consideration.

The head-motion estimates calculated in the correction step were also placed within the corresponding confounds file. The confound time series derived from head motion estimates and global signals were expanded with the inclusion of temporal derivatives and quadratic terms for each⁴². Frames that exceeded a threshold of 0.5 mm FD or 1.5 standardized DVARS were annotated as motion outliers. All resamplings can be performed with a single interpolation step by composing all the pertinent transformations (i.e. head-motion transform matrices, susceptibility distortion correction when available, and co-registrations to anatomical and output spaces). Gridded (volumetric) resamplings were performed using antsApplyTransforms (ANTs), configured with Lanczos interpolation to minimize the smoothing effects of other kernels⁴³. Non-gridded (surface) resamplings were performed using mri_vol2surf (FreeSurfer).

Many internal operations of fMRIPrep use Nilearn 0.6.2⁴⁴ (RRID:SCR_001362), mostly within the functional processing workflow. For more details of the pipeline, see the section corresponding to workflows in fMRIPrep’s documentation (https://fmriprep.readthedocs.io/en/latest/workflows.html).

Functional data denoising

The functional data preprocessed by fMRIprep was then denoised using custom Python scripts. The following nuisance parameters were regressed out from the functional time series using ordinary least-squares regression: six motion parameters and their derivatives, global signal, framewise displacement⁴⁰, the first six noise components estimated by aCompCor⁴¹, and polynomial trends up to second order. All metrics of interest were computed on data denoised as described, either in volume space or in surface space. No additional spatial smoothing or temporal filtering was performed.

Hyperalignment

We functionally aligned the functional data using whole-brain searchlight hyperalignment^1,7,18,20. The functional data projected to the fsaverage surface template and resampled to a low-resolution surface (10,242 vertices per hemisphere, approximately 3 mm resolution) was split in two separate datasets to perform hyperalignment and compute quality metrics on independent splits. The first split included runs 1–3, and the second split included runs 4 and 5. Transformation matrices were determined for disc searchlights of radius 15 mm, ignoring vertices in the medial wall. One subject (sub-sid000009) was used as the reference subject to create the hyperalignment common space. Data was z-scored before and after hyperalignment to normalize variance.

Estimation of temporal signal-to-noise ratio (tSNR)

We first computed tSNR for each preprocessed functional run using data in each subject’s anatomy without template normalization, which would smooth the data spatially and affect tSNR. For each voxel, tSNR was calculated within each separate run as the temporal mean divided by the temporal standard deviation⁴⁵. A tSNR map was generated for each subject by computing the median tSNR across runs within each voxel. To qualitatively visualize how tSNR varied according to brain areas and generate a group tSNR map, the same analysis was performed with functional data resampled to the fsaverage surface.

Inter-Subject correlation

Inter-Subject Correlation was computed to estimate what proportion of the brain signal in response to the movie was consistent across subjects¹⁰. The BOLD time series were projected to the template surface fsaverage, so that the data were spatially matched across subjects. Each subject’s data in a cortical node was correlated to the average time-series of the other 24 subjects in the same cortical node. This generated a map that quantifies the similarity of an individual subject’s response with the group response. The procedure was repeated for all subjects, and a median ISC map was computed at the group level.

Time-segment classification

Time-segment classification was used to estimate how much signal is available in local patterns of brain activity across subjects. First, functional data projected to the fsaverage template was hyperaligned (see Methods) with sub-sid000009 as the reference subject. We used a nearest-neighbor classifier to distinguish between 15 s segments of the movie across subjects (chance level < 0.1%). The movie segments were 1 TR apart and could have overlaps (see previous publications for more details^18,19). Classification was performed within surface searchlights with a radius of 10 mm. The data from 24 subjects was averaged and used as a training set, and the classifier was tested on the left-out subject. This process was repeated for all 25 subjects, and a final map was created by averaging across the 25 cross-validation folds.

Data Records

The raw data was standardized following the Brain Imaging Data Structure⁴⁶ (version 1.3.0) to facilitate data sharing and the use of tools such as fMRIprep and MRIQC⁴⁷. The dataset⁴⁸ is available on OpenNeuro (https://doi.org/10.18112/openneuro.ds003017.v1.0.2), and can be easily downloaded using DataLad⁴⁹ from http://datasets.datalad.org/?dir=/labs/gobbini. While we cannot share the raw stimuli for copyright reasons, we provide the scripts that were used to preprocess the stimuli with all the information needed for other researchers to generate the same stimuli. We also share presentation, preprocessing, and analyses scripts in the github repository (https://doi.org/10.5281/zenodo.3942173, https://github.com/mvdoc/budapest-fmri-data).

Technical Validation

The dataset was validated using different metrics that quantify data quality in separate domains. We analyzed the amount of subjects’ motion to quantify potential noise in the data caused by subjects’ behavior. We estimated tSNR for each voxel separately to make sure that all subjects had comparable levels of SNR and to highlight areas with low SNR. We computed Inter-Subject Correlation (ISC) as a metric that is specific to experiments with naturalistic paradigms. We consider ISC as a sanity check that the stimulus generated similar brain responses across subjects. All the metrics described so far provide information about data quality at the level of single voxels or surface nodes. To quantify data quality for multivariate analyses, we functionally aligned the data using searchlight hyperalignment and performed time-segment classification across subjects.

We first quantified motion in the dataset by inspecting the motion parameters estimated by fMRIprep (see Methods). Overall subject’s motion was low. The median framewise displacement across subjects was 0.09 mm (minimum median across subjects of 0.06 mm, max 0.19 mm, see Fig. 1). Across subjects, the median percentage of volumes marked as motion outliers by fMRIprep was 2.72% (min 0.03%, max 22.72%), with 20 subjects out of 25 having less than 5% volumes marked as outliers (fMRIprep defines an outlier as a volume in which framewise displacement is greater than 0.5 mm or standardized DVARS is greater than 1.5; see Methods.)

We estimated temporal SNR for all subjects, both in the subject’s own anatomical space (to reduce interpolations that can affect tSNR) and in the fsaverage template space for a qualitative assessment of tSNR across cortical areas. Temporal SNR is expected to vary across areas due to signal susceptibility artifacts, differences in anatomy across subjects, and overall subject arousal levels during the scan⁴⁵. The mean whole-brain tSNR across subjects was 74.42 ± 3.91, which is comparable to previous datasets^50,51. As expected, temporal SNR varied across areas, with higher tSNR in dorsal areas, and lower tSNR in anterior temporal cortex and orbito-frontal cortex (see Fig. 2).

We used Inter-Subject Correlation to highlight areas where brain activity in response to the movie was similar across subjects. As expected from an audio-visual movie, visual and auditory areas showed the highest ISC values (see Fig. 3). In addition, areas known to process social information such as precuneus, temporo-parietal junction (TPJ), and medial prefrontal cortex (MPFC)⁵² also showed positive ISC values. We speculate that this can reflect processing of the rich social information present in the movie, but future analyses might be required to investigate what further representations are encoded in these brain areas. Note that the ISC results in Fig. 3 provide a lower bound of what can be obtained with this dataset. The analysis that we report was performed after anatomical alignment, which is known to be suboptimal for between-subject analyses such as ISC when compared to hyperalignment¹.

Finally, we performed between-subject time-segment classification to highlight areas whose patterns encode shared information across subjects. We first split the movie data into two independent sets (split 1: runs 1–3; split 2: run 4, 5). Then, we used data from one split to functionally align the subject’s data with whole-brain surface-searchlight hyperalignment^1,7,18,20. The data from the other split was then used to classify 15 s time-segments across participants. The process was repeated for both splits (see Fig. 4). The average searchlight classification accuracy was 16.64%, and the maximum accuracy was 71.24%, with chance level less than 0.1% (split 1: mean accuracy 18.91%, max accuracy 77.43%; split 2: mean 14.37%, max 65.05%). Classification accuracy was higher than chance level across the whole cortex. The highest accuracy values were found in visual and auditory cortex, but also in prefrontal and medial areas such as precuneus and medial prefrontal cortex.

These analyses validate the quality of this dataset for both univariate and multivariate analyses. We found evidence of overall good subject compliance, as reflected by low motion during scanning, as well as comparable tSNR levels across subjects. Inter-Subject correlation analysis and time-segment classification analyses both revealed shared information in visual and auditory areas, as well as the default mode network^53,54, which also plays a role in theory-of-mind processes^52,55,56.

Code availability

All code is available in the github repository⁵⁷ https://github.com/mvdoc/budapest-fmri-data. The code includes scripts to process the stimuli, presentation scripts, and scripts for the analyses presented in this paper. The scripts rely heavily on open source Python packages such as PyMVPA⁵⁸, nilearn⁴⁴, pycortex⁵⁹, scipy⁶⁰, and numpy⁶¹.

References

Haxby, J. V., Guntupalli, J. S., Nastase, S. A. & Feilong, M. Hyperalignment: Modeling shared information encoded in idiosyncratic cortical topographies. Elife 9 (2020).
Wu, M. C.-K., David, S. V. & Gallant, J. L. Complete functional characterization of sensory neurons by system identification. Annu. Rev. Neurosci. 29, 477–505 (2006).
Article CAS PubMed Google Scholar
Vanderwal, T., Eilbott, J. & Castellanos, F. X. Movies in the magnet: Naturalistic paradigms in developmental functional neuroimaging. Dev. Cogn. Neurosci. 36, 100600 (2019).
Article PubMed Google Scholar
Hasson, U., Nir, Y., Levy, I., Fuhrmann, G. & Malach, R. Intersubject synchronization of cortical activity during natural vision. Science 303, 1634–1640 (2004).
Article ADS CAS PubMed Google Scholar
Hasson, U., Malach, R. & Heeger, D. J. Reliability of cortical activity during natural stimulation. Trends Cogn. Sci. 14, 40–48 (2010).
Article PubMed Google Scholar
Haxby, J. V., Gobbini, M. I. & Nastase, S. A. Naturalistic stimuli reveal a dominant role for agentic action in visual representation. Neuroimage 216, 116561 (2020).
Article PubMed Google Scholar
Haxby, J. V. et al. A Common, High-Dimensional Model of the Representational Space in Human Ventral Temporal Cortex. Neuron 72, 404–416 (2011).
Article CAS PubMed PubMed Central Google Scholar
Naselaris, T., Kay, K. N., Nishimoto, S. & Gallant, J. L. Encoding and decoding in fMRI. Neuroimage 56, 400–410 (2011).
Article PubMed Google Scholar
Hamilton, L. S. & Huth, A. G. The revolution will not be controlled: natural stimuli in speech neuroscience. Lang. Cogn. Neurosci. 35, 573–582 (2020).
Article PubMed Google Scholar
Nastase, S. A., Gazzola, V., Hasson, U. & Keysers, C. Measuring shared responses across subjects using intersubject correlation. Soc. Cogn. Affect. Neurosci. 14, 667–685 (2019).
PubMed PubMed Central Google Scholar
Haxby, J. V., Connolly, A. C. & Guntupalli, J. S. Decoding neural representational spaces using multivariate pattern analysis. Annu. Rev. Neurosci. 37, 435–456 (2014).
Article CAS PubMed Google Scholar
Norman, K. A., Polyn, S. M., Detre, G. J. & Haxby, J. V. Beyond mind-reading: multi-voxel pattern analysis of fMRI data. Trends Cogn. Sci. 10, 424–430 (2006).
Article PubMed Google Scholar
Kriegeskorte, N. & Kievit, R. A. Representational geometry: integrating cognition, computation, and the brain. Trends Cogn. Sci. 17, 401–412 (2013).
Article PubMed PubMed Central Google Scholar
Kriegeskorte, N., Mur, M. & Bandettini, P. Representational similarity analysis - connecting the branches of systems neuroscience. Front. Syst. Neurosci. 2, 4 (2008).
Article PubMed PubMed Central Google Scholar
Huth, A. G., de Heer, W. A., Griffiths, T. L., Theunissen, F. E. & Gallant, J. L. Natural speech reveals the semantic maps that tile human cerebral cortex. Nature 532, 453–458 (2016).
Article ADS PubMed PubMed Central Google Scholar
Deniz, F., Nunez-Elizalde, A. O., Huth, A. G. & Gallant, J. L. The Representation of Semantic Information Across Human Cerebral Cortex During Listening Versus Reading Is Invariant to Stimulus Modality. J. Neurosci. 39, 7722–7736 (2019).
Article CAS PubMed PubMed Central Google Scholar
Van Uden, C. E. et al. Modeling Semantic Encoding in a Common Neural Representational Space. Front. Neurosci. 12, 437 (2018).
Article PubMed PubMed Central Google Scholar
Guntupalli, J. S. et al. A Model of Representational Spaces in Human Cortex. Cereb. Cortex 26, 2919–2934 (2016).
Article PubMed PubMed Central Google Scholar
Feilong, M., Nastase, S. A., Guntupalli, J. S. & Haxby, J. V. Reliable individual differences in fine-grained cortical functional architecture. Neuroimage 183, 375–386 (2018).
Article PubMed Google Scholar
Guntupalli, J. S., Feilong, M. & Haxby, J. V. A computational model of shared fine-scale structure in the human connectome. PLoS Comput. Biol. 14, e1006120 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Feilong, M., Swaroop Guntupalli, J. & Haxby, J. V. The neural basis of intelligence in fine-grained cortical topographies. Preprint at https://www.biorxiv.org/content/10.1101/2020.06.06.138099v2 (2020).
Hanke, M. et al. A high-resolution 7-Tesla fMRI dataset from complex natural stimulation with an audio movie. Sci. Data 1, 140003 (2014).
Article PubMed PubMed Central Google Scholar
Jiahui, G. et al. Predicting individual face-selective topography using naturalistic stimuli. Neuroimage 216, 116458 (2020).
Article PubMed Google Scholar
Aliko, S., Huang, J., Gheorghiu, F., Meliss, S. & Skipper, J. I. A naturalistic neuroimaging database for understanding the brain using ecological stimuli. Sci. Data. 7, 347 (2020).
Nastase, S. A. et al. Narratives: fMRI data for evaluating models of naturalistic language comprehension. OpenNeuro https://doi.org/10.18112/openneuro.ds002345.v1.1.2 (2019).
DuPre, E., Hanke, M. & Poline, J.-B. Nature abhors a paywall: How open science can realize the potential of naturalistic stimuli. Neuroimage 216, 116330 (2020).
Article PubMed Google Scholar
Esteban, O. et al. fMRIPrep: a robust preprocessing pipeline for functional MRI. Nat. Methods 16, 111–116 (2019).
Article CAS PubMed Google Scholar
Gorgolewski, K. et al. Nipype: a flexible, lightweight and extensible neuroimaging data processing framework in python. Front. Neuroinform. 5, 13 (2011).
Article PubMed PubMed Central Google Scholar
Esteban, O. et al. nipype. Zenodo https://doi.org/10.5281/zenodo.596855 (2020).
Tustison, N. J. et al. N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320 (2010).
Article PubMed PubMed Central Google Scholar
Avants, B. B., Epstein, C. L., Grossman, M. & Gee, J. C. Symmetric diffeomorphic image registration with cross-correlation: evaluating automated labeling of elderly and neurodegenerative brain. Med. Image Anal. 12, 26–41 (2008).
Article CAS PubMed Google Scholar
Zhang, Y., Brady, M. & Smith, S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans. Med. Imaging 20, 45–57 (2001).
Article CAS PubMed Google Scholar
Dale, A. M., Fischl, B. & Sereno, M. I. Cortical Surface-Based Analysis* 1:: I. Segmentation and Surface Reconstruction. Neuroimage 9, 179–194 (1999).
Article CAS PubMed Google Scholar
Klein, A. et al. Mindboggling morphometry of human brains. PLoS Comput. Biol. 13, e1005350 (2017).
Article CAS PubMed PubMed Central Google Scholar
Fonov, V. S., Evans, A. C., McKinstry, R. C., Almli, C. R. & Collins, D. L. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage Supplement 1, S102 (2009).
Glasser, M. F. et al. The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage 80, 105–124 (2013).
Article PubMed Google Scholar
Greve, D. N. & Fischl, B. Accurate and robust brain image alignment using boundary-based registration. Neuroimage 48, 63–72 (2009).
Article PubMed Google Scholar
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841 (2002).
Article PubMed Google Scholar
Cox, R. W. AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. Comput. Biomed. Res. 29, 162–173 (1996).
Article ADS CAS PubMed Google Scholar
Power, J. D. et al. Methods to detect, characterize, and remove motion artifact in resting state fMRI. Neuroimage 84, 320–341 (2014).
Article PubMed Google Scholar
Behzadi, Y., Restom, K., Liau, J. & Liu, T. T. A component based noise correction method (CompCor) for BOLD and perfusion based fMRI. Neuroimage 37, 90–101 (2007).
Article PubMed Google Scholar
Satterthwaite, T. D. et al. An improved framework for confound regression and filtering for control of motion artifact in the preprocessing of resting-state functional connectivity data. Neuroimage 64, 240–256 (2013).
Article PubMed Google Scholar
Lanczos, C. Evaluation of Noisy Data. SIAM J. Numer. Anal. 1, 76–85 (1964).
ADS MathSciNet MATH Google Scholar
Abraham, A. et al. Machine learning for neuroimaging with scikit-learn. Front. Neuroinform. 8, 14 (2014).
Article PubMed PubMed Central Google Scholar
Murphy, K., Bodurka, J. & Bandettini, P. A. How long to scan? The relationship between fMRI temporal signal to noise ratio and necessary scan duration. Neuroimage 34, 565–574 (2007).
Article PubMed Google Scholar
Gorgolewski, K. J. et al. The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Sci. Data 3, 160044 (2016).
Article PubMed PubMed Central Google Scholar
Esteban, O. et al. MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLoS One 12, e0184661 (2017).
Article CAS PubMed PubMed Central Google Scholar
Visconti di Oleggio Castello, M., Chauhan, V., Jiahui, G. & Gobbini, M. I. An fMRI dataset in response to ‘The Grand Budapest Hotel’, a socially-rich, naturalistic movie. OpenNeuro https://doi.org/10.18112/openneuro.ds003017.v1.0.2 (2020).
Hanke, M. et al. datalad. Zenodo https://doi.org/10.5281/zenodo.808846 (2020).
Nastase, S. A., Halchenko, Y. O., Connolly, A. C., Gobbini, M. I. & Haxby, J. V. Neural Responses to Naturalistic Clips of Behaving Animals in Two Different Task Contexts. Front. Neurosci. 12, 316 (2018).
Article PubMed PubMed Central Google Scholar
Sengupta, A. et al. A studyforrest extension, retinotopic mapping and localization of higher visual areas. Sci. Data 3, 160093 (2016).
Article PubMed PubMed Central Google Scholar
Gallagher, H. L. & Frith, C. D. Functional imaging of ‘theory of mind’. Trends Cogn. Sci. 7, 77–83 (2003).
Article PubMed Google Scholar
Greicius, M. D., Krasnow, B., Reiss, A. L. & Menon, V. Functional connectivity in the resting brain: a network analysis of the default mode hypothesis. Proc. Natl. Acad. Sci. USA 100, 253–258 (2003).
Article ADS CAS PubMed Google Scholar
Raichle, M. E. et al. A default mode of brain function. Proc. Natl. Acad. Sci. USA 98, 676–682 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Frith, C. D. & Frith, U. Interacting minds—a biological basis. Science 286, 1692–1695 (1999).
Article CAS PubMed Google Scholar
Saxe, R. & Kanwisher, N. People thinking about thinking people. The role of the temporo-parietal junction in ‘theory of mind’. Neuroimage 19, 1835–1842 (2003).
Article CAS PubMed Google Scholar
Visconti di Oleggio Castello, M., Chauhan, V., Jiahui, G. & Gobbini, M. I. budapest-fmri-data. Zenodo https://doi.org/10.5281/zenodo.3942173 (2020).
Hanke, M. et al. PyMVPA: a Python Toolbox for Multivariate Pattern Analysis of fMRI Data. Neuroinformatics 7, 37–53 (2009).
Article PubMed PubMed Central Google Scholar
Gao, J. S., Huth, A. G., Lescroart, M. D. & Gallant, J. L. Pycortex: an interactive surface visualizer for fMRI. Front. Neuroinform. 9, 23 (2015).
Article PubMed PubMed Central Google Scholar
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
van der Walt, S., Colbert, S. C. & Varoquaux, G. The NumPy Array: A Structure for Efficient Numerical Computation. Comput. Sci. Eng. 13, 22–30 (2011).
Article Google Scholar

Download references

Acknowledgements

This project was supported by the NSF award #1835200 to M. Ida Gobbini. We would like to thank Jim Haxby, Yaroslav Halchenko, Sam Nastase, and the members of the Gobbini and Haxby lab for helpful discussions during the development of this project.

Author information

Authors and Affiliations

Helen Wills Neuroscience Institute, University of California, Berkeley, USA
Matteo Visconti di Oleggio Castello
Center for Cognitive Neuroscience, Dartmouth College, Hanover, USA
Vassiki Chauhan & Guo Jiahui
Cognitive Science Program, Dartmouth College, Hanover, USA
M. Ida Gobbini
Dipartimento di Medicina Specialistica, Diagnostica e Sperimentale, University of Bologna, Bologna, Italy
M. Ida Gobbini

Authors

Matteo Visconti di Oleggio Castello
View author publications
You can also search for this author in PubMed Google Scholar
Vassiki Chauhan
View author publications
You can also search for this author in PubMed Google Scholar
Guo Jiahui
View author publications
You can also search for this author in PubMed Google Scholar
M. Ida Gobbini
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.V.d.O.C. designed the experiment, wrote the presentation code, collected and analyzed the data, and wrote the manuscript. V.C. collected and analyzed the data, and provided critical input to the manuscript. G.J. analyzed the data and provided critical input to the manuscript. M.I.G. designed the experiment, obtained funding, and edited the manuscript. All the authors read and approved the manuscript.

Corresponding author

Correspondence to M. Ida Gobbini.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Visconti di Oleggio Castello, M., Chauhan, V., Jiahui, G. et al. An fMRI dataset in response to “The Grand Budapest Hotel”, a socially-rich, naturalistic movie. Sci Data 7, 383 (2020). https://doi.org/10.1038/s41597-020-00735-4

Download citation

Received: 28 July 2020
Accepted: 27 October 2020
Published: 11 November 2020
DOI: https://doi.org/10.1038/s41597-020-00735-4

This article is cited by

Multimodal single-neuron, intracranial EEG, and fMRI brain responses during movie watching in human patients
- Umit Keles
- Julien Dubois
- Ueli Rutishauser
Scientific Data (2024)
A large-scale fMRI dataset for human action recognition
- Ming Zhou
- Zhengxin Gong
- Zonglei Zhen
Scientific Data (2023)
A large-scale fMRI dataset for the visual processing of naturalistic scenes
- Zhengxin Gong
- Ming Zhou
- Zonglei Zhen
Scientific Data (2023)
A natural language fMRI dataset for voxelwise encoding models
- Amanda LeBel
- Lauren Wagner
- Alexander G. Huth
Scientific Data (2023)
The Dual Mechanisms of Cognitive Control dataset, a theoretically-guided within-subject task fMRI battery
- Joset A. Etzel
- Rachel E. Brough
- Todd S. Braver
Scientific Data (2022)