Object, face, body and scene representations in the human brain

$2,499,228ZIAFY2025MHNIH

National Institute Of Mental Health

Investigators

Linked publications, trials & patents

Paper 39319569 Paper 39075326 Paper 38864574 Paper 38820560 Paper 38820554 Paper 38527075 Paper 37745325 Paper 37662312 Paper 37126552 Paper 37012369 Paper 36847339 Paper 36791506 Paper 36459788 Paper 35868861 Paper 35597052 Paper 35074880 Paper 34913412 Paper 34792643 Paper 34727232 Paper 34099511 Paper 34031795 Paper 33500275 Paper 33497776 Paper 33383478 Paper 33285563 Paper 32601459 Paper 31983637 Paper 31613926 Paper 31305238 Paper 31264693 Paper 31226012 Paper 31054350 Paper 30605658 Paper 30602785 Paper 30504281 Paper 30106372 Paper 29960086 Paper 29867413 Paper 29777825 Paper 29621615 Paper 29513219 Paper 29384473 Paper 29382711 Paper 28890416 Paper 28782682 Paper 28053326 Paper 28044013 Paper 27940212 Paper 27769727 Paper 27699208 Paper 27588001 Paper 27227362 Paper 27105060 Paper 26912997 Paper 26849447 Paper 26283982 Paper 25742710 Paper 25665965 Paper 25632124 Paper 25057839 Paper 24999321 Paper 24567402 Paper 24409134 Paper 23906351 Paper 23817547 Paper 23265839 Paper 22473894 Paper 22383904 Paper 22319606 Paper 22164143 Paper 22040738 Paper 21593316 Paper 21415848 Paper 20400357 Paper 20351021 Paper 20208528 Paper 19945336 Paper 19396166 Paper 18978780 Paper 18660838 Paper 18262829

Abstract

The goal of this research is to understand how we see what we see: how does the brain analyze the light falling on the retina of the eye to reveal a world full of people, places, and things? Recent work has focused on three major areas: 1) Understanding the multidimensional nature of objects Real-world, natural scenes are incredibly complex and heterogeneous, yet we are able to understand the nature of those scenes effortlessly â we can categorize them and identify objects and people within those scenes and plan actions toward individual elements. While prior studies have identified brain regions that appear to be specialized for processing faces, object and scenes, it remains unclear what the precise roles of these different regions are, what information they contain, and how broadly representations are distributed. One of the major challenges in understanding visual perception in the brain is the wide range of different objects and scenes that we experience. Over the last few years we have run an ongoing set of studies aimed at capturing the breadth of our understanding of the visual world, which we call the THINGS Initiative. Despite this diversity of our visual experience, studies often use a small number of hand selected object or scene categories, but it becomes unclear how representative of real-world processing the subsequent findings are. To overcome this challenge, we developed a large-scale database (THINGS) of 1,854 diverse types of objects sampled systematically from concrete picturable and nameable nouns in the American English language. The THINGS database provides a rich resource of object concepts and object images and offers a tool for both systematic and large-scale naturalistic research in the fields of psychology, neuroscience, and computer science. We used this database in large-scale behavioral experiments using online crowdsourcing, sampling over 4 million trials in more than 5,000 participants. Applying a computational model of the task, we were able to identify 66 core dimensions of our internal mental representations of objects, providing a comprehensive and fine-grained characterization of this object representational space. In addition to the behavioral data we also collected extensive functional MRI and MEG data while people viewed images from the THINGS database. The model we developed based on the previously collected behavioral responses (described above) provides a critical framework for investigating the underlying neural representations of objects. Analysis of the fMRI data revealed that the individual dimensions are represented broadly across visual cortex, including both early visual and high-level cortex, as well in parietal and frontal regions. Further, these behaviorally relevant dimensions were superior to object category information at predicting cortical brain responses. Analysis of the MEG data revealed that the individual dimensions are rapidly represented, starting around 80-100 ms, with different dimensions showing different temporal dynamics that may reflect the representation of visual versus conceptual features of the objects. Further, early responses tended to generalize across subjects, suggesting they are more stimulus-driven, while later responses showed less generalization, suggesting they reflect more subject-specific representations that might be linked to individual experience. Beyond exploratory studies as part of the THINGS Initiative, we have also probed highly specific questions. First, we investigated how physical experience with real objects affects the representation of images of those objects. Prior to entering the MRI scanner, participants handled a set of objects for 30 seconds each. Then, while they were in the scanner they were shown heterogeneous images of those same objects as well as matched set of images for unexplored objects. We found that exploration of the objects had no detectable impact on visual cortex, but led to much stronger activation in medial parietal regions that have often been linked to episodic memory and explicit memory tasks. This work highlights the importance of considering real world objects and not just images of novel objects for revealing how our understanding of the natural world develops. 2) Impact of eye movements on object processing To successfully navigate our environment, we move our eyes every ~250 ms to bring items of interest from our peripheral vision to central vision. With each eye-movement, our visual system receives a new perspective, so how does the brain cope with this continuous update of visual information? Here, we examine if information in our peripheral vision is maintained across an eye-movement to interact with neural processing directly after the eye-movement, demonstrating an extraordinary efficiency of the visual system. Specifically, participants were asked to make eye-movements while viewing images on a screen in an MRI scanner. During an eye-movement the image on the screen (face or building) could either 1) stay the same, or 2) change. When the image changed our ability to determine what the visual system was processing was degraded relative to when the image stayed the same. This indicates that there must have been transference of feature information from prior to the eye-movement which impacts processing after the eye-movement. This study demonstrates the visual system is maintaining information across eye-movements to facilitate processing after an eye-movement. Characterizing neurotypical processing across eye movements in the visual system is critical as it is hypothesized these mechanisms are atypical in clinical populations (such as individuals with Schizophrenia). Using neurotypical data, we can pinpoint exactly when there is divergence with atypical behavior and neural processing to facilitate eventual development of rehabilitation techniques. 3) Replicating the effects of non-invasive brain stimulation Non-invasive brain stimulation (NIBS) techniques have the potential to demonstrate the causal impact of targeted brain regions on specific behaviors. Transcranial random noise stimulation (tRNS) is one form of transcranial electric stimulation in which an alternating current is passed between electrodes at random frequencies. High-frequency tRNS (hf-tRNS) is thought to enhance excitability and have facilitatory effects on behavior, demonstrated in healthy and clinical populations. Due to the potential application of tRNS, clear demonstrations of the efficacy and replicability of stimulation are critical. Here, we focused on replicating the facilitatory effect of hf-tRNS over the human middle temporal complex (hMT+) on contralateral motion processing. In prior work, the improvement in performance was specific to global motion processing in the visual field opposite to stimulation. However, our results indicated that hf-tRNS does not improve motion discrimination with n = 42. Specifically, we were unable to replicate a global motion processing facilitation following hf-tRNS to hMT+. Although our lack of replication could be due to minor changes in the protocol from the original study, for hf-tRNS to become a widely applied method, its modulatory effect should be robust to slight adjustments to the procedure. Collectively, these studies provide important information about the cognitive processes supporting our understanding of the visual world. Elucidating how the brain enables us to recognize objects, scenes, faces and bodies provides important insights into the nature of our internal representations of the world around us. Understanding these representations is vital to identify and characterize the underlying deficits in many mental health and neurological disorders.

View original record on NIH RePORTER →