- Mohamed Gallai
- Oam Khatavkar
- Pranavi Kolouju
- Ramez Masad
The human brain is complex, and it's activity can provide key insight into how we perceive and process the visuals of our natural world. However, it is an impossible feat to know what visual cues the human brain is interpreting with only medical imaging data. In this project, we will utalize fMRI data to reconstruct high resolution images accompanied by rich text captions. By coupling a diffusion model with the CLIP image encoder, we will be able to produce these results. Through this approach, aim to build a tool that can help "explain" the connection between our neural and visual systems using bimodal representations.
-
Generic Object Decoding
- Horikawa, T., & Kamitani, Y. (2019)
- fMRI scans from 5 subjects viewing images across multiple categories from ImageNet, performing a one-back image repetition task.
- Read here
-
BOLD5000
- Chang, N., Pyles, J., Gupta, A., Tarr, M., & Aminoff, E. (2018)
- fMRI scans from 4 subjects viewing 5,000 unique scene stimuli, performing a simple valence task.
- Read here
- Our datasets are uploaded onto Microsoft Azure because they include large amounts of data. We were not able to upload the data on GitHub directly due to the limits on the upload size. The data is accessed publicly through Azure's Blob Service Endpoint URL.
-
Identifying Natural Images from Human Brain Activity
- Kay, K., Naselaris, T., Prenger, R., & Gallant, J. L. (2008)
- Discusses how receptive-field models and fMRI data can be used to decode natural images viewed by individuals.
- Read here
-
High-resolution Image Reconstruction with Latent Diffusion Models from Human Brain Activity
- Takagi, Y., & Nishimoto, S. (2024)
- High-resolution image reconstruction from fMRI images using latent diffusion models without training or fine-tuning.
- Read here
-
Improving Visual Image Reconstruction using Diffusion Models
- Takagi, Y., & Nishimoto, S. (2024)
- Proposes the DiffMSR model with a Prior-Guide Large Window Transformer decoder.
- Read here
-
Controllable Mind Visual Diffusion Model
- Zeng, B., Li, S., Liu, X., et al. (2024)
- Introduces the Controllable Mind Visual Diffusion Model for extracting semantic and silhouette information from fMRI images.
- Read here