Accès libre

EEGVision: Reconstructing vision from human brain signals

   | 05 août 2024
À propos de cet article

Citez

The intricate mechanisms elucidating the interplay between human visual perceptions and cognitive processes remain elusive. Exploring and reconstructing visual stimuli from cerebral signals could help us better understand the processes by which the human brain generates visual imagery. However, the inherent complexity and significant noise in brain signals limit current efforts to reconstruct visual stimuli, resulting in low-granularity images that miss details. To address these challenges, this paper proposes EEGVision, a comprehensive framework for generating high-quality images directly from brain signals. Leveraging the recent strides in multi-modal models within the realm of deep learning, it is now feasible to bridge the gap between EEG data and visual representation. This process starts with a time-frequency fusion encoder in EEGVision, which quickly pulls out cross-domain and robust features from EEG signals. We then design two parallel pipelines to align EEG embeddings with image features at both perceptual and semantic levels. The process uses a stable diffusion-trained image-to-image pipeline that combines coarse and fine-grained data to get high-quality images back from EEG data. Both quantitative and qualitative assessments affirm that EEGVision surpasses contemporary benchmarks. This network architecture holds promise for further applications in the domain of neuroscience, aiming to unravel the genesis of human visual perception mechanisms. All code is accessible via https://github.com/AvancierGuo/EEGVision.

eISSN:
2444-8656
Langue:
Anglais
Périodicité:
Volume Open
Sujets de la revue:
Life Sciences, other, Mathematics, Applied Mathematics, General Mathematics, Physics