Optimization Driven Variational Autoencoder GAN for Artifact Reduction in EEG Signals for Improved Neurological Disorder and Disability Assessment
Published Online: Feb 24, 2025
Page range: 10 - 14
Received: Jun 04, 2024
Accepted: Jan 16, 2025
DOI: https://doi.org/10.2478/msr-2025-0002
Keywords
© 2025 Mohamed Yacin Sikkandar et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Numerous studies have shown that neurological problems are increasing at an alarming rate. The WHO states that one in four people worldwide will experience neurological problems at some point in their lives [1]. Neurological diseases are the second most common disease worldwide after ischemic heart disease. Neurological diseases affect both the brain and the nervous system of the human body [2]. Many neurological disorders are well-documented and rather prevalent, while many others are uncommon. Neurological difficulties include a variety of conditions, including epilepsy, learning disabilities, neuromuscular disorders, autism, Alzheimer's disease, attention deficit hyperactive disorder (ADHD), multiple sclerosis, Parkinson's disease, sleep problems, and cerebral palsy. Mental illnesses are classified as “psychiatric diseases” and are primarily characterized by abnormalities in cognition, emotion, or behavior that lead to suffering or functional impairment. A variety of brain-imaging modalities are available to diagnose neurological disorders, including positron emission tomography (PET), near infrared spectroscopy (NIRS), magnetoencephalography (MEG), electroencephalography (EEG), and functional magnetic resonance imaging (fMRI) [3]. This paper emphasizes EEG analysis due to its cost-effectiveness, non-invasiveness, and portability, making it a widely used approach. An EEG systematically monitors and records the brain's electrical activity to assess the cerebral processes. Many studies use EEG data to detect neurological diseases, neurodevelopmental problems, acute neurological events, and patient behavior [4]-[5]. Traumatic brain injury is the leading cause of disability and death in children worldwide. Over five million Americans are disabled as a result of a traumatic brain injury. Researchers believe that a computer-aided diagnosis (CAD) system trained on extensive patient data and physiologic signals and images using advanced signal processing and AI/ML techniques can help neurologists, neurosurgeons, radiologists, and other medical professionals improve clinical decision-making. Research in this area has increased significantly over the last ten years.
In [6], a generalized EEG neural network (GENet) architecture is developed based on a convolutional neural network, which is able to identify various neurological disorders based on EEG data. This paradigm facilitates the execution of the essential functions for the categorization process. In [7], a deep neural networks (DNN)-based hybrid ensemble feature selection (HEFS) Framework for Parkinson's disease identification was proposed. Multi-level dimensionality reduction (MLDR) is applied to HEFS matrices. After normalizing the matrix scores, merging the scores, reconstructing a new dataset, and reducing the features using neighborhood component analysis (NCA), an accuracy rate of 97.08 % and an F1-score of 98.10 % were achieved. In [8], a unique expert system was introduced that utilizes just EEG information for the early diagnosis of schizophrenia. A deep learning network was developed to improve the accuracy of the image categorization outcomes.
In [9]–[11], the use of variable-frequency complex demodulation (VFCDM) and convolutional neural networks (CNN) to differentiate between healthy, interictal, and ictal states was investigated using EEG data. Time frequency spectrum (TFS) shows frequency changes across different states that correspond to fluctuations in brain activity. The LOSO CV method routinely achieves good performance, ranging from 90 % to 99 % across different combinations of healthy and epileptic states. In [12]–[14], the EEG temporal spatial network (ETSNet) is introduced, which includes a Squeeze-and-Excitation Block and several CNNs tailored for eyes-open and closed resting states. Several limitations are evident form the above studies:
The lack of standardized assessment measures and datasets makes comparison difficult. The computational complexity of some deep learning models such as CNN with long short term memory (LSTM) may limit their practical use.
An optimization-enhanced variational autoencoder generative adversarial network (OE-VAE-GAN) for artifact reduction in EEG signals could be a robust approach for cleaning EEG data, especially in clinical and research contexts where artifact presence (e.g., due to muscle movements, eye blinks, or ambient noise) compromises the quality of data analysis, as shown in Fig. 1.

BSO-VAE-GAN architecture for artifact reduction.
First define the low-pass filter (
The samplers
The Gaussian distribution
A GAN consists of two CNNs, a generator and a discriminator, with opposing conditional arguments. In the discriminator, we used the Patch-GAN to classify each patch as true or generated. The discriminator should punish local signal patches to accurately mimic high-frequency components. The GAN training total loss function is:
The generator (G) minimizes the loss function
In this section, both study datasets are described. The CHB-MIT dataset [12] initially included 22 participants: 17 women aged 1.5–19 years and 5 men aged 3–22 years. The collection includes 198 seizures and 969 hours of EEG recordings. The number of seizures is lower than the number of seizure-free signals. The second dataset, KAU, was obtained at 256 Hz from two male scalp EEG patients aged 28 years. This dataset is similar to the CHB-MIT dataset. The subjects' ages were considered. The individuals in the CHB-MIT dataset are similar to these two cases. In both datasets, an age range of 1–28 years was chosen. This is significant as age considerably affects the clinical and electroencephalographic features of seizures [13]. Both individuals had 38-channel EEGs. They had two 495 s seizures and four 417 s seizures, respectively. The CHB-MIT dataset selects 18 out of 23 channels because they are similar to all recordings.
The time-step value in this study varied from 0 to
The
Table 1 shows the accuracy and error calculation for the proposed BrOpt_VAGAN method in terms of pseudo-clean and noisy input.
Accuracy performance of the proposed BrOpt_VAGAN model.
Mixtures of artifact components | Accuracy [%] | Error [%] | |
---|---|---|---|
Pseudo-clean | brain | 98.5 | 12.41 |
eye | 96.2 | 11.53 | |
muscle | 97.3 | 12.74 | |
Noisy input | brain | 98.6 | 11.84 |
eye | 95.9 | 11.90 | |
muscle | 93.5 | 12.56 |
Fig. 2 shows that BrOpt_VAGAN consistently achieves the lowest

Comparison of
Fig. 3 shows the

Comparison of
Fig. 4 compares the

Comparison of
The proposed study presents a BrOpt_VAGAN framework for automated classification of neurological disorders from raw EEG data. The experiments are performed with BrOpt_VAGAN, a publicly available benchmark dataset. The experiments were conducted under closed and open-eye conditions, as recommended in the CHB-MIT dataset publication and other research papers. The results show that the proposed method can improve the maximum performance by 98.7 % accuracy on the specified dataset using multiple channels. The performance improvement is shown for five-class classification, which confirms the effectiveness and efficiency of the BrOpt_VAGAN framework. Therefore, future work will focus on exploring other loss functions tailored to imbalanced data or incorporating ensemble methods that can also lead to improved accuracy. Furthermore, fine-tuning the self-supervised learning approach with larger and more diverse datasets could lead to better generalization and make the method more reliable in practical applications.