Diagnostic systems of cardiac arrhythmias face early and accurate detection challenges due to the overlap of electrocardiogram (ECG) patterns. Additionally, these systems must manage a huge number of features. This paper proposes a new classifier Kullback-Leibler classifier (KLC) that combines feature optimization and probabilistic Kullback-Leibler (KL) divergence. Particle swarm optimization (PSO) is used for optimizing the features of ECG data, while KL divergence counts the variance between training and testing probability distributions. The proposed framework led the new classifier to distinguish normal and abnormal rhythms accurately. MIT-BIH Standard Arrhythmia Dataset (MIT-BIH) is used to test the validity of the proposed model. The experimental results show the proposed classifier achieves results in precision (86.67%), recall (86.67%), and F1_Score (86.5%).
Fractal feature selection model for enhancing high-dimensional biological problems
BMC bioinformatics
Vol. 25
Issue 1
100249
2024
Fractal feature selection model for enhancing high-dimensional biological problems
Ali Hakem Alsaeedi, Haider Hameed R Al-Mahmood, Zainab Fahad Alnaseri, Mohammad R Aziz, Dhiah Al-Shammary, Ayman Ibaida, Khandakar Ahmed
The integration of biology, computer science, and statistics has given rise to the interdisciplinary field of bioinformatics, which aims to decode biological intricacies. It produces extensive and diverse features, presenting an enormous challenge in classifying bioinformatic problems. Therefore, an intelligent bioinformatics classification system must select the most relevant features to enhance machine learning performance. This paper proposes a feature selection model based on the fractal concept to improve the performance of intelligent systems in classifying high-dimensional biological problems. The proposed fractal feature selection (FFS) model divides features into blocks, measures the similarity between blocks using root mean square error (RMSE), and determines the importance of features based on low RMSE. The proposed FFS is tested and evaluated over ten high-dimensional bioinformatics datasets. The experiment results showed that the model significantly improved machine learning accuracy. The average accuracy rate was 79% with full features in machine learning algorithms, while FFS delivered promising results with an accuracy rate of 94%.
A novel brain EEG clustering based on Minkowski distance to improve intelligent epilepsy diagnosis
Informatics in Medicine Unlocked
Vol. 47
Issue 1
101492
2024
A novel brain EEG clustering based on Minkowski distance to improve intelligent epilepsy diagnosis
Dhiah Al-Shammary, Ekram Hakem, Ahmed M Mahdi, Ayman Ibaida, Khandakar Ahmed
This paper introduces a novel clustering approach based on Minkowski's mathematical similarity to improve EEG feature selection for classification and have efficient Particle Swarm Optimization (PSO) in the context of machine learning. Given the intricacy of high-dimensional medical datasets, feature selection plays a critical role in preventing disease and promoting public health. By employing Minkowski clustering, the objective is to group dataset records into two clusters exhibiting high feature coherence, thereby improving accuracy by applying optimization techniques like PSO to select the most optimal features. Furthermore, the proposed model can be extended to intelligent datasets, including EEG and others. As fewer features are needed for precise categorization, intelligent feature selection is an advanced step of machine learning. This paper investigates the key factors influencing feature selection in the EEG Bonn University dataset. The proposed system is compared against various optimization and feature selection methods, demonstrating superior performance in analyzing and classifying EEG signals based on accuracy measures. The experimental results have confirmed the effectiveness of the suggested model as a valuable tool for EEG data classification, achieving up to 100% accuracy. The outcomes of this research have the potential to benefit medical experts in related specialties by streamlining the process of identifying and diagnosing brain disorders. Technically, the machine learning algorithms RF, KNN, SVM, NB, and DT are employed to classify the selected features.