Название | Predicting Heart Failure |
---|---|
Автор произведения | Группа авторов |
Жанр | Медицина |
Серия | |
Издательство | Медицина |
Год выпуска | 0 |
isbn | 9781119813033 |
1.6.2.2 Unsupervised Learning
Unsupervised learning works with untagged data and its purpose is to create clusters based on the characteristics of the data. Unlike supervised learning, untagged data is used instead of labeled data. After the data are divided into groups according to their similarity or distance, labeling is done with the help of an expert. Two applications that stand out in unsupervised learning are clustering and association rule mining. Clustering is the assignment of data points to groups called clusters. It has two types: partitioned and hierarchical methods. In partitioned clustering, a data point can only be in one cluster. In hierarchical clustering, a point can be hierarchically located in more than one cluster. In association rules mining, association rules focused on finding rules based on relationships between events are used in mining relationships between attributes.
1.6.2.2.1 K-Means
The K-means segmentation clustering algorithm was first developed in 1967 by MacQueen [32]. The purpose of the algorithm is to divide the data into K clusters. Each cluster is presented with a center of gravity named centroid. The K value is determined by the user. An iterative method is used to divide the data into clusters. The distance function obtains the point to which each data point will be assigned.
1.6.2.2.2 Apriori Algorithm
The apriori algorithm is the prominent algorithm in association rules mining. It finds common patterns in the transaction database and performs rule generation [33]. With the help of the obtained rules, the occurrence of another event can be predicted after an event occurs. It is a frequently preferred algorithm especially because it helps to establish relationships between events.
1.6.3 Machine Learning Supported HF Studies
Machine learning, the most common application of artificial intelligence, reveals patterns in data by continuously improving the ability to learn from data and the prediction and diagnosis of cardiovascular disease [34]. When the machine learning based diagnosis system of HF is considered as input, process, and output modules, the modules can be presented as follows. The input module contains data to be used by the decision support system, such as physical examination data, laboratory results, clinical data, ECG monitoring data, and electrocardiography data. The transaction module is the module that contains machine learning algorithms, which are mainly supervised and unsupervised learning algorithms. In diagnosing HF the machine learning algorithms currently used include nearest neighbor, self-organizing maps, multilayer perceptron, classification and regression trees, random forests, SVMs, neural networks, logistic regression, decision trees, clustering, and fuzzy-genetic and neuro-fuzzy expert systems. In the output module, information such as the presence of HF, risk of HF events, evaluation of left ventricular deterioration, response to advanced therapies, and risk of death is attempted to be determined.
When the literature on machine learning methods (Table 1.2), which is an important option in diagnosing HF, is examined, it will be seen that the use of HRV stands out in many studies. In one of the case studies, Yang et al. [35] used a scoring method to diagnose HF. In the study, with the help of two SVM models, it was first checked whether the person has HF. If the result was normal, the second SVM model came into play and classified the person being examined as healthy or prone to HF. The scores were matched with the SVM model outputs and diagnostic outputs were obtained according to the score ranges.
The aim of the study by Son et al. [36] was to distinguish between CHF and shortness of breath problems. The study was initially made with 72 features; rough sets and logistic regression techniques were used to reduce the number of variables. The accuracy of the classification obtained according to the features selected with the help of coarse clusters was 97.5%, and the classification accuracy obtained with the features selected based on logistic regression was measured as 88.7%.
Masetic et al. 2016 [37] applied the random forest algorithm to ECG time series to detect CHF. The features on the ECG were extracted using the autoregressive Burg method. In the study, apart from the random forest algorithm, C4.5, SVM, ANN, and k-NN classifiers were used with the random forest algorithm giving the best performance.
Wu et al. [38] studied detecting HF prior to clinical diagnosis. Information such as electronic health records, health behavior, demographic data, clinical diagnosis, and clinical precautions were used to detect the disease in advance. SVM, boosting, and logistic regression were used for early detection of the disease. In addition, the contribution of feature selection to success was observed.
Aljaaf et al. [39] proposed a multilevel risk assessment for developing HF. With the help of the C4.5 classifier, estimates were made according to five different risk levels (1: No risk; 2: Low risk; 3: Moderate risk; 4: High risk; 5: Extremely high risk). The Cleveland heart disease data set was used in the study. A 10-fold cross-validation procedure was followed to evaluate the C4.5 classifier.
Zheng et al. [40] proposed a computer-aided diagnostic system for diagnosing HF. This system uses least-squares SVM (LS-SVM). The LS-SVM classifier gave better results than neural nets and hidden Markov models.
Pattekari et al. [41] designed a Naive Bayes-based smart system and developed a decision support system for HF prediction. With the web-based application, users were asked predefined questions and the estimation process was carried out by comparing their answers with the database.
Takcı [42] introduced a framework for the diagnosis of heart attack. In his study, in which the most successful classifier combination was sought with 12 different algorithms and four different feature selection methods, the most successful classifier was SVM using the linear kernel and the most successful feature selection method was the ReliefF algorithm. The obtained classification accuracy was reported as 84.81%.
Non-invasive techniques, such as electrocardiography, or invasive techniques, such as blood tests, which are used to diagnose HF, also measure irregularities in values. Imbalances and anomalies are measured with artificial intelligence techniques, such as the process performed with existing diagnostic techniques. Previously used conventional diagnostic techniques work by increasing capacity with the support of artificial intelligence. For example, it will be possible to increase the accuracy of diagnosis thanks to electrocardiography supported by artificial intelligence.
Table 1.2 Literature summary for artificial intelligence and machine learning techniques in HF.
Author | Method | Study |
---|---|---|
Guidi et al. [24] | ANN, SVM, decision tree, fuzzy genetic algorithm | Clinical decision support system for HF |
Elfadil et al. [25] | Neural nets and spectral analysis | HF patients grouping |
Gharehchopoghi et al. [26] | ANNs | Decision support system for HF |
Candelieri et al. [27] | Decision tree | To determine patient stabilization |
|