TY - JOUR
T1 - Simultaneous Feature Selection and Support Vector Machine Optimization Using the Grasshopper Optimization Algorithm
AU - Aljarah, Ibrahim
AU - Al-Zoubi, Ala’ M.
AU - Faris, Hossam
AU - Hassonah, Mohammad A.
AU - Mirjalili, Seyedali
AU - Saadeh, Heba
PY - 2018/6/1
Y1 - 2018/6/1
N2 - Support vector machine (SVM) is considered to be one of the most powerful learning algorithms and is used for a wide range of real-world applications. The efficiency of SVM algorithm and its performance mainly depends on the kernel type and its parameters. Furthermore, the feature subset selection that is used to train the SVM model is another important factor that has a major influence on it classification accuracy. The feature subset selection is a very important step in machine learning, specially when dealing with high-dimensional data sets. Most of the previous researches handled these important factors separately. In this paper, we propose a hybrid approach based on the Grasshopper optimisation algorithm (GOA), which is a recent algorithm inspired by the biological behavior shown in swarms of grasshoppers. The goal of the proposed approach is to optimize the parameters of the SVM model, and locate the best features subset simultaneously. Eighteen low- and high-dimensional benchmark data sets are used to evaluate the accuracy of the proposed approach. For verification, the proposed approach is compared with seven well-regarded algorithms. Furthermore, the proposed approach is compared with grid search, which is the most popular technique for tuning SVM parameters. The experimental results show that the proposed approach outperforms all of the other techniques in most of the data sets in terms of classification accuracy, while minimizing the number of selected features.
AB - Support vector machine (SVM) is considered to be one of the most powerful learning algorithms and is used for a wide range of real-world applications. The efficiency of SVM algorithm and its performance mainly depends on the kernel type and its parameters. Furthermore, the feature subset selection that is used to train the SVM model is another important factor that has a major influence on it classification accuracy. The feature subset selection is a very important step in machine learning, specially when dealing with high-dimensional data sets. Most of the previous researches handled these important factors separately. In this paper, we propose a hybrid approach based on the Grasshopper optimisation algorithm (GOA), which is a recent algorithm inspired by the biological behavior shown in swarms of grasshoppers. The goal of the proposed approach is to optimize the parameters of the SVM model, and locate the best features subset simultaneously. Eighteen low- and high-dimensional benchmark data sets are used to evaluate the accuracy of the proposed approach. For verification, the proposed approach is compared with seven well-regarded algorithms. Furthermore, the proposed approach is compared with grid search, which is the most popular technique for tuning SVM parameters. The experimental results show that the proposed approach outperforms all of the other techniques in most of the data sets in terms of classification accuracy, while minimizing the number of selected features.
KW - Feature selection
KW - GOA
KW - Grasshopper optimization algorithm
KW - Metaheuristics
KW - Optimisation
KW - Support vector machine
KW - SVM
UR - http://www.scopus.com/inward/record.url?scp=85040796296&partnerID=8YFLogxK
U2 - 10.1007/s12559-017-9542-9
DO - 10.1007/s12559-017-9542-9
M3 - Article
AN - SCOPUS:85040796296
SN - 1866-9956
VL - 10
SP - 478
EP - 495
JO - Cognitive Computation
JF - Cognitive Computation
IS - 3
ER -