Jian Liu
Click:
The Founding Time:..
The Last Update Time:..
· Scan attention
·Paper Publications
Journal: Journal of Instrumentation
Key Words: Data Cleaning, Machine Learning ECEI, Tokamak
Abstract: A new data cleaning procedure for the electron cyclotron emission imaging (ECEI) of the EAST tokamak is developed. Machine learning techniques, including support vector machine (SVM) and Decision Trees, are applied to the identification of saturated, zero, and weak signals of the ECEI raw data. As a result, the burden of data analysis is reduced, and the classification accuracy is improved. Proper training sets are sampled using the massive raw ECEI data from the EAST tokamak. The optimal window size of temporal signals, the kernel function, and other model parameters are obtained by the model training. Five-fold cross-validation (CV) is applied during modeling and an external testing set is employed to validate the prediction performance of models. The average recall rates on CV sets of saturated, zero, and weak signals are 95.9%, 96.72%, and 100%, respectively, which prove the accuracy of this procedure. Random Forest, as a comparative method, is also employed to deal with the same data sets. The average recall rates on CV sets of saturated, zero, and weak signals performed by Random Forest are 95.9%, 96.72%, and 95.88%. Our method has been proved to outperform Random Forest with small data sets.
Co-author: C Li,Ting Lan,Yulei Wang,Jian Liu,Jinlin Xie,Tao Lan,Hong LI,Hong Qin
Discipline: Natural Science
Volume: 13
Page Number: :P10029
Translation or Not: no
Date of Publication: 2018-10-24
Included Journals: SCI
Links to published journals: https://iopscience.iop.org/article/10.1088/1748-0221/13/10/P10029