个人信息Personal Information
副教授
教师拼音名称:Liu Jian
电子邮箱:
学历:博士研究生毕业
办公地点:西区科技实验东楼1402
学位:博士
毕业院校:北京大学
An automatic data cleaning procedure for electron cyclotron emission imaging on EAST tokamak using machine learning algorithm
点击次数:
发表刊物:Journal of Instrumentation
关键字:Data Cleaning, Machine Learning ECEI, Tokamak
摘要:A new data cleaning procedure for the electron cyclotron emission imaging (ECEI) of the EAST tokamak is developed. Machine learning techniques, including support vector machine (SVM) and Decision Trees, are applied to the identification of saturated, zero, and weak signals of the ECEI raw data. As a result, the burden of data analysis is reduced, and the classification accuracy is improved. Proper training sets are sampled using the massive raw ECEI data from the EAST tokamak. The optimal window size of temporal signals, the kernel function, and other model parameters are obtained by the model training. Five-fold cross-validation (CV) is applied during modeling and an external testing set is employed to validate the prediction performance of models. The average recall rates on CV sets of saturated, zero, and weak signals are 95.9%, 96.72%, and 100%, respectively, which prove the accuracy of this procedure. Random Forest, as a comparative method, is also employed to deal with the same data sets. The average recall rates on CV sets of saturated, zero, and weak signals performed by Random Forest are 95.9%, 96.72%, and 95.88%. Our method has been proved to outperform Random Forest with small data sets.
合写作者:C Li,Ting Lan,Yulei Wang,Jian Liu,Jinlin Xie,Tao Lan,Hong LI,Hong Qin
学科门类:理学
卷号:13
页面范围::P10029
是否译文:否
发表时间:2018-10-24
收录刊物:SCI
发布期刊链接:https://iopscience.iop.org/article/10.1088/1748-0221/13/10/P10029