Jian Liu

Click:

The Founding Time:..

The Last Update Time:..

· Scan attention

·Paper Publications

Current position: Home > Scientific Research > Paper Publications
An automatic data cleaning procedure for electron cyclotron emission imaging on EAST tokamak using machine learning algorithm
Release time:2022-05-10  Hits:

Journal: Journal of Instrumentation

Key Words: Data Cleaning, Machine Learning ECEI, Tokamak

Abstract: A new data cleaning procedure for the electron cyclotron emission imaging (ECEI) of the EAST tokamak is developed. Machine learning techniques, including support vector machine (SVM) and Decision Trees, are applied to the identification of saturated, zero, and weak signals of the ECEI raw data. As a result, the burden of data analysis is reduced, and the classification accuracy is improved. Proper training sets are sampled using the massive raw ECEI data from the EAST tokamak. The optimal window size of temporal signals, the kernel function, and other model parameters are obtained by the model training. Five-fold cross-validation (CV) is applied during modeling and an external testing set is employed to validate the prediction performance of models. The average recall rates on CV sets of saturated, zero, and weak signals are 95.9%, 96.72%, and 100%, respectively, which prove the accuracy of this procedure. Random Forest, as a comparative method, is also employed to deal with the same data sets. The average recall rates on CV sets of saturated, zero, and weak signals performed by Random Forest are 95.9%, 96.72%, and 95.88%. Our method has been proved to outperform Random Forest with small data sets.

Co-author: C Li,Ting Lan,Yulei Wang,Jian Liu,Jinlin Xie,Tao Lan,Hong LI,Hong Qin

Discipline: Natural Science

Volume: 13

Page Number: :P10029

Translation or Not: no

Date of Publication: 2018-10-24

Included Journals: SCI

Links to published journals: https://iopscience.iop.org/article/10.1088/1748-0221/13/10/P10029