Cleaning method for abnormal energy big data based on sparse self-coding

Sci Rep. 2024 Oct 14;14(1):24016. doi: 10.1038/s41598-024-74444-y.

Abstract

In order to reduce the interference of abnormal energy big data and improve the accuracy of anomaly cleaning and detection, this paper proposes a method for cleaning abnormal energy big database based on sparse self-coding. Firstly, the abnormal data detection method based on multi-criteria evaluation is used to analyze the spectral feature distribution of abnormal energy big data. By using chaotic time series reconstruction model, robust local weighted regression analysis and sparse self-coding method, the feature decomposition of time series of energy data is realized. Then, according to the periodicity of the original sequence, the original sequence is segmitated adaptively to represent the morphological characteristics of the abnormal energy data sub-sequences driven by dynamic carbon emissions, and the anomaly index of each sub-sequence is obtained by AFCM algorithm. Secondly, an energy anomaly evaluation model based on LOF value is established. Finally, the output decision function of data cleaning is constructed to realize the abnormal energy big data cleaning. The test results show that the error detection rate of this method is 0.24%, the missing detection rate is 0.27%, the cleaning rate can reach 99.49%, and the cleaning time is less than 2s.

Keywords: Abnormal energy big data; Abnormal wave crest; Cleaning; Dynamic driving of carbon emission; Sparse self-coding.