Journal of Harbin Institute of Technology(New Series)

An Improved Wind Turbine Bearing Fault Diagnosis Method Based on POSGMD and ICNN Under Strong Noise Scenarios

doi: 10.11916/j.issn.1005-9113.2024102

Weizhong Zhang ， Xiaoan Yan ， Maoyou Ye ， Xing Hua ， Dong Jiang

School of Mechatronics Engineering, Nanjing Forestry University, Nanjing 210037 , China

Funds: Sponsored by Jiangsu Association for Science and Technology Youth Talent Support Project (Grant No. JSTJ-2024-031), National Natural Science Foundation of China (Grant No. 52005265), and Natural Science Fund for Colleges and Universities in Jiangsu Province (Grant No. 20KJB460002).

Detailed information

Corresponding author

Xiaoan Yan, Ph.D, Associtate professor.E-mail: yanxiaoan89@sina.com.

CLC number: TM315,TH133

Document code: A

Article ID: 1005-9113(2026)01-0001-19

Abstract

Owing to the harsh conditions, wind turbine bearings are prone to faults, and the resulting fault information is easily submerged by strong noise disturbance, making conventional diagnosis challenging. Therefore, this study presents an innovative bearing fault diagnosis approach predicated on Parameter-Optimized Symplectic Geometry Mode Decomposition (POSGMD) and Improved Convolutional Neural Network (ICNN). Firstly, assisted by the relative entropy-based adaptive selection of embedding dimension, a POSGMD is presented to adaptively decompose the collected bearing vibration signals into various Symplectic Geometry Components (SGC), which can solve the problem of manual selection of the embedding dimension in the raw Symplectic Geometry Mode Decomposition (SGMD). Meanwhile, the signal reconstruction on the decomposed SGC is conducted based on kurtosis-weighted principle to obtain the reconstructed signals. Subsequently, the Continuous Wavelet Transform (CWT) of the reconstructed signals is calculated to generate the corresponding time-frequency images as sample set. Finally, an ICNN is introduced for model training and automatic recognition of bearing faults. Two case studies are used to validate the presented method's efficacy. Comparing the presented method with traditional fault diagnosis methods, experimental results show that it can achieve greater identification accuracy and superior anti-noise resilience. This work provides a practical and effective solution for fault diagnosis in wind turbine bearings, contributing to the timely detection of faults and the reliable operation of wind turbines or other rotational machinery in industrial applications.

Keywords

symplectic geometry mode decomposition / convolutional neural network / deep learning / rolling bearing / fault diagnosis / anti-noise robustness

0 Introduction 1 POSGMD 1.1 SGMD 1.2 POSGMD 2 ICNN 2.1 Traditional CNN 2.2 ICNN 3 POSGMD and ICNN⁃Based Rolling Bearing Fault Diagnostic Method 4 Experimental Verification 4.1 Case 1:Bearing Fault Diagnosis of CWRU Data 4.1.1 Introduction to the benchmark dataset 4.1.2 Fault diagnosis results of the proposed method 4.1.3 Analysis of ablation experiments 4.1.4 Comparative analysis with other representative fault diagnosis methods 4.2 Case2： Bearing Fault Diagnosis of Laboratory Data 4.2.1 Introduction to the experimental dataset 4.2.2 Fault diagnosis results of the proposed method 4.2.3 Analysis of ablation experiments 4.2.4 Comparative analysis with other similar fault diagnosis methods 4.3 Future Research Discussion 5 Conclusions

0 Introduction

As an integral mechanical element within a wind power generator, wind turbine bearings are extensively used to minimize frictional losses in wind power equipment. However, prolonged high-speed operation can lead to various degrees and types of damage in wind turbine bearings, resulting in wind power equipment malfunctions and operational disruptions ^[1]. Moreover, wind turbine bearing faults not only inflict significant economic losses on factories but also pose a serious threat to wind farm enterprise staff safety. Consequently, the development of effective methods for diagnosing wind turbine bearing failures is a key safeguard for the safe and dependable operation of wind power facilities and the prevention of injuries to personnel on site ^[2].

The operating conditions of wind power equipment are typically complex. When wind turbine bearing faults occur, the collected signals often show non-linear and non-stationary features, including the useful components and various noise interference^[3-4]. In existing research, many scholars have used adaptive signal decomposition techniques to analyze vibration signals acquired from bearings.The objective is to eliminate superfluous components while extracting pertinent fault feature information ^[5]. Empirical Mode Decomposition (EMD) is a prominent signal decomposition algorithm. However, during the decomposition process, EMD is prone to endpoint effects and mode mixing issues^[6]. To address these challenges, the Ensemble Empirical Mode Decomposition (EEMD) and its enhanced edition (i.e., Complementary Ensemble Empirical Mode Decomposition (CEEMD) ) are presented for suppressing the mode mixing phenomena through the introduction of white noise into the original signal. The EEMD method facilitates better decomposition of non-stationary signals, which can enhance the accuracy and reliability of signal decomposition. However, the introduction of variables during the addition of white noise lacks adaptability^[7-8]. Intrinsic Time-scale Decomposition (ITD) has good adaptability by utilizing a linear transformation approach to extract baseline signals. Nevertheless, during the decomposition process of ITD, the obtained components are susceptible to the spiky phenomenon, leading to distortions in instantaneous amplitude and frequency^[9]. Empirical Wavelet Transform (EWT) stands out as a multi-resolution signal analysis approach that can decompose the input signal into several sub-band signals and provide a signal reconstruction. However, EWT is prone to segmenting many invalid components and has the disadvantages of long computation time and mode aliasing phenomenon. The Variational Mode Decomposition (VMD) represents a contemporary approach to signal decomposition, which has witnessed significant utilization within the realm of bearing fault diagnosis. Unfortunately, although VMD has strong separation capabilities for non-stationary signals, two key parameters (i.e., number of modes and penalty factor) affect its decomposition performance. Besides, adjustments to these parameters often require empirical or trial-and-error methods^[10-11].

SGMD (Symplectic Geometry Mode Decomposition) is an inventive adaptive signal analysis approach compared with the traditional decomposition method (i.e., EMD, ITD and EEMD) . SGMD offers advantages such as preserving the inherent characteristics of the original time series, suppressing mode mixing, and demonstrating robustness against noise. As such, it has found numerous successful applications within the realm of bearing fault diagnosis^[12-13]. Wang et al.^[14] presented an inventive approach for gear fault diagnosis utilizing SGMD and Autogram. The superiority of this method was validated through simulation and gear fault data. Guo et al.^[15] introduced an adjustable SGMD approach for weak feature extraction and composite fault detection using periodic kurtosis entropy. Its effectiveness in fault diagnosis was corroborated through numerical simulations and experimental verification. Liu et al.^[16] presented a novel approach for rolling bearing fault diagnosis, utilizing Partially Reconstructed Symplectic Geometry Mode Decomposition (PRSGMD) , enhancing the robustness and efficacy of bearing fault diagnosis. Yan et al. ^[17] presented a smart fault diagnosis approach predicated upon SGMD, Improved Multiscale Symbolic Dynamic Entropy (IMSDE) , and Multiclass Relevance Vector Machine (MRVM) . Empirical findings showcased the elevated precision in identification afforded by this approach. However, in the aforementioned studies, the embedding dimension for SGMD is mostly determined to construct the trajectory matrix via manual experience or the power spectral density method. The manual determination of embedding dimension will introduce many uncertainties into the decomposition results of SGMD, thereby leading to some issues (e.g., the over-decomposition or under-decomposition) . Additionally, it easily leads to larger dimensions, the added decomposition components and the longer execution times via employing the power spectral density approach to determine the embedding dimension of SGMD. That said, when this method is used to select the embedding dimension, the curse of dimensionality and a lot of useless components are easily generated in the decomposition process of SGMD, thereby affecting the decomposition performance and operation efficiency of SGMD. Considering that the relative entropy can effectively calculate the discrepancy between probability distributions of two signals^[18]. Therefore, to enhance the decomposition efficacy of conventional SGMD and overcome the shortcomings of manual selection of embedding dimension of traditional SGMD, this study presents a POSGMD for adaptive signal decomposition and obtaining several SGC, where the relative entropy is employed to automatically ascertain the optimal embedding dimension of SGMD. Meanwhile, considering that the contribution degree of fault information of each SGC is different, the kurtosis-weighted principle is further introduced for signal reconstruction, which is aimed at more comprehensively utilizing fault information of each SGC and reducing the influence of the latent noise interference to a certain extent.

After signal decomposition and reconstruction, many scholars used some classification models to extract fault features from the processed signals and intelligently identify fault types. The used common classification models comprise Support Vector Machine (SVM) , and Back Propagation Neural Network (BPNN) , among others. However, these classification models belong to shallow models, and their recognition performance will sharply decline when faced with an increase in data volume. To deal with this problem, Deep Learning (DL) technology has been proposed and rapidly developed by other scholars and has been effectively implemented in bearing anomaly diagnosis^[19-20]. Particularly, compared with other DL models (e.g., Deep Belief Network (DBN) and autoencoder) , CNN is deemed to be a more representative and prominent DL algorithm distinguished by its superior feature extraction capabilities, and has garnered significant achievements in fault diagnosis^[21-22]. Cheng et al.^[23] suggested a sophisticated fault detection technique for rotating equipment utilizing a continuous wavelet transformation local binary CNN framework. Experimental results demonstrate its superiority in bearing faults and compound gearbox fault diagnosis. Xu et al.^[24] proposed to enhance the InceptionNet by combining Intrinsic Feature Extraction (IFE) and a Convolutional Block Attention Module (CBAM) . Their method offers a comprehensive approach to signal processing, culminating in the fault diagnosis process. Hu et al.^[25] presented a fault diagnosis approach predicated on CNN. This approach addresses the complexities of signal processing and professional expertise in traditional fault diagnosis methods. He et al. ^[26] presented a fault diagnosis approach for flywheel energy storage system bearings utilizing parameter-optimized VMD and energy entropy.

The data imbalance in actual industrial scenarios constrains the application of deep learning fault diagnosis methods. To address this issue, many researchers have explored integrating dynamic model responses into training processes. Feng et al.^[27] proposed a digital twin-driven intelligent health management method to monitor and assess gear surface degradation progression, enabling accurate Remaining Useful Life (RUL) prediction and predictive maintenance decision-making. Ming et al.^[28] introduced a digital twin-assisted framework to enhance rolling bearing fault diagnosis under imbalanced data by minimizing discrepancies between dynamic model responses and real measured data. Ni et al.^[29] presented a novel Physics-Informed Residual Network (PIResNet) , which integrates physical modal properties and a domain-conversion mechanism, the method demonstrates superior fault diagnosis performance under variable operating speeds and loads. Li et al.^[30] introduced the FCZD-IA framework, a zero-sample diagnostic method using infrared thermography and acoustic data to enhance gearbox fault detection. It combines a neuro-fuzzy system and deep learning for robust, interpretable diagnostics, showing superior performance in experiments. Nevertheless, in the real environments with high levels of noise, traditional CNN model is prone to the gradient vanishing and over-fitting issues, thereby affecting its robustness and generalization capability. To address this concern, by replacing the original convolutional layers of traditional CNN with the convolutional blocks and residual blocks, this study proposes an ICNN for fault feature extraction and automatic recognition of bearing fault types.

Existing studies on fault diagnosis for wind turbine bearings have achieved notable progress, particularly with the integration of deep learning techniques. However, several research gaps remain to be addressed. Firstly, many conventional methods exhibit limited robustness to strong noise interference, leading to reduced fault identification accuracy in practical scenarios. Secondly, the model complexity of some deep learning-based approaches is high, which increases computational cost and limits their applicability in real-time industrial environments. Thirdly, many algorithms rely heavily on manually selected parameters, such as embedding dimensions or filter parameters, which may introduce subjectivity and reduce repeatability. In addition， the emerging digital twin technology has shown great potential in fault diagnosis by bridging physical systems and virtual models. However, its application faces challenges such as high computational costs, reliance on domain knowledge for accurate modeling, and the need for substantial high-quality data.

In summary, to enhance the fault identification accuracy of wind turbine bearings under a strong noise scenario, this paper presents an improved bearing fault diagnosis method derived from POSGMD and ICNN. Firstly, POSGMD is introduced to adaptively decompose the collected original bearing vibration signal into various SGC, where the embedding dimension is automatically determined based on the relative entropy. Meanwhile, the signal reconstruction is performed based on kurtosis-weighted criteria to obtain the reconstructed signals. Subsequently, the CWT of the obtained reconstructed signals is calculated to generate the corresponding time-frequency images and to execute dataset partitioning. Finally, the proposed ICNN is introduced for model training and automatic recognition of bearing faults. The efficiency and ascendancy of the presented approach are validated through the analysis of two experimental cases. The primary contributions of this study are encapsulated below:

1) By introducing the relative entropy and kurtosis-weighted principle into the original SGMD, a POSGMD is presented for adaptive signal decomposition and signal reconstruction, which can both avoid the problem of manual selection of the embedding dimension of SGMD and mitigate the impact of strong noises to a certain degree.

2) By replacing the original convolutional layers of CNN with the convolutional blocks and residual blocks, an ICNN is introduced for automatic feature extraction and fault identification, which can both mitigate the over-fitting and address the gradient vanishing issue encountered during the training process of traditional CNN.

3) A cutting-edge bearing fault diagnostic methodology predicated on POSGMD and ICNN is proposed, which can enhance the fault diagnosis precision of bearings under noisy scenarios.

The ensuing sections of this paper are scheduled as follows. Section 1 displays the relevant theory of POSGMD. Section 2 elaborates on the related theory about ICNN. Section 3 presents an inventive bearing fault diagnosis approach predicated on POSGMD and ICNN and introduces its realization process. Section 4 substantiates the efficacy and supremacy of the presented approach through two sets of experiments. Finally, Section 5 concludes and discusses potential directions for further research.

1 POSGMD

1.1 SGMD

SGMD is a phase space geometric analysis method that employs symplectic geometry similarity transformations to compute the eigenvalues of Hamiltonian matrix. It reconstructs SGCs utilizing the corresponding eigenvectors, facilitating the adaptive decomposition of non-stationary signals. Throughout the decomposition process, SGMD preserves the invariance of the original temporal sequence, demonstrating robust decomposition capabilities^{[31, 32]}.The specific procedure of SGMD is outlined as follows:

1) Phase-space reconstruction. Assuming an input signal sequence as x = x₁, x₂, ..., x_n, where n represents the length of the signal data. Based on takens embedding theory, we employ the method of delay to process one-dimensional signals. By doing so, it becomes possible to reconstruct multidimensional signals, resulting in the generation of a trajectory matrix X .

X = [\begin{matrix} x_{1} & \dots & x_{1} + (d - 1) τ \\ \dots & \dots & \dots \\ x_{m} & \dots & x_{m} + (d - 1) τ \end{matrix}]

(1)

where d denotes the dimensionality of the embedding, τ stands for the delay time, and m = n- (d-1) τ. To ensure the rationality of reconstructing the matrix X, d is generally selected based on the Power Spectral Density (PSD) of the input signal x.

2) Symplectic geometric matrix transformation. The covariance matrix A is created as A = X ^T X, which results in the Hamiltonian matrix M .

Then, if F = M², the matrix F also becomes a Hamiltonian matrix, which can be acquired by using the symplectic orthogonal matrix Q .

Q^{T} F Q = [\begin{matrix} B & R \\ 0 & B^{T} \end{matrix}]

(2)

where

B

is an upper triangular matrix with elements

b_{i j} = 0 (i > j + 1)

． By computation，the eigenvalues of

B

are

λ_{1}, λ_{2}, \dots, λ_{d} .

Hence， the eigenvalues of A are represented as

σ_{i} = \sqrt{λ_{i}}

． Correspondingly， the associated eigenvectors are

Q_{i} . Let S = Q^{T} X, Z = Q S .

Derived from a sequence of initial single⁃component matrices

Z_{i},

the reconstructed trajectory matrix Z is meticulously crafted， such that

Z = Z_{1} + Z_{2} + \dots + Z_{d} .

Here，

Z_{i} = Q_{i} S_{i}

and

S_{i} = Q_{i}^{T} X^{T} .

Finally， the diagonal average results of d component signals are expressed as

Y = Y_{1} + Y_{2} + \dots + Y_{d} .

1.2 POSGMD

SGMD is an adaptive signal analysis algorithm． However， the embedding dimension of traditional SGMD is selected by using manual experience or the power spectral density method of the input signal，which exerts a substantial influence on its decomposition performance． Specifically， if the embedding dimension is set too large， too many unimportant components may be obtained，resulting in the over⁃decomposition problem and a lengthy processing time． On the contrary， if the embedding dimension is set too small， some useful components might be lost， resulting in the under⁃decomposition problem． Henceforth， in order to deal with thisproblem， this study suggests a POSGMD by using the relative entropy to automatically determine the embedding dimension in the decomposition process． Different from traditional SGMD， the proposed POSGMD includes both adaptive signal decomposition and signal reconstruction processes， which are aimed at reducing the interference of strong noises in the original signal． Fig．1 illustrates the flowchart of the parameter⁃optimized SGMD．

Fig.1The flowchart of POSGMD method

The precise steps involved in POSGMD are outlined below：

1） Load the collected bearing vibration signals and initialize the range of embedding dimension d of SGMD， and d is empirically chosen within the range of 2 to 10． In this step， the decomposition efficacy and computational efficiency are thoroughly evaluated in a holistic manner．

2） SGMD is adopted to disassemble the initial signal into a sequence of SGCs．

3） Calculate the relative entropy of each SGC obtained from SGMD and select the smallest relative entropy from them as the local minimum relative entropy． In this step， the relative entropy of each SGC obtained by SGMD can be calculated by the following equation ^[33]：

K_{L} (P | | Q) = \sum P (y_{i}) l o g \frac{P (y_{i})}{Q (y_{i})}

(3)

where

K_{L}

is the relative entropy， which is the difference in information entropy between two probability distributions，

y_{i}

is the i⁃th SGC，

P (y_{i})

and

Q (y_{i})

represent two probability distributions of the i⁃th SGC． In this step， the bigger relative entropy implies a greater difference between the decomposed component and the actual component （i．e．， worse decomposition results）， whereas a smaller relative entropy means a greater similarity between the decomposed component and the actual component （i．e．， better decomposition performance）． Therefore， the optimal embedding dimension of SGMD is adaptively determined based on the minimum relative entropy．

4）Ascertain whether the iteration condition is met． In particular， if the largest embedding dimension is achieved， stop the iterative process， select the smallest value from all local minimum relative entropies as the global minimum relative entropy， and output the embedding dimension corresponding to the global minimum relative entropy as the optimal embedding dimension of SGMD． Otherwise， set d ＝ d＋1， go back to Step 1） and carry out the iterative procedure again until the stop condition is met．

5） SGMD with the optimal embedding dimension d₀ is used to decompose the signals into various SGCs．

6） The kurtosis⁃weighted principle is introduced for adaptive signal reconstruction． Specifically， in this step， the correlation kurtosis of each SGC is firstly calculated by the following equation：

K_{c k} (T_{s}) = \frac{\sum_{n = 1}^{N} \prod_{m = 0}^{M} y_{n - m T_{s}}}{{(\sum_{n = 1}^{N} y_{n}^{2})}^{M + 1}}

(4)

where

K_{c k}

is the correlation kurtosis， N is the number of samples in the input signal， n is the n⁃th SGC， T_s is the deconvolution period and M denotes the displacement factor． Then， the kurtosis⁃weighted sum of all SGC is conducted to acquire the final reconstructed signal， which can be represented by

x_{r e} = \sum_{i = 1}^{d_{o}} γ_{i} S_{{S G C}_{i}} = \sum_{i = 1}^{d_{o}} \frac{K_{c k i}}{\sum_{i = 1}^{d_{o}} K_{c k i}} S_{{S G C}_{i}}

(5)

where

x_{r e} (t)

is the reconstructed signal，

d_{o}

is the number of SGCs，

γ_{i}

is the weighting coefficient of the i⁃th SGC，

K_{c k i}

indicates the correlation kurtosis of the i⁃th SGC， and

S_{S G C i}

means the i⁃th SGC．

2 ICNN

2.1 Traditional CNN

CNN is an effective deep learning model and can automatically extract fault characteristics from the raw signal， which has been used in several tasks （ e． g．，image classification， object detection and fault diagnosis）． CNN is mainly comprised of convolutional layers， max pooling layers， and fully connected layers． Firstly， the original signal is input into the convolutional layers with batch normalization and using Rectified Linear Unit （ReLU） as the activation function， where the convolutional operations with different sizes of kernels are used to extract local features from the source signal． Then， the features obtained by convolutional layers are fed into max pooling layers， where downsample operations are performed to screen some important features and reduce the parameter quantity． Finally， the fully connected layers are combined with softmax to classify the extracted features from the previous layer．

2.2 ICNN

When the local faults occur in rolling bearings， the fault⁃related information embedded in bearing vibration signals often gets submerged within high levels of noise． When traditional CNN is used to handle bearing vibration signals in high⁃noise scenarios， their feature learning capability tends to decline． Simultaneously， traditional CNN is prone to over⁃fitting and gradient vanishing issues when dealing with noisy signals， thereby easily affecting their generalization ability． To address the shortcomings of traditional CNN and ameliorate fault identification precision under a noisy scenario， this research presents an ICNN model． Fig．2 depicts the architecture of the presented ICNN， where its two pivotal blocks （ i． e．， the convolution block and residual block） are displayed in Fig．3．

Fig.2Architecture of ICNN model

Fig.3Convolution block and residual block

The following is a description of the specifics of the proposed ICNN：

1） Firstly， the input signal is fed into a convolutional layer with a3×3 kernel size for conducting the initial feature extraction． Meanwhile， after being batch⁃normalized， the collected features are passed into an activation function named Leaky ReLU to enhance the non⁃linear expressive capability of the network model．

2） Subsequently， the features extracted from the first convolutional layer are passed into two residual blocks and two convolutional⁃residual block units for enhancing feature learning and alleviating the vanishing gradient issue， where each convolutional⁃ residual block unit includes a convolutional block （as highlighted in blue in Fig．2） and a residual block （as highlighted in green in Fig．2）． Specifically， as shown in Fig．3， each residual block has two branches， with two convolutional layers on one branch， two batch normalization layers， and one Leaky ReLU activation layer， whereas another branch （ the residual branch） involves a skip connection． Different from the residual block， the residual branch of each convolutional block is not directly connected， but rather connected by a convolutional layer with a kernel size of 1×1． After deep feature learning，a global average pooling layer is used to further decrease the feature parameters and improve computing efficiency． Additionally， to mitigate the over⁃fitting problem， a dropout layer is incorporated to discard the outputs from a portion of neurons．

3） Finally， the extracted feature maps from the global average pooling layer are transformed into the final classification output by using the fully connected layer． Meanwhile， Softmax function is employed to convert each fault category into its corresponding probability， thereby determining the class label based on probability distribution and achieving fault identification．

In the proposed ICNN， the Leaky ReLU represents an enhanced iteration of the ReLU activation function， which can not only address the gradient vanishing problem associated with the sigmoid activation function but also mitigate the problem of the deactivation of certain neurons caused by zeroing out the input in ReLU． Therefore， this study selects the Leaky ReLU as the activation function of the ICNN， which can be defined as stated below：

h^{(i)} = m a x (w_{i} x, 0) = \{\begin{matrix} w_{i} x, w_{i} x > 0 \\ a w_{i} x, e l s e \end{matrix}

(6)

where a is set as 0．2， which is used to ensure a slightoutput when the input is less than 0．

In the training process of the presented ICNN， the difference between the actual labels and projected outputs is measured using the cross⁃entropy loss function． Besides， the Adam optimizer is utilized to renew the parameters of the network model， thereby minimizing the loss function and enhancing the learning efficacy of the network． Compared with the traditional CNN， the proposed ICNN model， with enhanced convolutional blocks and residual blocks， effectively mitigates the gradient vanishing issue during training． Additionally， in the proposed ICNN， the inclusion of Batch Normalization （BN） layers and dropout layers will contribute to enhancing the model􀆳 s anti⁃noise capacity and preventing the network􀆳 s over⁃fitting．

3 POSGMD and ICNN⁃Based Rolling Bearing Fault Diagnostic Method

This study suggests a novel rolling bearing fault diagnostic technique based on POSGMD and ICNN to increase fault detection accuracy in noisy scenarios， which mainly involves three procedures （i．e．， vibration signal acquisition， signal decomposition and reconstruction， and bearing intelligent fault diagnosis）． Fig．4 depicts the flowchart of the presented fault diagnosis approach．

The following are the exact phases of the presented fault diagnostic approach：

1） Vibration signal acquisition．In this step， one accelerometer is installed on the bearing fault simulation test bench， and a data acquisition card is utilized to gather vibration signals from bearings under various fault situations．

2） Signal decomposition and reconstruction． Firstly， POSGMD is initially adopted to decompose the gathered original bearing vibration signal into a string of SGC， where the optimal embedding dimension is adaptively determined with the help of the relative entropy． Subsequently， based on kurtosis⁃weighted criterion， the weighted sum of the obtained SGC are conducted to obtain the reconstructed signal． Finally， the CWT of the reconstructed signals is computed to generate the corresponding time⁃frequency images． Meanwhile， the received images are proportionally divided into the sample set （ i． e．， the training set， validation set and testing set） at a proportion of 7 ∶ 1 ∶ 2 by using random segmentation technique．

Fig.4Flowchart of the proposed bearing fault diagnosis method

3） Bearing intelligent fault diagnosis． In this step， the training and validation sets are fed into the presented ICNN for network model training and parameter tuning， whereas the testing set is fed into the well⁃trained ICNN model for model testing， thereby automatically outputting the results．

4 Experimental Verification

In this segment， two experimental cases were undertaken to verify the superiority and applicability of the presented approach for bearing fault identification under noisy conditions． The experiments were conducted on a PC platform running a version of Windows 10， equipped with an Intel

{Core}^{T M}

i5⁃8300H CPU@ 2．30GHz，16GB of memory． The utilized software for the experiments is Matlab2020b version．

4.1 Case 1:Bearing Fault Diagnosis of CWRU Data

4.1.1 Introduction to the benchmark dataset

The Case Western Reserve University （CWRU） bearing data center dataset was used to verify theefficacy of the presented approach． Fig． 5 illustrates the schematic diagram of the experimental setup， which primarily comprises a motor， test bearings， and power measurement instruments． In this experiment， in order to simulate bearing local faults， the Electric Discharge Machining （EDM） was employed to create three different diameters of faults （i．e．， 0．007 inches， 0．014 inches and 0．021 inches） on the surfaces of the Inner Race （IR）， Outer Race （OR）， and Ball （B） of normal bearings． In the data collection process， the motor speed was set to 1797 r/ min， with a sampling frequency of 48 kHz， and one accelerometer was mounted on the drive end of the test bearing （model： 6205⁃2RS JEM SKF） to collect the bearing vibration signals． Altogether 10 different operational conditions of bearing vibration signals were gathered， comprising9 different fault states and one normal state． Table1 provides detailed information on the sample dataset used to test bearings． Additionally， the collected data samples are randomly partitioned into training， validation， and testing sets at a ratio of 7 ∶ 1 ∶ 2． Thetime⁃domain waveform of a single sample under ten various operational scenarios is presented in Fig． 6． It can be observed from Fig． 6， as a result of the interference of noises， there is a certain similaritybetween different signal waveforms， making it challenging to accurately determine the type of bearing defects by direct signal waveform observation．

Fig.5Bearing fault simulation experimental device and its structure diagram

Table1Sample information of bearing vibration data in Case1

Fig.6Time⁃domain waveform of one sample under different bearing working states in Case1

4.1.2 Fault diagnosis results of the proposed method

To effectively distinguish bearing fault categories， the proposed method is applied to analyze the dataset from CWRU． In accordance with the flowchart in Fig．4， POSGMD is initially employed to disassemble the initial bearing vibration signal into multiple SGCs， where the embedding dimension is automatically chosen using relative entropy． Meanwhile， based on the kurtosis⁃weighted principle， signal reconstruction is conducted to acquire the reconstructed signals． Secondly， CWT of the reconstructed signals is calculated to acquire the corresponding time⁃frequency images as a sample set． Finally， the proposed ICNN is introduced for autonomous feature extraction and fault identification． Table2 lists the model parameters of the proposed ICNN． Fig．7 illustrates the network training curve and loss curve of the proposed ICNN．As shown in Fig．7， when the training iterations are larger than 100， the training accuracy can reach a stable value of 100％， whereas the training loss steadily decreases to zero． This indicates that the proposed ICNN model is well⁃ trained． Fig．8 presents the confusion matrix acquired by the proposed approach． As displayed in Fig． 8， none of the samples are misidentified， and the suggested approach can get 100％ identification accuracy． This outcome provides preliminary evidence for the proposed bearing fault diagnosis method􀆳 s efficacy．

Table2Model parameters of the proposed ICNN

Fig.7The training accuracy curve and loss curve of the proposed method in Case1

Fig.8Confusion matrix obtained by the proposed method in Case1

4.1.3 Analysis of ablation experiments

To demonstrate the efficacy of POSGMD employed in the presented approach， we executed the comparisons between the proposed POSGMD and four similar methods （ i． e．， SGMD， ITD， EWT and CEEMD）． In this comparison， all decomposition methods are used for processing the same experimental data． To ensure that these comparisons are equitable， in all signal decomposition methods， the kurtosis⁃ weighted principle is also conducted for signal reconstruction． Besides， due to mechanical equipment in actual engineering often operates at noisy environments，the Gaussian white noises with varyingSignal⁃to⁃Noise Ratio （ SNR） are artificially inserted into the experimental data for simulating the noisy data under different levels of noise， and then all methods are adopted to analyze these noisy data and identify bearing fault types under different levels of noise． Particularly， Gaussian white noises with SNR＝－6dB－ 6dB are inserted into the original signal， and the SNR can be computed by

S N R = 10 l o g (\frac{P_{signal}}{P_{noise}})

(7)

where

P_{signal}

and

P_{noise}

stand for the average power of the actual bearing vibration signal and the Gaussian white noise， respectively．

Fig.9 displays the fault diagnosis results acquiredby different decomposition approaches． It is apparent that the identification accuracy of different decomposition approaches will increase with the increase of SNR， as shown in Fig．9． Nevertheless， the proposed POSGMD can reach better recognition results than other decomposition approaches in scenarios with or without noise addition． Specifically， the discernment precision of the proposed POSGMD is evidently superior to that of alternative decomposition methodologies， particularly for a low SNR ＝－ 6dB． Therefore， the comparison results mean that the proposed POSGMD is adopted for signal decomposition and reconstruction， which is more reasonable and effective for bearing fault diagnosis．

Fig.9Fault identification accuracy of different decomposition methods in Case1

To demonstrate the feasibility and efficacy of the ICNN utilized in the proposed approach， we also conducted comparisons between the presented ICNN and four standard classification models （ i． e．， CNN， LeNet5， AlexNet and ResNet）． Concretely， in this comparison， all classification models are combined with POSGMD to extract fault features from the same experimental data and achieve fault classification． Namely， except for the classification model， the proposed procedure and the other processes are the same． The results of several classification models􀆳 fault identification at various noise levels are displayed in Fig．10． Similarly， as presented in Fig．10， when noises or no noises are inserted into the original signal， the identification accuracy of the presentedICNN model is all greater than that of other classification models． Particularly， when SNR is set as －6 dB， there is a greater difference in the identification precision between the presented ICNN and other classification models． Hence， the comparison results demonstrated the advantage of the presented ICNN model in identifying different types of bearing faults． In order to verify the advantages of the model used in this article in terms of time consumption and complexity， ICNN was compared with several typical deep learning models． Table3 shows that ICNN has certain advantages in training time， significantly shorter than other methods， and has a much smaller total number of model parameters than CNN， LeNet5， and Alexnet．

Fig.10Fault identification accuracy of different classification models in Case1

Table3Comparison of time consumption and complexity

4.1.4 Comparative analysis with other representative fault diagnosis methods

To verify the efficacy and advantage of the presented approach in bearing fault identification， several typical fault diagnosis approaches （ i． e．， CEEMD⁃SVM， CNN⁃LSTM， CWT⁃CNN， WDCNN， DACNN and MCMS⁃CNN） are selected for conducting comparative analysis． Table4 lists the specific details of six comparison approaches． Besides， the model parameters of these comparison methods are the sameas those in corresponding literature in Table4． Similarly， all fault diagnosis methods are used for analyzing the same experimental data． Fig． 11 displays the identification results of different fault diagnosis methods under various levels of noise． As evidenced by Fig．11， as the noise intensity increases， the fault discernment precision of the presented approach and other fault diagnosis approaches has a downward trend． However， whether the noiseless addition or noise addition condition， the presented approach can all achieve superior identification precision than other approaches． Even at an SNR of －6 dB，the proposed method can also accomplish the recognition accuracy of 95％． Compared with other methods， the accuracy rate is at least 10％ higher． Therefore， it can be inferred from this comparative analysis that the proposed method exhibits higher robustness to noise interference compared with some emblematic approaches for identifying bearing fault types under strong noise conditions．

Table4Detailed description of several comparison methods

Fig.11Comparative analysis results of different fault diagnosis methods in Case1

4.2 Case2： Bearing Fault Diagnosis of Laboratory Data

4.2.1 Introduction to the experimental dataset

Bearing fault data utilized in this experiment are collected from the ABLT⁃1A experimental equipment at Southeast University （SEU）， which is comprised of a computer control system， test head， lubricating system， conveyance system， loading system， motor control system， and data collection system， as depicted in Fig． 12． The motor control system is used to adjust the motor speed， whereas the loading system is adopted to apply the force on testing bearing． The type of testing bearing is HRB6205． In this experiment， the collected entire dataset consists of seven bearing working states （ i． e．， normal， Outer Race Fault （ORF）， Inner Race Fault （ IRF）， Ball Fault （BF）， Outer⁃Inner race compound Fault （OIF）， Outer race and Ball compound Fault （ OBF）， and Outer⁃Inner race⁃Ball compound Fault （ OIBF）． These bearing faults have a1 mm width， which are induced by applying Electrical Discharge Machining （ EDM）． Indata collection， two PCB⁃based accelerometers （Sensor⁃1 and Sensor⁃2） are first installed on the vertical direction of the bearing housing， and then the NI9234 data acquisition card is connected with the accelerometer to record the bearing vibration signal． The motor load is set as 5．1 kN， and the rotating speed and sampling frequency are designated as 1050 r/min and 12 kHz， respectively． Table5 presents the sample information of bearing vibration data． Altogether 700 samples are collected， which are randomly segmented into training， validation， and testing sets at a proportion of 7 ∶ 1 ∶ 2． The time⁃domain waveform of a single sample under different bearing operating conditions is shown in Fig．13． As shown in Fig．13， owing to the noise interference and the similarity of time⁃domain waveform under different bearing operating states， by looking at time⁃domain waveforms directly， bearing fault types are hard to discern． Therefore， it is imperative to embrace an efficacious approach for extracting the useful feature information and achieving automatic identification of bearing faults．

Fig.12ABLT⁃1A experimental equipment

Table5Sample information of bearing vibration data in Case2

Fig.13Time⁃domain waveform of one sample under different bearing working states in Case2

4.2.2 Fault diagnosis results of the proposed method

To showcase the availability of the presented method， the method is utilized to analyze the bearing dataset from Case2． In this case， except for the parameters set to kernel number （ 7） in the fully connected layer， other model parameters of the presentedICNN are identical to those of Case1． Fig．14 shows the network training curve and loss curve of the proposed ICNN． As shown in Fig．14， the presented ICNN can achieve a training precision of 100％ and a loss value close to zero when the training iterations are bigger than 100． Fig．15 plots the confusion matrix acquired by the presented approach． As presented in Fig．15， the identification precision of the presented method is 100％ for each bearing fault category， which shows that various bearing faults may be properly identified using the proposed method． To rephrase， the presented method􀆳 s efficacy are preliminarily verified．

4.2.3 Analysis of ablation experiments

Equal to Case1， to display the validity and reasonability of POSGMD adopted in the proposed approach， the performance comparisons between the presented POSGMD with four similar decomposition approaches （i．e．， SGMD， ITD， EWT and CEEMD） are also conducted under different noise levels． Fig．16 shows fault identification results of various signal decomposition approaches． It is evident from Fig．16 that the identification accuracy of all decomposition methods will decrease as the SNR decreases． When SNR is bigger than 2 dB，for the presented POSGMD and other decomposition approaches， there is no significant difference in accuracy． However， when SNR is less than －2 dB， the recognition precision of the proposed POSGMD far exceeds that of alternative decomposition approaches． Even at －6 dB， the proposed method can additionally attain a recognition precision of 95％． Therefore， these comparison results illustrate the advantages of the presented the POSGMD over decomposition approaches and the reasonability of using POSGMD method for signal processing in this study．

Fig.14The training accuracy curve and loss curve of the proposed method in Case2

Fig.15Confusion matrix obtained by the proposed method in Case 2

Equally， to present the efficacy and feasibility of ICNN applied in the presented method， the performancecomparisons between the ICNN with four standard classification approaches （ i． e．， CNN， LeNet5， AlexNet and ResNet） are also conducted under different noise levels． Fig．17 plots the fault identification results of various classification models． As depicted in Fig．17， the proposed ICNN model has high accuracy under different SNRs． Particularly， when SNR is less than － 2 dB， the presented ICNN can still achieve the identification precision of 94％ above， which is evidently superior to other classification approaches． Therefore， these comparative results further substantiate the efficacy and advantage of the presented ICNN for bearing fault identification under noisy conditions．

Fig.16Fault identification accuracy of different decomposition methods in Case2

Fig.17Fault identification accuracy of different classification models in Case2

4.2.4 Comparative analysis with other similar fault diagnosis methods

Equal to Case1， to display the availability and advantage of the presented approach in bearing faul identification， the presented approach is compared with several other typical fault diagnosis approaches （ i． e．， CEEMD⁃SVM， CNN⁃LSTM， CWT⁃CNN， WDCNN， DACNN and MCMS⁃CNN）． Fig．18displays the fault identification results of various fault diagnosis methods． As illustrated in Fig．18， when SNR is larger than －2 dB， the identification precision of the presented approach is relatively close to that of CWT⁃CNN． However， when SNR is lower than －2 dB， the presented approach has a greater identification precision， particularly at SNR ＝－6 dB． Besides， whether under the noiseless addition or noise addition condition （ e． g．，SNR rangs from －6 dB to 6 dB）， compared with other fault diagnostic approaches， the suggested method􀆳 s accuracy in fault detection appears to be higher． Consequently， from this comparative result， we can additionally infer that the presented approach behaves better for bearing fault identification than other fault diagnosis approaches under strong noise conditions．

4.3 Future Research Discussion

Through the aforementioned comparative analysis of various fault diagnosis methods， this study demonstrates the efficacy and advantages of the presented approach for bearing fault diagnosis． However， there are still some areas worth further research in the presented method， it can be summed up as stated below：

First and foremost， although the proposed POSGMD can be suitable for processing one⁃dimensional non⁃ stationary signals， it is currently unable to synchronously process multi⁃channel signals． Therefore， in our future work， illuminated by the idea of multi⁃channel signal analysis in Multivariate Empirical Mode Decomposition（ MEMD）， POSGMD will be further extended to the field of multi⁃channel signal analysis．

Fig.18Comparative analysis results of different fault diagnosis methods in Case2

Secondly， although the presented ICNN can efficiently extract fault feature information in noisy environments and accomplish fault discrimination， its model parameter settings require manual experience． Consequently， in our upcoming work， to avoid the manual selection of model parameters in the presented ICNN， some recently advanced optimizers （ e． g．， Dung Beetle Optimizer （DBO）， Pelican Optimization Algorithm （ POA）， Rat Swarm Optimizer （ RSO） and Nutcracker Optimizer Algorithm （NOA）） will be explored to automatically determine the ICNN model parameters， to enhance the model􀆳 s feature learning performance even more．

Finally， although the presented fault diagnosis method has been successfully employed for bearingfault identification at constant operating speed， its diagnostic capability is still unknown under variable speed conditions． Consequently， based on the improvement of existing research and experimental conditions， by continuously improving our designed network model architecture and the entire implementation process of the proposed algorithms， our future efforts will be directed towards bearing fault identification under varying operating conditions．

5 Conclusions

This paper presents an improved wind turbine bearing fault diagnosis method predicated on POSGMD and an ICNN， which can strengthen thefault identification capability of the wind turbine bearing under strong noise scenarios． Within the proposed method， with the help of relative entropy⁃ based embedding dimension selection， POSGMD is initially presented to adaptively decompose the collected raw bearing vibration signals into several SGCs． Meanwhile， signal reconstruction is conducted to obtain the reconstructed signal based on kurtosis⁃ weighted criteria． Subsequently， the CWT of the reconstructed signal is computed to generate the corresponding time⁃frequency images． Finally， the acquired images are input into an ICNN for model training and automatic fault identification of rolling bearings． The efficacy of the presented approach is validated through two case studies． Empirical findings demonstrate that the suggested approach can attain superior identification precision under noisy conditions when juxtaposed with various exemplary fault diagnosis methodologies． The following encapsulates the primary innovations and contributions of this research：

1）By incorporating the relative entropy and kurtosis⁃weighted criteria into the original SGMD， this study proposes a POSGMD， which mitigates the over⁃decomposition or under⁃decomposition issues caused by the inappropriate selection of the embedding dimension in SGMD．

2）By replacing the convolutional layers of conventional CNNs with the designed convolutional block and residual block， an ICNN is proposed for robust feature learning and automatic fault identification， which can not only alleviate the vanishing issue during the training process of traditional CNNs but also prevent the over⁃fitting problem in the network．

3）A novel bearing fault diagnosis method predicated on POSGMD and ICNN is proposed， which can enhance the fault identification precision under strong noise interference．

In future research， we aim to enhance the proposed method by incorporating multi⁃sensor information fusion technology． Single⁃sensor systems may miss critical fault information， while multi⁃sensor systems can capture complementary information to provide a more comprehensive view of equipment health． Specifically， we plan to develop an effective preprocessing strategy for multi⁃sensor signals， leveraging techniques such as Principal Component Analysis （ PCA） to reduce high⁃dimensional multi⁃channel data to three dimensions． These reduced⁃ dimensional datasets will then be transformed into RGB image samples， facilitating input into advanced deep learning models． Furthermore， we intend to design lightweight and efficient deep network architectures to improve the accuracy of fault diagnosis while reducing computational costs， making the approach more suitable for real⁃time industrial applications． In addition， future work will test the enhanced method under real⁃world operating conditions with diverse and complex noise scenarios， ensuring robustness and generalizability． These advancements will contribute to the development of a more powerful， reliable， and practical fault diagnosis system for wind turbine bearings and similar industrial equipment．

Acknowledgments

The authors want to thank to CWRU and SEU for supplying laboratory data．

Fig.1The flowchart of POSGMD method

Download: Full size image