# The Real-Time Spectrum Analysis Algorithm for the Weak Signal under Strong Noise Background and its Implementation on FPGA Platforms

Xingxing Feng<sup>a</sup>, Bo Yin<sup>b</sup>

School of Optoelectronic Engineering, the Chongqing University of Posts and Telecommunications, Chongqing 400065, China

<sup>a</sup>lq3838328@163.com, <sup>b</sup>1161839398@qq.com

# Abstract

The traditional spectrum analyzer does not have the function of real-time spectrum detection and inadequate capacity of analyzing the weak signal under strong noise background, so a real-time spectrum analytical approach is presented and implemented on FPGA hardware platform in this paper. A double channel ADC time interleaved parallel sampling, signal power extraction, frequency template triggering, poly-phase filter, spectrum zooming of Chirp Z Transform and spectrum analysis are adopted in this method to achieve the real-time spectrum analysis of signal. The configuration clock frequency of FPGA can be calculated according to the system delay analysis and the input arguments. The simulation result proves that this method can not only overcome the problem of traditional methods, based on power trigger and level triggered sampling, which have weak analysis ability of weak signal under strong noise background, and can not achieve specific frequency interval analysis, but also overcome the defects of mutual restriction between time-frequency resolution. For weak signal, it has a frequency measurement error less than 0.6% and power error less than 4.5%, the maximum system time delay is less than 37 $\mu$ s. Except for 22.558 $\mu$ s initial time delay, this design has the property of real-time processing of continuous data.

# Keywords

Real-Time Spectrum Analysis, Poly-phase Filter, Chirp Z Transform, Weak Signal, System Delay.

## 1. introduction

In the field of industrial monitoring, the method about judging the running condition of the rotating electromechanical equipment and the real-time fault detection through the analysis of the power spectral characteristics of vibration signals and noise signals related to shaft rotation frequency in rotating equipment is a very effective[1]. Therefore, frequency spectrum analysis is an important task in most disciplines, and the main objective is power spectrum measurement. Traditional methods of spectrum analysis are usually adopting power triggering or level triggered triggering. Besides, FFT and its improved algorithm or spectral Zoom Spectrum analysis frequency band, and using the frequency thinning algorithm can obtain the local detailed features of the spectrum. Considering frequency spectrum characteristic of the weak signal under strong noise background, hardware complexity and real-time. In this paper, FFT algorithm is used to obtain the signal spectrum profile to implement frequency template triggering, and then use the poly-phase filter to obtain analysis frequency band signal reconstruction, implement the spectrum refinement which is independent of time resolution through windowed CZT, and solve the side-lobe spectrum analysis error and noise error by using the cumulative mean and the spectrum analysis, and finally implement it on FPGA Platforms.

### 2. System overall design



Fig.1 Global View of Spectrum Analyzer

A global view of the design is shown in Fig.1. After analog down-conversion and ADC sampling, the RF signal is sent to FPGA to implement channel mismatch correction, DDC, real-time trigger control, memory control and baseband digital signal processing. In the real-time trigger module, FPGA firstly uses FFT to make the time-frequency conversion of the input signal, and then uses the data preprocessing module to extract the signal power, and then triggers the decision and timing. According to the trigger timing control of the real-time trigger module, the memory control module sends the captured valid data from the memory to the baseband digital signal processing module for real-time spectrum analysis. Baseband digital signal processing module uses poly-phase filter banks to realize user selection of frequency sub-band filtering and the simulation of long observation window to improve the frequency resolution. The appropriate window is used to ensure the frequency resolution and dynamic range requirements, by using CZT obtains the spectrum refinement, which is independent of time resolution, and the local spectrum zoom. Threshold analysis module can analyze the output spectrum of CZT and display the number of signals contained in the spectrum, center frequency of each signal and the signal description word(SDW) in the table form.

# 3. Analysis and implementation of key modules

### **3.1** Frequency trigger template principle and FPGA implementation



Fig. 2 structure chart of spectrum trigger template

The unit of frequency trigger module is data frame, the data through the digital down-conversion is divided into two parts. One part of data after FFT transform is used to extract the power of the square of the output complex spectrum modulo for comparison with trigger templates, if the power of a point signal, which is through the FFT transform, in power spectrum is larger than the corresponding power threshold of the frequency template, it will produce trigger. Another way the data is seamless captured directly, stored in DDR2 and wait for the trigger to generate. When the trigger occurs, by using the trigger control and timing module, manage the trigger memory addresses, and control instructions about the data in memory to write, freeze and read, and the sent the data to baseband

digital signal processing module to make a real-time spectrum analysis. The FPGA implementation status transition is shown in fig. 3.



Figure 3 spectrum triggered template FPGA to achieve the state transition diagram

#### **3.2** CZT algorithm principle and FPGA implementation

The combination and corrected sampling signal  $x(n)(n = 0, \dots, N-1)$  are realized by FPGA. According to the literature [2], the Z transform can be expressed as:

$$X(z_{k}) = W^{k^{2}/2} \sum_{n=0}^{N-1} f(n) h(k-n)$$
<sup>(1)</sup>

Where  $f(n) = x(n)A^{-n}W^{n^2/2}$ ,  $h(n) = W^{-n^2/2}$ ,  $A = A_0e^{i\theta_0}$ ,  $W = W_0e^{-i\phi_0}$ .  $A_0$  is the amplitude of the starting point for the z plane,  $\theta_0$  is the phase of the starting point (corresponding to the frequency of the starting point),  $W_0$  is the reciprocal of the amplitude gradient and  $\varphi_0$  is the phase increment (corresponding to the frequency resolution).  $A_0 = W_0 = 1$ , the CZT transform path is a section of circular arc on the unit circle, M=N (M is the number of spectral lines), and simplifies the computational complexity of CZT. By using  $\theta_0$  to adjust the starting frequency, and by using  $\varphi_0$  to determine the refinement ratio in the range of frequency selection.

The core of formula (1) is convolution calculation between f(n) and h(n). By the convolution theorem of Fourier transform, it can be concluded that cyclic convolution can be computed rapidly by FFT [3]. According to the CZT principle, the signal is related to input frequency, termination frequency, thinning factor and output line number. Therefore, the signal generation can be realized by two DDS modules and a data buffer RAM. In order to improve the system operation speed, the two DDS modules and two FFT are implemented in parallel architecture. In order to save resources and improve chip utilization, complex multiplier is multiplexed and multiplex FFT1 to FFT3. At this point, the FPGA implements the CZT module data flow, as shown in fig. 4.



Fig.4 Data flow diagram of CZT algorithm hardware implementation data flow diagram

#### 3.3 Window selection principle and FPGA implementation

The window choice is the most critical aspect of ensuring the successful application of the algorithm. The normalized frequency resolution of the system can be expressed as  $\Delta f_{\rm b} = \Delta f / \Delta f_1 = \Delta f \cdot M / (f_{\rm max} - f_{\rm min}) = \Delta f \cdot ND / f_{\rm s}$ , where D is frequency refinement multiples, The window must satisfy the following conditions:

1) Window SLL (sidelobe level) must satisfy: SLL > IDR. Otherwise, the algorithm will not be able to meet the IDR requirements.

2) The 6dB bandwidth of window needs to be satisfied:  $BW_{-6dB} < \Delta f_{\rm b}$ . To accomplish this situation when the frequency resolution is  $\Delta f$ , and correctly distinguish two signals of the same power, the typical standards  $BW_{-6dB} < \Delta f_{\rm b}$  are adopted[4].

3) Window amplitude drops, IDR dB bandwidth needs to be satisfied:  $BW_{-IDR} < 2\Delta f_b$ . To ensure the detection of a signal separated  $\Delta f$  from another signal IDR dB which has more power, the corresponding window width of the IDR must be narrower than  $2\Delta f_b$ .

In order to accomplish accurately detection of  $N_s$  signals, the worst side-lobe condition (i.e., the input signal is the combination  $N_s$  –1 same power signals with another signal whose power is IDR dB below) should be considered, and the window should satisfy the following conditions:

$$SLL > IDR + 20\log_{10}(N_s - 1) + SL$$
 (2)

where  $N_s > 1$ . The SL is window scalloping loss, i.e., the power loss when the signal frequency is located between two CZT frequency points[5].

According to the requirements of IDR, the normalized frequency resolution can be expressed as:

$$\Delta f_{\rm b} = \frac{BW_{\rm -IDR}}{2} = \frac{BW_{\rm null}}{2} \tag{3}$$

Where  $BW_{null}$  is the window main lobe width in bins.

#### 3.4 Principle of spectral analysis algorithm and FPGA implementation

In order to reduce the measurement error and obtain more accurate measurement results, three threshold parameters are designed in this paper. In this section, the threshold parameter setting theory is discussed in detail, and the specific parameters are shown in figure 5.



Fig.5 The Schematic diagram of Threshold parameter

1) window main width valve  $n_b$ : To simplify the hardware implementation of the algorithm, the main lobe of the signal after adding windows is assumed to be composed of odd point CZT resolution points,  $n_b = 2\lceil BW_{null}/2\rceil + 1$ . To avoid the situation that  $n_b$  is the even, reduce the error detection probability and eliminate the both side of the signal maximum  $n_b/2$  or  $(n_b - 1)/2$ .

2) Noise threshold T dB: In order to avoid the measurement error caused by noise, the noise threshold T (dB) is introduced here. Defined according to document [5]:

$$P_{\rm FA} = e^{-T/\sigma_{\rm T}^2} \tag{4}$$

where  $P_{\text{FA}}$  is the fixed value related to noise power  $\sigma_{\text{T}}^2$  and the error detection probability caused by noise. If  $\sigma_{\text{T}}^2$  is known, suppose  $X_{\text{P}}[k]$  is the power value which is located in k bins, M is the only noise power point,  $\gamma$  is related to the global error detection probability  $P_{\text{FA},\text{g}}$ , then the threshold T can be expressed as:

$$T = \gamma \frac{1}{M} \sum_{k=1}^{M} X_{\mathrm{P}}[k] \tag{5}$$

When  $M \gg 1$ , plug the formula (5) into (4):

$$P_{\rm FA}|_{M\gg1} \approx e^{-\gamma} \tag{6}$$

At this point,  $P_{FA}$  detector performance tends to the ideal detector (4).

If the number of spectral lines is  $M_{CZT}$  and obeys independent and identical distributed, then

$$P_{\text{FAg}} = 1 - (1 - P_{\text{FA}})^{M_{\text{CZT}}} \rightarrow P_{\text{FAg}} |_{P_{\text{FA}} \ll 1} \approx M_{\text{CZT}} P_{\text{FA}}$$

$$\tag{7}$$

According to formula (6), (7), it can be known :

$$\gamma \approx -\ln(P_{\rm FA,g}/M_{\rm CZT}) \tag{8}$$

Dynamic threshold S (dB): In order to ensure the requirements of IDR, avoid the detection error caused by the secondary lobe of the window, so set the dynamic threshold S = IDR + SL, in which SL is the scalloping loss of the window.

By cumulative averaging the output data of the CZT module, the transient error is reduced. And the results are sent to make a spectrum analysis to obtain SDW. The specific implementation steps are shown in figure 6.



Fig.6 The structure diagram of Spectrum Analysis

Create two memory whose length are  $N_s$ , storing power values and corresponding data locations, respectively. Spectral analysis can be achieved by the following steps:

Output the CZT result of M point, according to the parameters  $n_b$  to find the  $N_s$  maximum points, and its power and location corresponding to storage. In FPGA, by reading the cumulative average data in cache RAM, and searching the maximum point by the comparator in turn, the power value and the corresponding location are stored respectively. When the maximum value  $k_M^{(i)}$  is found in the i iteration,  $(n_b - 1)/2$  point data of  $k_M^{(i)}$  both the left and right are discarded at the time of reading, and the read result is written to the next level cache RAM. Similarly the same signal power spectrum data can be obtained.

2) Noise estimation: the noise power is not considered as the  $N_s$  average of the output power associated with the signals above :

$$P_n = \frac{1}{M_s} \sum_k X_P[k] b[k]$$
<sup>(9)</sup>

where (9),  $M_s = M - n_b N_s$ . b[k] is the blank window:

$$b[k] = \begin{cases} 0 & k = k_M^{(i)} - (n_b - 1)/2, \dots, k_M^{(i)} + (n_b - 1)/2, i = 1, \dots, N_s \\ 1 & \ddagger \& \end{cases}$$
(10)

where (10),  $k_M^{(i)}$  is the i storage value in the position memory.

3) Threshold application: according to the principle and noise estimation results above, noise threshold  $T = P_n(dB) + 10\log_{10}(\gamma)$  and dynamic threshold  $10\log_{10}(X_M^1) - S(dB)$  can be calculated and applied. The signal parameters in the memory need to make the logarithm operation, read and compared with the noise threshold and the dynamic threshold sequentially. When the signal power is less than the threshold, it is discarded or otherwise reserved.

### 3.5 System delay



Fig.7 The Schematic diagram of Spectrum Analyzer Time-Delay

Mathematical operations of baseband digital signal processing module mainly focused on three sub modules: multiphase filtering, CZT and threshold analysis. To achieve the pipeline processing of baseband digital signal, the design of the three modules of the data flow diagram as shown in fig.7. Two latency are defined here: latency between blocks  $l_{\rm B}$  (the time interval in when a new block can be introduced in the system) and the total delay  $l_{\rm T}$  (the time interval to obtain the block result SDW when the first sample is introduced). According to fig.7,  $l_{\rm T} = l_{\rm FIR} + l_{\rm CZT} + l_{\rm PA}$ , therefore,  $l_{\rm B} < l_{\rm T}$ . Since the modules are pipelined, the next block of data can enter the system before the previous block of data is processed. The time delay of the poly-phase filtering module is reconstructed from the resampling data filtering into the data block whose frame length is  $N_{\rm CZT}$ , and the processing time is  $N_{\rm CZT}$  clock cycle and  $N_{\rm CZT}$  window is added to the clock cycle, so  $l_{\rm B}$  can be expressed as:

$$l_{\rm B} = \frac{NT + N_{\rm CZT}}{f_{\rm s}} \tag{11}$$

The poly-phase filter module processes the data continuously at the sampling rate  $f_s$  and stores the results in the First input first output (FIFO) memory, so the delay of the module is equal to the block delay,  $l_{\text{FIR}} = l_{\text{B}}$ . When the first data block passes through the poly-phase filtering module, the CZT module will start to process the data, so the CZT module must process the data block before the next data block completes the poly-phase filtering, it means  $l_{\text{CZT}} \leq l_{\text{FIR}}$ . The delay of CZT module consists of 10 cycles address arithmetic delay, 3 cycles complex multiplier,  $N_{\text{CZT}}$  cycles FFT and  $N_{\text{CZT}}$  cycles IFFT delay,  $l_{\text{CZT}}$  can be expressed as:

$$l_{\rm CZT} = \frac{2 \cdot N_{\rm CZT} + 13}{f_{\rm CZT}} \tag{12}$$

Before the CZT module loads a new set of data, the spectral analysis module should complete the previous set of data processing, which means  $l_{PA} \leq l_{FIR}$ . Spectrum analysis first need to load the output data of CZT module into the spectrum analysis module, and then make  $N_s$  maximum search of  $N_{CZT}$  CZT output results, a noise estimation and 280 clock cycles of the other treatments (spectrum adjustment and SDW generation). However, because the cumulative average and maximum search is based on 1 cumulative RAM and 5 cache RAM implementations, the spectral analysis delay  $l_{PA}$  can be expressed as:

$$l_{\rm PA} = \frac{N_{\rm CZT} + 280}{f_{\rm PA}}$$
(13)

Through the above module delay analysis,  $l_{\rm T} \leq 3 \cdot l_{\rm FIR}$  can be seen.

### 4. System verification

In this section, we will verify the performance of the spectrum analyzer in the weak signal under strong noise background. The hardware platform uses Altera, Cyclone, IVE: EP4CE115F29I7 and FPGA.

In order to verify the detection performance of weak signal under strong noise background, the test signal is produce by MATLAB:

$$x_{1}(i) = \begin{cases} \sin(2\pi f_{1}i/f_{s}) + 0.5 \operatorname{randn}(\operatorname{size}(i)) & i \in [1, 2048] \\ \sin(2\pi f_{1}i/f_{s}) + 0.02 \sin(2\pi f_{2}i/f_{s}) \\ + 0.5 \operatorname{randn}(\operatorname{size}(i)) & i \in [2049, 5120] \end{cases}$$

Where:  $f_1 = 44$ MHz,  $f_2 = 65$ MHz,  $f_s = 200$ MHz, and randn is zero mean random noise signal. Since the weak signal frequency is 65MHz, MATLAB is used to generate a frequency spectrum template with a trigger frequency of 56.65MHz~68.75MHz and a normalized amplitude of 0.00032.

Based on the analysis of bandwidth and filter sub-band bandwidth, the filter sub-bands 7 and 10 are selected in this example. And the buffer delay period is set to 0 because the test signal is a stationary signal. In order to satisfy the performance of real-time spectrum analysis, the clock of CZT module

and threshold analysis module should be set properly. According to document [6], the maximum frequency of the FFT module is  $f_{\rm FFT} = 400$ MHz ,therefore,  $f_{\rm CZTmax} = 400$ MHz. By formula (11),  $l_{\rm FIR} = l_{\rm B} = 5.12 \mu s$  can be obtained. By formula (12) and continuous data stream processing module of CZT timing constraints,  $f_{\rm PAmin} = 154.69$ MHz is available. By formula (13) and the timing constraints of continuous data stream processing threshold analysis module,  $f_{\rm PAmin} = 154.69$ MHz is obtained. According to  $f_{\rm CZTmax} = 400$ MHz,  $f_{\rm PAmax} = 310$ MHz can be concluded. Therefore, to meet the requirements of the clock frequency as listed in table 1.Considering the system clock frequency is 300MHz, the system power consumption is reduced, and the real-time processing of the signal flow is guaranteed. In this case,  $f_{\rm CZT} = 250$ MHz,  $f_{\rm PA} = 200$ MHz, the CZT and threshold analysis module are configured, and the system delay  $l_{\rm T} = 13.23\mu s$  is achieved.

As the user input *IDR* = 30dB,  $N_s = 2$ , according to the formula (2) can be seen, *SLL*  $\ge$  32. The window is selected Hamming window (according to document [4], Hamming window, SSL=-43dB, SL=1.78dB and main valve count =4) to meet the requirements of IDR, and  $n_b = 5$ , S = 32dB. According to the starting and stop frequency, and the sampling rate of the system, D = 8 can be seen. The normalized resolution formula, frequency resolution and formula (3),  $N_{CZT} = M = 512$  are obtained. According to the relation between the initial frequency, the initial phase and  $N_{CZT} = 512$ ,  $\theta_0 = 2\pi \times f_{min} / f_s = 2\pi \cdot 86/512$ ,  $f_{PAmax} = 250$ MHz is obtained. Use  $f_{CZT} = 400$ MHz,  $f_{PA} = 250$ MHz to configure the CZT and PA modules, the system delays  $l_T = 38.27\mu$ s.

Table 1. baseband digital signal processing module clock configuration (weak signal performance test)

|                                         |     | ust)                    |       |       |
|-----------------------------------------|-----|-------------------------|-------|-------|
| CZT and threshold analysis module total |     | $f_{\rm czr}({ m MHz})$ |       |       |
| delay                                   |     | 250                     | 300   | 400   |
|                                         | 200 | 8.108                   | 7.417 | 6.553 |
| $f_{\rm PA}({ m MHz})$                  | 250 | 7.316                   | 6.625 | 5.761 |
|                                         | 300 | 6.788                   | 6.097 | 5.233 |



Fig.8 The timing simulation waveform of weak signal

The frequency values and power values of detection signals can be obtained from Fig.8. The input and detection signal frequencies and normalized power values are shown in table 2.

| Tab.2 Comparison of weak signal measurement results |       |        |  |  |
|-----------------------------------------------------|-------|--------|--|--|
| Input signal frequency $f_{in}$ (MHz)               | 44    | 65     |  |  |
| Measuring signal frequency $f_{test}$ (MHz)         | 44.07 | 65.01  |  |  |
| Absolute error(MHz)                                 | 0.07  | 0.01   |  |  |
| Normalized amplitude of input signal $A_{in}$       | 1     | 0.02   |  |  |
| Normalized amplitude of measured signal $A_{test}$  | 1     | 0.0205 |  |  |
| Relative error (%)                                  | 0     | 2.5    |  |  |

### 5. Conclusion

Based on the advantages of frequency template triggering method and CZT algorithm, the real-time measurement of weak signal under strong noise background power spectrum, a real-time spectrum based on frequency mask trigger and poly-phase filter bank, CZT transform and spectrum analysis of a combination of easy hardware implementation method is presented in this paper, and make a hardware implementation of this method by using FPGA platform. This method uses a frequency template to achieve a specific frequency range triggering, poly-phase filtering reconstruct the user focus spectrum, the CZT spectrum zooming algorithm to solve the limited window length on frequency resolution, spectral analysis algorithm provides accurate results of the spectrum analysis. The analysis result shows that the system can accurately detect the weak signal under strong background noise and spectrum analysis of the simulation results and error detection, the frequency error is less than 0.16% (70KHz), power error is less than 2.5% and the minimum delay system is less than 37  $\mu$ s. Simulation analysis shows that the design has good detection accuracy, time-frequency accuracy and real-time performance for weak signal frequency spectrum analysis.

## References

- [1] Medina L M C, De J R R, Cabal-Yepez E, et al. FPGA-Based Multiple-Channel Vibration Analyzer for Industrial Applications in Induction Motor Failure Detection [J]. IEEE Transactions on Instrumentation & Measurement, 2010, 59(1):63-72.
- [2] Hu Guangshu. Digital Signal Processing: Theory, Algorithm and Implementation [M] .2 edition. Beijing: Tsinghua University Press, 2012:192-195.
- [3] HU Guangshu. Digital signal processing: theory, algorithm and Implementation[M]. Version 2.Beijing: Tsinghua University Press,2012:192-195
- [4] MA Ke, ZHANG Yuan-an, ZHANG Kai-sheng.CZT and ZFFT spectral refinement performance analysis and FPGA implementation [J]. Computer Measurement and Control, 2016, 24(02):288-289.
- [5] MA Ke, ZHANG Yuanan, ZHANG Kaisheng. Performance Analysis for CZT and ZFFT Spectrum Zoom and Its FPGA Realization[J]. Computer Measurement &Control, 2016, 24(02):288-289.
- [6] Harris F J. On the use of harmonic analysis with the discrete Fourier transform [J]. Proceedings of the IEEE, 1978, 66(1):51-83.
- [7] Digital Communications Laboratory, Polytechnic University of Valencia. FPGA-based vectorial spectrum analyzer [EB/OL]. (2007)[2016-07-20]. http://www.gised.upv.es/anaspect.html.
- [8] Altera.(2011).External Memory Interface Handbook Volume 3.[Online].
- [9] https://www.altera.com.cn/content/dam/altera-www/global/zh\_CN/pdfs/literature/hb/externalmemory/emi\_ddr\_ug.pdf.