CN115295024A - Signal processing method, signal processing device, electronic apparatus, and medium - Google Patents

Signal processing method, signal processing device, electronic apparatus, and medium Download PDF

Info

Publication number
CN115295024A
CN115295024A CN202210377040.2A CN202210377040A CN115295024A CN 115295024 A CN115295024 A CN 115295024A CN 202210377040 A CN202210377040 A CN 202210377040A CN 115295024 A CN115295024 A CN 115295024A
Authority
CN
China
Prior art keywords
phase
spectrogram
frequency
frame
compensation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210377040.2A
Other languages
Chinese (zh)
Inventor
康东
刘良兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vivo Mobile Communication Co Ltd
Original Assignee
Vivo Mobile Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vivo Mobile Communication Co Ltd filed Critical Vivo Mobile Communication Co Ltd
Priority to CN202210377040.2A priority Critical patent/CN115295024A/en
Publication of CN115295024A publication Critical patent/CN115295024A/en
Priority to PCT/CN2023/086987 priority patent/WO2023197955A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Noise Elimination (AREA)
  • Complex Calculations (AREA)

Abstract

The application discloses a signal processing method, a signal processing device, electronic equipment and a medium, and belongs to the technical field of communication. The signal processing method comprises the following steps: acquiring a first phase spectrogram of M frames of signals, wherein M is a positive integer; and performing phase compensation on the first phase spectrogram according to the first phase compensation function to obtain a second phase spectrogram, wherein the second phase spectrogram is a phase characteristic spectrogram of the M frames of signals at the target frequency.

Description

Signal processing method, signal processing apparatus, electronic device, and medium
Technical Field
The present application belongs to the field of communication technologies, and in particular, to a signal processing method, apparatus, electronic device, and medium.
Background
With the development of communication technology, various signal detection is widely applied in the production and life of people. For example, signal detection such as harmonic detection, single tone detection, radar reflection frequency detection, biomedical signal frequency detection and the like of voice signals have important applications in scenes such as voice communication, industrial automation control, radar ranging, medical imaging and the like.
Since noise often accompanies a signal (e.g., a speech signal), in practical applications, it is necessary to distinguish the signal from the noise to obtain a valid signal. Generally, the curves of the energy of the signal and the noise can be reflected by the amplitude spectrogram, and since the energy of the signal is stronger than the energy of the noise, the signal and the noise can be distinguished according to the two curves in the amplitude spectrogram, that is, the signal indicated by the curve with stronger energy is determined as the signal of the target frequency.
However, when the electronic device is in a medium-low signal-to-noise ratio environment below-15 decibels (dB), since the energy of the signal is not much different from that of the noise, the curve in the amplitude spectrum shows a small energy difference, and thus the signal and the noise cannot be distinguished. As such, the signal of the target frequency cannot be accurately determined from the amplitude spectrogram.
Disclosure of Invention
The embodiment of the application aims to provide a signal processing method, a signal processing device, an electronic device and a medium, which can solve the problem that a signal of a target frequency cannot be accurately determined from an amplitude spectrogram in a medium-low signal-to-noise ratio environment.
In a first aspect, an embodiment of the present application provides a signal processing method, where the method includes: acquiring a first phase spectrogram of M frames of signals, wherein M is a positive integer; and performing phase compensation on the first phase spectrogram according to a first phase compensation function to obtain a second phase spectrogram, wherein the second phase spectrogram is a phase characteristic spectrogram of the M frame signal at the target frequency.
In a second aspect, an embodiment of the present application provides a signal processing apparatus, including: the device comprises an acquisition module and a processing module. The acquisition module is used for acquiring a first phase spectrogram of M frames of signals, wherein M is a positive integer. And the processing module is used for performing phase compensation on the first phase spectrogram acquired by the acquisition module according to the first phase compensation function to acquire a second phase spectrogram, wherein the second phase spectrogram is a phase characteristic spectrogram of the M frame signal at the target frequency.
In a third aspect, embodiments of the present application provide an electronic device, which includes a processor and a memory, where the memory stores a program or instructions executable on the processor, and the program or instructions, when executed by the processor, implement the steps of the method according to the first aspect.
In a fourth aspect, embodiments of the present application provide a readable storage medium, on which a program or instructions are stored, which when executed by a processor, implement the steps of the method according to the first aspect.
In a fifth aspect, an embodiment of the present application provides a chip, where the chip includes a processor and a communication interface, where the communication interface is coupled to the processor, and the processor is configured to execute a program or instructions to implement the method according to the first aspect.
In a sixth aspect, embodiments of the present application provide a computer program product, which is stored in a storage medium and executed by at least one processor to implement the method according to the first aspect.
In the embodiment of the application, a first phase spectrogram of M frames of signals is obtained, wherein M is a positive integer; according to a method, the method comprises: acquiring a first phase spectrogram of M frames of signals, wherein M is a positive integer; and performing phase compensation on the first phase spectrogram according to a phase compensation function to obtain a second phase spectrogram, wherein the second phase spectrogram is a phase characteristic spectrogram of the M frame signal at the target frequency. According to the scheme, after one phase spectrogram of at least one frame of signal is obtained, as the phase compensation is carried out on the one phase spectrogram according to the phase compensation function to obtain the other phase spectrogram, and as the other phase spectrogram is the phase characteristic spectrogram of the at least one frame of signal at the target frequency, the signal of the target frequency can be accurately determined from the other phase spectrogram. Therefore, when the signal processing method is in a medium-low signal-to-noise ratio environment, the signal of the target frequency can be distinguished through the signal processing method provided by the application.
Drawings
Fig. 1 is a schematic diagram of a signal processing method according to an embodiment of the present disclosure;
fig. 2 is a schematic diagram of a process of processing a phase spectrogram according to an embodiment of the present disclosure;
fig. 3 is a second schematic diagram illustrating a phase spectrogram processing process according to an embodiment of the present application;
fig. 4 is a third schematic diagram illustrating a process of processing a phase spectrogram according to an embodiment of the present disclosure;
fig. 5 is a fourth schematic view illustrating a phase spectrogram processing process according to an embodiment of the present application;
fig. 6 is a schematic diagram of time-frequency distribution according to an embodiment of the present disclosure;
fig. 7 is a second schematic diagram of a time-frequency distribution according to an embodiment of the present application;
fig. 8 is a third schematic diagram of time-frequency distribution according to an embodiment of the present application;
fig. 9 is a schematic structural diagram of a signal processing apparatus according to an embodiment of the present application;
fig. 10 is a schematic structural diagram of an electronic device provided in an embodiment of the present application;
fig. 11 is a hardware schematic diagram of an electronic device according to an embodiment of the present application.
Detailed Description
The technical solutions in the embodiments of the present application will be described clearly below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are some, but not all, embodiments of the present application. All other embodiments that can be derived by one of ordinary skill in the art from the embodiments given herein are intended to be within the scope of the present disclosure.
The terms first, second and the like in the description and in the claims of the present application are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that embodiments of the application may be practiced in sequences other than those illustrated or described herein, and that the terms "first," "second," and the like are generally used herein in a generic sense and do not limit the number of terms, e.g., the first term can be one or more than one. In addition, "and/or" in the specification and claims means at least one of connected objects, a character "/" generally means that a preceding and succeeding related objects are in an "or" relationship.
The signal processing method, the signal processing apparatus, the electronic device, and the medium according to the embodiments of the present application are described in detail below with reference to the accompanying drawings.
As shown in fig. 1, an embodiment of the present application provides a signal processing method, which includes the following steps S101 to S102.
S101, the signal processing device acquires a first phase spectrogram of the M frames of signals.
Wherein M is a positive integer.
Optionally, the signal processing method provided by the embodiment of the present application may be applied to a medium-low signal-to-noise ratio environment below-15 decibels (dB).
Optionally, in this embodiment of the present application, the M frame signal is a non-dense frequency signal. For example, the M-frame signal is a speech signal.
Optionally, the first phase spectrogram is a phase map of M frames of signals in a frequency domain.
Further, when M =1, namely a frame signal, the first phase spectrogram includes a phase-frequency curve; when M is greater than 1, i.e. a multi-frame signal, the first phase spectrogram comprises a plurality of phase frequency curves.
In addition, for the first phase spectrogram, the electronic device may directly obtain the first phase spectrogram from a server, or the electronic device may perform fourier transform on the time domain signal, which may specifically refer to an implementation manner in the related art, and this is not described in detail in this embodiment of the present application.
And S102, the signal processing device performs phase compensation on the first phase spectrogram according to the first phase compensation function to obtain a second phase spectrogram.
The second phase spectrogram is a phase characteristic spectrogram of the M-frame signal at the target frequency.
Optionally, the target frequency is determined according to a phase characteristic of the second phase spectrogram.
Optionally, before S102, the signal processing method provided in this embodiment of the present application may further include: and extracting the slope of the window function to obtain a first phase compensation function.
Alternatively, the window function may be set by a developer or customized by a user. For example, the window function may be a hanning window.
Specifically, since the window function is a multi-segment continuous curve, the slope of the window function can be extracted to form a straight line with equal slope. I.e. the slope lines are the first phase compensation function.
It should be noted that, by performing phase compensation processing on the first phase spectrogram according to the first phase compensation function, the wound phase spectrogram can be unwound, and a phase-frequency curve flattening effect can be achieved.
Exemplarily, M =3 is taken as an example. As shown in fig. 2, fig. 2 includes 3 subgraphs (a) to (c). Fig. 2 (a) is an amplitude spectrum of three adjacent frames, fig. 2 (b) is a first phase spectrum of the three adjacent frames, and fig. 2 (c) is a phase curve of the three adjacent frames, i.e., a second phase spectrum, after phase compensation is performed on the first phase spectrum by using a first phase compensation function.
Further, based on the analysis of the example in fig. 2, the region corresponding to the subband region with a relatively strong amplitude of the amplitude spectrogram in the second phase spectrogram exhibits a good flatness characteristic, i.e., the first phase compensation function has a good characteristic extraction function.
Exemplarily, calculating the difference between the first phase spectrogram and the first phase compensation function to obtain a phase after the coiled initial phase is subjected to primary compensation; then, 2 times of pi is carried out on the phase for residue taking, and the residue is divided by 2 to obtain another phase after residue taking; and finally, taking the sine of the other phase after the remainder and then taking the inverse sine to obtain a new phase spectrogram, namely the second phase spectrogram.
It should be noted that, according to the characteristic that the absolute value of the positive and negative phases of the sine signal is obtained, a method of obtaining sine and finally obtaining inverse sine is designed, and the interference of phase winding can be effectively removed. Further, considering that the phase spectrum information does not need to be concerned about the sign of the phase difference, dividing the phase information by 2 reduces the dynamic range to [0, pi ], which is more in line with the need to extract the target frequency.
It can be understood that after the phase compensation is performed on the first phase spectrogram according to the first compensation function to obtain the second phase spectrogram, the first phase compensation function has a good feature extraction effect because the region corresponding to the sub-band region of the second phase spectrogram, which has a relatively strong amplitude, shows a good flatness feature.
The embodiment of the present application provides a signal processing method, after obtaining one phase spectrogram of at least one frame of signal, because the phase compensation is performed on the one phase spectrogram according to a phase compensation function to obtain another phase spectrogram, and because the another phase spectrogram is a phase feature spectrogram of the at least one frame of signal at a target frequency, a signal of the target frequency can be accurately determined from the another phase spectrogram. Therefore, when the signal processing method is in a medium-low signal-to-noise ratio environment, the signal of the target frequency can be distinguished through the signal processing method provided by the application.
Optionally, before obtaining the second phase spectrum in S102, the signal processing method provided in the embodiment of the present application may further include S103 described below. Accordingly, the second phase spectrum obtained in S102 can be specifically obtained by S102A described below.
S103, the signal processing device obtains M second phase compensation functions.
Alternatively, S103 may be specifically realized by S103A and S103B described below.
S103A, the signal processing device determines a first estimation frequency based on the amplitude spectrogram of the M frames of signals.
The first estimated frequency is a frequency corresponding to a maximum amplitude value in the amplitude spectrogram.
Exemplarily, as shown in fig. 3, (a) in fig. 3 is an amplitude spectrogram of a 3-frame signal, where a frequency corresponding to a maximum amplitude in the amplitude spectrogram is 100, i.e., the first estimated frequency is 100.
S103B, the signal processing device determines M second phase compensation functions according to the first estimation frequency and the frame number of the M frames of signals.
Optionally, propagation time lengths corresponding to frame numbers of the M frame signals are respectively calculated, and M second phase compensation functions are obtained by combining the first estimation frequency.
It can be understood that, since each frame of the M frame signals corresponds to one frame number, M different second compensation functions are obtained.
According to the signal processing method provided by the embodiment of the application, the first estimation frequency is determined based on the amplitude spectrogram of the M frames of signals, and the M second phase compensation functions are determined according to the first estimation frequency and the frame number of the M frames of signals, so that the M second phase compensation functions can be used for performing interframe phase difference compensation on the M frames of signals, and the purpose of further feature extraction is achieved.
And S102A, the signal processing device performs inter-frame phase difference compensation on the third phase spectrogram according to the M second phase compensation functions to obtain a second phase spectrogram.
And one second phase compensation function performs phase compensation on one frame signal corresponding to the second phase compensation function. And the third phase spectrogram is obtained by performing phase compensation according to the first phase compensation function.
Alternatively, the one frame signal corresponding to the one second phase compensation function may be a signal after phase compensation is performed by using the first phase compensation function, or a signal without phase compensation performed by using the first phase compensation function. The method is determined according to actual conditions, and the method is not limited in the embodiment of the application.
The above method is to unwind the first phase spectrogram with winding, and then perform inter-frame phase difference compensation on the phase spectrogram obtained after unwinding to obtain a second phase spectrogram.
Optionally, in the step S102A, a third phase spectrogram is obtained after performing phase compensation on the first phase spectrogram according to the first phase compensation function; and performing inter-frame phase difference compensation on the third phase spectrogram according to the M second phase compensation functions to obtain a second phase spectrogram, which is exemplarily illustrated.
Further, when phase compensation needs to be performed on the first phase spectrogram according to the first phase compensation function and the M second phase compensation functions at the same time, inter-frame phase difference compensation can be performed on the first phase spectrogram according to the M second phase compensation functions to obtain a phase spectrogram; and according to the first phase compensation function, performing phase compensation on the phase spectrogram to obtain a second phase spectrogram.
It should be noted that, in the above manner, inter-frame phase difference compensation is performed on the first phase spectrogram, and then the phase spectrogram obtained after the inter-frame phase difference compensation is unwrapped to obtain the second phase spectrogram. It can be understood that the calculation amount can be effectively saved by adopting the method.
For example, the signal processing device may first calculate a difference between the first phase spectrogram and the second phase compensation function, where a phase of the one phase spectrogram is a phase obtained by performing one phase compensation on the initial phase with winding; then, calculating the difference between the phase spectrogram and the first phase compensation function to obtain another phase spectrogram, wherein the phase of the another phase spectrogram is the phase obtained after secondary phase compensation is carried out on the phase with winding; then, 2 times of pi is carried out on the phase of the other phase spectrogram for residue, and then 2 is divided to obtain the residue phase; and finally, taking the sine and taking the inverse sine of the rest phases to obtain a new phase, namely obtaining a second phase spectrogram.
Exemplarily, M =3 is taken as an example. Referring to fig. 2, as shown in fig. 3, (a) to (c) 3 sub-diagrams are included in fig. 3. Fig. 3 (a) is an amplitude spectrogram of adjacent three frames, and fig. 3 (c) is a phase curve, i.e., a second phase spectrogram, obtained by performing inter-frame phase difference compensation on fig. 3 (b) (i.e., fig. 2 (c)) by using a second phase compensation function.
Further, based on the analysis of the example in fig. 3, a region corresponding to a subband region of the second phase spectrogram, which has a relatively strong amplitude, shows a good inter-frame consistency, that is, the second phase compensation function has a good feature extraction effect.
In the embodiment of the application, after the phase compensation is performed on the first phase spectrogram according to the first compensation function, the inter-frame phase difference compensation is performed on the third phase spectrogram according to the M second phase compensation functions to obtain the second phase spectrogram, and as the region corresponding to the sub-band region with the stronger amplitude of the amplitude spectrogram in the second phase spectrogram has good inter-frame consistency, that is, the second phase compensation function has a good feature extraction effect.
Optionally, before obtaining the second phase spectrum in S102A, the signal processing method provided in this embodiment of the application may further include S104 described below. Accordingly, the second phase spectrum obtained in S102A described above can be specifically realized by S102A1 described below.
S104, the signal processing device acquires M first phase values corresponding to each frequency point in the fourth phase spectrogram.
And the fourth phase spectrogram is obtained by performing interframe phase difference compensation according to the M second phase compensation functions. And a first phase value is determined by calculating the standard deviation of P phase values, wherein the P phase values are the phase values of the P frame signals corresponding to a frequency point, and the P frame signals are signal frames adjacent to each frame signal in the M frame signals.
It should be noted that, each of the M first phase values corresponds to one frame of the M frames of signals.
S102A1, the signal processing device obtains a second phase spectrogram according to the M first phase values and the frequency points in the fourth phase spectrogram.
Optionally, the S102A1 specifically includes: and taking the M first phase values as phase values corresponding to the frequency points in the fourth phase spectrogram to obtain a second phase spectrogram.
It can be understood that the fourth phase spectrogram includes a plurality of consecutive frequency points, and each frequency point corresponds to M first phase values. Namely, the phase value of each frame signal at each frequency point is a first phase value, so as to obtain a second phase spectrogram.
Exemplarily, M =3 is taken as an example. Referring to fig. 3, fig. 4 includes (a) to (c) 3 sub-diagrams as shown in fig. 4. Fig. 4 (a) is an amplitude spectrum of adjacent three frames, and fig. 4 (c) is a phase curve, i.e., a second phase spectrum, obtained after performing the above-described S104 and S102A1 on fig. 4 (b) (i.e., fig. 3 (c)).
In the embodiment of the application, M first phase values corresponding to each frequency point in the second phase spectrogram can be obtained, and the second phase spectrogram is obtained according to the M first phase values and the frequency points in the fourth phase spectrogram, so that the area phase value corresponding to the sub-band area with the stronger amplitude of the amplitude spectrogram in the second phase spectrogram is obviously lower and tends to zero. I.e. by standard deviation calculation, further feature extraction can be performed.
Optionally, before obtaining the second phase spectrum in S102A1, the signal processing method provided in the embodiment of the present application may further include S105 described below. Accordingly, the second phase spectrum obtained in S102A1 may be specifically obtained in S102a described below.
And S105, the signal processing device acquires M second phase values corresponding to each frequency point in the fifth phase spectrogram.
And the fifth phase spectrogram is obtained according to the M first phase values and the frequency points in the fourth phase spectrogram. And a second phase value is determined by calculating the average value of the phase values of K adjacent frequency points, wherein the K adjacent frequency points are the frequency points adjacent to one frequency point, and K is an integer greater than 1.
It should be noted that each of the M second phase values corresponds to one frame of the M frames of signals.
And S102a, the signal processing device obtains a second phase spectrogram according to the M second phase values and the frequency points in the fifth phase spectrogram.
Optionally, the S102a specifically includes: and taking the M second phase values as phase values corresponding to the frequency points in the fifth phase spectrogram to obtain the second phase spectrogram.
It can be understood that the fifth phase spectrogram includes a plurality of consecutive frequency points, and each frequency point corresponds to M second phase values. Namely, the phase value of each frame signal at each frequency point is a second phase value, so as to obtain a second phase spectrogram.
Exemplarily, M =3 is taken as an example. Referring to fig. 4, as shown in fig. 5, (a) to (c) 3 sub-graphs are included in fig. 5. Fig. 5 (a) is an amplitude spectrogram of adjacent three frames, and fig. 5 (c) is a phase curve, i.e., a second phase spectrogram, obtained after performing the above-mentioned S105 and S102a on fig. 5 (b) (i.e., fig. 4 (c)).
In the embodiment of the application, M second phase values corresponding to each frequency point in the fifth phase spectrogram can be obtained, and the second phase spectrogram is obtained according to the M second phase values and the frequency points in the fifth phase spectrogram, so that the second spectrum phase diagram can further highlight that the region corresponding to the sub-band region with the stronger amplitude of the amplitude spectrogram has better single-frame stability and discrimination. I.e. by means of an average calculation, further feature extraction can be performed.
Optionally, after S102, the signal processing method provided in the embodiment of the present application may further include S106 described below.
And S106, sharpening the second phase spectrogram by the signal processing device to obtain a target time-frequency graph.
The target time-frequency diagram is a time-frequency distribution diagram of M frames of signals at a target frequency.
Optionally, the S106 specifically includes: and sharpening the phase spectrum information of the second phase spectrogram by adopting decibel scales to obtain a target time-frequency graph.
Specifically, the sharpening process of the second phase spectrogram in S106 specifically includes: and taking logarithm of the phase value of the second phase spectrogram, and then taking inverse number.
Illustratively, as shown in FIG. 6, the signal-to-noise ratio is-7 dB, the sampling frequency fs is 16 kilohertz (kHz) kHz, the lenFrame is 512, the lenFft is 2048, and the width4fnStd is 6. Fig. 6 (a) is an amplitude spectrum in which a dotted line box 01 is a target frequency; fig. 6 (b) is a target phase spectrum;
fig. 6 (c) is a target time-frequency graph after the target phase spectrogram is sharpened, and a dotted-line frame 02 in the target time-frequency graph is a target frequency. Therefore, the target time-frequency diagram reflects the distribution situation of the multi-frame signals at the target frequency of 100, so that the single-frame signals at the target frequency of 100 can be determined according to the target time-frequency diagram. Thus, compared with the amplitude spectrogram, the target time-frequency graph further highlights the distribution situation of the target frequency.
Illustratively, as shown in FIG. 7, the signal-to-noise ratio is-15 dB, the sampling frequency fs is 16 kilohertz (kHz), the lenFrame is 512, the lenFft is 2048, and the width4fnStd is 6. Fig. 7 (a) is an amplitude spectrum in which a dashed box 03 is a target frequency; fig. 7 (b) is a target phase spectrogram, fig. 7 (c) is a target time-frequency diagram after the target phase spectrogram is sharpened, and a dashed-line frame 04 in the target time-frequency diagram is a target frequency. Therefore, the target time-frequency diagram reflects the distribution situation of the multi-frame signals at the target frequency of 100, so that the single-frame signals at the target frequency of 100 can be determined according to the target time-frequency diagram. Thus, compared with the amplitude spectrogram, the target time-frequency graph further highlights the distribution situation of the target frequency.
Illustratively, as shown in FIG. 8, the signal-to-noise ratio is-18 dB, the sampling frequency fs is 16 kilohertz (kHz), the lenFrame is 512, the lenFft is 2048, and the width4fnStd is 6. Fig. 8 (a) is an amplitude spectrum in which a dashed box 05 is a target frequency; fig. 8 (b) is a target phase spectrogram, fig. 8 (c) is a target time-frequency diagram after the target phase spectrogram is sharpened, and a dashed frame 06 in the target time-frequency diagram is a target frequency. It can be known that, since the target time-frequency diagram reflects the distribution of the multi-frame signals at the target frequency of 100, a single-frame signal at the target frequency of 100 can be determined according to the target time-frequency diagram. Thus, compared with the amplitude spectrogram, the target time-frequency graph further highlights the distribution situation of the target frequency.
The signal processing method provided by the embodiment of the application can sharpen the second phase spectrogram to obtain the target time-frequency graph, and the target time-frequency graph is a time-frequency distribution graph of M frames of signals at the target frequency, so that the target frequency can be further highlighted, the signals of the target frequency can be determined according to the target time-frequency graph to distinguish noise, and thus higher noise resistance is achieved.
In the signal processing method provided by the embodiment of the application, the execution main body can be a signal processing device. In the embodiment of the present application, a method for executing signal processing by a signal processing apparatus is taken as an example, and the signal processing apparatus provided in the embodiment of the present application is described.
As shown in fig. 9, an embodiment of the present application provides a signal processing apparatus 200, which may include an acquisition module 201 and a processing module 202. The obtaining module 201 may be configured to obtain a first phase spectrogram of M frames of signals, where M is a positive integer. The processing module 202 may be configured to perform phase compensation on the first phase spectrogram obtained by the obtaining module 201 according to a first phase compensation function, so as to obtain a second phase spectrogram, where the second phase spectrogram is a phase characteristic spectrogram of the M-frame signal at the target frequency.
Optionally, the processing module may be further configured to extract a slope of the window function to obtain the first phase compensation function.
Optionally, the obtaining module may be further configured to obtain M second phase compensation functions. The processing module may be specifically configured to perform inter-frame phase difference compensation on the third phase spectrogram according to the M second phase compensation functions to obtain second phase spectrograms, where one second phase compensation function performs phase compensation on one frame of signals corresponding to one second phase compensation function; and the third phase spectrogram is obtained by performing phase compensation according to the first phase compensation function.
Optionally, the obtaining module may be specifically configured to determine a first estimated frequency based on an amplitude spectrogram of the M frames of signals, where the first estimated frequency is a frequency corresponding to a maximum amplitude in the amplitude spectrogram; and determining M second phase compensation functions according to the first estimation frequency and the frame number of the M frames of signals.
Optionally, the obtaining module may be further configured to obtain M first phase values corresponding to each frequency point in the fourth phase spectrogram, where the first phase values are determined by performing standard deviation calculation on P phase values, the P phase values are phase values corresponding to P frame signals at one frequency point, and the P frame signals are signal frames adjacent to each frame signal in the M frame signals. The processing module may be specifically configured to obtain a second phase spectrogram according to the M first phase values and frequency points in the fourth phase spectrogram; and the fourth phase spectrogram is obtained by performing interframe phase difference compensation according to the M second phase compensation functions.
Optionally, the obtaining module may be further configured to obtain M second phase values corresponding to each frequency point in the fifth phase spectrogram, where the second phase values are determined by performing mean value calculation on phase values of K adjacent frequency points, where the K adjacent frequency points are frequency points adjacent to one frequency point, and K is an integer greater than 1. The processing module may be specifically configured to obtain a second phase spectrogram according to the M second phase values and frequency points in the fifth phase spectrogram; and the fifth phase spectrogram is obtained according to the M first phase values and the frequency points in the fourth phase spectrogram.
Optionally, the processing module may be further configured to perform sharpening on the second phase spectrogram to obtain a target time-frequency map, where the target time-frequency map is a time-frequency distribution map of the M-frame signal at the target frequency.
The embodiment of the application provides a signal processing apparatus, after one phase spectrogram of at least one frame of signal is obtained, because phase compensation is performed on the one phase spectrogram according to a phase compensation function to obtain another phase spectrogram, and because the another phase spectrogram is a phase characteristic spectrogram of the at least one frame of signal at a target frequency, a signal of the target frequency can be accurately determined from the another phase spectrogram. Therefore, when the signal processing method is in a medium-low signal-to-noise ratio environment, the signal of the target frequency can be distinguished through the signal processing method provided by the application.
The signal processing apparatus in the embodiment of the present application may be an electronic device, and may also be a component in the electronic device, such as an integrated circuit or a chip. The electronic device may be a terminal, or may be a device other than a terminal. The electronic Device may be, for example, a Mobile phone, a tablet computer, a notebook computer, a palm top computer, a vehicle-mounted electronic Device, a Mobile Internet Device (MID), an Augmented Reality (AR)/Virtual Reality (VR) Device, a robot, a wearable Device, an ultra-Mobile personal computer (UMPC), a netbook or a Personal Digital Assistant (PDA), and the like, and may also be a server, a Network Attached Storage (Network Attached Storage, NAS), a personal computer (NAS), a Television (TV), a teller machine, a self-service machine, and the like, and the embodiments of the present application are not limited in particular.
The signal processing apparatus in the embodiment of the present application may be an apparatus having an operating system. The operating system may be an Android operating system, an ios operating system, or other possible operating systems, which is not specifically limited in the embodiment of the present application.
The signal processing apparatus provided in the embodiment of the present application can implement each process implemented in the method embodiments of fig. 1 to fig. 8, and is not described here again to avoid repetition.
Optionally, as shown in fig. 10, an electronic device 300 is further provided in the embodiment of the present application, and includes a processor 301 and a memory 302, where the memory 302 stores a program or an instruction that can be executed on the processor 301, and when the program or the instruction is executed by the processor 301, the steps of the signal processing method embodiment are implemented, and the same technical effects can be achieved, and are not described again here to avoid repetition.
It should be noted that the electronic device in the embodiment of the present application includes the mobile electronic device and the non-mobile electronic device described above.
Fig. 11 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
The electronic device 400 includes, but is not limited to: radio unit 401, network module 402, audio output unit 403, input unit 404, sensor 405, display unit 406, user input unit 407, interface unit 408, memory 409, and processor 410.
Those skilled in the art will appreciate that the electronic device 400 may further include a power source (e.g., a battery) for supplying power to various components, and the power source may be logically connected to the processor 410 through a power management system, so as to implement functions of managing charging, discharging, and power consumption through the power management system. The electronic device structure shown in fig. 11 does not constitute a limitation of the electronic device, and the electronic device may include more or less components than those shown, or combine some components, or arrange different components, and thus, the description is not repeated here.
The processor 410 is configured to obtain a first phase spectrogram of M frames of signals, where M is a positive integer; and the phase compensation is carried out on the first phase spectrogram according to the first phase compensation function to obtain a second phase spectrogram, wherein the second phase spectrogram is a phase characteristic spectrogram of the M frame signal at the target frequency.
Optionally, the processor 410 is further configured to extract a slope of the window function to obtain the first phase compensation function.
Optionally, the obtaining module is further configured to obtain M second phase compensation functions. The processor 410 may be specifically configured to perform inter-frame phase difference compensation on the third phase spectrogram according to the M second phase compensation functions to obtain second phase spectrograms, where one second phase compensation function performs phase compensation on one frame of signals corresponding to one second phase compensation function; and the third phase spectrogram is obtained by performing phase compensation according to the first phase compensation function.
Optionally, the processor 410 is configured to determine a first estimated frequency based on the amplitude spectrogram of the M frames of signals, where the first estimated frequency is a frequency corresponding to a maximum amplitude in the amplitude spectrogram; and determining M second phase compensation functions according to the first estimation frequency and the frame number of the M frames of signals.
Optionally, the processor 410 is configured to use M first phase values corresponding to each frequency point in the fourth phase spectrogram, where the first phase values are determined by performing standard deviation calculation on P phase values, where the P phase values are phase values corresponding to P frame signals at one frequency point, and the P frame signals are signal frames adjacent to each frame signal in the M frame signals; the second phase spectrogram is obtained according to the M first phase values and the frequency points in the fourth phase spectrogram; and the fourth phase spectrogram is obtained by performing interframe phase difference compensation according to the M second phase compensation functions.
Optionally, the processor 410 is configured to obtain M second phase values corresponding to each frequency point in the fifth phase spectrogram, where the second phase values are determined by performing mean calculation on phase values of K adjacent frequency points, where the K adjacent frequency points are frequency points adjacent to one frequency point, and K is an integer greater than 1; and the M second phase values and the frequency points in the fifth phase spectrogram are used for obtaining a second phase spectrogram; and the fifth phase spectrogram is obtained according to the M first phase values and the frequency points in the fourth phase spectrogram.
Optionally, the processor 410 is configured to perform sharpening on the second phase spectrogram to obtain a target time-frequency map, where the target time-frequency map is a time-frequency distribution map of the M frames of signals at the target frequency.
The embodiment of the application provides an electronic device, after one phase spectrogram of at least one frame of signal is obtained, because the phase compensation is performed on the one phase spectrogram according to a phase compensation function to obtain another phase spectrogram, and because the another phase spectrogram is a phase feature spectrogram of the at least one frame of signal at a target frequency, a signal of the target frequency can be accurately determined from the another phase spectrogram. Therefore, when the signal processing method is in a medium-low signal-to-noise ratio environment, the signal of the target frequency can be distinguished through the signal processing method provided by the application.
It should be understood that, in the embodiment of the present application, the input unit 404 may include a Graphics Processing Unit (GPU) 4041 and a microphone 4042, and the graphics processor 4041 processes image data of a still picture or a video obtained by an image capturing device (such as a camera) in a video capturing mode or an image capturing mode. The display unit 406 may include a display panel 4061, and the display panel 4061 may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like. The user input unit 407 includes at least one of a touch panel 4071 and other input devices 4072. A touch panel 4071, also referred to as a touch screen. The touch panel 4071 may include two parts, a touch detection device and a touch controller. Other input devices 4072 may include, but are not limited to, a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, a mouse, and a joystick, which are not described in detail herein.
The memory 409 may be used to store software programs as well as various data. The memory 409 may mainly include a first storage area storing a program or an instruction and a second storage area storing data, wherein the first storage area may store an operating system, an application program or an instruction (such as a sound playing function, an image playing function, etc.) required for at least one function, and the like. Further, the memory 409 may comprise volatile memory or non-volatile memory, or the memory 409 may comprise both volatile and non-volatile memory. The non-volatile Memory may be a Read-Only Memory (ROM), a Programmable ROM (PROM), an Erasable PROM (EPROM), an Electrically Erasable PROM (EEPROM), or a flash Memory. The volatile Memory may be a Random Access Memory (RAM), a Static Random Access Memory (Static RAM, SRAM), a Dynamic Random Access Memory (Dynamic RAM, DRAM), a Synchronous Dynamic Random Access Memory (Synchronous DRAM, SDRAM), a Double Data Rate Synchronous Dynamic Random Access Memory (Double Data Rate SDRAM, ddr SDRAM), an Enhanced Synchronous SDRAM (ESDRAM), a Synchronous Link DRAM (SLDRAM), and a Direct Memory bus RAM (DRRAM). Memory 109 in the embodiments of the subject application includes, but is not limited to, these and any other suitable types of memory.
Processor 410 may include one or more processing units; optionally, the processor 410 integrates an application processor, which primarily handles operations related to the operating system, user interface, and applications, and a modem processor, which primarily handles wireless communication signals, such as a baseband processor. It will be appreciated that the modem processor described above may not be integrated into the processor 410.
The embodiment of the present application further provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or the instruction is executed by a processor, the program or the instruction implements each process of the signal processing method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here.
The processor is the processor in the electronic device described in the above embodiment. The readable storage medium includes a computer readable storage medium, such as a computer read only memory ROM, a random access memory RAM, a magnetic or optical disk, and the like.
The embodiment of the present application further provides a chip, where the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to execute a program or an instruction to implement each process of the above xxx method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here.
It should be understood that the chips mentioned in the embodiments of the present application may also be referred to as system-on-chip, system-on-chip or system-on-chip, etc.
Embodiments of the present application provide a computer program product, where the program product is stored in a storage medium, and the program product is executed by at least one processor to implement the processes of the foregoing signal processing method embodiments, and can achieve the same technical effects, and in order to avoid repetition, details are not repeated here.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one of 8230, and" comprising 8230does not exclude the presence of additional like elements in a process, method, article, or apparatus comprising the element. Further, it should be noted that the scope of the methods and apparatus of the embodiments of the present application is not limited to performing the functions in the order illustrated or discussed, but may include performing the functions in a substantially simultaneous manner or in a reverse order based on the functions involved, e.g., the methods described may be performed in an order different than that described, and various steps may be added, omitted, or combined. In addition, features described with reference to certain examples may be combined in other examples.
Through the description of the foregoing embodiments, it is clear to those skilled in the art that the method of the foregoing embodiments may be implemented by software plus a necessary general hardware platform, and certainly may also be implemented by hardware, but in many cases, the former is a better implementation. Based on such understanding, the technical solutions of the present application or portions thereof that contribute to the prior art may be embodied in the form of a computer software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (which may be a mobile phone, a computer, a server, or a network device, etc.) to execute the method according to the embodiments of the present application.
While the present embodiments have been described with reference to the accompanying drawings, it is to be understood that the present embodiments are not limited to those precise embodiments, which are intended to be illustrative rather than restrictive, and that various changes and modifications may be effected therein by one skilled in the art without departing from the scope of the appended claims.

Claims (12)

1. A method of signal processing, the method comprising:
acquiring a first phase spectrogram of M frames of signals, wherein M is a positive integer;
and performing phase compensation on the first phase spectrogram according to a first phase compensation function to obtain a second phase spectrogram, wherein the second phase spectrogram is a phase characteristic spectrogram of the M frame signal at a target frequency.
2. The method of claim 1, wherein prior to phase compensating the first phase profile according to the first phase compensation function, the method further comprises:
and extracting the slope of the window function to obtain a first phase compensation function.
3. The method according to claim 1, wherein before said obtaining the second phase spectrum, the method further comprises;
obtaining M second phase compensation functions;
the obtaining of the second phase spectrogram comprises:
performing inter-frame phase difference compensation on a third phase spectrogram according to the M second phase compensation functions to obtain the second phase spectrogram, wherein one second phase compensation function performs phase compensation on one frame of signals corresponding to the one second phase compensation function;
and the third phase spectrogram is obtained by performing phase compensation according to the first phase compensation function.
4. The method of claim 3, wherein obtaining M second phase compensation functions comprises:
determining a first estimation frequency based on the amplitude spectrogram of the M frames of signals, wherein the first estimation frequency is a frequency corresponding to the maximum amplitude in the amplitude spectrogram;
and determining M second phase compensation functions according to the first estimation frequency and the frame number of the M frame signals.
5. The method of claim 3, wherein before obtaining the second phase spectrum, the method further comprises:
acquiring M first phase values corresponding to each frequency point in a fourth phase spectrogram, wherein the first phase values are determined by calculating the standard deviation of P phase values, the P phase values are phase values corresponding to P frame signals at one frequency point, and the P frame signals are signal frames adjacent to each frame signal in the M frame signals;
the obtaining of the second phase spectrogram comprises:
obtaining the second phase spectrogram according to the M first phase values and the frequency points in the fourth phase spectrogram;
and the fourth phase spectrogram is obtained by performing interframe phase difference compensation according to the M second phase compensation functions.
6. The method of claim 5, wherein before obtaining the second phase profile, the method further comprises:
acquiring M second phase values corresponding to each frequency point in a fifth phase spectrogram, wherein the second phase values are determined by performing mean value calculation on phase values of K adjacent frequency points, the K adjacent frequency points are frequency points adjacent to one frequency point, and K is an integer greater than 1;
the obtaining of the second phase spectrum comprises:
obtaining the second phase spectrogram according to the M second phase values and the frequency points in the fifth phase spectrogram;
and the fifth phase spectrogram is obtained according to the M first phase values and the frequency points in the fourth phase spectrogram.
7. The method of any one of claims 1 to 6, wherein after obtaining the second phase spectrum, the method further comprises:
and sharpening the second phase spectrogram to obtain a target time-frequency graph, wherein the target time-frequency graph is a time-frequency distribution graph of the M frames of signals at the target frequency.
8. The signal processing device is characterized by comprising an acquisition module and a processing module;
the acquisition module is used for acquiring a first phase spectrogram of M frames of signals, wherein M is a positive integer;
the processing module is configured to perform phase compensation on the first phase spectrogram obtained by the obtaining module according to a first phase compensation function to obtain a second phase spectrogram, where the second phase spectrogram is a phase characteristic spectrogram of the M-frame signal at a target frequency.
9. The apparatus of claim 8, wherein the processing module is further configured to extract a slope of the window function to obtain the first phase compensation function.
10. The apparatus according to claim 9, wherein the obtaining module is further configured to obtain M second phase compensation functions;
the processing module is specifically configured to perform inter-frame phase difference compensation on a third phase spectrogram according to the M second phase compensation functions to obtain the second phase spectrogram, and one second phase compensation function performs phase compensation on one frame of signals corresponding to the one second phase compensation function;
and the third phase spectrogram is obtained by performing phase compensation according to the first phase compensation function.
11. An electronic device, comprising a processor and a memory, the memory storing a program or instructions executable on the processor, the program or instructions, when executed by the processor, implementing the steps of the signal processing method according to any one of claims 1 to 7.
12. A readable storage medium, characterized in that it stores thereon a program or instructions which, when executed by a processor, implement the steps of the signal processing method according to any one of claims 1 to 7.
CN202210377040.2A 2022-04-11 2022-04-11 Signal processing method, signal processing device, electronic apparatus, and medium Pending CN115295024A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202210377040.2A CN115295024A (en) 2022-04-11 2022-04-11 Signal processing method, signal processing device, electronic apparatus, and medium
PCT/CN2023/086987 WO2023197955A1 (en) 2022-04-11 2023-04-07 Signal processing method and apparatus, electronic device, and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210377040.2A CN115295024A (en) 2022-04-11 2022-04-11 Signal processing method, signal processing device, electronic apparatus, and medium

Publications (1)

Publication Number Publication Date
CN115295024A true CN115295024A (en) 2022-11-04

Family

ID=83821217

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210377040.2A Pending CN115295024A (en) 2022-04-11 2022-04-11 Signal processing method, signal processing device, electronic apparatus, and medium

Country Status (2)

Country Link
CN (1) CN115295024A (en)
WO (1) WO2023197955A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023197955A1 (en) * 2022-04-11 2023-10-19 维沃移动通信有限公司 Signal processing method and apparatus, electronic device, and medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114464208A (en) * 2015-09-16 2022-05-10 株式会社东芝 Speech processing apparatus, speech processing method, and storage medium
CN108735213B (en) * 2018-05-29 2020-06-16 太原理工大学 Voice enhancement method and system based on phase compensation
CN110797041B (en) * 2019-10-21 2023-05-12 珠海市杰理科技股份有限公司 Speech noise reduction processing method and device, computer equipment and storage medium
CN111508514A (en) * 2020-04-10 2020-08-07 江苏科技大学 Single-channel speech enhancement algorithm based on compensation phase spectrum
CN114757229A (en) * 2022-04-11 2022-07-15 维沃移动通信有限公司 Signal processing method, signal processing device, electronic apparatus, and medium
CN115295024A (en) * 2022-04-11 2022-11-04 维沃移动通信有限公司 Signal processing method, signal processing device, electronic apparatus, and medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023197955A1 (en) * 2022-04-11 2023-10-19 维沃移动通信有限公司 Signal processing method and apparatus, electronic device, and medium

Also Published As

Publication number Publication date
WO2023197955A1 (en) 2023-10-19

Similar Documents

Publication Publication Date Title
US11138992B2 (en) Voice activity detection based on entropy-energy feature
CN109767783B (en) Voice enhancement method, device, equipment and storage medium
EP2828856B1 (en) Audio classification using harmonicity estimation
US11315582B2 (en) Method for recovering audio signals, terminal and storage medium
CN110970051A (en) Voice data acquisition method, terminal and readable storage medium
EP4254408A1 (en) Speech processing method and apparatus, and apparatus for processing speech
CN110070884B (en) Audio starting point detection method and device
CN111508519B (en) Method and device for enhancing voice of audio signal
WO2023197933A1 (en) Signal processing method and apparatus, and electronic device and medium
CN111968662A (en) Audio signal processing method and device and storage medium
CN111739544B (en) Voice processing method, device, electronic equipment and storage medium
CN108234793A (en) A kind of means of communication, device, electronic equipment and storage medium
CN112259116A (en) Method and device for reducing noise of audio data, electronic equipment and storage medium
CN115295024A (en) Signal processing method, signal processing device, electronic apparatus, and medium
CN110070885B (en) Audio starting point detection method and device
CN105931648B (en) Audio signal solution reverberation method and device
CN114879929A (en) Multimedia file playing method and device
CN112669878B (en) Sound gain value calculation method and device and electronic equipment
CN110085214B (en) Audio starting point detection method and device
CN112969130A (en) Audio signal processing method and device and electronic equipment
CN109841232B (en) Method and device for extracting note position in music signal and storage medium
CN110755055A (en) Method and equipment for determining waveform evaluation information of pulse waveform
CN114783454B (en) Model training and audio noise reduction method, device, equipment and storage medium
CN115798506A (en) Voice processing method and device, electronic equipment and storage medium
CN115273822A (en) Audio processing method, device, electronic equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination