CN111489759A - Noise evaluation method based on optical fiber voice time domain signal waveform alignment - Google Patents
Noise evaluation method based on optical fiber voice time domain signal waveform alignment Download PDFInfo
- Publication number
- CN111489759A CN111489759A CN202010210101.7A CN202010210101A CN111489759A CN 111489759 A CN111489759 A CN 111489759A CN 202010210101 A CN202010210101 A CN 202010210101A CN 111489759 A CN111489759 A CN 111489759A
- Authority
- CN
- China
- Prior art keywords
- voice
- signal
- voice signal
- optical fiber
- fiber
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0224—Processing in the time domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/60—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
Abstract
The invention relates to a noise evaluation method based on optical fiber voice time domain signal waveform alignment, which comprises the following steps: processing an original voice time domain signal; building a laser microphone of the optical fiber ring cavity; the location where the fiber optic coil is deployed; inputting the voice signal after time domain processing into a laser microphone through an optical fiber coil, observing the condition of the output signal by using an oscilloscope, and obtaining a stable output voice signal by adjusting the output power of an optical fiber laser source; and aligning the time domains of the input voice signal and the output voice signal of the laser microphone according to the inserted square wave signal, denoising the voice signal passing through the cavity laser microphone, namely the noise-containing voice signal, and evaluating the voice quality.
Description
Technical Field
The invention relates to an optical fiber sensing technology and an optical fiber annular cavity laser microphone, and belongs to the field of voice signal noise evaluation.
Background
As a mainstream communication mode, voice obviously becomes an important man-machine interaction means in the future. However, noise is inevitable and varies in the process of communication between people or human-computer interaction. For example, environmental noise, mechanical noise, traffic noise such as cars when people make calls on the road, etc. in voice communication affect voice quality. In the rapid development of the information-based society, the overall quality requirement on voice is higher and higher, and the voice denoising technology is rapidly developed. With the introduction of the deep research and new idea of speech denoising, a series of speech denoising methods such as human auditory masking, artificial neural network and speech denoising algorithm based on wavelet transform also appear in succession.
For a voice signal after voice denoising, how to judge the denoising effect needs to introduce a voice denoising evaluation index, and people perform performance evaluation on a voice enhancement algorithm by two methods, namely subjective evaluation, wherein the evaluation index relates to the intelligibility and the perception quality of a speech. Perceptual quality generally refers to the degree of speech recognition, speech quality, timbre pitch, etc. The subjective evaluation mainly considers whether the speech intelligibility is clear or not and whether the information transmitted by the speech signal is complete or not. Another method is objective evaluation by examining the quality of speech coding and speech communication. The objective evaluation method is to compare the voice denoising performance according to specific data, and other factors except the data are not needed to measure the denoising quality. The objective evaluation indexes mainly comprise signal-to-noise ratio, segmented signal-to-noise ratio and log-spectrum distortion measure. The objective evaluation index can obviously reflect the denoising performance to a certain extent, and is very important for the denoising effect.
The objective speech assessment method is illustrated here by an indicator of the signal-to-noise ratio:
the signal-to-noise ratio is defined as follows:
wherein, s (n) represents original voice, which may be a voice file recorded in a quiet environment of a laboratory, or a clearer voice file on a mobile phone, a computer or other equipment, or a recording file in mp3 format;representing the image by means of a de-noising processThe latter speech, L, represents the number of sample points of the speech signal, L is a parameter set by itself in the experiment.
It can also be known from the formula that the basic idea of speech quality assessment is to compare two speech signals, so that the original speech and the denoised speech must be in the same time period and have the same time length during speech quality assessment, that is, the speech signal time domain alignment processing is required. The general voice alignment processing is realized by an SPPAS tool, an audio alignment algorithm or a manual alignment method, but the common voice alignment processing has obvious errors, so that differences can be obviously heard subjectively. Speech time domain alignment is the basis for speech signal processing.
Technical scheme
The invention aims to provide a new method for realizing time domain waveform alignment of optical fiber signals, which is applied to the denoising processing of laser microphone voice signals based on an optical fiber ring cavity and realizes the time synchronization of input original and noisy voice when the denoising effect is subjected to voice quality evaluation. The technical scheme is realized as follows:
a noise evaluation method based on optical fiber voice time domain signal waveform alignment comprises the following steps:
firstly, processing an original voice time domain signal. Reading the original voice signal by Matlab software to obtain the sequence information of the original voice signal, adding square wave sequence information at the proper position of the original voice sequence, and converting and storing the synthesized voice sequence as an audio file for inputting into a laser microphone.
The laser microphone comprises an optical fiber laser source, an erbium-doped optical fiber, an FFP filter, a 2 × 2 coupler, an optical fiber coil and a data acquisition part, wherein the optical fiber laser source is connected with the FFP filter and outside the optical fiber ring cavity, continuous laser is generated by the optical fiber laser source, an optical signal which can only be transmitted in one direction is formed after passing through an isolator, then the optical signal is amplified by the erbium-doped optical fiber and then transmitted to the optical fiber coil, the optical fiber coil is formed by winding optical fibers with the length of more than 1 kilometer and serves as a voice signal sensing device, the optical signal is transmitted back to the optical fiber laser source through the coupler and the FFP filter, the coupler is used for converting the optical signal and an electric signal, the signal is converted through the coupler, and an output voice signal is obtained through the data acquisition part;
thirdly, arranging the position of the optical fiber coil;
fourthly, inputting the voice signal after time domain processing into a laser microphone through an optical fiber coil, observing the condition of the output signal by using an oscilloscope, and obtaining a stable output voice signal by adjusting the output power of an optical fiber laser source;
fifthly, aligning the time domains of the input voice signal and the output voice signal of the laser microphone according to the inserted square wave signal, denoising the voice signal passing through the cavity laser microphone, namely the noise-containing voice signal, and evaluating the voice quality.
Drawings
Fig. 1 is a schematic structural diagram of a laser microphone of a fiber ring cavity.
FIG. 2 is a flow chart of noise quality assessment based on waveform alignment of a fiber optic voice time domain voice signal according to the present invention.
Fig. 3 (a) is a schematic diagram of time domain waveforms of an original speech and a square wave signal, and (b) is a schematic diagram of a speech signal after a square wave sequence is added in the present invention.
Detailed Description
The invention is further illustrated and described below with reference to the accompanying drawings and specific examples.
The method is applied to evaluating the voice signal collected by the laser microphone in the optical fiber annular cavity, namely the voice signal containing noise.
First, the new microphone will be described, and referring to fig. 1, the system can be divided into three parts.
The first part is the hardware part of the system, which comprises a 980nm laser source, which generates continuous laser to provide light energy for the system, an erbium-doped fiber (EDFA), which is an optical fiber doped with a small amount of rare earth element erbium and can amplify light in the 1550nm range, a tunable fiber Fabry-Perot (FFP) filter adopted by the filter, the FFP filter and the optical fiber system have good compatibility, a 2 × 2 coupler outside the annular cavity of the optical fiber is directly connected with the filter, the annular cavity is formed by the FFP filter, the rest light is fed back into the annular cavity, and due to the action of an isolator, laser with the same transmission wavelength as that of the FFP can be generated in one direction only in the annular cavity.
The second part is an optical fiber coil, is an induction part of a voice signal, is also an input position of the voice signal, is equivalent to a loudspeaker effect, and is formed by winding ordinary optical fibers of a plurality of kilometers (which can be between 1 kilometer and 10 kilometers).
The third part is a voice acquisition part which consists of a Photodiode (PD) and a data acquisition card (DAQ).
The voice signal collected by the laser microphone of the fiber ring cavity is a voice signal containing noise. Noise originates from interference in the environment, and may be caused by a person speaking, walking, eating, opening or closing a door, knocking, traffic outside a window, or natural wind, rain, etc. during the recording process. These noises degrade the speech quality, and in order to obtain a clearer speech signal, speech denoising is required. In the speech denoising quality evaluation, a time domain alignment process for the speech signal is required, which is also the object of the present invention.
Referring to fig. 2, the flow chart of the present invention, the corresponding steps are briefly described as follows:
(1) time-domain processing of the original speech signal. The original voice signal can be a voice file recorded in a quiet environment of a laboratory, or a clearer mp3 voice file downloaded by equipment such as a mobile phone and a computer, the original voice signal is read to the computer by Matlab to obtain corresponding sequence information, square wave sequence information is added at a proper position of a voice sequence, a position with a large change of a sequence value is generally selected for adding, and then the synthesized sequence signal is converted into the voice signal to be used as a result of voice time domain processing.
(2) And (5) building a laser microphone system. The laser microphone system of the fiber ring cavity comprises components such as a fiber laser source, an FFP filter, an isolator, an erbium-doped fiber, a fiber coil and the like. And a voice acquisition part consisting of PD and DAQ outside the annular cavity.
(3) And arranging the fiber coil position. The optical fiber coil is used for sensing a voice signal, is also an input position of the voice signal and is formed by winding 2 kilometers of common optical fibers.
(4) And setting system parameters. And after a microphone system is built and an optical fiber coil is arranged, the microphone system is connected with an oscilloscope, and the optical fiber laser source is adjusted under the condition of voice signal input, so that sensitive and stable signal output is obtained under proper power. Generally, the higher the output power of the fiber laser source, the faster and more sensitive the output signal will be, and in the experiment, the output power of the fiber laser source is set to be about 350 w.
The idea of the invention is to add a square wave voice sequence in an original voice sequence, wherein the voice signal has a time mark point, then input the synthesized voice signal into a laser microphone, and the signal passing through the laser microphone is a voice signal containing noise, thereby ensuring that the input and output voice signals are aligned in time domain. Then, the output noise-containing voice signal is subjected to noise processing and voice signal quality evaluation.
Referring to fig. 3, a diagram (a) shows that in an original speech signal, a peak position is selected as a time node to add a square wave signal, and after the addition, a speech signal of a diagram (b) can be obtained, and a synthesized speech signal shown in the diagram (b) can be used as an input speech of a microphone system.
The invention has the following beneficial effects:
(1) the invention realizes the time domain waveform alignment of the optical fiber acoustic signal and has higher accuracy and practicability.
(2) The portability is good, aiming at the optical fiber sound signal wavelengths of different forms, only standard square waves, sawtooth waves or sine waves and the like need to be added at proper positions, and experimental programs can be used universally under various operating systems.
Claims (1)
1. A noise evaluation method based on optical fiber voice time domain signal waveform alignment comprises the following steps:
firstly, processing an original voice time domain signal: reading the original voice signal by Matlab software to obtain the sequence information of the original voice signal, adding square wave sequence information at the proper position of the original voice sequence, and converting and storing the synthesized voice sequence as an audio file for inputting into a laser microphone.
The second step, building a laser microphone of the fiber ring cavity, which comprises a fiber laser source, an erbium-doped fiber, an FFP filter, a 2 × 2 coupler outside the fiber ring cavity connected with the FFP filter, a fiber coil and a data acquisition part, wherein the fiber laser source generates continuous laser, optical signals which can only be transmitted in one direction are formed after passing through an isolator, then the optical signals are amplified by the erbium-doped fiber and then transmitted to the fiber coil, the fiber coil is formed by winding fibers with the length of more than 1 kilometer and serves as a voice signal induction device, the optical signals are transmitted back to the fiber laser source through the coupler and the FFP filter, the coupler is used for converting the optical signals and the electric signals, the signals are converted through the coupler, and output voice signals are obtained through the data acquisition part;
thirdly, arranging the position of the optical fiber coil;
fourthly, inputting the voice signal after time domain processing into a laser microphone through an optical fiber coil, observing the condition of the output signal by using an oscilloscope, and obtaining a stable output voice signal by adjusting the output power of an optical fiber laser source;
fifthly, aligning the time domains of the input voice signal and the output voice signal of the laser microphone according to the inserted square wave signal, denoising the voice signal passing through the cavity laser microphone, namely the noise-containing voice signal, and evaluating the voice quality.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010210101.7A CN111489759A (en) | 2020-03-23 | 2020-03-23 | Noise evaluation method based on optical fiber voice time domain signal waveform alignment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010210101.7A CN111489759A (en) | 2020-03-23 | 2020-03-23 | Noise evaluation method based on optical fiber voice time domain signal waveform alignment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111489759A true CN111489759A (en) | 2020-08-04 |
Family
ID=71810808
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010210101.7A Pending CN111489759A (en) | 2020-03-23 | 2020-03-23 | Noise evaluation method based on optical fiber voice time domain signal waveform alignment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111489759A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113409820A (en) * | 2021-06-09 | 2021-09-17 | 合肥群音信息服务有限公司 | Quality evaluation method based on voice data |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002097791A1 (en) * | 2001-05-25 | 2002-12-05 | Dolby Laboratories Licensing Corporation | Method for time aligning audio signals using characterizations based on auditory events |
CN103474083A (en) * | 2013-09-18 | 2013-12-25 | 中国人民解放军电子工程学院 | Voice time warping method based on orthogonal sinusoidal impulse sequence locating label |
US20140029762A1 (en) * | 2012-07-25 | 2014-01-30 | Nokia Corporation | Head-Mounted Sound Capture Device |
US20150279351A1 (en) * | 2012-12-19 | 2015-10-01 | Google Inc. | Keyword detection based on acoustic alignment |
CN107389097A (en) * | 2017-07-25 | 2017-11-24 | 北京航空航天大学 | Optical fibre gyro Sagnac fiber optic loop eigenfrequency tracking measurement methods |
CN110289014A (en) * | 2019-05-21 | 2019-09-27 | 华为技术有限公司 | A kind of speech quality detection method and electronic equipment |
US20200162821A1 (en) * | 2016-12-09 | 2020-05-21 | The Research Foundation For The State University Of New York | Fiber microphone |
US20210027769A1 (en) * | 2018-05-28 | 2021-01-28 | Huawei Technologies Co., Ltd. | Voice alignment method and apparatus |
-
2020
- 2020-03-23 CN CN202010210101.7A patent/CN111489759A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002097791A1 (en) * | 2001-05-25 | 2002-12-05 | Dolby Laboratories Licensing Corporation | Method for time aligning audio signals using characterizations based on auditory events |
US20140029762A1 (en) * | 2012-07-25 | 2014-01-30 | Nokia Corporation | Head-Mounted Sound Capture Device |
US20150279351A1 (en) * | 2012-12-19 | 2015-10-01 | Google Inc. | Keyword detection based on acoustic alignment |
CN103474083A (en) * | 2013-09-18 | 2013-12-25 | 中国人民解放军电子工程学院 | Voice time warping method based on orthogonal sinusoidal impulse sequence locating label |
US20200162821A1 (en) * | 2016-12-09 | 2020-05-21 | The Research Foundation For The State University Of New York | Fiber microphone |
CN107389097A (en) * | 2017-07-25 | 2017-11-24 | 北京航空航天大学 | Optical fibre gyro Sagnac fiber optic loop eigenfrequency tracking measurement methods |
US20210027769A1 (en) * | 2018-05-28 | 2021-01-28 | Huawei Technologies Co., Ltd. | Voice alignment method and apparatus |
CN110289014A (en) * | 2019-05-21 | 2019-09-27 | 华为技术有限公司 | A kind of speech quality detection method and electronic equipment |
Non-Patent Citations (3)
Title |
---|
O.KILIC ET AL.: "《Fiber-optical acoustic sensor based on a photonic-crystal diaphragm》", 《TRANSDUCERS 2009 - 2009 INTERNATIONAL SOLID-STATE SENSORS, ACTUATORS AND MICROSYSTEMS CONFERENCE》 * |
江毅、唐才杰: "《光纤Fabry-Perot干涉仪原理及应用》", vol. 2, 国防工业出版社, pages: 143 - 102 * |
高椿明 等: "《光纤声传感器综述》", 《光电工程》, vol. 45, no. 9, pages 116 - 125 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113409820A (en) * | 2021-06-09 | 2021-09-17 | 合肥群音信息服务有限公司 | Quality evaluation method based on voice data |
CN113409820B (en) * | 2021-06-09 | 2022-03-15 | 合肥群音信息服务有限公司 | Quality evaluation method based on voice data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10891967B2 (en) | Method and apparatus for enhancing speech | |
CN102016984B (en) | System and method for dynamic sound delivery | |
JP3636460B2 (en) | Method and system for detecting and generating transients in acoustic signals | |
JP3964456B2 (en) | Method and apparatus for objective voice quality measurement of telecommunications equipment | |
US20130246059A1 (en) | System and method for producing an audio signal | |
KR20070000995A (en) | Frequency extension of harmonic signals | |
WO2001033550A1 (en) | Speech parameter compression | |
CN108597505A (en) | Audio recognition method, device and terminal device | |
Prego et al. | A blind algorithm for reverberation-time estimation using subband decomposition of speech signals | |
CN101233561B (en) | Enhancement of speech intelligibility in a mobile communication device by controlling the operation of a vibrator of a vibrator in dependance of the background noise | |
CN109243429A (en) | A kind of pronunciation modeling method and device | |
CN111489759A (en) | Noise evaluation method based on optical fiber voice time domain signal waveform alignment | |
Gaudron et al. | LPG-based optical fibre sensor for acoustic wave detection | |
JP3205560B2 (en) | Method and apparatus for determining tonality of an audio signal | |
US7013266B1 (en) | Method for determining speech quality by comparison of signal properties | |
CN111128219B (en) | Laser Doppler sound taking method and device | |
Barnwell III | Objective measures for speech quality testing | |
CN111261192A (en) | Audio detection method based on LSTM network, electronic equipment and storage medium | |
KR20090080777A (en) | Method and Apparatus for detecting signal | |
CN112233693B (en) | Sound quality evaluation method, device and equipment | |
JP2002507776A (en) | Signal processing method for analyzing transients in audio signals | |
Baumgarte | A physiological ear model for auditory masking applicable to perceptual coding | |
Voishvillo | Measurements and Perception of Nonlinear Distortion—Comparing Numbers and Sound Quality | |
JP2006119647A (en) | System for spuriously converting whispery voice to ordinary voiced sound | |
Gully et al. | The Lombard effect in MRI noise |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Lv Chengang Inventor after: Xiao Yanping Inventor after: Ma Jingjing Inventor after: Huo Ziqiang Inventor before: Lv Chengang Inventor before: Ma Jingjing Inventor before: Huo Ziqiang |