CN111489759A - Noise evaluation method based on optical fiber voice time domain signal waveform alignment - Google Patents

Noise evaluation method based on optical fiber voice time domain signal waveform alignment Download PDF

Info

Publication number
CN111489759A
CN111489759A CN202010210101.7A CN202010210101A CN111489759A CN 111489759 A CN111489759 A CN 111489759A CN 202010210101 A CN202010210101 A CN 202010210101A CN 111489759 A CN111489759 A CN 111489759A
Authority
CN
China
Prior art keywords
voice
signal
voice signal
optical fiber
fiber
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010210101.7A
Other languages
Chinese (zh)
Inventor
吕辰刚
马敬敬
霍紫强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN202010210101.7A priority Critical patent/CN111489759A/en
Publication of CN111489759A publication Critical patent/CN111489759A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0224Processing in the time domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/60Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals

Abstract

The invention relates to a noise evaluation method based on optical fiber voice time domain signal waveform alignment, which comprises the following steps: processing an original voice time domain signal; building a laser microphone of the optical fiber ring cavity; the location where the fiber optic coil is deployed; inputting the voice signal after time domain processing into a laser microphone through an optical fiber coil, observing the condition of the output signal by using an oscilloscope, and obtaining a stable output voice signal by adjusting the output power of an optical fiber laser source; and aligning the time domains of the input voice signal and the output voice signal of the laser microphone according to the inserted square wave signal, denoising the voice signal passing through the cavity laser microphone, namely the noise-containing voice signal, and evaluating the voice quality.

Description

Noise evaluation method based on optical fiber voice time domain signal waveform alignment
Technical Field
The invention relates to an optical fiber sensing technology and an optical fiber annular cavity laser microphone, and belongs to the field of voice signal noise evaluation.
Background
As a mainstream communication mode, voice obviously becomes an important man-machine interaction means in the future. However, noise is inevitable and varies in the process of communication between people or human-computer interaction. For example, environmental noise, mechanical noise, traffic noise such as cars when people make calls on the road, etc. in voice communication affect voice quality. In the rapid development of the information-based society, the overall quality requirement on voice is higher and higher, and the voice denoising technology is rapidly developed. With the introduction of the deep research and new idea of speech denoising, a series of speech denoising methods such as human auditory masking, artificial neural network and speech denoising algorithm based on wavelet transform also appear in succession.
For a voice signal after voice denoising, how to judge the denoising effect needs to introduce a voice denoising evaluation index, and people perform performance evaluation on a voice enhancement algorithm by two methods, namely subjective evaluation, wherein the evaluation index relates to the intelligibility and the perception quality of a speech. Perceptual quality generally refers to the degree of speech recognition, speech quality, timbre pitch, etc. The subjective evaluation mainly considers whether the speech intelligibility is clear or not and whether the information transmitted by the speech signal is complete or not. Another method is objective evaluation by examining the quality of speech coding and speech communication. The objective evaluation method is to compare the voice denoising performance according to specific data, and other factors except the data are not needed to measure the denoising quality. The objective evaluation indexes mainly comprise signal-to-noise ratio, segmented signal-to-noise ratio and log-spectrum distortion measure. The objective evaluation index can obviously reflect the denoising performance to a certain extent, and is very important for the denoising effect.
The objective speech assessment method is illustrated here by an indicator of the signal-to-noise ratio:
the signal-to-noise ratio is defined as follows:
Figure BDA0002422517910000011
wherein, s (n) represents original voice, which may be a voice file recorded in a quiet environment of a laboratory, or a clearer voice file on a mobile phone, a computer or other equipment, or a recording file in mp3 format;
Figure BDA0002422517910000012
representing the image by means of a de-noising processThe latter speech, L, represents the number of sample points of the speech signal, L is a parameter set by itself in the experiment.
It can also be known from the formula that the basic idea of speech quality assessment is to compare two speech signals, so that the original speech and the denoised speech must be in the same time period and have the same time length during speech quality assessment, that is, the speech signal time domain alignment processing is required. The general voice alignment processing is realized by an SPPAS tool, an audio alignment algorithm or a manual alignment method, but the common voice alignment processing has obvious errors, so that differences can be obviously heard subjectively. Speech time domain alignment is the basis for speech signal processing.
Technical scheme
The invention aims to provide a new method for realizing time domain waveform alignment of optical fiber signals, which is applied to the denoising processing of laser microphone voice signals based on an optical fiber ring cavity and realizes the time synchronization of input original and noisy voice when the denoising effect is subjected to voice quality evaluation. The technical scheme is realized as follows:
a noise evaluation method based on optical fiber voice time domain signal waveform alignment comprises the following steps:
firstly, processing an original voice time domain signal. Reading the original voice signal by Matlab software to obtain the sequence information of the original voice signal, adding square wave sequence information at the proper position of the original voice sequence, and converting and storing the synthesized voice sequence as an audio file for inputting into a laser microphone.
The laser microphone comprises an optical fiber laser source, an erbium-doped optical fiber, an FFP filter, a 2 × 2 coupler, an optical fiber coil and a data acquisition part, wherein the optical fiber laser source is connected with the FFP filter and outside the optical fiber ring cavity, continuous laser is generated by the optical fiber laser source, an optical signal which can only be transmitted in one direction is formed after passing through an isolator, then the optical signal is amplified by the erbium-doped optical fiber and then transmitted to the optical fiber coil, the optical fiber coil is formed by winding optical fibers with the length of more than 1 kilometer and serves as a voice signal sensing device, the optical signal is transmitted back to the optical fiber laser source through the coupler and the FFP filter, the coupler is used for converting the optical signal and an electric signal, the signal is converted through the coupler, and an output voice signal is obtained through the data acquisition part;
thirdly, arranging the position of the optical fiber coil;
fourthly, inputting the voice signal after time domain processing into a laser microphone through an optical fiber coil, observing the condition of the output signal by using an oscilloscope, and obtaining a stable output voice signal by adjusting the output power of an optical fiber laser source;
fifthly, aligning the time domains of the input voice signal and the output voice signal of the laser microphone according to the inserted square wave signal, denoising the voice signal passing through the cavity laser microphone, namely the noise-containing voice signal, and evaluating the voice quality.
Drawings
Fig. 1 is a schematic structural diagram of a laser microphone of a fiber ring cavity.
FIG. 2 is a flow chart of noise quality assessment based on waveform alignment of a fiber optic voice time domain voice signal according to the present invention.
Fig. 3 (a) is a schematic diagram of time domain waveforms of an original speech and a square wave signal, and (b) is a schematic diagram of a speech signal after a square wave sequence is added in the present invention.
Detailed Description
The invention is further illustrated and described below with reference to the accompanying drawings and specific examples.
The method is applied to evaluating the voice signal collected by the laser microphone in the optical fiber annular cavity, namely the voice signal containing noise.
First, the new microphone will be described, and referring to fig. 1, the system can be divided into three parts.
The first part is the hardware part of the system, which comprises a 980nm laser source, which generates continuous laser to provide light energy for the system, an erbium-doped fiber (EDFA), which is an optical fiber doped with a small amount of rare earth element erbium and can amplify light in the 1550nm range, a tunable fiber Fabry-Perot (FFP) filter adopted by the filter, the FFP filter and the optical fiber system have good compatibility, a 2 × 2 coupler outside the annular cavity of the optical fiber is directly connected with the filter, the annular cavity is formed by the FFP filter, the rest light is fed back into the annular cavity, and due to the action of an isolator, laser with the same transmission wavelength as that of the FFP can be generated in one direction only in the annular cavity.
The second part is an optical fiber coil, is an induction part of a voice signal, is also an input position of the voice signal, is equivalent to a loudspeaker effect, and is formed by winding ordinary optical fibers of a plurality of kilometers (which can be between 1 kilometer and 10 kilometers).
The third part is a voice acquisition part which consists of a Photodiode (PD) and a data acquisition card (DAQ).
The voice signal collected by the laser microphone of the fiber ring cavity is a voice signal containing noise. Noise originates from interference in the environment, and may be caused by a person speaking, walking, eating, opening or closing a door, knocking, traffic outside a window, or natural wind, rain, etc. during the recording process. These noises degrade the speech quality, and in order to obtain a clearer speech signal, speech denoising is required. In the speech denoising quality evaluation, a time domain alignment process for the speech signal is required, which is also the object of the present invention.
Referring to fig. 2, the flow chart of the present invention, the corresponding steps are briefly described as follows:
(1) time-domain processing of the original speech signal. The original voice signal can be a voice file recorded in a quiet environment of a laboratory, or a clearer mp3 voice file downloaded by equipment such as a mobile phone and a computer, the original voice signal is read to the computer by Matlab to obtain corresponding sequence information, square wave sequence information is added at a proper position of a voice sequence, a position with a large change of a sequence value is generally selected for adding, and then the synthesized sequence signal is converted into the voice signal to be used as a result of voice time domain processing.
(2) And (5) building a laser microphone system. The laser microphone system of the fiber ring cavity comprises components such as a fiber laser source, an FFP filter, an isolator, an erbium-doped fiber, a fiber coil and the like. And a voice acquisition part consisting of PD and DAQ outside the annular cavity.
(3) And arranging the fiber coil position. The optical fiber coil is used for sensing a voice signal, is also an input position of the voice signal and is formed by winding 2 kilometers of common optical fibers.
(4) And setting system parameters. And after a microphone system is built and an optical fiber coil is arranged, the microphone system is connected with an oscilloscope, and the optical fiber laser source is adjusted under the condition of voice signal input, so that sensitive and stable signal output is obtained under proper power. Generally, the higher the output power of the fiber laser source, the faster and more sensitive the output signal will be, and in the experiment, the output power of the fiber laser source is set to be about 350 w.
The idea of the invention is to add a square wave voice sequence in an original voice sequence, wherein the voice signal has a time mark point, then input the synthesized voice signal into a laser microphone, and the signal passing through the laser microphone is a voice signal containing noise, thereby ensuring that the input and output voice signals are aligned in time domain. Then, the output noise-containing voice signal is subjected to noise processing and voice signal quality evaluation.
Referring to fig. 3, a diagram (a) shows that in an original speech signal, a peak position is selected as a time node to add a square wave signal, and after the addition, a speech signal of a diagram (b) can be obtained, and a synthesized speech signal shown in the diagram (b) can be used as an input speech of a microphone system.
The invention has the following beneficial effects:
(1) the invention realizes the time domain waveform alignment of the optical fiber acoustic signal and has higher accuracy and practicability.
(2) The portability is good, aiming at the optical fiber sound signal wavelengths of different forms, only standard square waves, sawtooth waves or sine waves and the like need to be added at proper positions, and experimental programs can be used universally under various operating systems.

Claims (1)

1. A noise evaluation method based on optical fiber voice time domain signal waveform alignment comprises the following steps:
firstly, processing an original voice time domain signal: reading the original voice signal by Matlab software to obtain the sequence information of the original voice signal, adding square wave sequence information at the proper position of the original voice sequence, and converting and storing the synthesized voice sequence as an audio file for inputting into a laser microphone.
The second step, building a laser microphone of the fiber ring cavity, which comprises a fiber laser source, an erbium-doped fiber, an FFP filter, a 2 × 2 coupler outside the fiber ring cavity connected with the FFP filter, a fiber coil and a data acquisition part, wherein the fiber laser source generates continuous laser, optical signals which can only be transmitted in one direction are formed after passing through an isolator, then the optical signals are amplified by the erbium-doped fiber and then transmitted to the fiber coil, the fiber coil is formed by winding fibers with the length of more than 1 kilometer and serves as a voice signal induction device, the optical signals are transmitted back to the fiber laser source through the coupler and the FFP filter, the coupler is used for converting the optical signals and the electric signals, the signals are converted through the coupler, and output voice signals are obtained through the data acquisition part;
thirdly, arranging the position of the optical fiber coil;
fourthly, inputting the voice signal after time domain processing into a laser microphone through an optical fiber coil, observing the condition of the output signal by using an oscilloscope, and obtaining a stable output voice signal by adjusting the output power of an optical fiber laser source;
fifthly, aligning the time domains of the input voice signal and the output voice signal of the laser microphone according to the inserted square wave signal, denoising the voice signal passing through the cavity laser microphone, namely the noise-containing voice signal, and evaluating the voice quality.
CN202010210101.7A 2020-03-23 2020-03-23 Noise evaluation method based on optical fiber voice time domain signal waveform alignment Pending CN111489759A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010210101.7A CN111489759A (en) 2020-03-23 2020-03-23 Noise evaluation method based on optical fiber voice time domain signal waveform alignment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010210101.7A CN111489759A (en) 2020-03-23 2020-03-23 Noise evaluation method based on optical fiber voice time domain signal waveform alignment

Publications (1)

Publication Number Publication Date
CN111489759A true CN111489759A (en) 2020-08-04

Family

ID=71810808

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010210101.7A Pending CN111489759A (en) 2020-03-23 2020-03-23 Noise evaluation method based on optical fiber voice time domain signal waveform alignment

Country Status (1)

Country Link
CN (1) CN111489759A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113409820A (en) * 2021-06-09 2021-09-17 合肥群音信息服务有限公司 Quality evaluation method based on voice data

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002097791A1 (en) * 2001-05-25 2002-12-05 Dolby Laboratories Licensing Corporation Method for time aligning audio signals using characterizations based on auditory events
CN103474083A (en) * 2013-09-18 2013-12-25 中国人民解放军电子工程学院 Voice time warping method based on orthogonal sinusoidal impulse sequence locating label
US20140029762A1 (en) * 2012-07-25 2014-01-30 Nokia Corporation Head-Mounted Sound Capture Device
US20150279351A1 (en) * 2012-12-19 2015-10-01 Google Inc. Keyword detection based on acoustic alignment
CN107389097A (en) * 2017-07-25 2017-11-24 北京航空航天大学 Optical fibre gyro Sagnac fiber optic loop eigenfrequency tracking measurement methods
CN110289014A (en) * 2019-05-21 2019-09-27 华为技术有限公司 A kind of speech quality detection method and electronic equipment
US20200162821A1 (en) * 2016-12-09 2020-05-21 The Research Foundation For The State University Of New York Fiber microphone
US20210027769A1 (en) * 2018-05-28 2021-01-28 Huawei Technologies Co., Ltd. Voice alignment method and apparatus

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002097791A1 (en) * 2001-05-25 2002-12-05 Dolby Laboratories Licensing Corporation Method for time aligning audio signals using characterizations based on auditory events
US20140029762A1 (en) * 2012-07-25 2014-01-30 Nokia Corporation Head-Mounted Sound Capture Device
US20150279351A1 (en) * 2012-12-19 2015-10-01 Google Inc. Keyword detection based on acoustic alignment
CN103474083A (en) * 2013-09-18 2013-12-25 中国人民解放军电子工程学院 Voice time warping method based on orthogonal sinusoidal impulse sequence locating label
US20200162821A1 (en) * 2016-12-09 2020-05-21 The Research Foundation For The State University Of New York Fiber microphone
CN107389097A (en) * 2017-07-25 2017-11-24 北京航空航天大学 Optical fibre gyro Sagnac fiber optic loop eigenfrequency tracking measurement methods
US20210027769A1 (en) * 2018-05-28 2021-01-28 Huawei Technologies Co., Ltd. Voice alignment method and apparatus
CN110289014A (en) * 2019-05-21 2019-09-27 华为技术有限公司 A kind of speech quality detection method and electronic equipment

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
O.KILIC ET AL.: "《Fiber-optical acoustic sensor based on a photonic-crystal diaphragm》", 《TRANSDUCERS 2009 - 2009 INTERNATIONAL SOLID-STATE SENSORS, ACTUATORS AND MICROSYSTEMS CONFERENCE》 *
江毅、唐才杰: "《光纤Fabry-Perot干涉仪原理及应用》", vol. 2, 国防工业出版社, pages: 143 - 102 *
高椿明 等: "《光纤声传感器综述》", 《光电工程》, vol. 45, no. 9, pages 116 - 125 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113409820A (en) * 2021-06-09 2021-09-17 合肥群音信息服务有限公司 Quality evaluation method based on voice data
CN113409820B (en) * 2021-06-09 2022-03-15 合肥群音信息服务有限公司 Quality evaluation method based on voice data

Similar Documents

Publication Publication Date Title
US10891967B2 (en) Method and apparatus for enhancing speech
CN102016984B (en) System and method for dynamic sound delivery
JP3636460B2 (en) Method and system for detecting and generating transients in acoustic signals
JP3964456B2 (en) Method and apparatus for objective voice quality measurement of telecommunications equipment
US20130246059A1 (en) System and method for producing an audio signal
KR20070000995A (en) Frequency extension of harmonic signals
WO2001033550A1 (en) Speech parameter compression
CN108597505A (en) Audio recognition method, device and terminal device
Prego et al. A blind algorithm for reverberation-time estimation using subband decomposition of speech signals
CN101233561B (en) Enhancement of speech intelligibility in a mobile communication device by controlling the operation of a vibrator of a vibrator in dependance of the background noise
CN109243429A (en) A kind of pronunciation modeling method and device
CN111489759A (en) Noise evaluation method based on optical fiber voice time domain signal waveform alignment
Gaudron et al. LPG-based optical fibre sensor for acoustic wave detection
JP3205560B2 (en) Method and apparatus for determining tonality of an audio signal
US7013266B1 (en) Method for determining speech quality by comparison of signal properties
CN111128219B (en) Laser Doppler sound taking method and device
Barnwell III Objective measures for speech quality testing
CN111261192A (en) Audio detection method based on LSTM network, electronic equipment and storage medium
KR20090080777A (en) Method and Apparatus for detecting signal
CN112233693B (en) Sound quality evaluation method, device and equipment
JP2002507776A (en) Signal processing method for analyzing transients in audio signals
Baumgarte A physiological ear model for auditory masking applicable to perceptual coding
Voishvillo Measurements and Perception of Nonlinear Distortion—Comparing Numbers and Sound Quality
JP2006119647A (en) System for spuriously converting whispery voice to ordinary voiced sound
Gully et al. The Lombard effect in MRI noise

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Lv Chengang

Inventor after: Xiao Yanping

Inventor after: Ma Jingjing

Inventor after: Huo Ziqiang

Inventor before: Lv Chengang

Inventor before: Ma Jingjing

Inventor before: Huo Ziqiang