EP1244094A1 - Procédé et dispositif de détermination de la qualité d'un signal audio - Google Patents

Procédé et dispositif de détermination de la qualité d'un signal audio Download PDF

Info

Publication number
EP1244094A1
EP1244094A1 EP01810285A EP01810285A EP1244094A1 EP 1244094 A1 EP1244094 A1 EP 1244094A1 EP 01810285 A EP01810285 A EP 01810285A EP 01810285 A EP01810285 A EP 01810285A EP 1244094 A1 EP1244094 A1 EP 1244094A1
Authority
EP
European Patent Office
Prior art keywords
signal
audio signal
quality
determining
interruptions
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP01810285A
Other languages
German (de)
English (en)
Inventor
Pero Juric
Thomet Bendicht
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rohde and Schwarz SwissQual AG
Original Assignee
SwissQual AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=8183803&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=EP1244094(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by SwissQual AG filed Critical SwissQual AG
Priority to EP01810285A priority Critical patent/EP1244094A1/fr
Priority to EP02703438.8A priority patent/EP1386307B2/fr
Priority to AT02703438T priority patent/ATE289109T1/de
Priority to US10/101,533 priority patent/US6804651B2/en
Priority to DE50202226T priority patent/DE50202226D1/de
Priority to PCT/CH2002/000164 priority patent/WO2002075725A1/fr
Publication of EP1244094A1 publication Critical patent/EP1244094A1/fr
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/69Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals

Definitions

  • the invention relates to a method for determining a quality measure of an audio signal.
  • the invention further relates to a device for performing this method and a noise reduction module and an interrupt detection and interpolation module for use in such a device.
  • Assessing the quality of a telecommunications network is an important tool to achieve or maintain a desired quality of service.
  • One way that Assessing the quality of service of a telecommunications network is quality to determine a signal transmitted over the telecommunications network.
  • audio signals Various intrusive methods are known for this, particularly in the case of voice signals.
  • the system under test is used intervened by occupying a transmission channel and transmitting a reference signal therein becomes.
  • the quality assessment is then carried out by comparing the known one Reference signal with the received signal, for example, subjectively by or a large number of test subjects.
  • this is complex and therefore expensive.
  • EP 0 980 064 describes a further intrusive method for machine-assisted quality assessment described an audio signal, being used to assess the transmission quality a spectral similarity value of the known source signal and the received signal is determined. This similarity value is based on a calculation of the covariance the spectra of the source signal and the received signal and a division of the covariance by the standard deviations of the two spectra mentioned.
  • intrusive methods generally have the disadvantage that, as already mentioned, this too testing system must be intervened. To determine the signal quality namely at least one transmission channel is occupied and a reference signal is transmitted therein become. This transmission channel cannot be used for data transmission during this time be used. It is also in a broadcasting system such as a broadcasting service in principle possible, the signal source for transmission of test signals, since this occupies all channels and the test signal this would be extremely impractical for all recipients. Intrusive Methods are also unsuitable to simultaneously control the quality of a variety of transmission channels to monitor.
  • the object of the invention is to provide a method of the type mentioned above, which avoids the disadvantages of the prior art and in particular offers a possibility to assess the signal quality of a transmitted over a telecommunications network Signal without knowing the originally sent signal.
  • a reference signal is first determined from the audio signal. through Comparing the determined reference signal with the audio signal becomes a quality value determined, which is used to determine the quality measure.
  • the method according to the invention thus allows an assessment of the quality of an audio signal at any connection of the telecommunication network. That is, it allows thus the quality assessment of many transmission channels at the same time, even a simultaneous assessment of all channels would be possible.
  • the quality assessment takes place solely on the basis of the properties of the received signal, d. H. without knowing the source signal or the signal source.
  • the invention thus not only enables monitoring of the transmission quality of the Telecommunications network, but also, for example, quality-based cost allocation, quality-based routing in the network, a test of the coverage ratio for example in the case of mobile radio networks, a QOS (Quality of Service) control of the network nodes or a quality comparison within a network or across networks.
  • QOS Quality of Service
  • Signal information typically also includes unwanted components such as different noise components, which are not in the original source signal were present.
  • the reference signal is determined by the in received signal estimated interference signal and then from the received signal can be removed.
  • a noisy audio signal is determined, which is preferred as Reference signal is used to assess the transmission quality.
  • the audio signal could, for example, be passed through appropriate filters.
  • a preferred method is to remove the noise components from the audio signal however, a neural network is used for this.
  • the audio signal is not used directly as an input signal.
  • DWT discrete wavelet transform
  • This transformation provides a plurality of DWT coefficients of the audio signal, which the neural Network as an input signal.
  • the neural network delivers on Output a plurality of corrected DWT coefficients, from which with the inverse DWT the reference signal is obtained. This corresponds to the noiseless version of the Audio signal.
  • the coefficients of the neural network must be set in this way be that to the DWT coefficients of a noisy input signal provides the DWT coefficients of the corresponding noiseless input signal.
  • the neural network In order to the neural network delivers the desired coefficients, it must first with a Set of corresponding noisy or noisy signal pairs trained become.
  • any other information will also be taken into account. This can be both information contained in the audio signal, as well as information about the transmission channel or the telecommunications network itself.
  • the quality of the received audio signal is determined by the quality of the Transmission influenced codec's (coder - decoder) influenced. It is difficult to do such Determine signal degradations, for example if the codec bit rates are too low part of the original signal information is lost. However, have too small Codec bit rates result in a change in the fundamental frequency (pitch) of the audio signal, which is why the course and dynamics of the fundamental frequency in the audio signal are advantageously examined becomes. Because such changes are easiest based on audio signal sections with vowels examined, signal components in the audio signal are preferred detected with vowels and then examined for pitch variations.
  • the received audio signal can namely not only have unwanted signal components, it can also partially on the way desired information has been lost. So can the received audio signal for example, have more or less long signal interruptions.
  • the received audio signal can include various types of audio signals. So it can contain, for example, speech, music, noise or even quiet signal components.
  • the quality assessment can of course be based on all or part of this Signal components take place. In a preferred variant of the invention, the assessment the signal quality, however, is limited to the speech signal components. With an audio discriminator the speech signal components are therefore first extracted from the audio signal and only these speech signal components for determining the quality measure, d. H. to Determination of the reference signal used. To determine the quality value is in In this case, of course, the determined reference signal does not match the received audio signal, but only compared with the extracted speech signal component.
  • the device according to the invention for machine-assisted determination of a quality measure an audio signal comprises first means for determining a reference signal the audio signal, second means for determining a quality value by means of comparisons the determined reference signal with the audio signal and third means for determination the quality measure taking into account the quality value.
  • the first means for determining a reference signal from the audio signal can be several Include modules. So is preferably a noise reduction module and / or a Interrupt detection and interpolation module provided.
  • noise signal components can be received Suppress audio signal. It includes the means to carry out those already described Wavelet transformations and the neural network to determine the new one DWT coefficients.
  • the interrupt detection and interpolation module has those Means on the one hand for the detection of signal interruptions in the audio signal and on the other hand for the polynomial interpolation of short and for model-based interpolation of medium-long signal interruptions are required. That determined so The reference signal thus corresponds to a noisy version of the received audio signal and typically only shows major signal interruptions.
  • the information about the signal interruptions of the audio signal is not only used to determine a better reference signal, they can also be used for determination of a better quality measure can be used.
  • the third means of determination of the quality measure are therefore preferably designed such that information can be taken into account via signal interruptions in the audio signal.
  • the device therefore advantageously has fourth means for determining information about codec-related Signal distortion.
  • codec-related Signal distortion include, for example, a vowel detection module, with which signal components with vowels can be detected in the audio signal. These vowel signal components are passed on to an assessment module, which is based on this Signal components Information about codec-related signal distortions determines which can also be used to assess the signal quality.
  • the third means are appropriate formed such that this information about the codec-related signal distortion can be taken into account when determining the quality measure.
  • the device therefore has, in particular, fifth means for extracting the method Speech signal components from the audio signal. Accordingly, to determine the Reference signal not the audio signal itself, but only its voice signal component noisy and examined for interruptions. Likewise, of course, the audio signal, but only compared its voice signal component with this reference signal. In order to the quality measure is only determined on the basis of the information in the Speech signal component, whereby the information from the remaining signal components is not taken into account become.
  • FIG. 1 shows a block diagram of the method according to the invention.
  • a Audio signal 1 determines a quality measure 2, which is also used for evaluation, for example of the telecommunications network used (not shown).
  • the audio signal 1 is understood here to mean that signal which is a receiver after transmission over the telecommunications network.
  • This audio signal 1 typically does not match the one sent by the (not shown) transmitter Signal coincides, because on the way from the transmitter to the receiver the transmission signal opens varied ways. For example, it goes through various modules such as voice encoders and decoders, multiplexers and demultiplexers or even voice enhancers and echo cancellers.
  • the transmission channel itself can also be a big one Influence the signal, which can take the form of interference, fading, Express transmission interruptions or interruptions, echo generation etc.
  • the audio signal 1 thus not only contains the desired signal components, i. H. the original Transmitted signal, but also unwanted interference signal components. It may also be that Signal components of the transmission signal are missing, i. H. lost during transmission are.
  • the assessment of the signal quality is not based on of the entire audio signal 1, but only based on the contained therein Speech portion.
  • the audio signal 1 is initially based on an audio discriminator 3 Speech signal components 4 examined. Found speech signal components 4 become further Processing passed, whereas other signal components such as music 5.1, breaks 5.2 or strong signal interference 5.3 sorted out and processed further or can be discarded.
  • the audio signal 1 piecewise, d. H. to pieces a each about 100 ms to 500 ms, passed to the audio discriminator 3. This breaks these pieces down further individual buffers of about 20 ms in length, processes these buffers and then arranges them in each case one of the signal groups to be distinguished: voice signal, music, pause or strong interference to.
  • the audio discriminator 3 is used, for example, to assess the signal pieces an LPC (linear predictive coding) transformation, with which the coefficients of a adaptive filters corresponding to the human speech tract are calculated.
  • LPC linear predictive coding
  • the assignment of the signal pieces to the different signal groups is based on the Form of the transmission characteristics of this filter.
  • this speech signal component 4 now a reference signal 6, i. H. a best possible estimate of the original from the transmitter transmitted transmission signal, determined.
  • This reference signal estimation is carried out in several stages.
  • a noise suppression module 7 undesirable components are initially created Signal components such as stationary noise or impulse interference from the voice signal component 4 removed or suppressed. This is done with the help of a neural network, which was previously used as an input and a plurality of noisy signals each trained the corresponding noise-free version of the input signal as the target signal has been.
  • the noise-free speech signal 11 obtained in this way is sent to the second stage forwarded.
  • the interrupt detection and interpolation module 8 there are interruptions detected in the audio signal 1 or in its speech signal component 4 and if possible interpolated, d. H. the missing samples are replaced by suitably estimated values.
  • signal interruptions are detected by means of an examination discontinuities in the fundamental signal frequency (pitch tracing).
  • the interpolation is carried out depending on the length of the detected break.
  • d With short Interruptions, d. H. Interruptions of a few samples in length become a polynomial Interpolation such as a Lagrangian, Newton, Hermite, or Cubic Spline interpolation applied.
  • model-based interpolations such as a maximum a posteriori, an autoregressive or a frequency-time interpolation is used. With longer ones Signal interruptions is an interpolation or other signal reconstruction in generally no longer possible in a meaningful way.
  • an algorithm can be used for this comparison can be used as it is for example in intrusive methods for comparison of the known source signal is used with the received signal. Suitable are, for example, psychoacoustic models, the signals are perceptual, i. H. perceptible to compare.
  • the result of this comparison is an intrusive quality value of 10.
  • this intrusive quality value 10 becomes the input signals, ie the Speech signal component 4 and the reference signal 6, in signal pieces of about 20 to 30 ms Length broken down and a partial quality value calculated for each signal piece. After about 20 to 30 signal pieces, which corresponds to a signal duration of 0.5 seconds, becomes intrusive Quality value 10 determined as the arithmetic mean of these partial quality values.
  • the intrusive quality value 10 forms the output signal of the comparison module 9.
  • a speech encoder or speech decoder which the transmitted signal passes on its way from the transmitter to the receiver has an influence on the audio signal 1.
  • These influences exist, for example in that both the fundamental frequency and the frequencies of the higher harmonics of the signal vary. The lower the bit rate of the speech codecs used, the greater the frequency shifts and thus the signal distortions.
  • noisy speech signal 11 is first supplied to a vocal detector 12.
  • Vowel signals 13, i.e. H. Signal components which recognizes the neural network as vowels are forwarded to an evaluation module 14, other signal components are rejected.
  • the evaluation module 14 divides the vowel signal 13 into pieces of approximately 30 msec calculates a DFT (discrete Fourier transformation) with a frequency resolution of approximately 2 Hz at a sampling frequency of approximately 8 kHz. Leave with it then determine the fundamental frequency as well as the frequencies of the higher harmonics and examine for variations. Another feature to evaluate the codec-related Distortion forms the dynamics of the signal spectrum, with a smaller dynamic poor signal quality means.
  • the reference values for the dynamic evaluation are obtained from the sample signals for the individual vowels. From the information on the influence of codecs on frequency shifts and spectrum dynamics the audio signal 1 or the noisy speech signal 11 becomes a codec quality value 15 derived.
  • This value includes information about the length and the number of interruptions detected by the interruption detection and interpolation module 8, in a preferred embodiment of the invention only the information about the long breaks.
  • further quality information 18 about the received audio signal 1 or the noisy speech signal 11, which is determined with other modules or examinations are included in the calculations of quality measure 2.
  • the individual quality values are now scaled so that they are between 0 and 1 lie, with a quality value of 1 an undiminished quality and Values below 1 indicate a correspondingly reduced quality.
  • the quality measure 2 is finally calculated as a linear combination of the individual quality values, whereby the individual weighting coefficients are determined experimentally and determined in such a way that their sum is 1.
  • figure 2 shows the noise reduction module 7.
  • the speech signal component 4 of the audio signal 1 is first subjected to a known DWT 19 (discrete wavelet transformation). Similar to DFTs, DWTs are used for signal analysis. An essential one The difference is, however, in contrast to those used in a DFT, unlimited in time and thus sine or cosine waveforms not temporally localized, the use of so-called wavelets, i.e. H. limited and thus localized Average 0 waveforms.
  • the speech signal component 4 is divided into signal pieces of approximately 20 ms to 30 ms, which are each subject to DWT 19.
  • the result of DWT 19 is a set of DWT coefficients 20.1, which are fed as input vector to a neural network 20 become. Its coefficients have previously been trained to match a given one Set of DWT coefficients 20.1 of a noisy signal a new set of DWT coefficients 20.2 deliver the noiseless version of this signal.
  • This new set of DWT coefficient 20.2 is now the IDWT 21, i. H. subject to the DWT inverse to DWT 19. In this way, this IDWT 21 delivers a largely noiseless version of the Speech signal components 4, the desired, noiseless speech signal 11.
  • the training configuration of the neural network 20 is shown in FIG. 3. It will be with Trained pairs of noisy and noiseless versions of sample signals.
  • noisy example signal 22.1 is subjected to DWT 19 and it becomes a first one Obtained set 20.3 of DWT coefficients.
  • the noisy example signal 22.2 is also subjected to the same DWT 19 and a second set 20.4 of DWT coefficients generated, which is fed into the neural network 20.
  • the output vector of the neural Network 20, the new DWT coefficients 20.5, is in a comparator 23 with the first Theorem 20.3 of DWT coefficients compared. Because of the differences between These two sets of DWT coefficients are corrected 24 for the coefficients of the neural network 20.
  • This process is done with a variety of sample signal pairs repeated so that the coefficients of the neural network 20 perform the desired function perform more and more precisely.
  • the neural Network 20 uses sample signals 22.1, 22.2, which human sounds from different Represent languages. It is also an advantage to do this for both women and women Use men's and children's voices.
  • the mentioned size of the to be processed individually Signal pieces of 20 ms to 30 ms duration is selected so that the processing of the Speech signal portion 4 are carried out regardless of the language and the speaker can. Even pauses in speech and very quiet signal sections are trained with this these are also recognized correctly.
  • a multilayer perceptron was used as the neural network 20 with an input layer 25, a hidden layer 26 and one Output layer 27 used.
  • the perceptron was trained with a back propagation algorithm.
  • the input layer 25 has a plurality of input neurons 25.1, the hidden layer 26 a plurality of hidden neurons 26.1 and the Output layer 27 has a plurality of output neurons 27.1. Any input neuron 25.1 becomes one of the DWT coefficients 20.1 of the previous DWT 19 fed.
  • the respective values are determined with the set coefficients of the respective neurons and the value combinations are calculated in the individual neurons each output neuron 27.1 one of the new DWT coefficients 20.2.
  • the audio discriminator 3 divides the signal pieces into individual buffers Length 20 ms. At a sampling rate of 8 kHz, this corresponds to 160 samples.
  • a neural network 20 with 160 input and output neurons each can be used 25.1, 27.1 and about 50 to 60 hidden neurons 26.1 can be used.
  • time-frequency interpolation is used for signal reconstruction.
  • a short-term spectrum for signal frames with 64 samples Length (8 ms) calculated. This is done by placing the signal frames with Hamming windows be multiplied at an overlap of 50%.
  • the goal of interpolation is to treat this gap.
  • FIG. 5 shows such a signal 28 of approximately 200 samples in length.
  • About periodicity 5 shows the signal 28 in the temporal domain.
  • On the abscissa axis 32 is the number of samples and the ordinate axis 33 the magnitudes applied.
  • the interpolation takes place in the frequency-time domain.
  • the interruption 29 is easy to recognize as a gap of just under 10 samples in length.
  • the pitch period 30 of the signal 28 is first of all determined.
  • the interpolation will be information from the samples before and after the gap within this pitch period 30 is taken into account.
  • the signal areas 31.1, 31.2 show those Areas of the signal 28 a pitch period before or after the interruption 29.
  • This Signal areas 31.1, 31.2 are not identical to the original signal piece at break 29, but still show a high degree of similarity. For little ones Gaps up to about 10 samples are assumed to have enough signal information is present in order to be able to carry out correct interpolation. With longer gaps additional information from samples of the environment can be used.
  • the invention allows the signal quality of a judge received audio signal without knowing the original broadcast signal.
  • the signal quality can also affect the quality of the transmission channels used and thus concluded on the quality of service of the entire telecommunications network become.
  • the fast response times of the method according to the invention which are in the order of magnitude of approximately 100 ms to 500 ms, thus making different possible Applications such as general comparisons of the service quality of different ones Networks or subnetworks, a quality-based cost allocation or a quality-based one Routing in a network or across multiple networks using appropriate Control of the network nodes (gateways, routers etc.).
EP01810285A 2001-03-20 2001-03-20 Procédé et dispositif de détermination de la qualité d'un signal audio Withdrawn EP1244094A1 (fr)

Priority Applications (6)

Application Number Priority Date Filing Date Title
EP01810285A EP1244094A1 (fr) 2001-03-20 2001-03-20 Procédé et dispositif de détermination de la qualité d'un signal audio
EP02703438.8A EP1386307B2 (fr) 2001-03-20 2002-03-19 Procede et dispositif pour determiner un niveau de qualite d'un signal audio
AT02703438T ATE289109T1 (de) 2001-03-20 2002-03-19 Verfahren und vorrichtung zur bestimmung eines qualitätsmasses eines audiosignals
US10/101,533 US6804651B2 (en) 2001-03-20 2002-03-19 Method and device for determining a measure of quality of an audio signal
DE50202226T DE50202226D1 (de) 2001-03-20 2002-03-19 Verfahren und vorrichtung zur bestimmung eines qualitätsmasses eines audiosignals
PCT/CH2002/000164 WO2002075725A1 (fr) 2001-03-20 2002-03-19 Procede et dispositif pour determiner un niveau de qualite d'un signal audio

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP01810285A EP1244094A1 (fr) 2001-03-20 2001-03-20 Procédé et dispositif de détermination de la qualité d'un signal audio

Publications (1)

Publication Number Publication Date
EP1244094A1 true EP1244094A1 (fr) 2002-09-25

Family

ID=8183803

Family Applications (2)

Application Number Title Priority Date Filing Date
EP01810285A Withdrawn EP1244094A1 (fr) 2001-03-20 2001-03-20 Procédé et dispositif de détermination de la qualité d'un signal audio
EP02703438.8A Expired - Lifetime EP1386307B2 (fr) 2001-03-20 2002-03-19 Procede et dispositif pour determiner un niveau de qualite d'un signal audio

Family Applications After (1)

Application Number Title Priority Date Filing Date
EP02703438.8A Expired - Lifetime EP1386307B2 (fr) 2001-03-20 2002-03-19 Procede et dispositif pour determiner un niveau de qualite d'un signal audio

Country Status (5)

Country Link
US (1) US6804651B2 (fr)
EP (2) EP1244094A1 (fr)
AT (1) ATE289109T1 (fr)
DE (1) DE50202226D1 (fr)
WO (1) WO2002075725A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102004029421A1 (de) * 2004-06-18 2006-01-05 Rohde & Schwarz Gmbh & Co. Kg Verfahren und Vorrichtung zur Bewertung der Güte eines Signals

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7177430B2 (en) * 2001-10-31 2007-02-13 Portalplayer, Inc. Digital entroping for digital audio reproductions
US7746797B2 (en) * 2002-10-09 2010-06-29 Nortel Networks Limited Non-intrusive monitoring of quality levels for voice communications over a packet-based network
US20040167774A1 (en) * 2002-11-27 2004-08-26 University Of Florida Audio-based method, system, and apparatus for measurement of voice quality
GB2407952B (en) * 2003-11-07 2006-11-29 Psytechnics Ltd Quality assessment tool
US20050228655A1 (en) * 2004-04-05 2005-10-13 Lucent Technologies, Inc. Real-time objective voice analyzer
US7856355B2 (en) * 2005-07-05 2010-12-21 Alcatel-Lucent Usa Inc. Speech quality assessment method and system
WO2007098258A1 (fr) * 2006-02-24 2007-08-30 Neural Audio Corporation Système et procédé de conditionnement pour un codec audio
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
US20080244081A1 (en) * 2007-03-30 2008-10-02 Microsoft Corporation Automated testing of audio and multimedia over remote desktop protocol
JP5054205B2 (ja) * 2008-03-04 2012-10-24 カーディアック ペースメイカーズ, インコーポレイテッド 移植式の多重長rfアンテナ
JP4327886B1 (ja) * 2008-05-30 2009-09-09 株式会社東芝 音質補正装置、音質補正方法及び音質補正用プログラム
JP4327888B1 (ja) * 2008-05-30 2009-09-09 株式会社東芝 音声音楽判定装置、音声音楽判定方法及び音声音楽判定用プログラム
JP2013500498A (ja) * 2009-07-24 2013-01-07 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声品質の評価のための方法、コンピュータ、コンピュータプログラム、およびコンピュータプログラム製品
US20110178800A1 (en) 2010-01-19 2011-07-21 Lloyd Watts Distortion Measurement for Noise Suppression System
US9558755B1 (en) 2010-05-20 2017-01-31 Knowles Electronics, Llc Noise suppression assisted automatic speech recognition
US8239196B1 (en) * 2011-07-28 2012-08-07 Google Inc. System and method for multi-channel multi-feature speech/noise classification for noise suppression
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
US9396738B2 (en) 2013-05-31 2016-07-19 Sonus Networks, Inc. Methods and apparatus for signal quality analysis
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
CN106797512B (zh) 2014-08-28 2019-10-25 美商楼氏电子有限公司 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质
CN106816158B (zh) * 2015-11-30 2020-08-07 华为技术有限公司 一种语音质量评估方法、装置及设备
US10490206B2 (en) * 2016-01-19 2019-11-26 Dolby Laboratories Licensing Corporation Testing device capture performance for multiple speakers
US10283140B1 (en) * 2018-01-12 2019-05-07 Alibaba Group Holding Limited Enhancing audio signals using sub-band deep neural networks
US10978091B2 (en) * 2018-03-19 2021-04-13 Academia Sinica System and methods for suppression by selecting wavelets for feature compression in distributed speech recognition

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0644526A1 (fr) * 1993-09-20 1995-03-22 ALCATEL ITALIA S.p.A. Procédé de réduction du bruit, notamment pour la reconnaissance automatique du langage et filtre pour complèter le procédé
US5583968A (en) * 1993-03-29 1996-12-10 Alcatel N.V. Noise reduction for speech recognition
WO2000072453A1 (fr) * 1999-05-25 2000-11-30 Algorex, Inc. Systeme universel de mesure de la qualite de signaux multimedia et d'autres signaux

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4897878A (en) * 1985-08-26 1990-01-30 Itt Corporation Noise compensation in speech recognition apparatus
DE3639753A1 (de) * 1986-11-21 1988-06-01 Inst Rundfunktechnik Gmbh Verfahren zum uebertragen digitalisierter tonsignale
US5446492A (en) * 1993-01-19 1995-08-29 Wolf; Stephen Perception-based video quality measurement system
US6122610A (en) * 1998-09-23 2000-09-19 Verance Corporation Noise suppression for low bitrate speech coder
US20020054685A1 (en) * 2000-11-09 2002-05-09 Carlos Avendano System for suppressing acoustic echoes and interferences in multi-channel audio systems
US6937978B2 (en) * 2001-10-30 2005-08-30 Chungwa Telecom Co., Ltd. Suppression system of background noise of speech signals and the method thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5583968A (en) * 1993-03-29 1996-12-10 Alcatel N.V. Noise reduction for speech recognition
EP0644526A1 (fr) * 1993-09-20 1995-03-22 ALCATEL ITALIA S.p.A. Procédé de réduction du bruit, notamment pour la reconnaissance automatique du langage et filtre pour complèter le procédé
WO2000072453A1 (fr) * 1999-05-25 2000-11-30 Algorex, Inc. Systeme universel de mesure de la qualite de signaux multimedia et d'autres signaux

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HAUENSTEIN M ET AL: "INSTRUMENTELLE SPRACHGUETEBEURTEILUNG", FUNKSCHAU,DE,FRANZIS-VERLAG K.G. MUNCHEN, vol. 71, no. 3, 23 January 1998 (1998-01-23), pages 61 - 64, XP000765678, ISSN: 0016-2841 *
LIANG J ET AL: "OUTPUT-BASED OBJECTIVE SPEECH QUALITY", PROCEEDINGS OF THE VEHICULAR TECHNOLOGY CONFERENCE,US,NEW YORK, IEEE, vol. CONF. 44, 8 June 1994 (1994-06-08), pages 1719 - 1723, XP000497716, ISBN: 0-7803-1928-1 *
SEOK JONG WON ET AL: "Speech enhancement with reduction of noise components in the wavelet domain", PROCEEDINGS OF THE 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, ICASSP. PART 2 (OF 5);MUNICH, GER APR 21-24 1997, vol. 2, 1997, ICASSP IEEE Int Conf Acoust Speech Signal Process Proc;ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Speech Processing 1997 IEEE, Piscataway, NJ, USA, pages 1323 - 1326, XP002170620 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102004029421A1 (de) * 2004-06-18 2006-01-05 Rohde & Schwarz Gmbh & Co. Kg Verfahren und Vorrichtung zur Bewertung der Güte eines Signals

Also Published As

Publication number Publication date
EP1386307B1 (fr) 2005-02-09
WO2002075725A1 (fr) 2002-09-26
EP1386307A1 (fr) 2004-02-04
ATE289109T1 (de) 2005-02-15
US20020191798A1 (en) 2002-12-19
DE50202226D1 (de) 2005-03-17
EP1386307B2 (fr) 2013-04-17
US6804651B2 (en) 2004-10-12

Similar Documents

Publication Publication Date Title
EP1386307B1 (fr) Procede et dispositif pour determiner un niveau de qualite d'un signal audio
EP1088300B1 (fr) Procede d'execution d'une evaluation automatisee de la qualite de transmission de signaux audio
DE60034026T2 (de) Sprachverbesserung mit durch sprachaktivität gesteuerte begrenzungen des gewinnfaktors
EP1953739B1 (fr) Procédé et dispositif pour réduire le bruit dans un signal décodé
DE2626793C3 (de) Elektrische Schaltungsanordnung zum Bestimmen des stimmhaften oder stimmlosen Zustandes eines Sprachsignals
EP0938831B1 (fr) Evaluation de la qualite, a adaptation auditive, de signaux audio
WO2007073949A1 (fr) Procede et dispositif pour elargir artificiellement la largeur de bande de signaux vocaux
DE69730721T2 (de) Verfahren und vorrichtungen zur geräuschkonditionierung von signalen welche audioinformationen darstellen in komprimierter und digitalisierter form
DE10017646A1 (de) Geräuschunterdrückung im Zeitbereich
DE60311619T2 (de) Datenreduktion in Audiokodierern unter Ausnutzung nichtharmonischer Effekte
DE60212617T2 (de) Vorrichtung zur sprachverbesserung
EP1634277B1 (fr) Extraction de sections de signaux d'essai pour la mesure de la qualite d'un signal audio
DE4343366C2 (de) Verfahren und Schaltungsanordnung zur Vergrößerung der Bandbreite von schmalbandigen Sprachsignalen
EP1382034B1 (fr) Procede de determination de valeurs caracteristiques d'intensite de bruits de fond dans des pauses de voix de signaux vocaux
DE60110541T2 (de) Verfahren zur Spracherkennung mit geräuschabhängiger Normalisierung der Varianz
EP0916206B1 (fr) Procede et dispositif pour evaluer la qualite d'un signal vocal transmis
DE3230391C2 (fr)
DE4445983C2 (de) Verfahren zur Rauschunterdrückung und Vorrichtungen zur Durchführung der Verfahren
DE102013005844B3 (de) Verfahren und Vorrichtung zum Messen der Qualität eines Sprachsignals
EP0787416B1 (fr) Procede de mesure de la fidelite de signaux audio stereophoniques et procede de reconnaissance de signaux audio stereophoniques codes ensemble
EP1366617B1 (fr) Procede et dispositif pour ameliorer la qualite de la voix sur des voies de transmission de telecommunications transparentes
DE19626329A1 (de) Verfahren zur Analyse von Schallereignissen
DE10136491B4 (de) Verfahren und Vorrichtung zur Verbesserung der Sprachqualität auf transparenten Telekommunikations-Übertragungswegen
DE102005019863A1 (de) Verfahren und Vorrichtung zur Geräuschunterdrückung
DE10016620A1 (de) Verfahren zur Herabsetzung von Geräuschkomponenten in Sprachsignalen

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

AKX Designation fees paid
REG Reference to a national code

Ref country code: DE

Ref legal event code: 8566

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20030325