KR970050779A

KR970050779A - Signal-to-Noise Ratio Evaluation Method of Speech Signals and Sounding Method of Speech Frames

Info

Publication number: KR970050779A
Application number: KR1019950050670A
Authority: KR
Inventors: 오기은; 김흥국; 김무영
Original assignee: 김광호; 삼성전자 주식회사
Priority date: 1995-12-15
Filing date: 1995-12-15
Publication date: 1997-07-29

Abstract

음성 신호를 묵음 구간과 유음 구간으로 나누고 유음 구간에 대해서만 신호대 잡음비를 계산하는 새로운 평가 방법 및 이에 적합한 음성 프레임 유음 여부 판별 방법이 개시된다.A new evaluation method for dividing a speech signal into a silent section and a sound section, and calculating a signal-to-noise ratio only for the sound section, and a method for determining whether a voice frame is sounded accordingly are disclosed.

음성 신호를 유음 구간과 무음 구간으로 구분하고 유음 구간에 대해서만 신호대 잡음비를 평가하는 방법에 있어서, 입력된 원래의 음성 신호 혹은 처리된 음성 신호를 15-20ms의 길이를 갖는 프레임들로 분할하는 블록화 과정; 상기 블록화 과정에서 블록화된 프레임을 입력하여 묵음 구간 여부를 결정하는 유음 구간 검출 과정; 상기 유음 구간 검출 과정에서 묵음 구간이 아닌 것으로 판별된 프레임에 대하여 프레임 기반의 신호대 잡음비를 계산하는 프레임 기반 신호대 잡음비 계산 과정; 및 상기 프레임 기반 신호대 잡음비 계산 과정에서 계산된 유음 구간의 신호대 잡음비를 종합하여 전체 신호의 신호대 잡음비를 계산하는 전체 신호대 잡음비 계산 과정을 포함함을 특징으로 한다.A method of dividing a speech signal into a sounding section and a silent section and evaluating a signal-to-noise ratio only for the sounding section, the partitioning process of dividing an input original speech signal or a processed speech signal into frames having a length of 15-20 ms ; A sounding section detection process of determining whether a silent section is input by inputting a block framed in the blocking process; A frame-based signal-to-noise ratio calculation process of calculating a frame-based signal-to-noise ratio for a frame determined as not being a silent section in the sound section; And a signal-to-noise ratio calculation process of calculating the signal-to-noise ratio of the entire signal by combining the signal-to-noise ratios of the sound-period section calculated in the frame-based signal-to-noise ratio calculation process.

본 발명에 따른 신호대 잡음비의 평가 방법은 음성 신호를 묵음 구간과 유음 구간으로 구분하고 유음 구간에 대해서만 신호대 잡음비를 계산하므로써 시간에 따라 변화하는 음성 신호의 특징을 정확하게 반영하는 평가 방법을 제공하는 효과가 있다.The method of evaluating the signal-to-noise ratio according to the present invention has the effect of providing an evaluation method that accurately reflects the characteristics of the voice signal that changes with time by dividing the speech signal into a silent section and a sound section and calculating the signal-to-noise ratio only for the sound section. have.

Description

Signal-to-Noise Ratio Evaluation Method of Speech Signals and Sounding Method of Speech Frames

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.

제1도는 본 발명에 따른 신호대 잡음비의 평가 방법을 보이는 흐름도이다.1 is a flowchart showing a method for evaluating a signal-to-noise ratio according to the present invention.

제2도는 제1도에 도시된 유음 구간 검출 방법을 보이는 흐름도이다.2 is a flowchart showing a method of detecting a sound section shown in FIG. 1.

Claims

A method of dividing a speech signal into a sounding section and a silent section and evaluating a signal-to-noise ratio only for the sounding section, the partitioning process of dividing an input original speech signal or a processed speech signal into frames having a length of 15-20 ms ; A sounding section detecting step of detecting whether a silent section is input by inputting a block framed in the blocking process; A frame-based signal-to-noise ratio calculation process of calculating a frame-based signal-to-noise ratio for a frame determined as not being a silent section in the sound section; And a total signal-to-noise ratio calculation process of calculating the signal-to-noise ratio of the entire signal by combining the signal-to-noise ratios of the noise period calculated in the frame-based signal-to-noise ratio calculation process.

The method of claim 1, wherein the frame-based signal to noise ratio calculation process calculates SNR _vad (m ') by the following equation,

The overall signal-to-noise ratio calculation process calculates the SNR _vad-seg by the following equation.

Here, M 'is the total number of noise interval frames.

Claims [1] A method of evaluating whether a voice blocked in a frame unit is sound, comprising: an average energy calculating step of calculating an average energy of a frame; An interframe energy change calculation step of calculating an amount of change in average energy between a previous frame and a current frame based on the average energy calculated in the average energy calculation step; A periodicity detection process of examining whether a current frame is periodical; A threshold composition process according to the change of average energy; A sound section determination step of performing a sound section section according to a result of the threshold value forming process; And a hangover process of performing a hangover process in order to avoid that the low level voice is determined to be silent as a result of the sound interval determination process.

According to claim 3, The average energy calculation process is to calculate the average energy E _current by the following formula,

The process of calculating the change amount of energy between frames calculates the change amount E _var of the average energy between the previous frame and the current frame according to the following equation,

The periodicity detection process detects periodicity ptch according to the following procedure, 1) obtains j having the maximum correlation correlation P (j) for the current frame,

2) The periodic ptch is determined according to the following condition compared to j of the previous frame.

if {j _prev × 0.75≤j≤j _prev × 1.25} or

{0.5j _prev × 0.75≤j≤0.5j _prev × 1.25} or

{2.0j _prev × 0.75≤j≤2.0j _prev × 1.25} or

then ptch = true

The threshold adjustment process is to adjust the threshold value according to the average amount of energy change between frames by the following equation,

if (ptch = true) then

if | E _var | ≤10.0 thvad = thvad;

else if | E _var | ≤20.0 thvad = thvad * (1 + 0.005E _var );

else if | E _var | ≤30.0 thvad = thvad * (1 + 0.003E _var );

end

Here, thvad is a predetermined threshold value for determining the sound interval.

The sound interval section determination process is performed by the following equation to determine the sound section,

if (E _current 〉 thvad) then tvad = true

else tvad = false

In the hangover process, if a predetermined number of sections are determined as a sound section, a predetermined number of sections are determined as a sound section.

※ Note: The disclosure is based on the initial application.