EP0518742A1

EP0518742A1 - Method for detecting a noisy wanted signal

Info

Publication number: EP0518742A1
Application number: EP92401553A
Authority: EP
Inventors: Dominique Pastor
Original assignee: Thales Avionics SAS
Current assignee: Thales Avionics SAS
Priority date: 1991-06-14
Filing date: 1992-06-05
Publication date: 1992-12-16
Anticipated expiration: 2012-06-05
Also published as: FR2677828A1; US5337251A; DE69225090T2; EP0518742B1; WO1992022889A1; FR2677828B1; DE69225090D1; JPH06503185A

Abstract

In order to detect a noisy wanted signal, a measurement of the expected S/N ratio of this signal is taken over one time slice, a measurement of the estimated white noise alone is taken over another time slice without wanted signal, the mean energy of the noise and of the noisy signal are calculated, each in its time slice, the theoretical detection threshold and the ratio of these two energies are calculated, and the ratio is compared with the calculated threshold, this threshold being greater than 1 (ideal threshold).

Description

La présente invention se rapporte à un procédé de détection d'un signal utile bruité.The present invention relates to a method for detecting a noisy useful signal.

Un des problèmes importants en traitement du signal, simple quant à son énoncé, mais d'autant plus complexe quant à sa résolution, consiste à déterminer la présence ou l'absence d'un signal utile noyé dans un bruit additif.One of the important problems in signal processing, simple as for its statement, but all the more complex as for its resolution, consists in determining the presence or the absence of a useful signal drowned in an additive noise.

Diverses solutions sont envisageables. On peut utiliser comme variable l'amplitude instantanée du signal reçu ou traité par référence à un seuil déterminé expérimentalement.Various solutions are possible. The instantaneous amplitude of the signal received or processed can be used as a variable by reference to a threshold determined experimentally.

On peut aussi utiliser comme variable l'énergie du signal total sur une tranche temporelle de durée T, en seuillant, toujours expérimentalement, cette énergie.One can also use as variable the energy of the total signal on a time slice of duration T, by thresholding, always experimentally, this energy.

Ces seuillages permettent une première présomption sur la présence ou l'absence du signal. Ils sont de plus applicables à tout signal. Aussi, sont ils complétés par des systèmes de "confirmation", définissant des critères "quasi-certains", propres au type de signal utile, lorsque la nature de celui-ci est connue a priori.These thresholds allow a first presumption on the presence or absence of the signal. They are also applicable to any signal. Also, they are supplemented by "confirmation" systems, defining "almost certain" criteria, specific to the type of useful signal, when the nature of this is known a priori.

Un tel système complémentaire est largement utilisé en traitement de la parole et peut consister, par exemple, en une extraction de "pitch" ou en l'évaluation de l'énergie minimale d'une voyelle.Such a complementary system is widely used in speech processing and can consist, for example, in a pitch extraction or in the evaluation of the minimum energy of a vowel.

La présente invention a pour objet un procédé de détection d'un signal utile bruité, déterminant de façon la plus rigoureuse possible le seuil de détection, et pouvant fonctionner de façon autoadaptative.The subject of the present invention is a method for detecting a noisy useful signal, determining as rigorously as possible the detection threshold, and which can operate in a self-adaptive manner.

Selon l'invention, on dispose du rapport signal/bruit attendu du signal à traiter, et on dispose d'une mesure du bruit seul estimé, mesure numérisée sur M points, ce bruit étant blanc ou rendu blanc, on calcule l'énergie moyenne du bruit sur ces M points, on prend une tranche de N points de signal bruité, on calcule l'énergie moyenne de ces N points, on calcule le seuil de détection théorique, on calcule le rapport des deux dites énergies moyennes, et on compare ce rapport audit seuil.According to the invention, there is the expected signal / noise ratio of the signal to be processed, and there is a measurement of the estimated noise alone, measurement digitized on M points, this noise being white or made white, the average energy is calculated. noise on these M points, we take a slice of N noisy signal points, we calculates the average energy of these N points, the theoretical detection threshold is calculated, the ratio of the two said average energies is calculated, and this ratio is compared with said threshold.

La présente invention sera mieux comprise à la lecture de la description détaillée d'un mode de réalisation pris à titre d'exemple non limitatif.The present invention will be better understood on reading the detailed description of an embodiment taken by way of nonlimiting example.

On va d'abord expliquer comment doit se faire théoriquement, dans le cas idéal, la détection d'un signal bruité.We will first explain how to theoretically, in the ideal case, detect a noisy signal.

On dispose d'une première information u(n) pour une première tranche temporelle telle que : $u(n) = s(n) + x(n)$

n étant un nombre entier : 0 ≦ n ≦ N-1, s(n) étant un signal utile et x(n) un bruit. En outre, on dispose d'une autre information y(n), avec 0 ≦ n ≦ M-1, et M pouvant être égal à N ou différent de celui-ci. y(n) est une mesure du bruit x(n) sur une autre tranche temporelle exempte de signal utile.We have a first piece of information u (n) for a first time slot such as:

u (n) = s (n) + x (n)

n being an integer: 0 ≦ n ≦ N-1, s (n) being a useful signal and x (n) a noise. In addition, there is other information y (n), with 0 ≦ n ≦ M-1, and M can be equal to N or different from it. y (n) is a measure of the noise x (n) over another time slot free of useful signal.

On pose : $U = (u(0)² + u(1)² + ... + u(N)²)/N$

et

V = (y(0)² + y(1)² + ... + y(M)²)/M

et

Z = U/V

We ask:

U = (u (0) ² + u (1) ² + ... + u (N) ²) / N

and

V = (y (0) ² + y (1) ² + ... + y (M) ²) / M

and

Z = U / V

Ainsi, dans un cas idéal et irréaliste, on aurait, en notant RSB = rapport signal à bruit : $Z = 1 + RSB$

et le simple critère de détection serait :

Z > 1 : présence de signal utile

Z < 1 : absence de signal utile

Thus, in an ideal and unrealistic case, we would have, by noting RSB = signal to noise ratio:

Z = 1 + RSB

and the simple detection criterion would be:

Z> 1: presence of useful signal

Z <1: absence of useful signal

Selon la présente invention, on remplace le seuil théorique de 1 par un seuil µ, calculé de la façon expliquée ci-dessous, qui tient compte du fait que les signaux dont on dispose ne sont pas parfaitement ergodiques et que U et V ne sont que des estimées des valeurs vraies des variances σ_u2 et σ_x2.According to the present invention, the theoretical threshold of 1 is replaced by a threshold µ, calculated as explained below, which takes into account the fact that the signals available are not perfectly ergodic and that U and V are only estimates of the true values of the variances σ _u 2 and σ _x 2.

Pour effectuer ce calcul de µ, on procède de la façon suivante.To perform this calculation of µ, we proceed as follows.

On part du fait que les variables U et V sont de nature aléatoire, et que par conséquent Z l'est aussi. On calcule alors la densité de probabilité de Z (qui dépend du rapport signal sur bruit).We start from the fact that the variables U and V are random in nature, and therefore Z is too. We then calculate the probability density of Z (which depends on the signal-to-noise ratio).

Il s'agit ensuite, en faisant appel au principe du maximum de vraisemblance, de déterminer la meilleure estimation du rapport signal sur bruit après avoir calculé la variable Z.Next, using the maximum likelihood principle, determine the best estimate of the signal-to-noise ratio after calculating the variable Z.

A cet effet, on mesure sur une tranche temporelle la variable U(n) précitée, et on mesure la variable y(n) sur une autre tranche temporelle où l'on est sûr qu'il n'y a pas de signal utile, mais uniquement du bruit (indépendant et décorrélé de s(n)).To this end, the aforementioned variable U (n) is measured on a time slice, and the variable y (n) is measured on another time slice where it is certain that there is no useful signal, but only noise (independent and decorrelated from s (n)).

Pour déterminer la densité de la variable aléatoire Z (que l'on peut qualifier de variable observée), on procède de la façon suivante. Soient X₁ appartenant à N (m₁ ; σ₁²) et X₂ appartenant à N (m₂ ; σ₂²) deux variables aléatoires gaussiennes indépendantes pour lesquelles les probabilités P_r {X₁ < o} et P_r {X₂ < o} sont pratiquement nulles.
On pose : $m = m₁/m₂, σ² = σ₁²/σ₂², α = m₂/σ₂.$

To determine the density of the random variable Z (which can be called the observed variable), we proceed as follows. Let X₁ belonging to N (m₁; σ₁²) and X₂ belonging to N (m₂; σ₂²) two independent Gaussian random variables for which the probabilities P _r {X₁ <o} and P _r {X₂ <o} are practically zero.
We ask:

m = m₁ / m₂, σ² = σ₁² / σ₂², α = m₂ / σ₂.

La densité de probabilité f_x (x) de X est alors :

où U(x) = 1 si x ≧ o et U(x) = o si x < o.The probability density f _x (x) of X is then:

where U (x) = 1 if x ≧ o and U (x) = o if x <o.

Si $h(x) = α . \frac{x - m}{{(x² + σ²)}^{1/2}}$

on a : P(x) = P_r {X < x} = F [h(x)], expression dans laquelle F(x) désigne la fonction caractéristique de la variable gaussienne normalisée.Yes

h (x) = α. \frac{x - m}{{(x² + σ²)}^{1/2}}

we have: P (x) = P _r {X <x} = F [h (x)], expression in which F (x) denotes the characteristic function of the normalized Gaussian variable.

On suppose maintenant que les signaux s(n), x(n) et y(n) sont blancs, gaussiens et centrés.We now assume that the signals s (n), x (n) and y (n) are white, Gaussian and centered.

On pose

We pose

Ce dernier terme est donc, lui aussi, blanc, gaussien et centré ;
et on pose

This last term is therefore also white, Gaussian and centered;
and we ask

Puisque l'on définit σ_s² et σ_x², on suppose implicitement que le calcul de la densité de probabilité se fait à σ_s² et σ_x² connus. On évalue donc la densité de Z en connaissant σ_s² et σ_x². Dans ce cas, U et V suivent des lois du chi-2, et, pour N et M suffisamment grands, U et V sont approximées par des lois gaussiennes pratiquement toujours positives :

Z est donc le rapport de deux variables gaussiennes indépendantes. On peut facilement démontrer que U et V sont indépendantes.
Avec :

{m₁ = σ}_{u} {² , σ₁² = 2σ}_{u} {⁴/N , m₂ = σ}_{x} {² , σ₂² = 2σ}_{x} ⁴/M

il vient :

{m = σ}_{u} {²/σ}_{x} {² , σ² = (M/N) (σ}_{u} {²/σ}_{x} {²)², α = (M/2)}^{1/2} .

Or : σ_u²/σ_x²= 1+ r où r = σ_s²/σ_x² est le rapport signal à bruit. Soit k = M/N, il vient : m = r+1, σ²= k(r+1)².Since we define σ _s ² and σ _x ², we implicitly assume that the probability density is calculated at σ _s ² and σ _x ² known. We therefore evaluate the density of Z by knowing σ _s ² and σ _x ². In this case, U and V follow laws of chi-2, and, for N and M sufficiently large, U and V are approximated by Gaussian laws practically always positive:

Z is therefore the ratio of two independent Gaussian variables. We can easily demonstrate that U and V are independent.
With:

{m₁ = σ}_{u} {², σ₁² = 2σ}_{u} {⁴ / N, m₂ = σ}_{x} {², σ₂² = 2σ}_{x} ⁴ / M

he comes :

{m = σ}_{u} {² / σ}_{x} {², σ² = (M / N) (σ}_{u} {² / σ}_{x} {²) ², α = (M / 2)}^{1/2} .

Now: σ _u ² / σ _x ² = 1+ r where r = σ _s ² / σ _x ² is the signal to noise ratio. Let k = M / N, it comes: m = r + 1, σ² = k (r + 1) ².

La densité de probabilité de Z, connaissant σ_s² et σ_x², s'exprime donc par :

1°) x ≧ 0
2°) x ≦ 0 d'où: f_z(z : σ_s², σ_x²) = 0

On posera :

de sorte que : f_z(z:σ_s², σ_x²) = f_k,M(z,σ_s²/σ_x²)The probability density of Z, knowing σ _s ² and σ _x ², is therefore expressed by:

1 °) x ≧ 0
2 °) x ≦ 0 where: f _z (z: σ _s ², σ _x ²) = 0

We will ask:

so that: f _z (z: σ _s ², σ _x ²) = f _{k, M} (z, σ _s ² / σ _x ²)

D'après les résultats ci-dessus relatifs à la densité de probabilité f_x(x), on déduit la probabilité ${Pr { Z < z : σ}_{s} {², σ}_{x} ²}.$

From the above results relating to the probability density f _x (x), we deduce the probability

{Pr {Z <z: σ}_{s} {², σ}_{x} ²}.

Soit : $h_{k.M} {(x,r) = (M/2)}^{1/2} \frac{x - (r+1)}{{[x² + k(r+1)²]}^{1/2}}$

Il vient :

{Pr {Z < z : σ}_{s} {² ; σ}_{x} {²} = F{h}_{k,M} (x,r)}.

Is :

h_{kM} {(x, r) = (M / 2)}^{1/2} \frac{x - (r + 1)}{{[x² + k (r + 1) ²]}^{1/2}}

He comes :

{Pr {Z <z: σ}_{s} {²; σ}_{x} {²} = F {h}_{k, M} (x, r)}.

On va maintenant examiner le cas d'un signal quelconque s(n) et d'un bruit blanc gaussien.We will now examine the case of any signal s (n) and a white Gaussian noise.

On suppose toujours que les bruits x(n) et y(n) sont blancs, gaussiens avec σ_x² = E[x(n)²] = E[y(n)²]. Le signal utile s(n) est supposé quelconque, indépendant du bruit.We always assume that the noises x (n) and y (n) are white, Gaussian with σ _x ² = E [x (n) ²] = E [y (n) ²]. The useful signal s (n) is assumed to be arbitrary, independent of noise.

L'hypothèse nouvelle faite ici est de supposer que s(n) et x(n) sont non corrélés au sens temporel du terme, c'est-à-dire que : $c = \frac{Σ 0 ≦ n ≦ N-1 s (n) x (n)}{{(Σ 0 ≦ n ≦ N-1 s(n)²)}^{1/2} {(Σ 0 ≦ n ≦ N-1 x(n)²)}^{1/2}} = 0$

The new hypothesis made here is to suppose that s (n) and x (n) are uncorrelated in the temporal sense of the term, that is to say that:

c = \frac{Σ 0 ≦ n ≦ N-1 s (n) x (n)}{{(Σ 0 ≦ n ≦ N-1 s (n) ²)}^{1/2} {(Σ 0 ≦ n ≦ N-1 x (n) ²)}^{1/2}} = 0

On montre alors que U peut être approximée par : ${U = µ ²}_{s} + (1/N) Σ 0 ≦ n ≦ N-1 x(n)² et Z par :$

Z = \frac{µ_{s} ² + (1/N) Σ 0 ≦ n ≦ N-1 x(n)²}{(1/M) Σ 0 ≦ n ≦ M-1 y(n)²}

We then show that U can be approximated by:

{U = µ ²}_{s} + (1 / N) Σ 0 ≦ n ≦ N-1 x (n) ² and Z by:

Z = \frac{µ_{s} ² + (1 / N) Σ 0 ≦ n ≦ N-1 x (n) ²}{(1 / M) Σ 0 ≦ n ≦ M-1 y (n) ²}

De même que ci-dessous, le calcul de la densité de Z s'est fait en connaissant σ_s²et σ_x², ici le calcul se fera en connaissant µ_s² et σ_x². La densité à calculer sera notée par f_z(z:µ $\binom{2}{s}$

, σ

\binom{2}{x}

).As below, the calculation of the density of Z was done by knowing σ _s ² and σ _x ², here the calculation will be done by knowing µ _s ² and σ _x ². The density to be calculated will be noted by f _z (z: µ

\binom{2}{s}

, σ

\binom{2}{x}

).

Connaissant µ_s², U = µ_s² + (1/N) Σ 0≦n ≦ N-1 x(n)² appartient à

(µ_s² + σ_x²; (2/N) σ_x⁴). V appartient à

(σ_x² ; (2/M) σ_x⁴).Knowing µ _s ², U = µ _s ² + (1 / N) Σ 0 ≦ n ≦ N-1 x (n) ² belongs to

(µ _s ² + σ _x ²; (2 / N) σ _x ⁴). V belongs to

(σ _x ²; (2 / M) σ _x ⁴).

Z = U/V est donc approchée par le rapport de deux lois gaussiennes indépendantes. Comme U et V sont indépendantes, on applique donc le résultat concernant la densité de probabilité de X, avec : ${m₁ = µ}_{s} {² + σ}_{x} {², σ₁² = (2/N) σ}_{x} {⁴, m₂ = σ}_{x} {², σ₂² = (2/M)σ}_{x} ⁴$

Donc : m = r+1, σ² = k, α = (M/2)^1/2, avec k = M/N et r = µ_s²/σ_x².Z = U / V is therefore approximated by the ratio of two independent Gaussian laws. As U and V are independent, we therefore apply the result concerning the probability density of X, with:

{m₁ = µ}_{s} {² + σ}_{x} {², σ₁² = (2 / N) σ}_{x} {⁴, m₂ = σ}_{x} {², σ₂² = (2 / M) σ}_{x} ⁴

So: m = r + 1, σ² = k, α = (M / 2) ^1/2 , with k = M / N and r = µ _s ² / σ _x ².

La densité de probabilité de Z, connaissant µ_s² et σ_x² vaut donc :

On posera :

de sorte que : f_z(z:σ_s², σ_x²) = f_k,M(z, σ_s²/σ_x²)The probability density of Z, knowing µ _s ² and σ _x ² is therefore worth:

We will ask:

so that: f _z (z: σ _s ², σ _x ²) = f _{k, M} (z, σ _s ² / σ _x ²)

D'après les résultats ci-dessus concernant la densité de probabilité de X, on en déduit la probabilité ${Pr { Z < z : µ}_{s} {², σ}_{x} ²}.$

Soit :

h_{k,M} {(x,r) = (M/2)}^{1/2} \frac{x - (r+1)}{{[x² + k]}^{1/2}}

il vient :

{Pr { Z < z : µ}_{s} {, σ}_{x} {² } = F (h}_{k,M} (x,r))

From the above results concerning the probability density of X, we deduce the probability

{Pr {Z <z: µ}_{s} {², σ}_{x} ²}.

Is :

h_{k, M} {(x, r) = (M / 2)}^{1/2} \frac{x - (r + 1)}{{[x² + k]}^{1/2}}

he comes :

{Pr {Z <z: µ}_{s} {, σ}_{x} {²} = F (h}_{k, M} (x, r))

Selon la présente invention, on met en oeuvre la détection d'activité en faisant appel au maximum de vraisemblance.According to the present invention, the activity detection is implemented by using maximum likelihood.

Dans les cas de signaux traités, la densité de probabilité de la variable Z, connaissant les énergies du signal utile et du bruit, s'exprime par une fonction de la forme :
f_k,M(z,r) où r désigne le rapport signal à bruit. Cette probabilité dépend donc du rapport signal sur bruit. Aussi, la règle de décision ne peut se donner qu'à rapport signal sur bruit attendu. Soit donc r_o ce rapport signal à bruit attendu.In the case of processed signals, the probability density of the variable Z, knowing the energies of the useful signal and of the noise, is expressed by a function of the form:
f _{k, M} (z, r) where r denotes the signal to noise ratio. This probability therefore depends on the signal to noise ratio. Also, the decision rule can only be given with an expected signal-to-noise ratio. Let r _{o be} this expected signal-to-noise ratio.

On suppose que la probabilité d'absence de s(n) est π_o et que la probabilité de présence de s(n) est π₁.We suppose that the probability of absence of s (n) is π _o and that the probability of presence of s (n) is π₁.

Puisqu'on connaît la densité de probabilité f_k,M(z,r) la règle de décision optimale est fournie par la théorie générale de la détection et s'exprime par : $\frac{{π₁ f}_{k,M} {(z,r}_{o})}{π_{o} f_{k,M} (z,0)} > 1 ⇒ D = 1$

\frac{{π₁ f}_{k,M} {(z,r}_{o})}{π_{o} f_{k,M} (z,0)} < 1 ⇒ D = 0

Since we know the probability density f _{k, M} (z, r) the optimal decision rule is provided by the general theory of detection and is expressed by:

\frac{{π₁ f}_{k, M} {(z, r}_{o})}{π_{o} f_{k, M} (z, 0)} > 1 ⇒ D = 1

\frac{{π₁ f}_{k, M} {(z, r}_{o})}{π_{o} f_{k, M} (z, 0)} <1 ⇒ D = 0

On peut aussi exprimer cette régle de décision sous la forme : $(Z < µ ⇒ D = 0) et (Z > µ ⇒ D = 1).$

We can also express this decision rule in the form:

(Z <µ ⇒ D = 0) and (Z> µ ⇒ D = 1).

Il faut alors déterminer µ et résoudre l'équation : ${ln[f}_{k,M} {(z,r}_{o} {)] - ln[f}_{k,M} {(z, 0)] - ln(π}_{o} /π₁) = 0.$

We must then determine µ and solve the equation:

{ln [f}_{k, M} {(z, r}_{o} {)] - ln [f}_{k, M} {(z, 0)] - ln (π}_{o} / π₁) = 0.

On démontre alors que la probabilité d'errreur vaut : ${Pe = π}_{o} {[1 - F(h}_{k,M} {(µ,0))] + π₁ F(h}_{k,M} {(µ,r}_{o})).$

We then demonstrate that the probability of error is worth:

{Pe = π}_{o} {[1 - F (h}_{k, M} {(µ, 0))] + π₁ F (h}_{k, M} {(µ, r}_{o})).

On va maintenant examiner le cas de la détection d'un signal blanc gaussien dans un bruit lui-même blanc gaussien.We will now examine the case of the detection of a white Gaussian signal in a white Gaussian noise itself.

Les signaux s(n), x(n) et y(n) sont supposés blancs, gaussiens, centrés. Soit r_o le rapport signal à bruit attendu, et k = M/N. la probabilité d'absence de s(n) est π_o et la probabilité de présence de s(n) est π₁.The signals s (n), x (n) and y (n) are assumed to be white, Gaussian, centered. Let r _{o be} the expected signal-to-noise ratio, and k = M / N. the probability of absence of s (n) is π _o and the probability of presence of s (n) is π₁.

La règle de décision est alors :
Décision D = 1 lorsque :

Décision D = 0 lorsque :

The decision rule is then:
Decision D = 1 when:

Decision D = 0 when:

Le seuil étant déterminé pour l'égalité (au lieu d'inégalité) entre les termes de ces deux expressions.The threshold being determined for equality (instead of inequality) between the terms of these two expressions.

On peut aussi exprimer cette règle de décision sous la forme : $(Z < µ ⇒ D = 0) et (Z > µ ⇒ D = 1).$

On obtient par exemple pour µ, à M = N = 128, π_o = π₁ = 1/2 :

r_o en dB µ

-2 1,27 -1 1,34 0 1,41 1 1,50 2 1,68

We can also express this decision rule in the form:

(Z <µ ⇒ D = 0) and (Z> µ ⇒ D = 1).

We obtain for example for µ, at M = N = 128, π _o = π₁ = 1/2:

r _o in dB µ

-2 1.27 -1 1.34 0 1.41 1 1.50 2 1.68

La probabilité d'erreur est : ${Pe = π}_{o} {[1-F(h}_{k,M} {(µ,0))] + π₁ F(h}_{k,M} {(µ,r}_{o}))$

avec :

h_{k,M} {(x,r) = (M/2)}^{1/2} \frac{x - (r+1)}{{[x² + k(r+1)²]}^{1/2}}

The probability of error is:

{Pe = π}_{o} {[1-F (h}_{k, M} {(µ, 0))] + π₁ F (h}_{k, M} {(µ, r}_{o}))

with:

h_{k, M} {(x, r) = (M / 2)}^{1/2} \frac{x - (r + 1)}{{[x² + k (r + 1) ²]}^{1/2}}

Nous donnons ci-après quelques valeurs de Pe fonction de r_o . π_o et π₁ sont prises égales à 0,5. r_o en dB Pe -2 0,086 -1 0,052 0 0,028 1 0,013 2 0,005 We give below some values of Pe as a function of r _o . π _o and π₁ are taken equal to 0.5. r _o in dB Pe -2 0.086 -1 0.052 0 0.028 1 0.013 2 0.005

Dans un exemple de simulation, on a généré un bruit blanc gaussien de variance unité. Pour chaque trame de 128 points (N = M = 128), on a décidé aléatoirement de générer un bruit s(n) additif, présentant un rapport signal sur bruit défini préalablement. Les probabilités d'apparition et d'absence (π_o et π₁) sont égales à 0,5. On a généré un second bruit blanc gaussien de variance unité, qui a servi à calculer la variable aléatoire V. Pour chaque trame, on a calculé Z. On a appliqué alors la règle de décision et l'on a compté le nombre d'erreurs. r_o en dB Nombre d'erreurs sur 1000 itérations -2 73 -1 43 0 18 1 10 2 2 In a simulation example, a white Gaussian noise with unit variance was generated. For each frame of 128 points (N = M = 128), it was randomly decided to generate an additive noise s (n), having a signal to noise ratio defined beforehand. The probabilities of appearance and absence (π _o and π₁) are equal to 0.5. We generated a second Gaussian white noise with unit variance, which was used to calculate the random variable V. For each frame, we calculated Z. We then applied the decision rule and we counted the number of errors . r _o in dB Number of errors in 1000 iterations -2 73 -1 43 0 18 1 10 2 2

Ces résultats corroborent ceux prévus par le calcul théorique.These results corroborate those predicted by the theoretical calculation.

On suppose toujours que les bruits x(n) et y(n) sont blancs, gaussiens avec σ_x²= E[x(n)²]=E[y(n)²]. Le signal utile s(n) est supposé quelconque, indépendant du bruit. Soit r_o le rapport signal à bruit attendu, k = M/N. La probabilité d'absence de s(n) est π_o et celle de présence de s(n) est π₁.
La règle de décision est alors :
Décision D = 1 lorsque : $ln \frac{(r+1) z+k}{z+k} > (M/4) \frac{{[z-(r}_{o} +1)]² - (z-1)²}{z²+k} + ln \frac{π_{o}}{π₁}$

Décision D = 0 lorsque :

ln \frac{(r+1) z+k}{z+k} < (M/4) \frac{{[z-(r}_{o} +1)]² - (z-1)²}{z²+k} + ln \frac{π_{o}}{π₁}

We always assume that the noises x (n) and y (n) are white, Gaussian with σ _x ² = E [x (n) ²] = E [y (n) ²]. The useful signal s (n) is assumed to be arbitrary, independent of noise. Let r _{o be} the expected signal-to-noise ratio, k = M / N. The probability of absence of s (n) is π _o and that of presence of s (n) is π₁.
The decision rule is then:
Decision D = 1 when:

ln \frac{(r + 1) z + k}{z + k} > (M / 4) \frac{{[z- (r}_{o} +1)] ² - (z-1) ²}{z² + k} + ln \frac{π_{o}}{π₁}

Decision D = 0 when:

ln \frac{(r + 1) z + k}{z + k} <(M / 4) \frac{{[z- (r}_{o} +1)] ² - (z-1) ²}{z² + k} + ln \frac{π_{o}}{π₁}

We can also express this decision rule in the form:

(Z <µ ⇒ D = 0) and (Z> µ ⇒ D = 1).

On obtient pour µ les valeurs suivantes en fonction de r_o, pour M = N = 128, π_o = π₁ = 1/2. r_o en dB µ -2 1,30 -1 1,38 0 1,48 1 1,60 2 1,76 The following values are obtained for µ as a function of r _o , for M = N = 128, π _o = π₁ = 1/2. r _o in dB µ -2 1.30 -1 1.38 0 1.48 1 1.60 2 1.76

De plus, on obtient : ${Pe = π}_{o} {[ 1-F (h}_{k,M} {(µ,0)) ] + π₁ F (h}_{k,M} {(µ, r}_{o}))$

avec :

h_{k,M} {(x,r) = (M/2)}^{1/2} \frac{x - (r+1)}{{[x² + k ]}^{1/2}}

In addition, we get:

{Pe = π}_{o} {[1-F (h}_{k, M} {(µ, 0))] + π₁ F (h}_{k, M} {(µ, r}_{o}))

with:

h_{k, M} {(x, r) = (M / 2)}^{1/2} \frac{x - (r + 1)}{{[x² + k]}^{1/2}}

Nous donnons ci-après quelques valeurs de Pe en fonction de r_o. Les probabilités π_o et π_osont prises égales à 0,5. r_o en dB Pe -2 0,062 -1 0,032 0 0,013 1 0,004 2 0,001 We give below some values of Pe as a function of r _o . The probabilities π _o and π _o are taken equal to 0.5. r _o in dB Pe -2 0.062 -1 0.032 0 0.013 1 0.004 2 0.001

Dans un exemple de simulation, pour chaque trame de 128 points de bruit blanc généré (N = M = 128), on a décidé aléatoirement d'y ajouter s(n), qui est ici, une sinusoïde, présentant un rapport signal sur bruit défini préalablement. π₁ et π_o sont prises égales à 0,5.In a simulation example, for each frame of 128 points of white noise generated (N = M = 128), we decided randomly to add s (n), which is here, a sinusoid, presenting a signal to noise ratio defined beforehand. π₁ and π _o are taken equal to 0.5.

On a généré un second bruit blanc gaussien de variance unité, servant à calculer V. Pour chaque trame, on a calculé Z et on a appliqué la règle de décision précitée. On a compté le nombre d'erreurs.A second white Gaussian noise of unit variance was generated, used to calculate V. For each frame, Z was calculated and the above decision rule was applied. We counted the number of errors.

On a obtenu les résultats suivants r_o en dB Nombre d'erreurs sur 1000 itérations -2 70 -1 37 0 12 1 6 2 3 The following results were obtained r _o in dB Number of errors in 1000 iterations -2 70 -1 37 0 12 1 6 2 3

Les résultats précédents, parce que très généraux, permettent la détection de signaux noyés dans du bruit additif, même lorsque le rapport signal sur bruit est faible, voisin de 0 dB.The previous results, because they are very general, allow the detection of signals embedded in additive noise, even when the signal to noise ratio is low, close to 0 dB.

On va décrire ci-dessous une application dans laquelle ce type de détection peut se révéler très utile.We will describe below an application in which this type of detection can be very useful.

Les algorithmes présentés s'appliquent au cas de la parole, comme pré-système de détection d'activité vocale.The algorithms presented apply to the case of speech, as a pre-system for detecting voice activity.

Le choix du seuil de détection dépend du contexte.The choice of detection threshold depends on the context.

En ce qui concerne les bandes audio utilisées, une caractérisation préalable du bruit et de la parole, à l'aide de mesures basées sur l'estimation par maximum de vraisemblance montre que le signal vocal à détecter présente un rapport signal sur bruit d'au moins 6 dB.With regard to the audio tapes used, a preliminary characterization of noise and speech, using measurements based on the maximum likelihood estimation shows that the speech signal to be detected has a signal-to-noise ratio of at least minus 6 dB.

D'autre part, le système de traitement utilise des trames de signal de 128 points, la fréquence d'échantillonnage étant de 10 kHz.On the other hand, the processing system uses 128 point signal frames, the sampling frequency being 10 kHz.

Les variables U et V sont toutes les deux évaluées sur 128 points de sorte que M = N = 128.The variables U and V are both evaluated on 128 points so that M = N = 128.

D'après ce qui précède, on déduit le seuil théorique de détection à 3.From the above, the theoretical detection threshold is deduced from 3.

Cependant, on ne peut pas se contenter de cet unique seuil. En effet, si le bruit est relativement stationnaire, il présente des instationnarités à prendre en compte pour renouveler la variable V, ce qui permet de rendre l'algorithme partiellement adaptatif.However, we cannot be satisfied with this single threshold. In fact, if the noise is relatively stationary, it has instabilities to be taken into account to renew the variable V, which makes the algorithm partially adaptive.

On introduit donc un second seuil, qui permet de décider si la variable V va être renouvelée ou non.A second threshold is therefore introduced, which makes it possible to decide whether the variable V will be renewed or not.

Ce second seuil est choisi à 1,25, ce qui correspond à un bruit additif au bruit stationnaire présentant un rapport signal à bruit de -2 dB.This second threshold is chosen at 1.25, which corresponds to noise additive to stationary noise having a signal to noise ratio of -2 dB.

La règle de décision est alors :The decision rule is then:

If Z <1.25 :

Alors la trame traitée est composée du même bruit que celle utilisée comme référence. La variable V est remplacée par la valeur de l'énergie de la trame traitée.Then the processed frame is composed of the same noise as that used as a reference. The variable V is replaced by the value of the energy of the processed frame.

On notera que, puisque la décision est de considérer la trame traitée comme du bruit représentatif, on pourrait renouveler la variable V en faisant la moyenne de l'ancienne valeur de V et de l'énergie de la trame considérée. Ce qui amène à changer la valeur de M (nombre de points sur lequel est évalué V) mais cette opération peut induire un mauvais fonctionnement de l'algorithme.Note that since the decision is to consider the frame processed as representative noise, we could renew the variable V by averaging the old value of V and the energy of the frame considered. Which brings to change the value of M (number of points on which V is evaluated) but this operation can induce a malfunction of the algorithm.

If 1.25 <z> 3 :

La trame est considérée comme contenant une non-stationnarité du bruit, et exempte de parole.The frame is considered to contain non-stationarity of the noise, and free of speech.

If 3 <Z :

La trame est considéré comme de la parole.The frame is considered to be speech.

Des essais effectués sur des échantillons de signaux bruités ont validé cette détection.Tests carried out on samples of noisy signals have validated this detection.

Cependant, rappelons que cette détection vocale peut être améliorée par l'utilisation de critères propres au signal de parole, tel que le calcul de "pitch".However, remember that this voice detection can be improved by the use of criteria specific to the speech signal, such as the calculation of "pitch".

L'algorithme proposé ici concerne l'étude de quelques exemples de signaux. Il est évident que pour d'autres signaux de parole présentant des rapports signal à bruit différents, un nouveau choix de seuils est nécessaire.The algorithm proposed here concerns the study of some examples of signals. It is obvious that for other speech signals having different signal to noise ratios, a new choice of thresholds is necessary.

L'utilisation de deux seuils est généralement préférable.The use of two thresholds is generally preferable.

Une application de cet algorithme permet de créer des fichiers de référence corrects pour le système de reconnaissance vocale étudié. Une segmentation précise des élocutions est alors nécessaire.An application of this algorithm makes it possible to create correct reference files for the voice recognition system studied. A precise segmentation of speech is then necessary.

Dans une application, on a utilisé un alternat micro (ouverture et fermeture micro) qui fournit une segmentation grossière des élocutions.In one application, a micro alternation (micro opening and closing) was used which provides rough segmentation of speech.

L'algorithme précédent a été utilisé pour affiner cet alternat. Une première passe de l'algorithme a permis de préciser le début de l'élocution. Une seconde passe a consisté à lire le fichier de parole "à l'envers", c'est-à-dire en partant de la fermeture micro vers l'ouverture micro. Ce qui a permis alors de préciser la fin de l'élocution.The previous algorithm was used to refine this alternation. A first pass of the algorithm made it possible to specify the start of the speech. A second pass consisted in reading the speech file "upside down", that is to say starting from the microphone closure towards the microphone opening. This then made it possible to specify the end of the speech.

Cette utilisation non causale de l'algorithme est nécessaire, car la détection d'activité est suffisamment précise pour détecter, à l'intérieur des mots, la présence de silences, ce qui est préjudiciable à une mise en place d'une segmentation pour les apprentissages.This non-causal use of the algorithm is necessary, since the activity detection is precise enough to detect, within words, the presence of rests, which is detrimental to the establishment of segmentation for the learning.

Le même type d'application permet aussi de segmenter les fichiers de parole sur lesquels on effectue une reconnaissance.The same type of application also makes it possible to segment the speech files on which a recognition is carried out.

Cependant, cet algorithme n'est évidemment pas causal, ce qui est préjudiciable à une utilisation temps réel. D'où la nécessité de compléter cet algorithme par un calcul propre au traitement de la parole.However, this algorithm is obviously not causal, which is detrimental to real-time use. Hence the need to complete this algorithm with a calculation specific to speech processing.

Nous avons démontré l'existence de seuils optimaux de détection, ce qui permet d'avoir une approche théorique du problème de l'estimation du rapport signal sur bruit et, surtout de la détection, dans le cas d'un bruit blanc et d'un signal connu seulement par son énergie sur N points lorsque celle-ci reste relativement stationnaire.We have demonstrated the existence of optimal detection thresholds, which provides a theoretical approach to the problem of estimating the signal-to-noise ratio and, especially of detection, in the case of white noise and a signal known only by its energy on N points when it remains relatively stationary.

Claims

Method for detecting a noisy useful signal, for which there is the expected signal / noise ratio of the signal to be processed, and a measurement of the estimated noise alone, measurement digitized on M points, characterized by the fact that l average energy of the noise on these M points, we take a slice of N noisy signal points, we calculate the average energy of these N points, we calculate the theoretical detection threshold, we calculate the ratio (Z) of the two so-called average energies, and this ratio is compared to said threshold.

Method according to claim 1, characterized in that the only estimated noise is white or made white.

Method according to one of the preceding claims, characterized in that the theoretical detection threshold is determined for:

ln \frac{(r + 1) z + k}{z + k} = (M / 4) \frac{{[z- (r}_{o} +1)] ² - (z-1) ²}{z² + k} + ln \frac{π_{o}}{π₁}

r _o being the expected signal to noise ratio, k = M / N, π _o being the probability of absence of the useful signal and π₁ its probability of presence.

Method according to one of claims 1 or 2 for the detection of a white Gaussian signal, characterized in that the theoretical detection threshold is determined for:

Method according to one of claims 1 to 3, for the detection of speech, characterized in that, in addition to the theoretical detection threshold, a second decision threshold is used to update the measured portion of the estimated noise alone , in order to take account of the instarity of the noise.