FR3142639A1

FR3142639A1 - METHOD FOR CHARACTERIZING A FILTER FOR PROCESSING THE VOICE OF AN INDIVIDUAL, COMMUNICATION DEVICE

Info

Publication number: FR3142639A1
Application number: FR2212450A
Authority: FR
Inventors: Jonas GUERCHE; Gabriel GUERCHE; Denis EFIMOV; Rosane USHIROBIRA
Original assignee: 52 Hertz
Current assignee: 52 Hertz
Priority date: 2022-11-28
Filing date: 2022-11-28
Publication date: 2024-05-31
Also published as: WO2024115504A1

Abstract

PROCEDE DE CARACTERISATION D’UN FILTRE POUR LE TRAITEMENT D’UNE VOIX D’UN INDIVIDU, DISPOSITIF DE COMMUNICATION Procédé de caractérisation d’un filtre (F1) pour le traitement d’une voix d’un sujet donné, ladite voix étant acquise par un micro agencé dans un embout buccal, ledit procédé comportant : Acquisition d’un premier signal correspondant à une voix d’un sujet déformée ;Acquisition d’un second signal correspondant à une voix d’un sujet non déformé ;Echantillonnage du premier signal ;Estimation de valeurs moyennes de l’amplitude du premier signal ;Estimation de valeurs de vitesse du signal à partir d’une méthode du différenciateur homogène en temps-fini;Calcul d’un premier ensemble de coefficients ;Calcul d’une erreur entre un ensemble d’échantillons du second signal et les échantillons obtenus en sortie du premier modèle,Itération(s) du calcul des coefficients du premier modèle. Figure pour l’abrégé : Fig. 1METHOD FOR CHARACTERIZING A FILTER FOR PROCESSING A VOICE OF AN INDIVIDUAL, COMMUNICATION DEVICE Method for characterizing a filter (F1) for processing a voice of a given subject, said voice being acquired by a microphone arranged in a mouthpiece, said method comprising: Acquisition of a first signal corresponding to a distorted subject's voice; Acquisition of a second signal corresponding to an undistorted subject's voice; Sampling of the first signal; Estimation of average values of the amplitude of the first signal; Estimation of signal speed values using a finite-time homogeneous differentiator method; Calculation of a first set of coefficients; Calculation of an error between a set of samples of the second signal and the samples obtained at the output of the first model, Iteration(s) of the calculation of the coefficients of the first model. Figure for abstract: Fig. 1

Description

METHOD FOR CHARACTERIZING A FILTER FOR PROCESSING THE VOICE OF AN INDIVIDUAL, COMMUNICATION DEVICE

Field of the invention

L’invention concerne un procédé pour traiter et corriger un flux audio altéré provenant d’un individu ayant par exemple en bouche un embout buccal lui permettant de respirer sous l’eau et comportant un micro. Le domaine de l’invention se rapporte aux procédés mis en œuvre par un ordinateur embarqué dans un équipement électronique pour être exécuté en temps réel, sans connexon extérieure et configurés pour rendre plus intelligible un flux audio déformé par la présence d’un élément maintenu en bouche.The invention relates to a method for processing and correcting an altered audio stream coming from an individual having, for example, a mouthpiece in their mouth allowing them to breathe underwater and comprising a microphone. The field of the invention relates to methods implemented by a computer embedded in electronic equipment to be executed in real time, without an external connection and configured to make an audio stream distorted by the presence of an element held in place more intelligible. mouth.

State of the art

On connait le document US2012/0213034 décrivant un embout buccal de détendeur à mettre dans la bouche et comprenant un microphone ainsi qu’un écouteur à conduction osseuse faisant vibrer un plateau de conduction acoustique pour faire passer le signal par les dents de l’utilisateur.We know the document US2012/0213034 describing a regulator mouthpiece to be placed in the mouth and comprising a microphone as well as a bone conduction earpiece vibrating an acoustic conduction plate to pass the signal through the user's teeth.

Un défaut est que le signal capté est peu intelligible du fait de la partie labiale de l’embout buccal empêchant l’utilisateur de bouger les lèvres de manière suffisante. Il existe un besoin de reconstruire un son intelligible à partir d’un procédé de traitement de la voix.A fault is that the signal captured is hardly intelligible due to the labial part of the mouthpiece preventing the user from moving their lips sufficiently. There is a need to reconstruct intelligible sound using a voice processing process.

Un premier problème est que le flux audio dépend beaucoup de l’individu qui prononce les phrases qui sont déformées par la présence de l’embout. Un second problème est que certains algorithmes demandent des temps de calculs importants qui ne permettent pas d’obtenir un traitement en temps réel de la voix d’un individu, sur un système embarqué, rendant ainsi difficile la communication entre deux individus au moyen de ce type d’équipement. Tout au plus il est possible de récupérer la voix d’un nageur et de la traiter en différé.A first problem is that the audio flow depends a lot on the individual who pronounces the sentences which are distorted by the presence of the mouthpiece. A second problem is that certain algorithms require significant calculation times which do not allow real-time processing of an individual's voice, on an embedded system, thus making communication between two individuals by means of this difficult. type of equipment. At most it is possible to recover the voice of a swimmer and process it offline.

L’invention propose un procédé mis en œuvre par ordinateur, notamment un ordinateur embarqué, tel qu’une unité de commande électronique, appelé MCU, un microprocesseur, un microcontrôleur ou un FPGA. Le procédé de l’invention permet la reconstruction et/ou la correction de certaines syllabes par un traitement en temps réel du flux audio acquis par un microphone situé au voisinage de la bouche d’un plongeur. Le procédé de l’invention permet notamment de transmettre ou de traiter en réception le flux audio afin qu’un autre plongeur puisse écouter de manière intelligible le flux audio émis par un premier plongeur.The invention proposes a method implemented by computer, in particular an embedded computer, such as an electronic control unit, called MCU, a microprocessor, a microcontroller or an FPGA. The method of the invention allows the reconstruction and/or correction of certain syllables by real-time processing of the audio stream acquired by a microphone located in the vicinity of a diver's mouth. The method of the invention makes it possible in particular to transmit or process the audio stream in reception so that another diver can listen intelligibly to the audio stream emitted by a first diver.

Selon un premier aspect l’invention concerne un procédé de caractérisation d’un filtre pour le traitement d’une voix d’un sujet donné, ladite voix étant acquise par un micro agencé dans un dispositif d’aide à la respiration sous-marine destiné à être porté à proximité de la bouche d’un sujet, ledit procédé comportant :

Acquisition d’au moins un premier signal correspondant à la prononciation d’un premier ensemble de phonèmes par ledit sujet donné ; ledit premier signal correspondant à une voix d’un sujet acquise par le micro du dispositif d’aide à la respiration sous-marine lorsque ce dernier est porté par ledit sujet ;
Acquisition d’au moins un second signal correspondant à la prononciation du premier ensemble de phonèmes par ledit sujet ; ledit second signal correspondant à une voix d’un sujet acquise par un micro sans que le sujet porte un dispositif d’aide à la respiration sous-marine ;
Pour le premier signal, exécution des étapes suivantes :
- Echantillonnage du premier signal ;
- Estimation d’un premier ensemble de valeurs moyennes de l’amplitude du premier signal sur une pluralité de premières fenêtres prédéfinies, chaque première fenêtre comportant une pluralité d’échantillons du premier signal ;
- Estimation d’un second ensemble de valeurs de vitesse du signal à partir d’une méthode du différenciateur homogène en temps-fini sur la pluralité de premières fenêtres prédéfinies ;
- Calcul d’un premier ensemble de coefficients d’un système d’équations différentielles représentant un premier modèle dont les solutions sont le premier et le second ensemble de valeurs ;
Calcul d’une erreur entre au moins un ensemble d’échantillons du second signal et les échantillons obtenus en sortie du premier modèle exécuté avec des valeurs du premier signal et les coefficients estimés,
Itération(s) du calcul des coefficients du premier modèle en minimisant à chaque itération ladite erreur calculée, le nombre d’itérations étant configuré pour minimiser l’erreur en dessous d’un seuil prédéfini de manière à définir des coefficients d’exploitation, lesdits coefficients d’exploitation définissant les coefficients du filtre.

According to a first aspect, the invention relates to a method for characterizing a filter for processing a voice of a given subject, said voice being acquired by a microphone arranged in an underwater breathing aid device intended to be worn close to the mouth of a subject, said method comprising:

Acquisition of at least a first signal corresponding to the pronunciation of a first set of phonemes by said given subject; said first signal corresponding to a voice of a subject acquired by the microphone of the underwater breathing aid device when the latter is worn by said subject;
Acquisition of at least one second signal corresponding to the pronunciation of the first set of phonemes by said subject; said second signal corresponding to a subject's voice acquired by a microphone without the subject wearing an underwater breathing aid device;
For the first signal, perform the following steps:
- Sampling of the first signal;
- Estimation of a first set of average values of the amplitude of the first signal over a plurality of first predefined windows, each first window comprising a plurality of samples of the first signal;
- Estimation of a second set of signal speed values from a finite-time homogeneous differentiator method over the plurality of first predefined windows;
- Calculation of a first set of coefficients of a system of differential equations representing a first model whose solutions are the first and the second set of values;
Calculation of an error between at least one set of samples of the second signal and the samples obtained at the output of the first model executed with values of the first signal and the estimated coefficients,
Iteration(s) of calculating the coefficients of the first model by minimizing at each iteration said calculated error, the number of iterations being configured to minimize the error below a predefined threshold so as to define operating coefficients, said operating coefficients defining the filter coefficients.

Un avantage est de permettre une caractérisation d’un filtre audio afin de reconstruire ensuite un son intelligible lorsqu’une voix est déformé par un embout buccal. Un avantage de la méthode de l’invention est de proposer un algorithme peu gourmand en puissance de calcul et en mémoire pour être implémenté sur un calculateur embarqué. Un autre avantage est de permettre un encodage en lignes de code simple et comportant un minimum de lignes de codes. Un autre avantage est de permettre un algorithme adaptatif à l’utilisateur et à son environnement.One advantage is to allow characterization of an audio filter in order to then reconstruct an intelligible sound when a voice is distorted by a mouthpiece. An advantage of the method of the invention is to propose an algorithm that requires little computing power and memory to be implemented on an on-board computer. Another advantage is to allow encoding in simple lines of code and comprising a minimum of lines of code. Another advantage is to allow an algorithm that is adaptive to the user and their environment.

Selon un mode de réalisation, le procédé de caractérisation comprend une étape de filtrage du premier signal réalisée préalablement à l’échantillonnage.According to one embodiment, the characterization method comprises a step of filtering the first signal carried out prior to sampling.

Un avantage est de permettre une caractérisation d’un filtre audio s’affranchissant des bruits, c’est-à-dire une caractérisation de la déformation induite de la voix du fait de l’équipement présent en bouche.An advantage is to allow a characterization of an audio filter that eliminates noise, that is to say a characterization of the induced distortion of the voice due to the equipment present in the mouth.

Selon un mode de réalisation, l’échantillonnage du premier signal comprend les étapes suivantes :

Analyse de la densité spectrale du signal afin de déterminer des seuils caractéristiques d’amplitude ou de puissance ;
Sélection d’un ensemble de fréquences caractéristiques pour lesquelles les seuils sont dépassés ;
Echantillonnage de segments du premier signal autour de chaque fréquence caractéristique ;
Génération d’un ensemble d’échantillons pour le traitement du premier signal.

According to one embodiment, sampling the first signal comprises the following steps:

Analysis of the spectral density of the signal in order to determine characteristic amplitude or power thresholds;
Selection of a set of characteristic frequencies for which the thresholds are exceeded;
Sampling of segments of the first signal around each characteristic frequency;
Generation of a set of samples for processing the first signal.

Un avantage est de focaliser les temps de calculs sur les zones d’intérêts de la voix, c’est à dire les fréquences caractéristiques de la voix d’un individu, notamment celle qui un maximum de signal utile.An advantage is to focus the calculation times on the areas of interest of the voice, that is to say the characteristic frequencies of the voice of an individual, in particular that which provides a maximum of useful signal.

Selon un mode de réalisation, l’échantillonnage produit entre 400 et 96000 échantillons pendant une 1 seconde.According to one embodiment, sampling produces between 400 and 96,000 samples for 1 second.

Un intérêt est de permettre différents modes de fonctionnement selon si on souhaite privilégier le temps de calcul ou la qualité du signal.An advantage is to allow different operating modes depending on whether we wish to prioritize calculation time or signal quality.

Un échantillonnage à 4000 échantillons par seconde permet notamment de garder un spectre de la voix entre 300 Hz et 1300 Hz et réduire la puissance de calcul.Sampling at 4000 samples per second makes it possible to keep a voice spectrum between 300 Hz and 1300 Hz and reduce computing power.

Selon un mode de réalisation, le procédé est répété pour une pluralité d’acquisitions de premiers signaux, chaque premier signal acquis correspondant à la prononciation d’un nouvel ensemble de phonèmes par le même sujet, le calcul de l’erreur et sa minimisation étant réalisés pour chaque nouveau premier signal acquis.According to one embodiment, the method is repeated for a plurality of acquisitions of first signals, each first acquired signal corresponding to the pronunciation of a new set of phonemes by the same subject, the calculation of the error and its minimization being carried out for each new first signal acquired.

Un avantage est de réaliser le procédé de l’invention pour quelques phonèmes, c’est-à-dire quelques phrases pour permettre de caractériser un filtre rapidement. Un avantage est de permettre un entrainement rapide de l’algorithme à partir de 3 ou 4 phrases prononcées par un individu. Ainsi, l’équipement peut être rapidement prêt, configuré et entrainé juste avant son utilisation, par exemple juste avant de réaliser une plongée.An advantage is to carry out the method of the invention for a few phonemes, that is to say a few sentences to allow a filter to be characterized quickly. An advantage is to allow rapid training of the algorithm from 3 or 4 sentences spoken by an individual. Thus, the equipment can be quickly ready, configured and trained just before use, for example just before making a dive.

Selon un mode de réalisation, le système d’équations différentielles est du premier ordre.According to one embodiment, the system of differential equations is of the first order.

Un avantage est la simplification des calculs et donc le gain en temps de calcul.An advantage is the simplification of calculations and therefore the saving in calculation time.

Selon un mode de réalisation, le procédé de caractérisation comprend :

Estimation d’un troisième ensemble de valeurs d’accélération du signal à partir de la méthode du différenciateur homogène en temps-fini sur la pluralité de premières fenêtres prédéfinies ;
Calcul d’un premier ensemble de coefficients d’un système d’équations différentielles du premier ordre représentant un premier modèle dont les solutions sont le premier et le second ensemble de valeurs.

According to one embodiment, the characterization method comprises:

Estimation of a third set of signal acceleration values using the finite-time homogeneous differentiator method over the plurality of first predefined windows;
Calculation of a first set of coefficients of a system of first order differential equations representing a first model whose solutions are the first and second set of values.

Un avantage est l’amélioration de la caractérisation du filtre permettant une meilleure reconstruction de la voie déformée d’un individu.An advantage is the improvement in the characterization of the filter allowing better reconstruction of the deformed path of an individual.

Selon un mode de réalisation, le procédé de caractérisation comprend :

Estimation d’un N^ièmeensemble de valeurs d’une dérivée d’ordre N du signal à partir de la méthode du différenciateur homogène en temps-fini sur la pluralité de premières fenêtres prédéfinies ;
Calcul d’un premier ensemble de coefficients d’un système d’équations différentielles du N^ièmeordre représentant un premier modèle dont les solutions sont le premier et le second ensemble de valeurs.

According to one embodiment, the characterization method comprises:

Estimation of an ^Nth set of values of an N-order derivative of the signal using the finite-time homogeneous differentiator method over the plurality of first predefined windows;
Calculation of a first set of coefficients of a system of ^Nth order differential equations representing a first model whose solutions are the first and the second set of values.

Selon un mode de réalisation, le premier modèle est un modèle linéaire et s’écrit :

,

According to one embodiment, the first model is a linear model and is written:

,

où G, F et H sont des matrices dont les coefficients sont à déterminer, la dimension de la matrice étant déterminée par l’ordre du système linéaire d’équations différentielles, y(t) est le signal qu’on cherche à identifier approchant le second signal, et x(t) est une variable d’état interne au premier modèle.where G, F and H are matrices whose coefficients are to be determined, the dimension of the matrix being determined by the order of the linear system of differential equations, y(t) is the signal that we seek to identify approaching the second signal, and x(t) is a state variable internal to the first model.

Selon un mode de réalisation, le premier modèle est un modèle non-linéaire et s’écrit :According to one embodiment, the first model is a non-linear model and is written:

où G, F et H sont des matrices dont les coefficients sont à déterminer, la dimension de la matrice étant déterminée par l’ordre du système d’équations différentielles, NN désigne un filtre non-linéaire, y(t) est le signal qu’on cherche à identifier approchant le second signal, et x(t) est une variable d’état interne au premier modèle,

le procédé comportant en outre, un calcul d’un ensemble de coefficients d’un réseau de neurones à partir d’une régression configurée pour minimiser l’erreur.

where G, F and H are matrices whose coefficients are to be determined, the dimension of the matrix being determined by the order of the system of differential equations, NN designates a non-linear filter, y(t) is the signal that 'we seek to identify approaching the second signal, and x(t) is a state variable internal to the first model,

the method further comprising a calculation of a set of coefficients of a neural network from a regression configured to minimize the error.

Selon un mode de réalisation, le calcul de l’étape de minimisation de l’erreur est réalisé à partir de la mise en œuvre d’un réseau de neurones dont les coefficients sont calculés par une régression pour minimiser l’erreur.According to one embodiment, the calculation of the error minimization step is carried out from the implementation of a neural network whose coefficients are calculated by a regression to minimize the error.

Un avantage est la simplicité et la rapidité de l’apprentissage.An advantage is the simplicity and speed of learning.

Selon un mode de réalisation, à chaque itération une première condition est vérifiée, ladite première condition étant la vérification que l’erreur est inférieure à un seuil prédéfini et/ou qu’une seconde condition est vérifiée, ladite seconde condition étant la vérification qu’un nombre d’itérations prédéfini a été réalisé, le procédé comportant une étape de génération des coefficients du filtre dès qu’une des deux conditions est vérifiée.According to one embodiment, at each iteration a first condition is verified, said first condition being the verification that the error is less than a predefined threshold and/or that a second condition is verified, said second condition being the verification that a predefined number of iterations was carried out, the method comprising a step of generating the filter coefficients as soon as one of the two conditions is verified.

Selon un autre aspect, l’invention concerne un produit programme d’ordinateur comportant une mémoire et un calculateur pour exécuter des instructions de codes mettant en œuvre le procédé de l’invention.According to another aspect, the invention relates to a computer program product comprising a memory and a calculator for executing code instructions implementing the method of the invention.

Un avantage est de permettre une caractérisation d’un filtre sur n’importe quel type d’équipement par exemple un téléphone intelligent, appelé Smartphone, ou encore tout autre équipement électronique mobile à partir duquel le filtre de caractérisation peut être élaboré puis transféré sur un équipement de plongée.An advantage is to allow characterization of a filter on any type of equipment, for example a smart phone, called a Smartphone, or any other mobile electronic equipment from which the characterization filter can be developed then transferred to a diving equipment.

Selon un autre aspect, l’invention concerne un procédé de traitement d’une voix d’un sujet donné comprenant :

Acquisition d’un signal correspondant à une voix acquise par un micro agencé dans dispositif d’aide à la respiration sous-marine destiné à être porté dans la bouche d’un sujet, la voix acquise étant définie par un signal audio acquis en temps réel correspondant à une voix d’un sujet déformée par la présence de l’embout buccal en bouche ;
Application d’un premier modèle entrainé selon la méthode de caractérisation du filtre de l’invention, ladite application visant à obtenir un second signal de voix transformée à partir du premier signal de la voix déformée par un dispositif d’aide à la respiration sous-marine.

According to another aspect, the invention relates to a method of processing a voice of a given subject comprising:

Acquisition of a signal corresponding to a voice acquired by a microphone arranged in an underwater breathing aid device intended to be worn in the mouth of a subject, the acquired voice being defined by an audio signal acquired in real time corresponding to a subject's voice distorted by the presence of the mouthpiece in the mouth;
Application of a first model trained according to the filter characterization method of the invention, said application aiming to obtain a second voice signal transformed from the first voice signal distorted by a device for assisting underwater breathing. Marine.

Un avantage est d’équiper en avance des équipements avec une pré-configuration du filtre et d’adapter l’entrainement avec la voix d’un individu donné.An advantage is to equip equipment in advance with a pre-configuration of the filter and to adapt the training with the voice of a given individual.

Selon un autre aspect, l’invention concerne un dispositif de communication comprenant un embout buccal destiné à être porté dans la bouche d’un sujet comprenant :

un micro pour acquérir un signal acoustique provenant de la voix du sujet ;

According to another aspect, the invention relates to a communication device comprising a mouthpiece intended to be worn in the mouth of a subject comprising:

a microphone for acquiring an acoustic signal from the subject's voice;

une carte électronique comportant un filtre audio, un convertisseur analogique-numérique, un calculateur pour échantillonner le signal numérisé, une mémoire pour enregistrer les coefficients d’un filtre à appliquer au signal échantillonné et un calculateur permettant de générer un signal de sortie, lesdits coefficients enregistrés étant calculés à partir d’une méthode de l’invention.an electronic card comprising an audio filter, an analog-digital converter, a calculator for sampling the digitized signal, a memory for recording the coefficients of a filter to be applied to the sampled signal and a calculator for generating an output signal, said coefficients recorded being calculated from a method of the invention.

Description of the invention

L’invention se rapporte à un premier procédé mis en œuvre par ordinateur pour configurer un filtre de traitement et de correction d’un flux audio. Ce procédé est dénommé procédé de caractérisation d’un filtre pour le traitement d’une voix.The invention relates to a first computer-implemented method for configuring a filter for processing and correcting an audio stream. This process is called the process of characterizing a filter for processing a voice.

L’invention concerne également un second procédé mis en œuvre par ordinateur destiné à être embarqué et exécuté dans un équipement électronique pour communiquer notamment sous l’eau. Ce second procédé met en œuvre un filtre caractérisé par le premier procédé.The invention also relates to a second computer-implemented method intended to be embedded and executed in electronic equipment to communicate in particular underwater. This second method implements a filter characterized by the first method.

Ces deux procédés peuvent être exécutés l’un dans le prolongement de l’autre de manière successive ou de manière espacée dans le temps. Toutefois, on note que le premier procédé peut être exécuté au préalable du second procédé ou d’une alternative du second procédé. Néanmoins, il réalise le même objectif final de traiter le flux audio d’un plongeur ayant un embout buccal en bouche lorsqu’il s’exprime pour le rendre intelligible.These two processes can be carried out as an extension of the other successively or spaced out over time. However, it should be noted that the first process can be executed before the second process or an alternative to the second process. However, it achieves the same end goal of processing the audio stream of a diver with a mouthpiece in their mouth as they speak to make it intelligible.

L’invention concerne également le dispositif de communication formant un élément d’un système de Talkie-Walkie sous-marin. Il s’agit d’un appareil de communication vocal entre plongeurs. Ce dispositif comporte généralement un embout contenant l’électronique nécessaire pour enregistrer et restituer le son. L’embout comporte une interface électronique telle qu’un câble lui permettant d’être relié à un boîtier. Une carte électronique agencée dans le boitier comprend des composants pour enregistrer les coefficients du filtre définissant sa caractérisation et pour réaliser les traitements de données et d’émission et de réception du signal audio.The invention also relates to the communication device forming an element of an underwater Walkie-Talkie system. This is a voice communication device between divers. This device generally includes a tip containing the electronics necessary to record and reproduce sound. The tip includes an electronic interface such as a cable allowing it to be connected to a box. An electronic card arranged in the housing includes components for recording the coefficients of the filter defining its characterization and for carrying out data processing and transmission and reception of the audio signal.

Un intérêt de l’invention est de créer un appareil de communication vocale pour la plongée sous-marine en ne modifiant pas ou peu l’équipement du plongeur. Cette contrainte implique d’utiliser un embout de détendeur comme moyen de réception et d’émission du signal sonore audible. La voix étant modifiée par l’embout et son intelligibilité étant ainsi diminuée, l’invention permet de récupérer une partie de l’intelligibilité dégradée grâce au filtre qui a été caractérisé par le premier procédé.An advantage of the invention is to create a voice communication device for scuba diving with little or no modification to the diver's equipment. This constraint involves using a regulator tip as a means of receiving and emitting the audible sound signal. The voice being modified by the mouthpiece and its intelligibility thus being reduced, the invention makes it possible to recover part of the degraded intelligibility thanks to the filter which was characterized by the first process.

Microphone

Selon un mode de réalisation, le micro peut être fixé sur une partie mécanique conduisant le flux d’air créé par la voix du plongeur, cette partie mécanique comprend au moins un canal ou une lumière s’étendant entre l’embout et le détendeur. Ainsi, selon différents modes de réalisation, le micro peut être fixé entre l’embout et le détendeur de manière à capter le flux audio prononcé par un plongeur. L’invention se rapporte à tous ces modes de réalisation.According to one embodiment, the microphone can be fixed on a mechanical part conducting the air flow created by the diver's voice, this mechanical part comprises at least one channel or a light extending between the mouthpiece and the regulator. Thus, according to different embodiments, the microphone can be fixed between the mouthpiece and the regulator so as to capture the audio stream spoken by a diver. The invention relates to all of these embodiments.

Selon un autre exemple, le microphone peut être intégré dans un revêtement élastique peut être protégé par une paroi de protection agencée frontalement au capteur du microphone. La paroi de protection permet avantageusement de réduire les difficultés d’intégration du microphone dans le revêtement.According to another example, the microphone can be integrated into an elastic covering which can be protected by a protective wall arranged frontally on the microphone sensor. The protective wall advantageously reduces the difficulties of integrating the microphone into the covering.

L’embout buccal 1 comprend préférentiellement une portion labiale 42. La portion labiale 42 est destinée à être recouverte par les lèvres de l’utilisateur pendant utilisation. Ladite portion labiale 42 comprend une lumière ou des lumières permettant le passage de l’air de part et d’autre de la portion labiale. Avantageusement, cette lumière ou ces lumières permet(tent) le passage de l’air depuis le canal du détendeur vers la bouche du sujet pendant utilisation. La portion labiale 42 permet de créer avec les lèvres de l’utilisateur une fermeture étanche entre l’environnement extérieur et la ou les lumière(s) de passage d’air.The mouthpiece 1 preferably comprises a labial portion 42. The labial portion 42 is intended to be covered by the lips of the user during use. Said labial portion 42 comprises a light or lights allowing the passage of air on either side of the labial portion. Advantageously, this light or these lights allow the passage of air from the channel of the regulator towards the mouth of the subject during use. The labial portion 42 makes it possible to create with the user's lips a tight closure between the external environment and the air passage light(s).

La portion labiale 42 s’étend entre une zone de respiration comprenant les plateaux de conduction acoustique 20 et un moyen de connexion 43. Le moyen de connexion 43 permet avantageusement de faire coopérer l’embout avec un détendeur de plongée. Selon un mode de réalisation, le moyen de connexion 43 est de forme tubulaire dont les parois sont rigides ou souples.The labial portion 42 extends between a breathing zone comprising the acoustic conduction plates 20 and a connection means 43. The connection means 43 advantageously allows the mouthpiece to cooperate with a diving regulator. According to one embodiment, the connection means 43 is of tubular shape whose walls are rigid or flexible.

Procédé de caractérisation / procédé de traitementCharacterization process / treatment process

Les figures 1 et 2 se rapportent plus particulièrement aux étapes d’un procédé de traitement d’un flux audio PROC₁mettant en œuvre un filtre caractérisé par un procédé de caractérisation du filtre selon l’invention.Figures 1 and 2 relate more particularly to the steps of a process for processing an audio stream PROC ₁ implementing a filter characterized by a method for characterizing the filter according to the invention.

Le flux audio traité par le procédé de traitement PROC₁est noté S_A(t). Ce flux audio est acquis et filtré grâce au filtre F₁caractérisé par un premier procédé de caractérisation du filtre CARAC₁. L’application du filtre F₁permet de reconstruire une partie de l’intelligibilité perdue par la présence de l’embout EM₁en bouche d’un individu.The audio stream processed by the PROC ₁ processing method is denoted S _A (t). This audio stream is acquired and filtered using the filter F ₁ characterized by a first method of characterizing the filter CARAC ₁ . The application of the F ₁ filter makes it possible to reconstruct part of the intelligibility lost by the presence of the EM ₁ tip in an individual's mouth.

L’invention concerne donc un procédé noté PROC₁sur la qui consiste à appliquer APP₁le filtre F₁à une entrée acquise ACQ₀d’un flux audio correspondant à la voix d’un individu, le filtre F₁étant caractérisé par un premier procédé noté CARAC₁qui est décrit ci-après grâce aux figures 3 et 4.The invention therefore relates to a process denoted PROC ₁ on the which consists of applying APP ₁ the filter F ₁ to an acquired input ACQ ₀ of an audio stream corresponding to the voice of an individual, the filter F ₁ being characterized by a first process denoted CARAC ₁ which is described below thanks to in Figures 3 and 4.

Les figures 3 et 4 se rapportent donc plus particulièrement aux étapes d’un procédé de caractérisation CARAC₁d’un filtre à partir de quelques séquences prononcées par un utilisateur ayant un embout en bouche et les mêmes séquences prononcés sans embout en bouche. Le filtre étant ensuite utilisé pour l’équipement du plongeur.Figures 3 and 4 therefore relate more particularly to the steps of a CARAC ₁ characterization process of a filter from a few sequences spoken by a user with a mouthpiece in the mouth and the same sequences pronounced without a mouthpiece in the mouth. The filter is then used for the diver's equipment.

Les flux audio traités par le procédé de caractérisation CARAC₁d’un filtre F₁sont notés S₁(t) et S₂(t).The audio streams processed by the CARAC ₁ characterization process of a filter F ₁ are denoted S ₁ (t) and S ₂ (t).

Le procédé CARAC₁comporte un apprentissage d’une fonction caractérisant le filtre afin d’être adapté pour corriger la déformation induite par la présence d’un embout en bouche et pour prendre en compte un type de voix, sa nature, sa tessiture, son timbre, etc.The CARAC ₁ process includes learning a function characterizing the filter in order to be adapted to correct the deformation induced by the presence of a mouthpiece and to take into account a type of voice, its nature, its range, its stamp, etc.

La représente notamment le passage du flux audio et son en codage au sein de différents composants. A cet effet, la représente l’embout buccal EM₁, noté également 1 sur la , d’un dispositif de communication sous-marine. Il peut comprendre une portion à maintenir en bouche par exemple formé par deux plateaux de conductions acoustiques 20. L’embout 1 comprend préférentiellement une portion labiale 42 et une connexion 43 avec par exemple un détendeur (non représenté). Des ouvertures 61 et 62 sont préférentiellement ménagées pour le passage de l’air entre la bouche et le détendeur. Un microphone MIC (non représenté sur la ) permet d’acquérir les sons produits par un individu parlant dans l’embout buccal 1 avant de transmettre le signal à un convertisseur analogique numérique CAN₁. L’invention se rapporte à toute autre architecture mécanique d’un embout buccal 1 destiné à être raccordé à un détendeur. Notamment, le dispositif de l’invention peut comprendre différentes variantes, notamment sur le nombre de canaux envisagés, sur la présence ou non de plateaux de conductions acoustiques, leur disposition et leur matériau.There represents in particular the passage of the audio and sound flow in coding within different components. For this purpose, the represents the mouthpiece EM ₁ , also noted 1 on the , of an underwater communication device. It may include a portion to be held in the mouth, for example formed by two acoustic conduction plates 20. The tip 1 preferably comprises a labial portion 42 and a connection 43 with, for example, a regulator (not shown). Openings 61 and 62 are preferably provided for the passage of air between the mouth and the regulator. A MIC microphone (not shown on the ) makes it possible to acquire the sounds produced by an individual speaking in the mouthpiece 1 before transmitting the signal to an analog digital converter CAN ₁ . The invention relates to any other mechanical architecture of a mouthpiece 1 intended to be connected to a regulator. In particular, the device of the invention can include different variants, in particular on the number of channels envisaged, on the presence or not of acoustic conduction plates, their arrangement and their material.

Dans l’exemple de la , un élément vibrant 3 peut être utilisé pour retranscrire l’audio émis par un autre individu et ainsi permettre une écoute par conduction osseuse. Un autre système pourrait toutefois être utilisé afin de retranscrire la voix et son intelligibilité qui a été traitée par le procédé de l’invention.In the example of the , a vibrating element 3 can be used to transcribe the audio emitted by another individual and thus allow listening by bone conduction. Another system could, however, be used to transcribe the voice and its intelligibility which has been processed by the method of the invention.

Phonemes for learning

La décrit deux branches d’un procédé. La première branche comprend une étape ACQ₁d’acquisition d’un flux audio entrant S₁(t) comportant un premier ensemble de phonèmes déformé par l’embout buccal 1. La seconde branche comporte l’acquisition d’un flux audio entrant S₂(t) comportant le même ensemble de phonèmes définissant un second flux audio non déformé par la présence d’un embout buccal 1. Préférentiellement, les phonèmes sont les mêmes et sont préférentiellement acquis par le même utilisateur. En effet, afin de permettre un apprentissage et une définition des coefficients du filtre optimal, il est nécessaire d’isoler au maximum l’influence de la déformation liée à la présence de l’embout 1 en bouche. Ainsi, l’invention est particulièrement pertinente dans le cas où les deux séquences de phonèmes sont sensiblement les mêmes. L’apprentissage sera également d’autant plus performant lorsqu’un même individu prononcera les deux flux audio S₁(t) et S₂(t)There describes two branches of a process. The first branch comprises a step ACQ ₁ of acquiring an incoming audio stream S ₁ (t) comprising a first set of phonemes distorted by the mouthpiece 1. The second branch comprises the acquisition of an incoming audio stream S ₂ (t) comprising the same set of phonemes defining a second audio stream not distorted by the presence of a mouthpiece 1. Preferably, the phonemes are the same and are preferentially acquired by the same user. Indeed, in order to allow learning and definition of the coefficients of the optimal filter, it is necessary to isolate as much as possible the influence of the deformation linked to the presence of the tip 1 in the mouth. Thus, the invention is particularly relevant in the case where the two sequences of phonemes are substantially the same. Learning will also be even more efficient when the same individual pronounces the two audio streams S ₁ (t) and S ₂ (t)

Les phonèmes peuvent par exemple correspondre à quelques phrases à prononcer avec et sans embout buccal 1. L’invention propose ainsi un mode de réalisation dans lequel un individu prononce deux mêmes groupes de phrases avec et sans embout.The phonemes can for example correspond to a few sentences to be pronounced with and without a mouthpiece 1. The invention thus proposes an embodiment in which an individual pronounces two same groups of sentences with and without a mouthpiece.

Les phonèmes peuvent correspondre à un groupement de lettres, de syllabes ou de phrases. Les deux groupes comprennent préférentiellement un même groupe de phonèmes et un même ordonnancement de ces phonèmes au sein de la séquence prononcé par un individu, avec et sans appareil.Phonemes can correspond to a grouping of letters, syllables or sentences. The two groups preferably include the same group of phonemes and the same ordering of these phonemes within the sequence pronounced by an individual, with and without a device.

On comprend qu’un apprentissage d’un algorithme se basant sur quelques phrases prononcées, par exemple entre 1 et 10 phrases permet un apprentissage rapide pour l’utilisateur.We understand that learning an algorithm based on a few spoken sentences, for example between 1 and 10 sentences, allows rapid learning for the user.

L'enregistrement du flux audio réalisé avec l’embout buccal est traité notamment par un différenciateur. L’enregistrement du flux audio réalisé sans l’embout buccal est utilisé comme la sortie souhaitée lors de la création du filtre.The recording of the audio stream made with the mouthpiece is processed in particular by a differentiator. The recording of the audio stream made without the mouthpiece is used as the desired output when creating the filter.

Un différentiateur de l’invention peut comprendre un ensemble de composants pour effectuer les opérations mathématiques de différenciation, c'est-à-dire fournissant une sortie proportionnelle à la dérivée de l'entrée par rapport à une ou plusieurs variables.A differentiator of the invention may comprise a set of components for performing the mathematical operations of differentiation, i.e. providing an output proportional to the derivative of the input with respect to one or more variables.

Selon une première variante, l’invention permet de considérer dans l’algorithme l’ensemble des phrases prononcées comprenant une pluralité de lettres, syllabes ou phrases. Dans ce cas, elles sont toutes traitées ensemble.According to a first variant, the invention makes it possible to consider in the algorithm all of the pronounced sentences comprising a plurality of letters, syllables or sentences. In this case, they are all processed together.

Selon une seconde variante, l’invention permet de considérer chaque phrase dans l’algorithme de manière séquentielle et indépendante les unes des autres. Dans ce cas, chaque phrase prononcée comprend une pluralité de lettres, syllabes et chaque phrase est traitée indépendamment des autres.According to a second variant, the invention makes it possible to consider each sentence in the algorithm sequentially and independently of each other. In this case, each spoken sentence includes a plurality of letters, syllables and each sentence is processed independently of the others.

Les phrases peuvent être définies de manière à obtenir une répartition de syllabes les plus représentatives du langage pouvant être utilisé dans un environnement sous-marin et dans une langue donnée.The sentences can be defined in such a way as to obtain a distribution of syllables most representative of the language that can be used in an underwater environment and in a given language.

Selon un exemple de réalisation, les phrases sélectionnées pour l’apprentissage et la caractérisation du filtre F₁comprennent un échantillon représentatif des différentes labiales, notamment des syllabes comprenant les lettres suivantes : B, P, M, F, V.According to an exemplary embodiment, the sentences selected for learning and characterization of the filter F ₁ comprise a representative sample of the different labials, in particular syllables comprising the following letters: B, P, M, F, V.

Selon un mode de réalisation, les phrases peuvent être affichées sur un afficheur de l’équipement de plongée de sorte que ce dernier les lise. Selon un autre exemple, les phrases sont affichées sur un afficheur d’un terminal électronique tel que celui d’un Smartphone au moyen d’une application permettant de sélectionner une langue. Ainsi, on comprend que le procédé de caractérisation CARAC₁peut être effectué selon les modes de réalisation directement sur l’équipement de plongée ou sur un Smartphone. Lorsque le filtre est ainsi caractérisé à partir d’un Smartphone, différentes possibilités peuvent être mises en œuvre pour transférer le filtre caractérisé sur une mémoire d’une carte électronique disposée sur l’équipement de plongée, telle que l’embout buccal.According to one embodiment, the sentences can be displayed on a display of the diving equipment so that the latter reads them. According to another example, the sentences are displayed on a display of an electronic terminal such as that of a Smartphone by means of an application allowing a language to be selected. Thus, we understand that the CARAC ₁ characterization method can be carried out according to the embodiments directly on the diving equipment or on a Smartphone. When the filter is thus characterized from a Smartphone, different possibilities can be implemented to transfer the characterized filter to a memory of an electronic card placed on the diving equipment, such as the mouthpiece.

Selon un mode de réalisation, l’affichage des phrases comprend un curseur permettant d’indiquer une vitesse de lecture. L’affichage peut comprendre des indications graphiques permettant d’indiquer à un utilisateur une séquence de lecture avec ou sans embout en bouche afin d’acquérir les deux ensembles de phrases.According to one embodiment, the display of sentences includes a cursor making it possible to indicate a reading speed. The display may include graphic indications making it possible to indicate to a user a reading sequence with or without a mouthpiece in order to acquire the two sets of sentences.

F1 filter calculation resources

Afin de réaliser cette étape préliminaire, selon un premier mode de réalisation, un utilisateur peut utiliser un terminal électronique tel qu’un Smartphone, c’est-à-dire un téléphone intelligent ou tout autre disposition comportant un micro. Les deux séquences de phrases prononcées peuvent être enregistrées grâce au microphone du terminal électronique et être émises à un serveur distant pour réaliser le procédé de l’invention. Selon ce mode, le procédé est réalisé à distance et peut profiter de ressources de calculs importantes.In order to carry out this preliminary step, according to a first embodiment, a user can use an electronic terminal such as a Smartphone, that is to say a smart phone or any other arrangement comprising a microphone. The two sequences of spoken sentences can be recorded using the microphone of the electronic terminal and transmitted to a remote server to carry out the method of the invention. According to this mode, the process is carried out remotely and can benefit from significant calculation resources.

Notamment, selon un mode préféré, les caractéristiques du filtre peuvent être calculées par un serveur distant disposant de ressources de calculs importantes sans contraintes matérielles liées à un équipement mobile en particulier. A cet effet, le signal de la voix émise par un utilisateur peut être acquis par un dispositif électronique mobile, tel qu’un Smartphone, autrement appelé téléphone intelligent, et être transmis à un serveur distant. Le filtre calculé par le serveur distant peut dans un second temps être réémis vers le Smartphone ou l’équipement électronique embarqué de plongée.In particular, according to a preferred mode, the characteristics of the filter can be calculated by a remote server having significant calculation resources without hardware constraints linked to a particular mobile equipment. For this purpose, the voice signal emitted by a user can be acquired by a mobile electronic device, such as a Smartphone, otherwise called a smart phone, and transmitted to a remote server. The filter calculated by the remote server can then be re-transmitted to the Smartphone or on-board electronic diving equipment.

Selon un second mode de réalisation, un utilisateur peut utiliser l’équipement de plongée comportant un module électronique comportant un calculateur et une mémoire. Dans ce cas de figure, l’apprentissage est réalisé à partir de ressources moins importantes qu’un serveur. L’algorithme est alors optimisé pour être exécuté dans un environnement contraint. Un avantage est de n’avoir pas besoin de connexion réseau, ce cas peut se révéler intéressant dans un environnement de plongée en mer par exemple lorsqu’on se situe sur un bateau loin des côtes. Un autre avantage est d’utiliser le micro de l’équipement de plongée. Ainsi, l’apprentissage sera d’autant plus efficace qu’il s’affranchira des écarts liés aux différences de matériels entre l’équipement servant pour l’apprentissage et celui utilisé pour l’exploitation du filtre caractérisé.According to a second embodiment, a user can use diving equipment comprising an electronic module comprising a computer and a memory. In this case, learning is carried out using fewer resources than a server. The algorithm is then optimized to be executed in a constrained environment. An advantage is not needing a network connection, this case can be interesting in a sea diving environment for example when you are on a boat far from the coast. Another advantage is to use the microphone of the diving equipment. Thus, learning will be all the more effective as it overcomes the gaps linked to differences in equipment between the equipment used for learning and that used for operating the characterized filter.

Selon un troisième mode de réalisation hybride, le filtre peut être calculé sur un terminal électronique de type Smartphone à partir d’une application téléchargée permettant d’exploiter les ressources de calculs du terminal électronique. Un intérêt est de réaliser le filtre avec un peu plus de ressources que le cas d’une carte électronique embarquée dans l’équipement de plongée tout en permettant de réaliser l’opération localement, c’est-à-dire sans mise en œuvre d’un serveur distant.According to a third hybrid embodiment, the filter can be calculated on an electronic terminal of the Smartphone type from a downloaded application making it possible to exploit the calculation resources of the electronic terminal. An advantage is to produce the filter with a little more resources than the case of an electronic card embedded in the diving equipment while allowing the operation to be carried out locally, that is to say without implementing a 'a remote server.

Lorsque l’opération est réalisée par un terminal électronique de type Smartphone, il peut être prévu une liaison sans fil permettant le chargement du filtre F₁calculé par une autre ressource que celle de l’équipement de plongée pour qu’il soit enregistré dans une mémoire ou une ressource dudit équipement de plongée.When the operation is carried out by an electronic terminal of the Smartphone type, a wireless connection can be provided allowing the loading of the filter F ₁ calculated by a resource other than that of the diving equipment so that it is recorded in a memory or resource of said diving equipment.

Audio signal acquisition mode

Le signal issu du micro avec l’embout buccal agencé dans la bouche d’un individu est dénommé le premier signal S₁, le signal acquis par un micro sans que l’embout buccal soit agencé dans la bouche d’un individu est dénommé le second signal S₂. Ces signaux S₁et S₂sont acquis dans l’objectif de caractériser un filtre audio.The signal from the microphone with the mouthpiece arranged in the mouth of an individual is called the first signal S ₁ , the signal acquired by a microphone without the mouthpiece being arranged in the mouth of an individual is called the second signal S ₂ . These signals S ₁ and S ₂ are acquired with the aim of characterizing an audio filter.

Préférentiellement, un même et unique micro est utilisé pour acquérir les deux signaux S₁et S₂. Ce mode de réalisation permet de s’affranchir des caractéristiques intrinsèques du micro et de l’influence de ce dernier sur le signal acquis. Selon un mode de réalisation, ce micro est le micro de l’équipement de plongé, par exemple celui agencé dans la partir de l’embout distal situé entre la partie destinée à être positionnée en bouche et le détendeur. Selon un autre mode de réalisation, le micro est le micro d’un terminal électronique tel que celui d’un téléphone mobile de type Smartphone. Dans ce dernier cas, un individu positionne l’embout buccal 1 en bouche et dit les quelques phrases à proximité du micro du terminal électronique pour entrainer l’algorithme exécuté par ce dernier. Selon un mode de réalisation, un filtre correctif connu peut être utilisé afin de corriger le signal acquis par le Smartphone.Preferably, the same and single microphone is used to acquire the two signals S ₁ and S ₂ . This embodiment makes it possible to overcome the intrinsic characteristics of the microphone and the influence of the latter on the acquired signal. According to one embodiment, this microphone is the microphone of the diving equipment, for example that arranged in the part of the distal tip located between the part intended to be positioned in the mouth and the regulator. According to another embodiment, the microphone is the microphone of an electronic terminal such as that of a smartphone type mobile telephone. In the latter case, an individual positions the mouthpiece 1 in the mouth and says the few sentences near the microphone of the electronic terminal to train the algorithm executed by the latter. According to one embodiment, a known corrective filter can be used to correct the signal acquired by the Smartphone.

D’autres modes de réalisation peuvent être mis en œuvre, notamment des modes impliquant l’utilisation des signaux acquis par des différents micros.Other embodiments can be implemented, in particular modes involving the use of signals acquired by different microphones.

Selon un mode de réalisation, les deux signaux S₁et S₂encodés numériquement après leur conversion dans un convertisseur analogique – numérique, appelée CAN. La conversion et l’encodage comprennent avantageusement un échantillonnage de chaque signal.According to one embodiment, the two signals S ₁ and S ₂ encoded digitally after their conversion in an analog – digital converter, called CAN. The conversion and encoding advantageously include sampling of each signal.

De la même manière lors de l’acquisition du signal dans le procédé de traitement utilisant un filtre caractérisé, une étape de conversion analogique - numérique est mise en œuvre et un échantillonnage est opéré du signal audio acquis.In the same way during the acquisition of the signal in the processing method using a characterized filter, an analog - digital conversion step is implemented and sampling is carried out of the acquired audio signal.

Selon un mode de réalisation, le premier signal S₁correspondant aux phrases prononcées avec l’embout buccal en bouche sont échantillonnés selon un premier échantillonnage ECH₁. Les échantillonnages des signaux mis en œuvre dans le procédé de caractérisation CARAC₁du filtre F₁et dans le procédé de traitement PROC₁peuvent être identiques, c’est-à-dire que la fréquence d’échantillonnage, les mesures réalisées sur les échantillons par regroupement de ces derniers en considérant des fenêtres temporelles sont configurés identiquement. Toutefois, l’invention pourrait mettre en œuvre un échantillonnage propre à la caractérisation du filtre F₁et un échantillonnage propre au procédé de traitement du flux audio prononcé par un plongeur. Typiquement, dans ce mode de réalisation, les fréquences d’échantillonnage pourraient être différentes. Un intérêt est de consommer moins d’information lors des étapes de traitement lors d’une plongée dans laquelle on souhaite conserver le plus longtemps possible la liaison de communication alors que dans la phase d’apprentissage et de caractérisation du filtre F₁, un échantillonnage plus fin, c’est-à-dire permettant d’obtenir plus d’échantillons permet d’obtenir un filtre F₁plus performant.According to one embodiment, the first signal S ₁ corresponding to the sentences pronounced with the mouthpiece in the mouth are sampled according to a first ECH ₁ sampling. The sampling of the signals implemented in the CARAC ₁ characterization process of the F ₁ filter and in the PROC ₁ processing process can be identical, that is to say that the sampling frequency, the measurements carried out on the samples by grouping the latter by considering time windows are configured identically. However, the invention could implement sampling specific to the characterization of the filter F ₁ and sampling specific to the process of processing the audio stream spoken by a diver. Typically, in this embodiment, the sampling frequencies could be different. An advantage is to consume less information during the processing stages during a dive in which we wish to keep the communication link as long as possible while in the learning and characterization phase of the filter F ₁ , sampling finer, that is to say allowing more samples to be obtained, makes it possible to obtain a more efficient F ₁ filter.

Selon un exemple de réalisation, un échantillonnage aboutissant à l’enregistrement de valeurs numériques est configuré pour une valeur haute à 96000 échantillons par seconde. Toutefois, un nombre d’échantillon plus faible peut permettre d’obtenir de très bons résultats également avec une valeur comprise entre 4000 et 8000 échantillons par seconde.According to an exemplary embodiment, sampling leading to the recording of digital values is configured for a high value of 96,000 samples per second. However, a lower number of samples can also allow very good results to be obtained with a value between 4000 and 8000 samples per second.

Lors de l’acquisition du signal audio, un nombre d’échantillons glissants est conservé en mémoire pendant des cycles de calculs en temps réels des valeurs moyennes. Un nombre d’échantillons compris entre 200 et 400 échantillons est conservés à chaque cycle de calculs en temps réel.When acquiring the audio signal, a number of sliding samples is kept in memory during real-time calculation cycles of the average values. A number of samples of between 200 and 400 samples is kept for each real-time calculation cycle.

Additional band-pass or low-pass filter

Selon un exemple de réalisation, un filtrage préalable peut être réalisé avant l’échantillonnage. Ce dernier filtrage a pour objectif d’éliminer un bruit ou de restreindre la bande d’analyse du signal.According to an exemplary embodiment, prior filtering can be carried out before sampling. This last filtering aims to eliminate noise or restrict the signal analysis band.

Selon un exemple, l’invention comprend la mise en œuvre d’un filtre passe-bas dont la fréquence de coupure dépend de la fréquence d'échantillonnage. Selon un exemple de réalisation, un filtre adapté permet de filtrer l’effet produit par les bulles lors de la respiration ou lors d’émission d’un son par la bouche d’un plongeur. La fréquence spécifique peut être filtré de manière à améliorer l’intelligibilité en amont de la construction du filtre caractéristique F₁.According to one example, the invention comprises the implementation of a low-pass filter whose cutoff frequency depends on the sampling frequency. According to an exemplary embodiment, a suitable filter makes it possible to filter the effect produced by the bubbles during breathing or when emitting a sound through the mouth of a diver. The specific frequency can be filtered so as to improve intelligibility upstream of the construction of the characteristic filter F ₁ .

Sample selection

Selon un exemple de réalisation, afin de diminuer le temps de calcul, une sélection d’échantillons est choisie. A cette fin, la sélection d’un ensemble d’échantillons revient à choisir un ensemble de fenêtres temporelles Tf₁comportant chacune un sous ensemble d’échantillons à partir desquels les valeurs moyennes seront calculées.According to an exemplary embodiment, in order to reduce the calculation time, a selection of samples is chosen. To this end, the selection of a set of samples amounts to choosing a set of time windows Tf ₁ each comprising a subset of samples from which the average values will be calculated.

Afin de sélectionner les meilleurs échantillons ou les meilleurs groupes d’échantillons pour définir un filtre caractéristique F₁performant, une étape de calcul de fréquences caractéristiques du spectre du signal acquis peut être réalisée. Un intérêt est de sélectionner quelques fréquences comportant soit les plus grandes intensités de signaux ou les fréquences ayant la grande densité spectrale, par exemple en considérant la puissance spectrale du signal acquis. Ces fréquences sélectionnées permettent de sélectionner un intervalle autour de la fréquence pour retenir des échantillons caractéristiques du flux audio.In order to select the best samples or the best groups of samples to define an efficient characteristic filter F ₁ , a step of calculating characteristic frequencies of the spectrum of the acquired signal can be carried out. An advantage is to select a few frequencies comprising either the greatest signal intensities or the frequencies having the greatest spectral density, for example by considering the spectral power of the acquired signal. These selected frequencies make it possible to select an interval around the frequency to retain samples characteristic of the audio stream.

Cette méthode permet un gain de temps et de calculs et permet donc d’embarquer le procédé de l’invention dans un composant électronique sur lequel l’apprentissage peut être réalisé pour définir le filtre caractéristique F₁rapidement.This method saves time and calculations and therefore makes it possible to embed the method of the invention in an electronic component on which learning can be carried out to define the characteristic filter F ₁ quickly.

Selon un exemple entre 5 et 20 fréquences caractéristiques peuvent être configurés pour déterminer entre 5 et 20 intervalles autour de chaque fréquence permettant de définir entre 5 et 20 fenêtres temporelles d’échantillons.According to an example between 5 and 20 characteristic frequencies can be configured to determine between 5 and 20 intervals around each frequency making it possible to define between 5 and 20 sample time windows.

Speed and acceleration of spoken sounds

La représente une sortie S₁(t) correspondant au flux audio acquis avec un embout et une sortie S₂(t) correspondant au flux audio acquis sans embout. On rappelle que les phrases constituées de phonèmes dans chaque signal S₁(t) et S₂(t) sont identiques dans cet exemple. On remarque sur le graphique de la que la durée du flux audio S₁(t) est plus longue que la durée du flux audio S₂(t) du fait que la vitesse d’élocution est ralentie par la présence de l’embout en bouche.There represents an output S ₁ (t) corresponding to the audio stream acquired with a tip and an output S ₂ (t) corresponding to the audio stream acquired without a tip. We recall that the sentences made up of phonemes in each signal S ₁ (t) and S ₂ (t) are identical in this example. We notice in the graph of the that the duration of the audio stream S ₁ (t) is longer than the duration of the audio stream S ₂ (t) due to the fact that the speed of speech is slowed down by the presence of the mouthpiece in the mouth.

Le procédé de l’invention permet de prendre en compte dans le modèle utilisé la déformation du son induite par la vitesse d’élocution et donc du son du flux audio qui est enregistrée et filtré.The method of the invention makes it possible to take into account in the model used the distortion of the sound induced by the speed of speech and therefore of the sound of the audio stream which is recorded and filtered.

Un intérêt de l’invention est de prendre en considération dans la caractérisation du filtre les artefacts du signal liés à la déformation du signal induite par le changement de vitesse et de l’accélération du son prononcé par un plongeur.An advantage of the invention is to take into consideration in the characterization of the filter the signal artifacts linked to the deformation of the signal induced by the change of speed and the acceleration of the sound pronounced by a diver.

Estimation of average quantities

Le procédé de l’invention s’appuie sur la mise en œuvre d’une méthode reposant sur la théorie de l’homogénéité par la définition d’un différentiateur homogène. Cette méthode permet de réaliser une estimation en temps fini de signaux bruités. Un avantage de la méthode de l’invention est que les algorithmes mis en œuvre sont simples, particulièrement rapides et de nature non asymptotique contrairement aux méthodes traditionnelles.The process of the invention is based on the implementation of a method based on the theory of homogeneity by the definition of a homogeneous differentiator. This method makes it possible to carry out a finite-time estimation of noisy signals. An advantage of the method of the invention is that the algorithms implemented are simple, particularly fast and non-asymptotic in nature unlike traditional methods.

Cette méthode permet l’estimation des dérivées du signal selon l’ordre choisi en considérant l’ensemble des données échantillonnées. Afin de lisser les estimations et de réduire les artefacts liés à des erreurs d’acquisitions ou du signal bruité, des valeurs moyennes sont calculées afin de rendre le procédé de l’invention plus robuste.This method allows the estimation of the derivatives of the signal according to the chosen order by considering all of the sampled data. In order to smooth the estimates and reduce artifacts linked to acquisition errors or noisy signals, average values are calculated in order to make the method of the invention more robust.

Le procédé permet d’estimer EST₁un premier ensemble de valeurs d’une donnée relative au signal acquis sur des fenêtres de temps prédéfinies Tf₁, chaque première fenêtre Tf₁comportant un nombre donné d’échantillons du signal.The method makes it possible to estimate EST ₁ a first set of values of data relating to the signal acquired over predefined time windows Tf ₁ , each first window Tf ₁ comprising a given number of samples of the signal.

Ainsi, selon un exemple, la grandeur physique relative au signal acquis est l’amplitude du signal. Selon un mode de réalisation, le procédé permet d’estimer les valeurs moyennes de l’amplitude du signal prise sur un ensemble d’échantillon d’une fenêtre temporelle donnée.Thus, according to an example, the physical quantity relating to the acquired signal is the amplitude of the signal. According to one embodiment, the method makes it possible to estimate the average values of the amplitude of the signal taken over a set of samples of a given time window.

D’autres grandeurs physiques peuvent être utilisées ou traitées selon le procédé de l’invention, toutefois le mode de réalisation si après détaillé est relatif autre traitement des valeurs moyennes des amplitudes du signal acquis.Other physical quantities can be used or processed according to the method of the invention, however the embodiment if detailed afterward is relative to other processing of the average values of the amplitudes of the acquired signal.

Le procédé prend en considération une valeur liée à la dynamique du signal : sa vitesse. A cet effet, le procédé de l’invention permet d’estimer EST₂un second ensemble de grandeurs physiques relatif à la dynamique du signal. Selon un exemple, les valeurs de vitesse du premier signal S₁sont estimées. La vitesse correspond à la rapidité avec laquelle les phrases sont prononcées. Afin de calculer la vitesse du premier signal S₁, une méthode dite du différenciateur homogène en temps fini DHT est mise en œuvre. Le différenciateur est appliqué sur une pluralité de premières fenêtres temporelles prédéfinies comportant chacune un ensemble d’échantillons.The method takes into consideration a value linked to the dynamics of the signal: its speed. For this purpose, the method of the invention makes it possible to estimate EST ₂ a second set of physical quantities relating to the dynamics of the signal. According to one example, the speed values of the first signal S ₁ are estimated. Speed refers to how quickly sentences are spoken. In order to calculate the speed of the first signal S ₁ , a method called the finite-time homogeneous differentiator DHT is implemented. The differentiator is applied to a plurality of first predefined time windows each comprising a set of samples.

Selon un exemple amélioré, le procédé prend en considération une autre valeur liée à la dynamique du signal : son accélération. A cet effet, le procédé de l’invention permet d’estimer EST₃un troisième ensemble de grandeurs physiques relatif à la dynamique du signal. Selon un exemple, les valeurs d’accélération du premier signal S₁sont estimées. L’accélération correspond aux variations de la vitesse avec laquelle les phrases sont prononcées. Afin de calculer l’accélération du premier signal S₁, une méthode équivalente dite du différenciateur homogène en temps fini DHT est mise en œuvre également. Le différenciateur est appliqué sur une pluralité de premières fenêtres temporelles prédéfinies comportant chacune un ensemble d’échantillons. Les échantillons sont préférentiellement les mêmes que ceux considérés pour estimer les vitesses.According to an improved example, the method takes into consideration another value linked to the dynamics of the signal: its acceleration. For this purpose, the method of the invention makes it possible to estimate EST ₃ a third set of physical quantities relating to the dynamics of the signal. According to one example, the acceleration values of the first signal S ₁ are estimated. Acceleration refers to variations in the speed with which sentences are spoken. In order to calculate the acceleration of the first signal S ₁ , an equivalent method called the homogeneous finite time differentiator DHT is also implemented. The differentiator is applied to a plurality of first predefined time windows each comprising a set of samples. The samples are preferably the same as those considered to estimate the speeds.

De la même manière, lors du procédé de traitement PROC₁, les valeurs des dérivées et accélérations du signal en entrée S_A(t) qui est issue d’un signal correspondant à un flux audio d’un plongeur acquis avec un embout en bouche peuvent être calculées. Un intérêt est de générer le flux de sortie S_B(t) à partir du filtre F₁caractérisé par le procédé de caractérisation CARAC₁obtenu dans la phase d’apprentissage.In the same way, during the PROC ₁ processing method, the values of the derivatives and accelerations of the input signal S _A (t) which comes from a signal corresponding to an audio stream of a diver acquired with a mouthpiece in the mouth can be calculated. An interest is to generate the output stream S _B (t) from the filter F ₁ characterized by the CARAC ₁ characterization method obtained in the learning phase.

Dynamic system modeling

Le procédé de l’invention permet de définir un modèle basé sur la modélisation des systèmes dynamiques et de résoudre le problème posé par l’obtention d’une solution approchée en optimisant les calculs par itération en minimisant une erreur. L’erreur peut être définie par la connaissance de l’entrée du système et la sortie du système. Une problématique résolue par l’invention est de définir un modèle qui converge suffisamment rapidement pour définir un filtre F₁opérationnel et performant permettant de reconstruire par la suite le flux audio déformé par la présence de l’embout.The method of the invention makes it possible to define a model based on the modeling of dynamic systems and to solve the problem posed by obtaining an approximate solution by optimizing the calculations by iteration while minimizing an error. Error can be defined by knowing the system input and system output. A problem solved by the invention is to define a model which converges quickly enough to define an operational and efficient filter F ₁ making it possible to subsequently reconstruct the audio stream distorted by the presence of the tip.

Le signal S₂(t) désigne la voix normale en entrée qui n’a pas subi de déformation, et peut définir la sortie souhaitée du modèle. Le modèle de déformation de la voix du fait de la présence de l’embout peut être représenté par cette égalité .The signal S ₂ (t) designates the normal input voice which has not undergone distortion, and can define the desired output of the model. The model of distortion of the voice due to the presence of the mouthpiece can be represented by this equality .

Un intérêt de la modélisation par un système dynamique est d’introduire un opérateur de différentiation par rapport à la variable temps t.- Dans une telle modélisation, on peut écrire , où p est l’opérateur de différentiation par rapport à la variable temps, p = d/dt.An advantage of modeling by a dynamic system is to introduce a differentiation operator with respect to the time variable t.- In such modeling, we can write , where p is the differentiation operator with respect to the time variable, p = d/dt.

Le signal S₂(t) défini la sortie du système dynamique dans la phase d’apprentissage du modèle. Il convient donc de définir le problème inverse consistant à trouver S₂(t) à partir de S₁(t).The signal S ₂ (t) defines the output of the dynamic system in the model learning phase. It is therefore appropriate to define the inverse problem consisting of finding S ₂ (t) from S ₁ (t).

Le problème inverse revient à calculer la fonction inverse W^-1qui représente un modèle apprenant qu’on cherche à optimiser afin de calculer le signal S_B(t) représenté sur la correspondant au flux audio corrigé lorsque le signal S₂(t) connu ne sera plus disponible. Les premières séquences audio S₂(t) permettent donc de réaliser un entrainement du modèle afin de paramétrer un filtre F₁audio entrainé, c’est-à-dire dont les coefficients ont été produits par un algorithme comportant une opération de régression ou d’itération visant à minimiser une erreur.The inverse problem amounts to calculating the inverse function W ^-1 which represents a learning model that we seek to optimize in order to calculate the signal S _B (t) represented on the corresponding to the corrected audio stream when the known signal S ₂ (t) will no longer be available. The first audio sequences S ₂ (t) therefore make it possible to carry out training of the model in order to parameterize a trained audio filter F ₁ , that is to say whose coefficients have been produced by an algorithm comprising a regression operation or d iteration aimed at minimizing an error.

Les coefficients ainsi produits sont les coefficients d’exploitation permettant de définir un filtre F1 opérationnel.The coefficients thus produced are the operating coefficients making it possible to define an operational F1 filter.

L’invention permet donc de générer un « modèle appris » pour dans un second temps utiliser un filtre caractéristique F₁au sein de l’équipement pour corriger le flux audio déformé par la présence de l’embout en bouche.The invention therefore makes it possible to generate a “learned model” to subsequently use a characteristic filter F ₁ within the equipment to correct the audio stream distorted by the presence of the mouthpiece in the mouth.

Le problème inverse peut s’écrire : .The opposite problem can be written as: .

Linear model

Ce qui peut s’écrire en termes de la théorie des systèmes dynamiques à l’équation suivante selon un modèle linéaire MOD₁où x(t) est une variable interne au système:Which can be written in terms of the theory of dynamic systems as the following equation according to a linear model MOD ₁ where x(t) is a variable internal to the system:

Selon un mode de réalisation, afin de résoudre ce système, une première hypothèse est de considérer un signal suffisamment lisse S₂(t) afin de considérer une fonction dérivable ayant un ensemble de dérivées constituant différents ordres de la fonction S₂(t) = y(t) où y(t) est la sortie du modèle MOD₁ou MOD₂. Ainsi, l’ensemble des dérivées, s’écrit [y⁽ⁿ⁾], « n » étant l’ordre la dérivée du signal échantillonné et étant supérieure ou égal à 1.According to one embodiment, in order to solve this system, a first hypothesis is to consider a sufficiently smooth signal S ₂ (t) in order to consider a differentiable function having a set of derivatives constituting different orders of the function S ₂ (t) = y(t) where y(t) is the output of the MOD ₁ or MOD ₂ model. Thus, the set of derivatives is written [y ⁽ⁿ⁾ ], “n” being the order of the derivative of the sampled signal and being greater than or equal to 1.

La sortie recherchée peut être modélisée par un vecteur z défini par le système d’équations différentielles qu’on cherche à estimer :The desired output can be modeled by a vector z defined by the system of differential equations that we seek to estimate:

z = [y⁽¹⁾, y⁽²⁾, y⁽³⁾, … y⁽ⁿ ^-1 ⁾]^Tetz = [y ⁽¹⁾ , y ⁽²⁾ , y ⁽³⁾ , … y ⁽ⁿ ^-1 ⁾ ] ^T and

y⁽ⁿ⁾(t) = Thêta(t)y ⁽ⁿ⁾ (t) = Theta(t)

Il s’agit de l’ensemble des dérivées premières, secondes, troisièmes et ainsi de suite jusqu’à la n^ièmedérivée. Le nombre de dérivées considérées définies la dimension du système.This is the set of first, second, third derivatives and so on up to the ^nth derivative. The number of derivatives considered defines the dimension of the system.

En considérant un ordre des dérivées retenues égal à 2 pour simplifier les calculs il est possible de définir un modèle MOD₁simple du système qui soit performant. La suite est décrite pour un ordre n afin d’illustrer un mode plus général de réalisation.By considering an order of derivatives retained equal to 2 to simplify the calculations, it is possible to define a simple MOD ₁ model of the system which is efficient. The following is described for an order n in order to illustrate a more general embodiment.

On peut définir le système équivalent suivant :We can define the following equivalent system:

Dans laquelle MA est définie par la matrice suivante :In which MA is defined by the following matrix:

Et « C » est un vecteur défini par le vecteur suivant :And “C” is a vector defined by the following vector:

C = [1, 0, 0 …, 0]C = [1, 0, 0…, 0]

Et est un vecteur défini par le vecteur suivant :And is a vector defined by the following vector:

Phi(t) = [0, …, Thêta(t)]^T Phi(t) = [0, …, Theta(t)] ^T

On note , où alpha est strictement supérieur à 0 et x est un réel.We notice , where alpha is strictly greater than 0 and x is a real number.

Le différentiateur homogène en temps fini est défini par le système suivant pour i = 2, …, n-1:The finite-time homogeneous differentiator is defined by the following system for i = 2, …, n-1:

Où est l’estimation de zOr is the estimate of z

Les coefficients k₁, …, k_nforment un polynôme de Hurwitz.The coefficients k ₁ , …, k _n form a Hurwitz polynomial.

Il est possible dans la résolution de ce système de choisir de manière appropriée pour assurer l’homogénéité du système les coefficients (a₁, …a_n) dans l’espace des réels strictement positifs de dimension n.It is possible in the resolution of this system to choose in an appropriate manner to ensure the homogeneity of the system the coefficients (a ₁ , …a _n ) in the space of strictly positive real numbers of dimension n.

Selon un exemple, pour un A donné dans l’intervalle [1 – 1/(n-1), 1], les séquences (r₁, …, r_n) et (a₁, …a_n) peuvent être choisi ainsi :According to an example, for a given A in the interval [1 – 1/(n-1), 1], the sequences (r ₁ , …, r _n ) and (a ₁ , …a _n ) can be chosen as follows :

, avec 1 ≤ i ≤ n , with 1 ≤ i ≤ n

De manière à assurer une convergence à temps fini, selon un exemple, la méthode peut être modélisée en prenant en compte les conditions nécessaires à la théorie de l’homogénéité. Dans ce cette hypothèse, il est possible d’écrire queIn order to ensure convergence in finite time, according to an example, the method can be modeled by taking into account the conditions necessary for the theory of homogeneity. In this hypothesis, it is possible to write that

⁽ ^j ^-1)(t) = _j(t), j = 1, …, n après un temps fini des transitoires. ⁽ ^j ^-1) (t) = _j (t), j = 1, …, n after a finite time of the transients.

La dynamique des erreurs de différenciation prend la forme du système suivant :

(1)

The dynamics of differentiation errors takes the form of the following system:

(1)

On connait, en ayant choisi les coefficients qui puissent définir un polynôme de Hurwitz qu’il existe A dans l’intervalle [1-1/(n-1), 1] suffisamment proche de 1 tel que l’équation (1) est globalement stable à temps fini.We know, by having chosen the coefficients which can define a Hurwitz polynomial, that there exists A in the interval [1-1/(n-1), 1] sufficiently close to 1 such that equation (1) is generally stable at finite time.

L’un des principaux avantages des systèmes homogènes stables en temps fini est la rapidité de leur taux de convergence et leur robustesse par rapport à différentes perturbations.One of the main advantages of stable homogeneous systems in finite time is the speed of their convergence rate and their robustness with respect to different disturbances.

En raison du terme Thêta(t), il est impossible d’obtenir la convergence de l’erreur à zéro sans avoir des connaissances supplémentaires sur ce signal Thêta(t). Ce problème peut être surmonté en supposant que y(t) est localement polynomial et que sur un petit intervalle de temps, Thêta(t) = 0. Dans ce cas, il est alors possible de récupérer les dérivées.Due to the term Theta(t), it is impossible to get the error to converge to zero without having additional knowledge about this Theta(t) signal. This problem can be overcome by assuming that y(t) is locally polynomial and that over a small time interval, Theta(t) = 0. In this case, it is then possible to recover the derivatives.

Selon un mode de réalisation, le procédé comportant un nombre d’itérations visant à répéter la séquence de calcul des coefficients jusqu’à ce qu’une erreur calculée soit inférieure à un seuil donné.According to one embodiment, the method comprising a number of iterations aimed at repeating the coefficient calculation sequence until a calculated error is less than a given threshold.

Afin de quantifier la performance des résultats obtenus avec l’algorithme final, une erreur est définie basée sur la norme de la différence des sons reconstruits et sons originels.In order to quantify the performance of the results obtained with the final algorithm, an error is defined based on the standard of the difference of the reconstructed sounds and original sounds.

Selon un mode de réalisation, à chaque itération une première condition C₁est vérifiée. Selon un exemple la première condition C₁correspond à la vérification que l’erreur est inférieure à un seuil prédéfini. Selon un exemple, une seconde condition C₂est vérifiée. La seconde condition C₂est la vérification qu’un nombre d’itérations prédéfini a été réalisé.According to one embodiment, at each iteration a first condition C ₁ is verified. According to an example, the first condition C ₁ corresponds to the verification that the error is less than a predefined threshold. According to one example, a second condition C ₂ is verified. The second condition C ₂ is the verification that a predefined number of iterations has been carried out.

Avantageusement, le procédé comporte une étape de génération des coefficients du filtre dès qu’une des deux conditions C₁ou C₂est vérifiée.Advantageously, the method includes a step of generating the filter coefficients as soon as one of the two conditions C ₁ or C ₂ is verified.

La succession des étapes d’itérations correspond à l’étape d’apprentissage de l’algorithme qui aboutit à produire des coefficients du filtre permettant de reconstruire le signal S_B(t) à partir d’un signal d’entrée S_A(t) selon le procédé de traitement PROC₁de l’invention.The succession of iteration steps corresponds to the learning step of the algorithm which results in producing filter coefficients making it possible to reconstruct the signal S _B (t) from an input signal S _A (t ) according to the PROC ₁ treatment method of the invention.

Nonlinear model

Selon une alternative de réalisation, un modèle non linéaire MOD₂peut être utilisé pour mettre en œuvre l’invention.According to an alternative embodiment, a nonlinear MOD ₂ model can be used to implement the invention.

Où NN désigne un filtre non-linéaire.Where NN denotes a non-linear filter.

Dans cette mise en œuvre le filtre F₁= NN correspond à une correction additionnelle qui permet de corriger l’erreur de modélisation du modèle linéaire. Dans ce cas, l’erreur est minimisée en prenant en considération la partie non linéaire NN.In this implementation the filter F ₁ = NN corresponds to an additional correction which makes it possible to correct the modeling error of the linear model. In this case, the error is minimized by taking into consideration the nonlinear part NN.

Dans le cas d’une mise en œuvre d’un modèle non linéaire MOD₂, le filtre F₁est un filtre non linéaire. Selon un exemple, une méthode de calcul des coefficients du filtre non linéaire F₁peut être mise en œuvre à partir d’un réseau de neurones. À cette fin, un réseau de neurones à propagation avant est produit avec le résultat obtenu avec le filtre linéaire. Selon un exemple, l’invention comprend la mise en œuvre d’un réseau de type CNN, définissant un réseau de neurones convolutif. Dans cet exemple de mise en œuvre, un réseau d'anticipation conventionnel avec une couche cachée qui utilise des fonctions d'activation sigmoïdales permet d’obtenir de bonnes performances d’apprentissage.In the case of an implementation of a nonlinear model MOD ₂ , the filter F ₁ is a nonlinear filter. According to one example, a method for calculating the coefficients of the nonlinear filter F ₁ can be implemented using a neural network. For this purpose, a forward propagation neural network is produced with the result obtained with the linear filter. According to one example, the invention comprises the implementation of a CNN type network, defining a convolutional neural network. In this example implementation, a conventional feedforward network with a hidden layer that uses sigmoidal activation functions achieves good learning performance.

Dans le cas d’un filtre non linéaire, l’apprentissage comprend également la phase de calcul des coefficients du réseau par une régression effectué à chaque itération.In the case of a non-linear filter, learning also includes the phase of calculating the network coefficients by a regression carried out at each iteration.

Selon d’autres exemples, d’autres réseaux de neurones peuvent être utilisés.According to other examples, other neural networks can be used.

Le procédé de l’invention repose sur la mise en œuvre d’un algorithme permettant le calcul de coefficients intégrant des phases itératives comportant le calcul d’une erreur. Cet algorithme est dit « algorithme apprenant » ou « fonction apprenante » dans la mesure où il prend en compte des étapes intermédiaires permettant de converger vers un résultat aboutissant à la construction du filtre caractéristique F₁.The method of the invention is based on the implementation of an algorithm allowing the calculation of coefficients integrating iterative phases comprising the calculation of an error. This algorithm is called a “learning algorithm” or “learning function” insofar as it takes into account intermediate steps making it possible to converge towards a result leading to the construction of the characteristic filter F ₁ .

Une fois le filtre F₁déterminé l’ensemble des paramètres du modèle MOD₁pour le modèle linéaire et du modèle MOD₂pour le modèle non linéaire sont déterminés. Les modèles peuvent ensuite être appliqués de manière à prédire la sorite S_B(t) sur la .Once the filter F ₁ has been determined, all the parameters of the MOD ₁ model for the linear model and of the MOD ₂ model for the non-linear model are determined. The models can then be applied to predict the sorite S _B (t) on the .

Les figures 3 et 4 représentent l’étape de calcul de l’erreur er(t) à chaque itération à chaque en considérant S₂(t) acquis directement et la sortie y(t) estimé.Figures 3 and 4 represent the step of calculating the error er(t) at each iteration, considering S ₂ (t) acquired directly and the output y(t) estimated.

Selon un mode de réalisation, un filtre initial Fi est pré-entrainé et enregistré dans une mémoire d’un équipement. Un tel filtre initial Fi peut être entrainé avec des profils de voix différents tels que des profils de voix de femme, de voix d’homme, voix d’enfant. Certains tessiture ou timbre de voix peuvent être utilisé pour pré-entrainé un filtre initial Fi au sein de l’équipement. Un intérêt est de définir des coefficients initiaux permettant d’améliorer la convergence de l’erreur en dessous d’un seuil donné lors de l’apprentissage.According to one embodiment, an initial filter Fi is pre-trained and recorded in a device memory. Such an initial filter Fi can be trained with different voice profiles such as female voice profiles, male voice profiles, child voice profiles. Certain vocal range or timbre can be used to pre-train an initial Fi filter within the equipment. An interest is to define initial coefficients making it possible to improve the convergence of the error below a given threshold during learning.

Afin de définir un filtre initial Fi, la méthode de caractérisation CARAC₁de l’invention peut être utilisée avec une première configuration. La première configuration peut bénéficier de ressources de calculs supplémentaires, ainsi l’ordre du système peut être augmenté, ainsi que les échantillons ou encore on peut disposer d’un plus grand nombre de phrases d’entrainement. Dans cette configuration, un modèle non linéaire MOD₂peut être utilisé pour définir le filtre initial Fi et un ou le modèle linéaire peut être utilisé pour entrainer le modèle avec la véritable voix du plongeur.In order to define an initial filter Fi, the CARAC ₁ characterization method of the invention can be used with a first configuration. The first configuration can benefit from additional calculation resources, thus the order of the system can be increased, as well as the samples or even a greater number of training sentences can be available. In this configuration, a non-linear MOD ₂ model can be used to define the initial filter Fi and one or the linear model can be used to train the model with the real voice of the diver.

Selon un autre mode de réalisation, le procédé de traitement PROC1 de l’invention peut être mis en œuvre pour corriger une erreur également pendant la phase de plongée par exemple avec un entrainement supervisé ou non supervisé.According to another embodiment, the PROC1 processing method of the invention can be implemented to correct an error also during the diving phase, for example with supervised or unsupervised training.

Par exemple, une phrase répétée ou un signal indiquant une mauvaise compréhension peut être utilisé afin de labelliser des sorties du modèles afin d’améliorer la construction d’un filtre F2 lors du procédé de traitement PROC₁.For example, a repeated sentence or a signal indicating poor understanding can be used to label model outputs in order to improve the construction of a filter F2 during the PROC ₁ processing process.

Selon un exemple de réalisation, le procédé de l’invention peut comprendre trois entrainements successifs du modèle ou des modèles pouvant être réalisés à des moments espacés entre eux. Un premier entrainement du modèle permet de définir un filtre initial Fi à partir d’une première configuration du procédé de caractérisation CARAC₁par exemple avec une voix pré sélectionnée correspondant à un premier signal S2(t).According to an exemplary embodiment, the method of the invention can comprise three successive trainings of the model or models which can be carried out at times spaced apart from each other. A first training of the model makes it possible to define an initial filter Fi from a first configuration of the CARAC ₁ characterization process, for example with a pre-selected voice corresponding to a first signal S2(t).

Un second entrainement du modèle permet de définir un filtre caractéristique F₁à partir d’une seconde configuration du procédé de caractérisation CARAC₁avec la voix du plongeur correspondant à un second signal S2(t). Ce second entrainement peut être réalisé avec des coefficients du modèle initiaux qui ont été calculés à partir du premier entrainement.A second training of the model makes it possible to define a characteristic filter F ₁ from a second configuration of the CARAC ₁ characterization process with the diver's voice corresponding to a second signal S2(t). This second training can be carried out with initial model coefficients which were calculated from the first training.

Enfin, un troisième entrainement peut être réalisé lors des communications en utilisant le procédé de traitement PROC1 et une labellisation automatique des sorties qui peut être déduites de l’analyse du flux audio d’un ou des deux plongeurs, tels que par exemple :

Les répétitions de phrases prononcées à des mêmes vitesses ou à des vitesses différentes ;
L’identification de phrase prédéfinie telle que « pas compris », « je ne comprends pas », « peux-tu répéter », etc ;
Un changement de respiration ou une détection d’un taux de bulles ;
Etc.

Finally, a third training can be carried out during communications using the PROC1 processing method and automatic labeling of the outputs which can be deduced from the analysis of the audio stream of one or both divers, such as for example:

Repetitions of sentences spoken at the same or different speeds;
Predefined phrase identification such as “don’t understand”, “I don’t understand”, “can you repeat”, etc.;
A change in breathing or detection of a level of bubbles;
Etc.

Conversion du signal filtré et transmis pour conduction osseuseConversion of the filtered and transmitted signal for bone conduction

Selon un mode de réalisation, la sortie filtrée par le filtre F₁caractérisé par le procédé de caractérisation CARAC₁du filtre est transmise via une interface de communication sans fil à un autre dispositif de communication d’un autre nageur. Selon un exemple, la sortie audio filtrée est modulée sur une fréquence porteuse. L’invention est compatible de toute forme de transmission sans fil par voie électromagnétique telle qu’une liaison radio ou par émission d’ondes sonores ou ultrasonores. Différentes modulations peuvent être utilisées, que ce soit de la modulation de fréquence, d’amplitude. Selon un mode de réalisation, le signal audio filtré est transmis en étant modulé par une technique d’évasion de fréquences.According to one embodiment, the output filtered by the filter F ₁ characterized by the CARAC ₁ characterization method of the filter is transmitted via a wireless communication interface to another communication device of another swimmer. In one example, the filtered audio output is modulated to a carrier frequency. The invention is compatible with any form of wireless transmission by electromagnetic means such as a radio link or by emission of sound or ultrasonic waves. Different modulations can be used, whether frequency or amplitude modulation. According to one embodiment, the filtered audio signal is transmitted by being modulated by a frequency escape technique.

L’embout buccal 1 peut comprendre des moyens de transmission de signal connectés au microphone 5 et/ou à l’élément vibrant 3. Les moyens de transmission peuvent comprendre des câbles électroniques et/ou une nappe de conduction. Ces moyens de transmissions peuvent être au moins partiellement intégré dans un revêtement élastique.The mouthpiece 1 may comprise signal transmission means connected to the microphone 5 and/or the vibrating element 3. The transmission means may comprise electronic cables and/or a conduction layer. These transmission means can be at least partially integrated into an elastic covering.

Lorsque le signal transmis d’un équipement à l’autre est démodulé en réception, le signal audio peut être converti de manière faire vibrer un plateau de conduction osseuse. La représente un exemple de plateau de conduction acoustique 20 agencé de manière à être mordu par l’utilisateur portant l’embout buccal 1. Selon cet exemple, l’embout buccal 1 comprend en outre un élément vibrant de type transducteur. L’élément vibrant est connecté au plateau de conduction de manière à transmettre les vibrations de l’élément vibrant au plateau de conduction acoustique.When the signal transmitted from one piece of equipment to another is demodulated on reception, the audio signal can be converted to vibrate a bone conduction plate. There represents an example of an acoustic conduction plate 20 arranged so as to be bitten by the user wearing the mouthpiece 1. According to this example, the mouthpiece 1 further comprises a vibrating element of the transducer type. The vibrating element is connected to the conduction plate so as to transmit vibrations from the vibrating element to the acoustic conduction plate.

L’élément vibrant 3 est configuré pour convertir une entrée audio en une sortie acoustique et se coupler acoustiquement aux dents supérieures et/ou inférieures du plongeur pour conduire la sortie acoustique des dents supérieures du plongeur à travers le crâne pour propager le signal acoustique jusqu’à l’oreille interne du plongeur via les os du crâne et de la mâchoire lorsque le plongeur porte l'embout dans la bouche.The vibrating element 3 is configured to convert audio input to acoustic output and acoustically couple to the upper and/or lower teeth of the diver to conduct the acoustic output from the upper teeth of the diver through the skull to propagate the acoustic signal up to the diver's inner ear via the skull and jaw bones when the diver wears the tip in the mouth.

Le plateau de conduction acoustique 20 est conçu et agencé pour se coupler acoustiquement aux dents supérieures de l’utilisateur pour conduire la sortie acoustique des dents supérieures du plongeur à travers le crâne jusqu'à la cochlée afin de générer un son audible dans au moins une des oreilles internes du plongeur lorsque le plongeur porte l'embout buccal 1.The acoustic conduction plate 20 is designed and arranged to acoustically couple to the upper teeth of the user to conduct the acoustic output of the upper teeth of the diver through the skull to the cochlea to generate audible sound in at least one of the diver's inner ears when the diver wears the mouthpiece 1.

Le plateau de conduction acoustique 20 est configuré pour s'engager et se coupler acoustiquement à la surface des dents du plongeur et est configuré pour conduire les vibrations de l’élément vibrant 3 en réponse à un signal électrique. La vibration de l’élément vibrant 3 produit un signal de sortie acoustique qui est conduit acoustiquement vers les dents du plongeur, via le plateau de conduction acoustique, puis à travers les os de sa mâchoire et de son crâne jusqu'à l'oreille interne, y compris la cochlée, où il est perçu comme un son.The acoustic conduction plate 20 is configured to acoustically engage and couple with the surface of the plunger teeth and is configured to conduct the vibrations of the vibrating element 3 in response to an electrical signal. The vibration of the vibrating element 3 produces an acoustic output signal which is acoustically conducted to the diver's teeth, via the acoustic conduction plate, then through his jaw bones and skull to the inner ear , including the cochlea, where it is perceived as a sound.

Claims

Characterization process (CARAC₁) of a filter (F₁) for processing (PROC₁) of a voice of a given subject, said voice being acquired by a microphone arranged in an underwater breathing aid device intended to be worn near the mouth of a subject, said method comprising:

Acquisition (ACQ1) of at least a first signal (S ₁ ) corresponding to the pronunciation of a first set of phonemes by said given subject; said first signal (S ₁ ) corresponding to a voice of a subject acquired by the microphone of the underwater breathing aid device when the latter is worn by said subject;
Acquisition (ACQ2) of at least one second signal (S ₂ ) corresponding to the pronunciation of the first set of phonemes (PHR ₁ ) by said subject; said second signal (S ₂ ) corresponding to a subject's voice acquired by a microphone without the subject wearing an underwater breathing aid;
For the first signal (S ₁ ), execution of the following steps:
- Sampling (ECH ₁ ) of the first signal (S ₁ ) to generate a plurality of samples from the first signal (S ₁ );
- Estimation (EST ₁ ) of a first set (ENS ₁ ) of average values of the amplitude of the first signal (S ₁ ) over a plurality of first predefined windows, each first window comprising a plurality of samples of the first signal (S ₁ );
- Estimation (EST ₂ ) of a second set (ENS ₂ ) of speed values of the signal (S ₁ ) from a finite-time homogeneous differentiator method over the plurality of first predefined windows;
- Calculation (EST _F ) of a first set of coefficients of a system of differential equations representing a first model (MOD _1, MOD ₂ ) whose solutions are the first and the second set of values (ENS ₁ , ENS ₂ ) ;
Calculation of an error (Er) between at least one set of samples of the second signal (S₂) and the samples obtained at the output of the first model (MOD₁,MOD₂) executed with values of the first signal (S₁) and the estimated coefficients (G, F, H),
Iteration(s) of calculating the coefficients of the first model (MOD₁,MOD₂) by minimizing at each iteration said calculated error (Er), the number of iterations being configured to minimize the error below a predefined threshold so as to define operating coefficients, said operating coefficients defining the coefficients of the filter (F₁).

Characterization method according to claim 1, characterized in that it comprises a step of filtering the first signal (S ₁ ) carried out prior to sampling.

Characterization method according to any one of claims 1 to 2, characterized in that the sampling of the first signal (S₁) includes the following steps:

Analysis of the spectral density of the signal (S ₁ ) in order to determine characteristic amplitude or power thresholds;
Selection of a set of characteristic frequencies for which said determined thresholds are exceeded;
Sampling of segments of the first signal (S ₁ ) around each selected characteristic frequency;
Generation of a set of samples (ECH ₁ ) for processing the first signal (S ₁ ) from each sampled segment.

Characterization method according to any one of claims 1 to 3, characterized in that the sampling (ECH ₁ ) produces between 400 and 96,000 samples for 1 second.

Characterization method according to any one of claims 1 to 4 characterized in that the method is repeated for a plurality of acquisitions of first signals (S ₁ ), each first signal (S ₁ ) acquired corresponding to the pronunciation of a new set of phonemes by the same subject, the calculation of the error (Er) and its minimization being carried out for each new first signal (S ₁ ) acquired.

Characterization method according to any one of claims 1 to 5 characterized in that the system of differential equations is of the first order.

Characterization method according to any one of claims 1 to 6 characterized in that it comprises:

Estimation (EST ₃ ) of a third set (ENS ₃ ) of acceleration values of the signal (S ₁ ) using the finite-time homogeneous differentiator method over the plurality of first predefined windows;
Calculation (EST_F) of a first set of coefficients of a system of first order differential equations representing a first model (MOD₁,MOD₂) whose solutions are the first and second set of values (ENS₁, ENS_2,ENS₃).

Characterization method according to any one of claims 1 to 7 characterized in that it comprises:

Estimation (EST _N ) of an ^Nth set (ENS _N ) of values of a derivative of order N of the signal (S ₁ ) using the finite-time homogeneous differentiator method over the plurality of first predefined windows ;
Calculation (EST_F) of a first set of coefficients of a system of differential equations of N^thorder representing a first model (MOD₁,MOD₂) whose solutions are the first and second set of values (ENS₁, ENS_{2, …,}ENS_NOT).

Characterization method according to any one of claims 1 to 8 characterized in that the first model is a linear model (MOD1) and is written:

,

where G, F and H are matrices whose coefficients are to be determined, the dimension of the matrix being determined by the order of the linear system of differential equations, y(t) is the signal that we seek to identify approaching the second signal (S₂), and x(t) is a state variable internal to the first model (MOD₁).

Characterization method according to any one of claims 1 to 8 characterized in that the first model is a non-linear model (MOD₂) and is written:

,

where G, F and H are matrices whose coefficients are to be determined, the dimension of the matrix being determined by the order of the system of differential equations, NN designates a non-linear filter, y(t) is the signal that 'we seek to identify approaching the second signal (S₂), and x(t) is a state variable internal to the first model (MOD₂),

the method further comprising a calculation (EST _NN ) of a set of coefficients of a neural network from a regression configured to minimize the error (Er).

Characterization method according to any one of claims 1 to 10 characterized in that the calculation of the minimization step (EST _F , EST _NN ) of the error (Er) is carried out from the implementation of a neural network whose coefficients are calculated by regression to minimize the error (Er).

Characterization method according to any one of claims 1 to 11 characterized in that at each iteration a first condition is verified, said first condition being the verification that the error is less than a predefined threshold and/or that a second condition is verified, said second condition being the verification that a predefined number of iterations has been carried out, the method comprising a step of generating the filter coefficients as soon as one of the two conditions is verified.

Treatment process (PROC₁) in a voice of a given subject including:

Acquisition of a signal (S _A (t)) corresponding to a voice acquired by a microphone arranged in an underwater breathing aid device intended to be worn in the mouth of a subject, the acquired voice being defined by an audio signal acquired in real time ( _SA ) corresponding to a subject's voice distorted by the presence of the mouthpiece in the mouth;
Application (APP ₁ ) of a first model (MOD ₁ ) trained according to the method of characterizing the filter (F ₁ ) of any one of claims 1 to 12, said application (APP ₁ ) aiming to obtain a second signal ( S _B ) of voice transformed from the first signal (S _A ) of the voice distorted by an underwater breathing aid device (EB ₁ ).

Communication device comprising a mouthpiece (1) intended to be worn in the mouth of a subject comprising:

a microphone for acquiring an acoustic signal from the subject's voice;

an electronic card comprising an audio filter, an analog-digital converter, a calculator for sampling the digitized signal, a memory for recording the coefficients of a filter (F₁) to be applied to the sampled signal (S_HAS(t)) and a calculator making it possible to generate an output signal (SB(t)), said recorded coefficients being calculated from a method according to any one of claims 1 to 12.

Computer program product comprising instructions which, when the program is executed by the communication device, cause said communication device to implement the steps of the method of any one of claims 1 to 12.