EP0953969B1 - Procédé de restitution de la parole à régulation de silences - Google Patents
Procédé de restitution de la parole à régulation de silences Download PDFInfo
- Publication number
- EP0953969B1 EP0953969B1 EP19990400873 EP99400873A EP0953969B1 EP 0953969 B1 EP0953969 B1 EP 0953969B1 EP 19990400873 EP19990400873 EP 19990400873 EP 99400873 A EP99400873 A EP 99400873A EP 0953969 B1 EP0953969 B1 EP 0953969B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- silence
- restoration
- sound
- speech
- threshold
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
Definitions
- speech is transmitted in the form a continuous stream of code words representing the instantaneous amplitude of the voice signal, cyclically sampled and digitized for this purpose.
- a communication has a permanent transmission channel, or circuit, to flow this continuous flow.
- the code words representing the voices are transmitted in packets, on a channel offering sufficient speed to be able to share the channel in time between several terminals.
- the crossing time of the network is essentially variable, because each package can follow a variable path and we cannot take count the packets as soon as they are received, because a packet can arrive before the end of the sound reproduction of the previous one, or can, on the contrary, happen after this one.
- packet crushing in the second case, dead time.
- the present invention aims to remedy these drawbacks.
- the invention relates to a method of sound reproduction of speech signals, received in successive packets representing slices successive speech periods originally comprising periods of silence and temporarily memorized before to be restored with sound, a process characterized by the fact that detects the presence of said silences, in the slices received, and that regulates the duration of restitution to reproduce, in a single piece, the signals of speech other than silences.
- the device of FIG. 1 comprises a microprocessor-based controller 1 controlling the operation of a reception and restitution chain sound of speech signals.
- the chain connected as input to line 2 of a packet communication network, has as input a circuit 3 for reading the received packets, which identifies the sound segments and the rests and reconstructs their temporal position.
- explicit landmarks may exist that distinguish the segments and indicate their dates, or such Explicit marks do not exist and circuit 3 reconstructs them from the serial number of packets and their content decoded into voice signal whose form it analyzes.
- Circuit 3 is followed by a buffer memory 4 of storage of the packets, which puts them back in their order of transmission and transmits to a circuit 5 of reproduction or sound reproduction commanding a headset 6.
- a transmitting terminal in communication with that of figure 1, analyzes the signal of its microphone by a vocoder for the code in compressed form.
- FIG. 2A represents the amplitude S of the voice signal as a function of time t .
- a vocoder seeks to delimit segments with a voice signal corresponding to a normalized sound in library, such as a vowel, a consonant or a quasi-silence. For the transmission of information representing the signal S, this is then replaced by a series of code words representing the sounds recognized there. The volume of information data is thus very reduced. In reception, the consultation of a library. similar allows the original signal S to be restored.
- the signal is not analyzed so finely for the coding, and it is circuit 3 of the receiver which analyzes the reconstructed signal to locate the segments of silence.
- the information coded is transmitted in packets P1, P2, P3, P4 each carrying a more or less long section of the speech.
- the signal S comprises, in the packet P1, the end S0 of a silence, a block of speech energy signal 11 followed of a silence S1 and of another block 12.
- the block 12 is continues, with reference 13, and is followed by two blocks 14 and 15 with rests S2 and S3 interposed.
- the package P3 includes the end of block 15, referenced 16, a silence S4, a block 17, a silence S5 and the beginning 18 of a block followed, packet P4, of an end 19 of the block then of the start S6 of a silence.
- Blocks 11, 12-13, 14, 15-16, 17, 18 here represent the six sequences respective: “and” “the invention” “is” “new” “and” “inventive” (figure 2B).
- block 12-13 (just like 15-16 and 18-19), spread over the two packets, P1 and P2, may be temporarily separated in two when of its sound reproduction. We therefore seek here to avoid it.
- the instants t0, t1, t2, t3 delimit the slices of the initial signal assigned to successive packets P1, P2, P3, P4.
- References t'0, t'1, t'2, t'3 (fig. 2C), uniformly translated with respect to the instants respective t0, t1, t2, t3, mark the corresponding theoretical dates of playback in the earpiece 6. Due to the fluctuation in the delay transmission, packets can arrive early or late. In this example, packet P2 arrives after a dead time or delay R following the instant t '1 of end of restitution of the packet P1. On the contrary, the P3 package, although arriving after t'2, arrives ahead of the end of the restitution of the P2 package.
- the rests S2 and S4 are here shortened, or abbreviated (.) to empty buffer memory 4 as quickly as possible in order to better tolerate early arrivals of following packages. This is of particular interest in the case for which the duration of restitution of sound sequence (s) triggered by the arrival of a packet is greater than the theoretical period. Indeed an arriving packet can complete an earlier sound block representing an uninterrupted signal duration spanning several time slices, which will then be returned.
- FIG. 3 illustrates the management for this purpose of the sound reproduction of packets received.
- step 23 we shorten, in step 24, the silence in course - or even delete it and restore the above sound sequence by transition to state 21. If not, in steps 22 and 23, we pass to a state 26 of silence reproduction.
- step 25 is detected as a negative overflow of memory 4 (no information available in memory, relating to next segment to be reproduced).
- step 27 it is also detected if the memory 4 is empty of the segment next to reproduce and we decide in such a case, step 27, to extend the silence beyond its normal duration, i.e. we insert a silence additional.
- step 28 if a contiguous sound sequence, like 12 - 13, is present and therefore available (P1 and P2 packets received) and represents a duration exceeding a threshold B, the silence in progress in state 26 (S1) will be Abridged.
- a block (12) of sequence sound has a delay in restitution exceeding threshold A, even in the absence of reception of the end (13) of the block. If, on the other hand, the threshold B is not reached, an additional silence is inserted.
- step 26 We leave state 26 when all the silence that was to be reproduced has been and that the following sound sequence should be reproduced. We delay it however by passing temporarily or durably through a state 29, emission of background noise, or background noise, sounding or tone vocal like "uhh", indicating that the speaker will speak again, which avoids being cut off by replacing an extended silence, or additional, by background noise.
- the two conditions of step 28 are also sought in state 29 and we pass, in yes, in state 21 of sound reproduction.
- step 20 if the delay in restitution exceeds a threshold C, by example of 1.5 times the threshold value.
- A which indicates an overflow memory positive 4.
- the oldest rests, restore first are reduced and possibly almost deleted. It can also be planned to delete the sound sequences the older or simply time slices thereof, which amounts to modulating, here accelerating, the speed of restitution of the sound signal.
Description
- la figure 1 est un schéma par blocs illustrant un dispositif de mise en oeuvre du procédé de l'invention,
- la figure 2, formée des figures 2A, 2B, 2C, 2D et 2E, illustre le découpage par paquets d'un signal de parole en fonction du temps t, et
- la figure 3 est un diagramme de cheminement illustrant le procédé.
Claims (8)
- Procédé de restitution sonore de signaux de parole (11, S1, 12, 13, S2) reçus par paquets successifs (P1, P2, P3) représentant des tranches temporelles successives de parole comportant à l'origine des périodes de silence et mémorisés temporairement (4) avant d'être restitués de façon sonore (5, 6), procédé caractérisé par le fait qu'on détecte la présence desdits silences (S1, S2), dans les tranches reçues, et qu'on en régule la durée de restitution pour restituer, d'un seul tenant, les signaux de parole (11, 12) autres que les silences.
- Procédé selon la revendication 1, dans lequel on abrège (24, 28) la restitution de tout silence (S1) dont au moins le début de la séquence sonore suivante est disponible.
- Procédé selon la revendication 1, dans lequel on abrège (23, 28) la restitution du silence (S1) si le retard à la restitution des signaux de parole dépasse un seuil (A).
- Procédé selon l'une des revendications 1 à 3, dans lequel on insère un silence supplémentaire (25, 27) lorsqu'aucune séquence (11) suivante n'est disponible en mémoire.
- Procédé selon l'une des revendications 1 à 4, dans lequel, lorsqu'un silence (S1) n'est pas suivi en mémoire d'une séquence sonore de durée supérieure à un seuil (B), on insère un silence supplémentaire.
- Procédé selon l'une des revendications 4 et 5, dans lequel on remplace le silence supplémentaire par un bruit de fond (29).
- Procédé selon la revendication 6, dans lequel le bruit de fond est à tonalité vocale.
- Procédé selon l'une des revendications 1 à 7, dans lequel on élimine (20), de la restitution, les signaux mémorisés les plus anciens lorsque leur retard pour la restitution dépasse un seuil (C).
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR9805227A FR2778011B1 (fr) | 1998-04-27 | 1998-04-27 | Procede de restitution de la parole a regulation de silences |
FR9805227 | 1998-04-27 |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0953969A1 EP0953969A1 (fr) | 1999-11-03 |
EP0953969B1 true EP0953969B1 (fr) | 2003-10-01 |
Family
ID=9525692
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP19990400873 Expired - Lifetime EP0953969B1 (fr) | 1998-04-27 | 1999-04-09 | Procédé de restitution de la parole à régulation de silences |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP0953969B1 (fr) |
DE (1) | DE69911685T2 (fr) |
FR (1) | FR2778011B1 (fr) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5526353A (en) * | 1994-12-20 | 1996-06-11 | Henley; Arthur | System and method for communication of audio data over a packet-based network |
EP0756267A1 (fr) * | 1995-07-24 | 1997-01-29 | International Business Machines Corporation | Méthode et système pour enlever des silences dans la communication vocale |
US5682384A (en) * | 1995-10-31 | 1997-10-28 | Panagiotis N. Zarros | Apparatus and methods achieving multiparty synchronization for real-time network application |
-
1998
- 1998-04-27 FR FR9805227A patent/FR2778011B1/fr not_active Expired - Fee Related
-
1999
- 1999-04-09 DE DE1999611685 patent/DE69911685T2/de not_active Expired - Lifetime
- 1999-04-09 EP EP19990400873 patent/EP0953969B1/fr not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
FR2778011A1 (fr) | 1999-10-29 |
FR2778011B1 (fr) | 2000-06-09 |
DE69911685D1 (de) | 2003-11-06 |
DE69911685T2 (de) | 2004-07-29 |
EP0953969A1 (fr) | 1999-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8937963B1 (en) | Integrated adaptive jitter buffer | |
EP0082077B1 (fr) | Procédé de télédistribution d'informations enrégistrées, notamment d'oeuvres musicales, et système de mise en oeuvre | |
US6885987B2 (en) | Method and apparatus for encoding and decoding pause information | |
US6873954B1 (en) | Method and apparatus in a telecommunications system | |
KR100722707B1 (ko) | 멀티미디어 신호를 전송하기 위한 전송 시스템 | |
FR2488434A1 (fr) | Systeme de reproduction de signaux codes | |
EP0041895A1 (fr) | Procédé et dispositif de transmission de données numériques sous forme de paquets | |
CN101518001B (zh) | 用于补偿分组流中的抖动的方法 | |
EP0953969B1 (fr) | Procédé de restitution de la parole à régulation de silences | |
EP0251854B1 (fr) | Procédé de transmission de trains numériques sur des voies à débits plus élevés et dispositif de mise en oeuvre | |
CN109155680B (zh) | 当前音视频再现被中断覆盖后继续当前再现的方法和设备 | |
US6748000B1 (en) | Apparatus, and an associated method, for compensating for variable delay of a packet data in a packet data communication system | |
FR2848049A1 (fr) | Procede de traitement de paquets de donnees recus sur des reseaux asynchrones, et dispositif pour la mise en oeuvre du procede | |
JP5330183B2 (ja) | パケット挿入削除方法及び通話システム | |
FR2795548A1 (fr) | Procede pour la gestion du decodage et de la restitution d'un signal sonore dans un systeme de transmission asynchrone | |
EP0762639B1 (fr) | Dispositif de commande de volume sonore pour récepteur de signaux de parole codés par blocs | |
EP0194186B1 (fr) | Procédé de transmission de données par insertion dans un signal vocal analogique et dispositifs pour la mise en oeuvre de ce procédé | |
WO2001050692A1 (fr) | Dispositif de reception de paquets | |
EP0632922B1 (fr) | Dispositif haute fidelite de reproduction du son au cinema | |
FR2579047A1 (fr) | Procede de synchronisation par rattrapage de frequence et dispositif de mise en oeuvre du procede | |
JP2000092122A5 (fr) | ||
FR2859592A1 (fr) | Procede de commande d'un terminal multimodal, plate-forme de traitement et terminal multimodal | |
WO2020188097A1 (fr) | Procédé de restitution de contenus de personnalisation d'un flux radiophonique principal | |
FR2978322A1 (fr) | Procede et dispositif de generation d'un signal porteur d'un code numerique | |
KR970014249A (ko) | 다중언어신호의 지연재생장치 및 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): DE GB |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
17P | Request for examination filed |
Effective date: 20000422 |
|
AKX | Designation fees paid |
Free format text: DE GB |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: 7G 10L 21/04 B Ipc: 7G 10L 19/00 A |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE GB |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REF | Corresponds to: |
Ref document number: 69911685 Country of ref document: DE Date of ref document: 20031106 Kind code of ref document: P |
|
GBT | Gb: translation of ep patent filed (gb section 77(6)(a)/1977) |
Effective date: 20040120 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20040702 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: 732E Free format text: REGISTERED BETWEEN 20090319 AND 20090325 |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: 732E Free format text: REGISTERED BETWEEN 20090326 AND 20090401 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R082 Ref document number: 69911685 Country of ref document: DE Representative=s name: , Ref country code: DE Ref legal event code: R082 Ref document number: 69911685 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R081 Ref document number: 69911685 Country of ref document: DE Owner name: SAGEMCOM SAS, FR Free format text: FORMER OWNER: SAGEM S.A., PARIS, FR Effective date: 20120120 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R081 Ref document number: 69911685 Country of ref document: DE Owner name: SAGEMCOM SAS, FR Free format text: FORMER OWNER: SAGEM TELECOMMUNICATIONS S. A., PARIS, FR Effective date: 20130129 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20160324 Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20160321 Year of fee payment: 18 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 69911685 Country of ref document: DE |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20170409 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20171103 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20170409 |