CN1255255A - Echo reducing phone with state machine controlled switches - Google Patents
Echo reducing phone with state machine controlled switches Download PDFInfo
- Publication number
- CN1255255A CN1255255A CN 98804832 CN98804832A CN1255255A CN 1255255 A CN1255255 A CN 1255255A CN 98804832 CN98804832 CN 98804832 CN 98804832 A CN98804832 A CN 98804832A CN 1255255 A CN1255255 A CN 1255255A
- Authority
- CN
- China
- Prior art keywords
- state machine
- microphone
- finite state
- signal
- loud speaker
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M9/00—Arrangements for interconnection not involving centralised switching
- H04M9/08—Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
Abstract
The purpose of the present invention is thus to reduce the echo introduced by cross-talk. The problem described above, with how to reduce the echo introduced by cross-talk is solved by to the microphone and to the speaker introduce switches controlled by a state-machine which take as input the signal energy of the signal from the microphone, a VAD flag of the signal from the microphone, the signal energy of the signal to the speaker and a VAD flag of the signal to the speaker.
Description
The present invention relates generally to telecommunications, relate more specifically to the speech processes of the Speech Communication on the internet.
The PC of typical Internet telephony utilization band sound plate, microphone and two loud speakers.Microphone and loud speaker are put on the table usually toward each other.This configuration causes sounding the considerable amounts of cross talk that resembles echo at receiver end.For being used, Internet telephony must suppress this echo.
In GSM, knownly detect the mobile phone user whether in speech with VAD (voice activity detection).Bandwidth when utilizing this information can reduce the emission speech.In the discontinuousness voice coding according to VOX principle (transmission of voice operation), VAD is responsible for the unit detecting the voice whether sound sequence that is received represents the people.Two kinds of different states can be got in the VAD unit, wherein the first state representation sound sequence speech of behaving and speech that another state representation sound sequence is not the people.
If the VAD unit detects the speech that given sound sequence is represented the people, first status signal will be issued to the speech coding unit in this VAD unit, and the latter is coded in this sound sequence in the speech frame.Otherwise if given sound sequence is represented people's voice thing in addition, this VAD unit will be issued second status signal and give SID (silence descriptor) unit.Every N the frame in described SID unit sends a SID frame.May not send anything in the opportunity of transmit frame at all the other N-1.The SID frame comprises the information about the noise spectrum of the background noise estimated and estimation on the transmit leg.Can save the power of battery and radio bandwidth with this process.
When the SID unit changes to generation second status signal from generating first status signal, be from detecting voice to the time interval that detects non-voice, usually apply the so-called hangover, the speech coding unit continue to send speech frame during this, and seeming the sound sequence that is received still is people's voice.If the VAD unit still detects non-voice after the hang-over delay, just generate the SID frame.The reason of this process is the pause of the weak point between the word in people's the language not to be interpreted as non-voice, and the speech frame generator must enliven.
The invention discloses the method and apparatus of the echo of reducing the cross-talk introducing.
Thereby purpose of the present invention is for reducing the echo that cross-talk is introduced.
Above-mentioned is solved by the switch of state machine control by introducing to microphone and loud speaker about how reducing the echo problem that cross-talk introduces, this state machine with from the signal energy of the signal of microphone, from the VAD sign of the signal of microphone, to the signal energy of the signal of loud speaker and to the VAD sign of the signal of loud speaker as importing.
One of advantage of the present invention is to have reduced the echo that cross-talk introduces significantly and need not more computing capability.
From detailed description given below, will be conspicuous for other advantage of person skilled in the art person.
From detailed description given below, further range of application of the present invention will be conspicuous.However, it should be understood that most preferred embodiment of the present invention is exemplary, because from this detailed description, various changes within the scope of the present invention are conspicuous with revising person skilled in the art person.
Fig. 1 illustrates the block diagram of one embodiment of the present of invention.
Fig. 2 illustrates finite state digraph.
Microphone 101 is connected on the GSM encoder 102 in Fig. 1.Before signal arrived GSM encoder 102, it was digitized according to unshowned known technology among Fig. 1 and samples.At first code signal is transferred to not shown receiver from GSM encoder 102 by starting or ending the switch 103 that transmits.From GSM encoder 102 with ACF
E(autocorrection coefficient) passes to VAD unit 104.Also transmit long-term predictor lagged value N from the GSM frame
EGive VAD unit 104.The value P that will represent the energy of signal from VAD unit 104
EPass to finite state machine 105.VAD unit 104 also calculates the sign F whether indication VAD unit 104 has detected people's voice
ETo indicate F
EPass to finite state machine 105.If detect people's speech then indicate F
EFor very.
Also has the coded speech signal that receives and pass to the sampling of GSM decoder 106 from sender's (not shown) among Fig. 1.At first pass to loud speaker 107 from GSM decoder 106 by the sampling voice signal that can make or will decode by the switch 108 of voice signal arrival loud speaker.According to unshowned known technology among Fig. 1,, need the D/A conversion in order to make loud speaker energy operate as normal.From the coded speech signal that is received, derive long-term predictor lagged value N
0And pass to VAD unit 109.
Do not use the VAD unit because the decoding of GSM frame does not comprise usually, the GSM decoder lacks the call parameter that is used to calculate ACF.In order to calculate ACF, auto-correlation unit 110 receives the ACF that data and calculating from GSM decoder 106 pass to VAD unit 109
DAuto-correlation unit 110 is the part of the GSM encoder described in the standard.To arrive the indicated value P of the energy the voice signal of loud speaker from VAD unit 109
DPass to finite state machine 105.Also will indicate F from VAD unit 109
DPass to described finite state machine, whether indication VAD unit detects people's speech.
Finite state machine 106 comprises according to being input to the value configuration switch 103 of finite state machine and 109 function.
The state of the finite state machine shown in Fig. 2 among Fig. 1 and possible transfer.
Transfer between the state is carried out according to following description.Utilize following definitions:
F
E: the VAD sign during coding
F
D: the VAD sign during decoding
P
E: the signal energy during coding
P
D: the signal energy during decoding
Hangover: from the determine switch direction to the time of carrying out switch.This time, necessary long enough was with the compensation indoor echo.
201.F
E=1AND F
D=0 OR F
E=1 and P
E>P
D, hangover=0
202.F
E=0, hangover=600ms
203.F
D=1 AND F
E=0 OR F
D=1 and P
D>P
E, hangover=0
204.F
D=0, hangover=600ms
205.F
D=1 AND P
D>P
E, hangover=600ms
205.F
E=1 AND P
E>P
D, hangover=600ms
In state TRANSMITTING (transmission) 207, start-up control is from the switch of microphone transporting speech signal and by the switch of control transmission voice signal to loud speaker.In state RECEIVING (reception) 208, by controlling the switch that transmits to loud speaker from the switch and the start-up control of microphone transporting speech signal.In IDLE (free time) state 209, two switches all end.
Described the present invention like this, apparent available multiple mode changes the present invention.And this change does not think to depart from spirit of the present invention and scope, all is intended to be included within the scope of following claim for the conspicuous all such modifications of person skilled in the art person.
Claims (9)
1. method that is used for when the phone application transporting speech reducing echo, described phone application comprises loud speaker and microphone, it is characterized in that, finite state machine influences opening or closing of described loud speaker and microphone according to the signal characteristic that reaches described loud speaker from the signal characteristic of described microphone.
2. according to the method for claim 1, wherein said phone application comprises at least one VAD unit, a GSM encoder and a GSM decoder, it is characterized in that, to pass to described finite state machine from the VAD sign of the signal of microphone will represent to pass to described finite state machine from first value of the representation signal energy in the signal of microphone, the 2nd VAD sign that arrives the signal of loud speaker is passed to described finite state machine, second value of the energy in the signal of expression arrival loud speaker is passed to described finite state machine, according to described VAD sign, described the 2nd VAD sign, described first value and described second value, described finite state machine influence control is from first switch of the transmission of the described signal of described microphone, and described finite state machine is transferred to described secondary signal the second switch of described loud speaker.
3. according to the method for claim 2, it is characterized in that, to pass to described GSM encoder from the first sampling voice signal of described microphone, the first long-term predictor lagged value is passed to a VAD unit, to pass to a described VAD unit from first auto-correlation coefficient of a described GSM encoder, to pass to described finite state machine from first Boolean denotation of a described VAD unit, expression is passed to described finite state machine from first value of the energy of the signal of described microphone from a described VAD unit, receive the second sampling coded speech signal, described second voice signal is passed to the GSM decoder, to pass to the 2nd VAD unit from the second long-term predictor lagged value of described second voice signal, calculate second auto-correlation coefficient and pass to described the 2nd VAD unit, second value of the energy in described second voice signal of expression is passed to described finite state machine from described the 2nd VAD unit, second Boolean denotation is passed to described finite state machine from described VAD unit, and described finite state machine is according to described first Boolean denotation, described second Boolean denotation, described first value and described second value, the control influence is from first switch of the transmission of the described first sampling coded speech signal of described microphone, and the described second decoding voice signal of influence is to the second switch of the transmission of loud speaker.
4. according to the method for claim 2, it is characterized in that, if described finite state machine is got first state, control is arranged to allow this transmission from described first switch of the transmission of described microphone, and control transmission is arranged to not allow this transmission to the described second switch of loud speaker, if described finite state machine is got second state, control is arranged to not allow this transmission from described first switch of the transmission of described microphone, and will controls the described second switch of the transmission of loud speaker is arranged to allow this transmission.
5. according to the method for claim 4, it is characterized in that,, then described first and second switch all is arranged to identical state if described finite state machine is got the third state.
6. according to the method for claim 5, it is characterized in that, if if described first be masked as true with described second be masked as puppet or described first be masked as pseudo-and described first value greater than described second value, then described finite state machine switches to described first state from the described third state; If described first is masked as puppet and has pass by hang-over delay, then described finite state machine switches to the described third state from described first state; If if described second be masked as true and described first and be masked as puppet or described second and be masked as true and described second value greater than described first value, then described finite state machine switches to described second state from the described third state; If described second is masked as pseudo-and has pass by described hang-over delay, then described finite state machine switches to the described third state from described second state; If described second be masked as true and described second value greater than described first value with pass by described hang-over delay, then described finite state machine switches to described second state from described first state; If described first is masked as true and described first value greater than described second value and pass by described hang-over delay, then described finite state machine switches to described first state from described second state.
7. according to the method for claim 6, it is characterized in that described hang-over delay is 600ms.
8. device that is used for when the phone application transporting speech reducing echo, described phone application comprises loud speaker and microphone, it is characterized in that described phone application comprises the finite state machine that opens or closes that is configured to according to influence described loud speaker and microphone from the signal characteristic of described microphone and the signal characteristic that arrives described loud speaker.
9. one kind is configured to phone application transport and the personal computer that receives speech, described phone application comprises the echo that is used to reduce described speech, described phone application comprises loud speaker and microphone, it is characterized in that described phone application comprises that basis reaches the finite state machine that opens or closes that influences described loud speaker and microphone from the signal characteristic of described loud speaker from the signal characteristic of described microphone.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9700873A SE511650C2 (en) | 1997-03-11 | 1997-03-11 | Method and apparatus for reducing echo in a telephone application |
SE97008734 | 1997-03-11 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1255255A true CN1255255A (en) | 2000-05-31 |
Family
ID=20406109
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 98804832 Pending CN1255255A (en) | 1997-03-11 | 1998-02-24 | Echo reducing phone with state machine controlled switches |
Country Status (9)
Country | Link |
---|---|
EP (1) | EP0974205A1 (en) |
JP (1) | JP2001514823A (en) |
CN (1) | CN1255255A (en) |
AU (1) | AU735505B2 (en) |
BR (1) | BR9808240A (en) |
CA (1) | CA2283590A1 (en) |
SE (1) | SE511650C2 (en) |
TW (1) | TW407435B (en) |
WO (1) | WO1998040974A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101145803B (en) * | 2007-09-06 | 2012-09-05 | 杭州华三通信技术有限公司 | A method and device for separating echo reflection |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000056036A2 (en) * | 1999-03-15 | 2000-09-21 | Vocaltec Communications Ltd. | Echo suppression device and method for performing the same |
US6754337B2 (en) * | 2002-01-25 | 2004-06-22 | Acoustic Technologies, Inc. | Telephone having four VAD circuits |
US7020257B2 (en) * | 2002-04-17 | 2006-03-28 | Texas Instruments Incorporated | Voice activity identiftication for speaker tracking in a packet based conferencing system with distributed processing |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4897832A (en) * | 1988-01-18 | 1990-01-30 | Oki Electric Industry Co., Ltd. | Digital speech interpolation system and speech detector |
GB2256351B (en) * | 1991-05-25 | 1995-07-05 | Motorola Inc | Enhancement of echo return loss |
FI110826B (en) * | 1995-06-08 | 2003-03-31 | Nokia Corp | Eliminating an acoustic echo in a digital mobile communication system |
-
1997
- 1997-03-11 SE SE9700873A patent/SE511650C2/en not_active IP Right Cessation
-
1998
- 1998-02-24 BR BR9808240-0A patent/BR9808240A/en not_active IP Right Cessation
- 1998-02-24 EP EP98909895A patent/EP0974205A1/en not_active Withdrawn
- 1998-02-24 JP JP53949498A patent/JP2001514823A/en active Pending
- 1998-02-24 CA CA002283590A patent/CA2283590A1/en not_active Abandoned
- 1998-02-24 AU AU64264/98A patent/AU735505B2/en not_active Ceased
- 1998-02-24 WO PCT/SE1998/000332 patent/WO1998040974A1/en not_active Application Discontinuation
- 1998-02-24 CN CN 98804832 patent/CN1255255A/en active Pending
- 1998-03-06 TW TW87103302A patent/TW407435B/en not_active IP Right Cessation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101145803B (en) * | 2007-09-06 | 2012-09-05 | 杭州华三通信技术有限公司 | A method and device for separating echo reflection |
Also Published As
Publication number | Publication date |
---|---|
BR9808240A (en) | 2000-05-16 |
JP2001514823A (en) | 2001-09-11 |
SE9700873D0 (en) | 1997-03-11 |
EP0974205A1 (en) | 2000-01-26 |
TW407435B (en) | 2000-10-01 |
WO1998040974A1 (en) | 1998-09-17 |
AU6426498A (en) | 1998-09-29 |
AU735505B2 (en) | 2001-07-12 |
CA2283590A1 (en) | 1998-09-17 |
SE9700873L (en) | 1998-09-12 |
SE511650C2 (en) | 1999-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1123118C (en) | Muting microphone in radiocommunication systems | |
CN1097360C (en) | Acoustic echo elimination in a digital mobile communications system | |
EP1346553B1 (en) | Audio signal quality enhancement in a digital network | |
EP1293072B1 (en) | Arrangement and method relating to communication of speech | |
JP3182032B2 (en) | Voice coded communication system and apparatus therefor | |
KR20020071851A (en) | Speech recognition technique based on local interrupt detection | |
CN1266312A (en) | Method and apparatus for provding speaking telephone operation in portable communication equipment | |
JPH05160773A (en) | Voice communication equipment | |
KR100421152B1 (en) | Acoustic echo cancellation method and apparatus in a communication system for tty/tdd service | |
US20100054454A1 (en) | Method and apparatus for the detection and suppression of echo in packet based communication networks using frame energy estimation | |
CN1551516A (en) | Apparatus and method for controlling noise in mobile communication terminal | |
JP2576690B2 (en) | Digital mobile phone | |
CN1255255A (en) | Echo reducing phone with state machine controlled switches | |
KR100298317B1 (en) | Communication method for portable phone | |
JP3173639B2 (en) | Background noise update system and method | |
JP2979859B2 (en) | Digital mobile radio equipment | |
JPH09200308A (en) | Communication terminal equipment | |
KR19980031897A (en) | Mailing method using voice mailbox | |
CN100496077C (en) | Device and method for reducing white noise of mobile communication terminal | |
JPH04278757A (en) | Device for transmitting voice data | |
JPH0766757A (en) | Method for canceling echo | |
JP2974427B2 (en) | Voice communication system and voice communication device | |
KR100283078B1 (en) | Signal Detection Method in Speech Coder and Packet Transmission Method Using the Same | |
JP2983829B2 (en) | Digital mobile phone | |
KR100927289B1 (en) | Mobile communication system and method for transmitting and receiving voice packet |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |