EP0996112A2 - Procédé et dispositif de codage/décodage à compression de silence - Google Patents

Procédé et dispositif de codage/décodage à compression de silence Download PDF

Info

Publication number
EP0996112A2
EP0996112A2 EP99120614A EP99120614A EP0996112A2 EP 0996112 A2 EP0996112 A2 EP 0996112A2 EP 99120614 A EP99120614 A EP 99120614A EP 99120614 A EP99120614 A EP 99120614A EP 0996112 A2 EP0996112 A2 EP 0996112A2
Authority
EP
European Patent Office
Prior art keywords
background noise
compression coding
frame
threshold value
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP99120614A
Other languages
German (de)
English (en)
Other versions
EP0996112A3 (fr
Inventor
Mieko NEC Corporation Yui
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Electronics Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of EP0996112A2 publication Critical patent/EP0996112A2/fr
Publication of EP0996112A3 publication Critical patent/EP0996112A3/fr
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding

Definitions

  • the present invention relates to a silence compression coding / decoding method and device of the same. More to particularly, this invention relates to a silence compression coding / decoding method and device of the same which selects either silence compression coding or speech compression coding processing (appropriately, referred to as regular coding processing) corresponding to background noise level of input speech signal at a transmission side to implement a coding transmission and which implements decoding at the reception side in digital image / speech transmission.
  • silence compression coding / decoding method and device of the same which selects either silence compression coding or speech compression coding processing (appropriately, referred to as regular coding processing) corresponding to background noise level of input speech signal at a transmission side to implement a coding transmission and which implements decoding at the reception side in digital image / speech transmission.
  • the system causes an input signal of analog image / speech to be subjected to sampling and quantization, before undergoing coding and compression in conformity with transmission protocol to transmit.
  • a video signal of 4MHz band is compressed together with PCM digital coding, resulting in compression of 1.5 M bit / s (ITU-T / MPEG-1 Standard-Low Bit Rate Video Image Coding System Recommendation) to transmit through ISDN or PSTN.
  • a decoding for inverting the received compressed signal into discrete PAM pulse string with respect to time is implemented, before generating analog image signal by low pass filtering.
  • in-active interval such as break of conversation (there exists silence and background noise).
  • the speech transmission method enlarges a compression coding ratio, and lessens coding data, thus reducing transmission band exceedingly in order to enhance transmission efficiency.
  • Discrimination regarding in-active interval or active interval is implemented in such a way that threshold value is established in order to compare sampling value with the threshold value.
  • the transmission band becomes narrow because of large compression ratio. Namely, data transmission quantity is low, thereby, difference between input speech signal (original sound) before transmission and decompressed data becomes large according to decompression at the time of decoding, especially, deterioration of sound quality of the background noise is easy to occur.
  • Fig. 1 is a block diagram showing a device constitution regarding conventional silence compression coding / decoding method.
  • the device causes digital input speech signal to be coded using a coder 1 to transmit to a decoder 3 through a transmission line 2.
  • decoded analog output speech signal is transmitted.
  • the coder 1 and the decoder 3 are fixed devices provided with function either silence compression coding function or non silence compression coding function.
  • the device notices the active / in-active flags from the coder 1 to the decoder 3 through the transmission line 2 to implement either regular decoding or decompression decoding.
  • the device enlarges compression coding ratio of the in-active interval and the device reduces the transmission band to enhance the transmission efficiency.
  • this processing is compression coding processing to the speech signal. Consequently, when the compression ratio is large in relation to the background noise exception for the speech signal, especially, deterioration of sound quality becomes remarkable, thus this becomes a main cause of speech deterioration of the background noise in the video telephone system, or the video conference system.
  • Japanese Patent Application Laid-Open No. HEI 4-357735 discloses that when the device detects in-active interval at the side of transmission, the device transmits a packet of information of this noise level, while at the side of reception, the device inserts background noise of in-active interval according to information of received noise level to implement appropriate regeneration, thus the device causes no noise level difference to generate at the time of conversation from the in-active interval to the active interval.
  • the conventional example of the Japanese Patent Application Laid-Open No. HEI 9-162892 "ATM IN-ACTIVE COMPRESSION METHOD" discloses that the device detects to store in-active parts within 47-bite of payload data on the inside of ATM cell, before regenerating thereof according to demand, thus the device does not generate ATM cell to be useless transmission, and the device reduces a sense of incompatibility at the time of regeneration, and it is capable of using transmission band efficiently.
  • the conventional device in the above described conventional example, it is capable of being performed appropriate regeneration (decoding), and difference of noise level does not generate at the time of a conversion from in-active interval to active interval. Further, the conventional device does not transmit unnecessary ATM cell, thereby, the transmission band can be used efficiently, and a sense of compatibility is reduced at the time of regeneration.
  • the conventional device always performs silence compression coding processing in regard to background noise before transmitting it. Therefore, when the device implements this decoding, there occurs defect that sound quality of the background noise deteriorates at the time of regeneration (decoding), thus there is the defect that a telephone call becomes a call which is disagreeable to listen.
  • silence compression coding / decoding method and device of the same which is capable of selecting silence compression coding processing or speech compression coding processing (regular coding processing) corresponding to background noise level of input speech signal, thus sound quality of the background noise at the time of decoding is improved, the transmission efficiency is improved, and device scale is reduced, and degrees of freedom of constitution (design) is obtained.
  • a compression coding / decoding method in which an in-active interval of an input speech signal is subjected to a compression coding before implementing decompression decoding, comprising the steps of picking out a background noise component in every input frame, comparing background noise level of the background noise component in every input frame with a discrimination threshold value, and selecting regular coding processing when background noise component level exceeds the discrimination threshold value in every input frame, while when background noise level does not exceed the discrimination threshold value, compression coding processing is selected in every input frame.
  • a compression coding / decoding method in which an in-active interval of an input speech signal is subjected to a compression coding before implementing decompression decoding, comprising the steps of picking out a background noise component of a first frame of the input speech signal, comparing background noise level of the background noise component of the first frame with a discrimination threshold value, and selecting regular coding processing in subsequent frames when background noise component level of the first frame exceeds the discrimination threshold value in every input frame, while when background noise level of the first frame does not exceed the discrimination threshold value, compression coding processing is selected in subsequent input frame.
  • a silence compression coding / decoding method wherein the discrimination threshold value is any of a fixed value determined previously, a semi-fixed value according to manual establishment, and a variable establishment value variably established automatically by control signal from outer section.
  • a silence compression coding / decoding method wherein the coding processing notices either an in-active flag or an active flag indicating either silence compression processing or speech compression coding processing toward decoding side, thus implementing either decompression decoding processing or non-decompression decoding processing according to this notification.
  • a silence compression coding / decoding device having a coder and decoder, for implementing compression coding to an in-active interval of an input speech signal, before implementing decompression decoding thereto
  • the coder comprises a coding means for coding an input speech signal to output in every frame, a compression coding means for compression coding the input speech signal to output in every frame, a background noise measuring means for picking out a background noise component in every frame of the input speech signal, a background noise level discrimination means for comparing a background noise level from the background noise measuring means with a discrimination threshold value in every frame, and a selecting means, in the background noise level discrimination means, for selecting the coding means in every frame when background noise component level exceeds the discrimination threshold value, while when the background noise level does not exceed the discrimination threshold value, the selecting means selects the compression coding means in every frame.
  • a silence compression coding / decoding device having a coder and decoder, for implementing compression coding to an in-active interval of an input speech signal, before implementing decompression decoding thereto
  • the coder comprises a coding means for coding an input speech signal to output, a compression coding means for compression coding the input speech signal to output, a background noise measuring means for picking out a background noise component from the input speech signal in a first frame, a background noise level discrimination means for comparing a background noise level from the background noise measuring means with a discrimination threshold value in the first frame, and a selecting means, in the background noise level discrimination means, for selecting the coding means in subsequent frame when background noise component level of the first frame exceeds the discrimination threshold value, while when the background noise level of the first frame does not exceed the discrimination threshold value, said selecting means selects said compression coding means in subsequent frame.
  • a silence compression coding / decoding device further comprising an establishing means for establishing the discrimination threshold value as a fixed value determined beforehand.
  • a silence compression coding / decoding device further comprising a manual operation variable establishing means for establishing the discrimination threshold value as a semi-fixed value according to manual operation establishment.
  • a silence compression coding / decoding device further comprising an automatic variable establishing means for automatically establishing variably the discrimination threshold value according to control signal from an external section.
  • a silence compression coding / decoding device wherein the background noise measuring means picks out the background noise component in such a way that the background noise measuring means extracts speech band component from the frame of the input speech signal before implementing subtraction processing in order to subtract the speech band component of the frame.
  • a silence compression coding / decoding device wherein when the coder implements coding processing, the coder transmits either an in-active flag or an active flag indicating either silence compression coding processing or speech compression coding processing to the decoder, before the decoder implements either decompression decoding processing or non-decompression decoding processing according to this notification, and wherein the coder stores the in-active flag or the active flag in the frame before transmitting the flag in order to notice to the decoder.
  • the device compares the background noise component level picked out in every frame of the input speech signal with the discrimination threshold value, thus when the background noise component level exceeds the discrimination threshold value, the device selects the regular coding processing, while when the background noise component level does not exceed the discrimination threshold value, the device selects the silence compression coding processing, before transmitting either coding data or silence compression coding data.
  • the device is capable of selecting either silence compression coding processing or speech compression coding processing (regular coding processing) corresponding to background noise level of the input speech signal in the video image / speech transmission and so forth.
  • speech compression coding processing regular coding processing
  • the silence compression coding processing or the speech compression coding processing is selected while comparing the background noise component level 'Sm' with the discrimination threshold value data 'Srf' so as not to come to be disagreeable to listen to.
  • the transmission efficiency deteriorates slightly in comparison with the case where the whole background noise undergoes the silence compression coding processing, however, the transmission efficiency does not deteriorate largely, thus the transmission efficiency is improved. Further, the background noise does not generate deterioration of sound quality, thus speech quality is improved, resulting in high quality of speech.
  • the silence compression coding / decoding method and the device of the same of the present invention compares the background noise level of the background noise component in the first frame of the input speech signal. Namely, in the middle of the speech, the speaker does not move from the place, in the circumferential environment (for instance, establishment of the video telephone, place of the conference, the inside of a car, and so forth) is low in change of the background noise, therefore, either the silence compression coding processing or the speech compression coding processing is selected to be fixed using the first frame.
  • the transmission efficiency and sound quality of the background noise are improved, and the data processing is simplified because the regular coding processing or the silence compression coding processing is not selected in every frame, consequently the data processing does not become complicated, and device size is reduced.
  • the discrimination threshold value is established fixedly, or the discrimination threshold value is variably established automatically by the control signal from the outer section, or the discrimination threshold value is established automatically by control from the outer section.
  • the optimum discrimination threshold value can be established variably corresponding to the background noise of circumferential environment (for instance, location of the video telephone, location of the conference, or inside of the car) to be used, and when the device is applied to the video telephone system, video conference system or digital telephone system, optimum constitution (design) is capable of being achieved while considering the background noise in every use environment.
  • Fig. 2 is a block diagram showing a constitution of a silence compression coding / decoding method and a device of the same according to a first embodiment of the present invention.
  • the first embodiment is applied to a video telephone system and a video conference system.
  • a silence compression coding selector 11 when the device codes to transmit the input speech signal 'Sd', transmits either silence compression coding output speech signal 'Sno' or regular coding output speech signal 'So' according to background noise level.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' is obtained in such a way that the silence compression coding processing or the speech compression coding processing (regular coding processing) according to the background noise level is selected to process.
  • the silence compression coding selector 11 stores therein in-active / active flag to notice to the decoding side.
  • the in-active / active flag indicate whether a frame of the input speech signal is in-active or active.
  • the present embodiment is provided with a transmission line 12 of communication telephone network such as ISDN, PSTN and so forth or private dedicated line, and a decoder 13 for decompressing to decode a silence compression decoding output speech signal 'Sno' received through the transmission line 12 from the silence compression coding selector 11, and for transmitting analog output speech signal 'Sa' which is obtained by decoding the regular coding output speech signal 'So'.
  • a transmission line 12 of communication telephone network such as ISDN, PSTN and so forth or private dedicated line
  • a decoder 13 for decompressing to decode a silence compression decoding output speech signal 'Sno' received through the transmission line 12 from the silence compression coding selector 11, and for transmitting analog output speech signal 'Sa' which is obtained by decoding the regular coding output speech signal 'So'.
  • the decoder 13 receives the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' through the transmission line 12 from the silence compression coding selector 11.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' is discriminated whether each of them should be processed by the silence compression coding processing or the regular coding processing according to the in-active / active flag which is stored in this frame and is transmitted by upper layer. According to this discrimination, there is implemented either decompression decoding or regular decoding, thus either the decompression decoded analog output speech signal 'Sa' or the regular decoded analog output speech signal 'Sa' is transmitted.
  • the silence compression coding selector 11 is provided with a background noise measuring section 21 for transmitting background noise component level 'Sm' indicating the volume (level) of the background noise while performing filtering-processing using band pass filter and so forth about speech band and the background noise in every frame of the input speech signal 'Sd', and a background noise level discrimination section 22 for transmitting switching signal 'Sw' for selecting either the silence compression coding processing or the regular coding processing concerning the input speech signal 'Sd' (frame), while comparing the background noise component level 'Sm' from the background noise measuring section 21 with discrimination threshold value data 'Srf'.
  • the silence compression coding selector 11 further comprises a silence compression correspondence coder 23 for transmitting the silence compression coding output speech signal 'Sno' undergoing compression coding processing in every frame of the input speech signal 'Sd', a coder 24 for transmitting the regular coding output speech signal 'So' according to regular coding processing in which the silence compression coding processing is not implemented in every frame of the input speech signal 'Sd', a continuous frame discrimination section 25 for transmitting continuous frame discrimination signal 'Sp' indicating that the input speech signal 'Sd' is continuous signal, a discrimination threshold value establishing section 26 for transmitting discrimination threshold value data 'Srf' which comes into comparison reference for selecting either the silence compression coding processing or the regular coding processing according to the background noise component level 'Sm' from the background noise measuring section 21, and switches SW1, SW2 for selecting or not-selecting input-side or output-side of the silence compression correspondence coder 23 or the coder 24 using switching signal 'Sw'.
  • the discrimination threshold value establishing section 26 is provided with a variable establishing section VR for variably establishing level (threshold value) of the discrimination threshold value data 'Srf'.
  • the switch SW1 supplies the input speech signal 'Sd' to either the silence compression correspondence coder 23 or the coder 24 which are connected to fixed contacts in such a way that the input speech signal 'Sd' is supplied to movable contact and the SW1 is switched by the switching signal 'Sw' from the background noise level discrimination section 22.
  • the movable contact of the switch SW2 is switched by the switching signal 'Sw' from the background noise level discrimination section 22 resulting in connection to the fixed contact.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' from respective output terminals of the silence compression correspondence coder 23 or the coder 24 which are connected to the fixed contacts are selected to be transmitted to the transmission line 12.
  • the input speech signal 'Sd' is supplied to the background noise measuring section 21 and the continuous frame discrimination section 25.
  • a continuous frame discrimination signal 'Sp' indicating that the input speech signal 'Sd' is the continuous frame is outputted to the discrimination threshold value establishing section 26.
  • speech band component is extracted in every inputted frame by filtering processing of the band pass filter. A balance processing in which the speech band component data undergoes phase inversion from the input speech signal 'Sd' and the phase inverted speech band component data is added is implemented to pick out the background noise component.
  • the background noise component level 'Sm' picked out from the background noise measuring section 21 is inputted to the background noise level discrimination section 22.
  • the discrimination threshold value data 'Srf' is transmitted to the background noise level discrimination section 22 in answer to the continuous frame discrimination signal 'Sp' namely being transmitted continuously in every frame, from the discrimination threshold value establishing section 26.
  • the discrimination threshold value data 'Srf' from the discrimination threshold value establishing section 26 is adjusted to be established in a variable establishing section VR described after.
  • the background noise level discrimination section 22 when the background noise component level 'Sm' from the background noise measuring section 21 exceeds the discrimination threshold value data 'Srf' from the discrimination threshold value establishing section 26, the background noise level discrimination section 22 transmits the switching signal 'Sw' in order to select the coder 24 to the switches SW1, SW2. On the other hand, when the background noise component level 'Sm' does not exceed the discrimination threshold value data 'Srf', the background noise level discrimination section 22 transmits the switching signal 'Sw' in order to select the silence compression correspondence coder 23 to the switches SW1, SW2.
  • the regular coding processing is selected. Further, when the background noise component level 'Sm' does not exceed the discrimination threshold value data 'Srf', the silence compression coding processing is selected.
  • Each of the regular coding output speech signal 'So' from the coder 24 or the silence compression coding output speech signal 'Sno' from the silence compression correspondence coder 23 is transmitted to the decoder 13 through the transmission line 12.
  • the decoder 13 when the regular coding output speech signal 'So' is inputted therein from the coder 24 of the silence compression coding selector 11, the decoder 13 implements regular decoding processing without implementing decompression processing to transmit an analog output speech signal 'Sa'.
  • the decoder 13 when the silence compression coding output speech signal 'Sno' is inputted from the silence compression correspondence coder 23 of the silence compression coding selector 11, the decoder 13 transmits an analog output speech signal 'Sno' undergoing decompression processing.
  • the silence compression coding selector 11 stores the in-active / active flag indicating whether the frame is in-active or active into the decoder 13 in order to notice to the decoder 13.
  • the decoder 13 transmits the analog output speech signal 'Sa' which is obtained by decoding the regular coding output speech signal 'So' as being the speech compression coding data according to the active flag noticed.
  • the decoder 13 transmits the analog output speech signal 'Sa' while implementing decoding processing which decompresses the silence compression coding output speech signal 'Sno' to be the silence compression coding data according to the in-active flag to transmit the analog output speech signal 'Sa'.
  • either the silence compression coding processing or the speech compression coding processing is selected while comparing the background noise component level 'Sm' with the discrimination threshold value data 'Srf".
  • the silence compression coding processing or the speech compression coding processing is selected while comparing the background noise component level 'Sm' with the discrimination threshold value data 'Srf' so as not to come to be disagreeable to listen to.
  • the background noise level is large, the background noise is coded by the speech compression coding processing to transmit, therefore, the transmission efficiency deteriorates slightly in comparison with the case where the whole background noise undergoes the silence compression coding processing, however, the transmission efficiency does not deteriorate largely, thus the transmission efficiency is improved. Further, the background noise in the analog output speech signal 'Sa' decoded by the decoder 13 does not generate deterioration of sound quality, thus speech quality is improved.
  • variable establishment of the discrimination threshold value establishing section 26 provided with the variable establishing section VR namely, the discrimination threshold value data 'Srf' is established to the level which selects either the silence compression coding processing or the speech compression coding processing, so the decoded analog output speech signal 'Sa' does not come to be disagreeable sound quality to listen.
  • the variable establishing section VR is established manually, while when the device is applied to the video telephone system, or the video conference system the variable establishing section VR is established automatically while considering the background noise of the establishment place described in a third embodiment.
  • Fig. 3 is a block diagram showing a constitution of the second embodiment.
  • the second embodiment is also applied to a video telephone system and a video conference system.
  • a silence compression coding selector 11 when the device codes to transmit the input speech signal 'Sd', transmits either silence compression coding output speech signal 'Sno' or regular coding output speech signal 'So'.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' is obtained in such a way that the silence compression coding processing or the speech compression coding processing (regular coding processing) according to the background noise level is selected to process.
  • the silence compression coding selector 11 stores therein in-active / active flag to notice to the decoding side.
  • the in-active / active flag indicate whether a frame of the input speech signal is in-active or active.
  • the present second embodiment is provided with a transmission line 12 of communication telephone network such as ISDN, PSTN and so forth or private dedicated line, and a decoder 13 for decompressing to decode a silence compression decoding output speech signal 'Sno' received through the transmission line 12 from the silence compression coding selector 11, and for transmitting analog output speech signal 'Sa' which is obtained by decoding the regular coding output speech signal 'So'.
  • a transmission line 12 of communication telephone network such as ISDN, PSTN and so forth or private dedicated line
  • a decoder 13 for decompressing to decode a silence compression decoding output speech signal 'Sno' received through the transmission line 12 from the silence compression coding selector 11, and for transmitting analog output speech signal 'Sa' which is obtained by decoding the regular coding output speech signal 'So'.
  • the decoder 13 receives the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' through the transmission line 12 from the silence compression coding selector 11.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' is discriminated whether each of them should be processed by the silence compression coding processing or the regular coding processing according to the in-active / active flag which is stored in this frame and is transmitted by upper layer. According to this discrimination, there is implemented either decompression decoding or regular decoding, thus decompression decoded analog output speech signal 'Sa' or regular decoded analog output speech signal 'Sa' is transmitted.
  • the silence compression coding selector 11 is provided with a background noise measuring section 21 for transmitting background noise component level 'Sm' indicating the volume (level) of the background noise while performing filtering-processing using band pass filter and so forth about speech band and the background noise in every frame of the input speech signal 'Sd', and a background noise level discrimination section 22 for transmitting switching signal 'Sw' for selecting either the silence compression coding processing or the regular coding processing concerning the input speech signal 'Sd' (frame), while comparing the background noise component level 'Sm' from the background noise measuring section 21 with discrimination threshold value data 'Srf".
  • a background noise measuring section 21 for transmitting background noise component level 'Sm' indicating the volume (level) of the background noise while performing filtering-processing using band pass filter and so forth about speech band and the background noise in every frame of the input speech signal 'Sd'
  • a background noise level discrimination section 22 for transmitting switching signal 'Sw' for selecting either the silence compression coding processing or the
  • the silence compression coding selector 11 further comprises a silence compression correspondence coder 23 for transmitting the silence compression coding output speech signal 'Sno' undergoing compression coding processing in the first frame of the input speech signal 'Sd', a coder 24 for transmitting the regular coding output speech signal 'So' according to regular coding processing in which the silence compression coding processing is not implemented in every frame of the input speech signal 'Sd', a first frame discrimination section 30 for transmitting a first frame discrimination signal 'Spa' indicating discrimination result after discriminating the first frame of the input speech signal 'Sd', a discrimination threshold value establishing section 26 for transmitting discrimination threshold value data 'Srf' which comes into comparison reference for selecting either the silence compression coding processing or the regular coding processing according to the background noise component level 'Sm' from the background noise measuring section 21, and switches SW1, SW2 for selecting or not-selecting input-side or output-side of the silence compression correspondence coder 23 or the coder 24 using switching signal
  • the discrimination threshold value establishing section 26 is provided with a variable establishing section VR for variably establishing level (threshold value) of the discrimination threshold value data 'Srf'.
  • the switch SW1 supplies the input speech signal 'Sd' to either the silence compression correspondence coder 23 or the coder 24 which are connected to fixed contacts in such a way that the input speech signal 'Sd' is supplied to movable contact and the SW1 is switched by the switching signal 'Sw' from the background noise level discrimination section 22.
  • the movable contact of the switch SW2 is switched by the switching signal 'Sw' from the background noise level discrimination section 22 resulting in connection to the fixed contact.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' from respective output terminals of the silence compression correspondence coder 23 or the coder 24 which are connected to the fixed contacts are selected to be transmitted to the transmission line 12.
  • the first frame discrimination section 30 discriminates the first frame of the input speech signal 'Sd', before transmitting the first frame discrimination signal 'Spa' toward the discrimination threshold value establishing section 26.
  • the first frame discrimination section 30 transmits the first frame discrimination signal 'Spa' according to only the first frame, not transmitting the first frame discrimination signal after the first transmission.
  • the discrimination threshold value establishing section 26 transmits the discrimination threshold value data 'Srf' established in the variable establishing section VR toward the background noise level discrimination section 22 in the subsequent frame according to the first frame discrimination signal 'Spa'.
  • the background level discrimination section 22 when the background noise component level 'Sm' measured in the first frame from the background noise measuring section 21 is inputted therein, before comparing the background noise component level 'Sm' with the discrimination threshold value data 'Srf', in the subsequent frame, thus this selection continues. Namely, when the background noise component level 'Sm' according to the first frame exceeds the discrimination threshold value data 'Srf', the switching signal 'Sw' is transmitted in order to select the coder 24 toward the switches SW1, SW2. Further, when the background noise component level 'Sm' according to the first frame does not exceed the discrimination threshold value data 'Srf', the switching signal 'Sw' in order to select the silence compression correspondence coder 23 is transmitted to the switches SW1, SW2.
  • either the silence compression coding processing or the speech compression coding processing is selected using only the first frame, in the subsequent frame, this selection continues.
  • the background noise does not change within the speech. Namely, in the middle of the speech, the speaker does not move from the place, in the circumferential environment (for instance, establishment of the video telephone, place of the conference, the inside of a car, and so forth) is low in change of the background noise, therefore, either the silence compression coding processing or the speech compression coding processing is selected to be fixed using the first frame. Consequently, the data processing does not become complicated, and device size is reduced.
  • Fig. 4 is a block diagram showing a constitution of the third embodiment.
  • the third embodiment is applied to a video telephone system and a video conference system.
  • a silence compression coding selector 11 when the device codes to transmit the input speech signal 'Sd', transmits either silence compression coding output speech signal 'Sno' or regular coding output speech signal 'So'.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' is obtained in such a way that the silence compression coding processing or the speech compression coding processing (regular coding processing) according to the background noise level is selected to process.
  • the silence compression coding selector 11 stores therein in-active / active flag to notice to the decoding side.
  • the in-active / active flag indicate whether a frame of the input speech signal is in-active or active.
  • the present third embodiment is provided with a transmission line 12 of communication telephone network such as ISDN, PSTN and so forth or private dedicated line, and a decoder 13 for decompressing to decode a silence compression decoding output speech signal 'Sno' received through the transmission line 12 from the silence compression coding selector 11, and for transmitting analog output speech signal 'Sa' which is obtained by decoding the regular coding output speech signal 'So'.
  • a transmission line 12 of communication telephone network such as ISDN, PSTN and so forth or private dedicated line
  • a decoder 13 for decompressing to decode a silence compression decoding output speech signal 'Sno' received through the transmission line 12 from the silence compression coding selector 11, and for transmitting analog output speech signal 'Sa' which is obtained by decoding the regular coding output speech signal 'So'.
  • the decoder 13 receives the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' through the transmission line 12 from the silence compression coding selector 11.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' is discriminated whether each of them should be processed by the silence compression coding processing or the regular coding processing according to the in-active / active flag which is stored in this frame and is transmitted by upper layer. According to this discrimination, there is implemented either decompression decoding or regular decoding, thus decompression decoded analog output speech signal 'Sa' or regular decoded analog output speech signal 'Sa' is transmitted.
  • the silence compression coding selector 11 is provided with a background noise measuring section 21 for transmitting background noise component level 'Sm' indicating the volume (level) of the background noise while performing filtering-processing using band pass filter and so forth about speech band and the background noise in every frame of the input speech signal 'Sd', and a background noise level discrimination section 22 for transmitting switching signal 'Sw' for selecting either the silence compression coding processing or the regular coding processing concerning the input speech signal 'Sd' (frame), while comparing the background noise component level 'Sm' from the background noise measuring section 21 with discrimination threshold value data 'Srf'.
  • the silence compression coding selector 11 further comprises a silence compression correspondence coder 23 for transmitting the silence compression coding output speech signal 'Sno' undergoing compression coding processing in every frame of the input speech signal 'Sd', a coder 24 for transmitting the regular coding output speech signal 'So' according to regular coding processing in which the silence compression coding processing is not implemented in every frame of the input speech signal 'Sd', a continuous frame discrimination section 25 for transmitting continuous frame discrimination signal 'Sp' indicating that the input speech signal 'Sd' is continuous signal, a discrimination threshold value establishing section 26 for transmitting discrimination threshold value data 'Srf' which comes into comparison reference for selecting either the silence compression coding processing or the regular coding processing according to the background noise component level 'Sm' from the background noise measuring section 21, and switches SW1, SW2 for selecting or not-selecting input-side or output-side of the silence compression correspondence coder 23 or the coder 24 using switching signal 'Sw'.
  • the discrimination threshold value establishing section 26 is provided with an establishment control section 32 for variably establishing level (threshold value) of the discrimination threshold value data 'Srf' according to a control signal 'Sc' from an outer section.
  • the switch SW1 supplies the input speech signal 'Sd' to either the silence compression correspondence coder 23 or the coder 24 which are connected to fixed contacts in such a way that the input speech signal 'Sd' is supplied to movable contact and the SW1 is switched by the switching signal 'Sw' from the background noise level discrimination section 22.
  • the movable contact of the switch SW2 is switched by the switching signal 'Sw' from the background noise level discrimination section 22 resulting in connection to the fixed contact.
  • the silence compression coding output speech signal 'Sno' or the regular coding output speech signal 'So' from respective output terminals of the silence compression correspondence coder 23 or the coder 24 which are connected to the fixed contacts are selected to be transmitted to the transmission line 12.
  • the third embodiment operates similarly to the first embodiment shown in Fig. 2. However, the difference therebetween is that the discrimination threshold value establishing section 26 establishes automatically the discrimination threshold value data 'Srf' into a level for selecting either the silence compression coding processing or the speech compression coding processing so as not to come to be disagreeable sound quality (speech) when decoding the discrimination threshold value data 'Srf'. For instance, when this device is applied to the video telephone system or the video conference system, the discrimination threshold value data 'Srf' of the background noise level is automatically established while considering the background noise level of individual establishment place.
  • the device inputs therein the control signal 'Sc' which is capable of establishing the discrimination threshold value data 'Srf' so as to come to be agreeable speech to listen due to the fact that deterioration of sound quality of the background noise is reduced when decoding it.
  • the third embodiment is capable of being applied to the second embodiment.
  • this constitution is applied to the video telephone system or the video conference system, there is described about the coding and the decoding in the speech transmission.
  • it is capable of being applied to CODEC processing in which the coding and the decoding in cellular mobile telephone system (PDC / PHS / DECT and so forth) for implementing only digital speech transmission.
  • PDC cellular mobile telephone system
  • the device compares the background noise component level taken out in every frame of the input speech signal with the discrimination threshold value, thus when the background noise component level exceeds the discrimination threshold value, the device selects the regular coding processing, while when the background noise component level does not exceed the discrimination threshold value, the device selects the silence compression coding processing, before transmitting either coding data or silence compression coding data.
  • the transmission efficiency slightly decreases, however the transmission efficiency does not decrease largely, in comparison with the case where the background noise is coded by speech compression coding processing to transmit, thus the whole background noise undergoes the silence compression coding processing, so that the transmission efficiency is improved. Further, the sound quality of the background noise at the time of decoding is improved, thus speech of high quality becomes possible.
  • the device since change of the background noise at circumferential environment during speech, the device compares the background noise component level in the first frame of the input speech signal with the discrimination threshold value, the processing is fixed into either the regular coding processing or the silence compression coding processing in subsequent frame.
  • the transmission efficiency and sound quality of the background noise are improved, and the data processing is simplified because the regular coding processing or the silence compression coding processing is not selected in every frame, device size is reduced.
  • the discrimination threshold value is established fixedly, or the discrimination threshold value is variably established automatically by the control signal from the outer section, or the discrimination threshold value is established automatically by control from the outer section.
  • the optimum discrimination threshold value can be established variably corresponding to the background noise of circumferential environment to be used, and when the device is applied to the video telephone system, video conference system or digital telephone system, optimum constitution (design) is capable of being achieved while considering the background noise in every use environment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Analogue/Digital Conversion (AREA)
EP99120614A 1998-10-20 1999-10-18 Procédé et dispositif de codage/décodage à compression de silence Withdrawn EP0996112A3 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP29815298 1998-10-20
JP10298152A JP3061039B2 (ja) 1998-10-20 1998-10-20 無音圧縮符号復号化方法及びその装置

Publications (2)

Publication Number Publication Date
EP0996112A2 true EP0996112A2 (fr) 2000-04-26
EP0996112A3 EP0996112A3 (fr) 2002-06-26

Family

ID=17855879

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99120614A Withdrawn EP0996112A3 (fr) 1998-10-20 1999-10-18 Procédé et dispositif de codage/décodage à compression de silence

Country Status (2)

Country Link
EP (1) EP0996112A3 (fr)
JP (1) JP3061039B2 (fr)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6954745B2 (en) 2000-06-02 2005-10-11 Canon Kabushiki Kaisha Signal processing system
US7010483B2 (en) 2000-06-02 2006-03-07 Canon Kabushiki Kaisha Speech processing system
US7035790B2 (en) 2000-06-02 2006-04-25 Canon Kabushiki Kaisha Speech processing system
US7072833B2 (en) 2000-06-02 2006-07-04 Canon Kabushiki Kaisha Speech processing system
US7653539B2 (en) 2004-02-24 2010-01-26 Panasonic Corporation Communication device, signal encoding/decoding method
WO2017039422A3 (fr) * 2015-09-04 2017-04-20 삼성전자 주식회사 Procédés de traitement de signal et appareils d'amélioration de la qualité sonore

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013068532A (ja) * 2011-09-22 2013-04-18 Clarion Co Ltd 情報端末、サーバー装置、検索システムおよびその検索方法
CN110325470B (zh) * 2017-03-03 2021-11-23 三菱电机大楼技术服务株式会社 电梯的中心侧通信装置及通信系统
US20200184991A1 (en) * 2018-12-05 2020-06-11 Pascal Cleve Sound class identification using a neural network

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5737695A (en) * 1996-12-21 1998-04-07 Telefonaktiebolaget Lm Ericsson Method and apparatus for controlling the use of discontinuous transmission in a cellular telephone

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5737695A (en) * 1996-12-21 1998-04-07 Telefonaktiebolaget Lm Ericsson Method and apparatus for controlling the use of discontinuous transmission in a cellular telephone

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BENYASSINE A ET AL: "ITU-T RECOMMENDATION G.729 ANNEX B: A SILENCE COMPRESSION SCHEME FOR USE WITH G.729 OPTIMIZED FOR V.70 DIGITAL SIMULTANEOUS VOICE AND DATA APPLICATIONS" IEEE COMMUNICATIONS MAGAZINE, IEEE SERVICE CENTER. PISCATAWAY, N.J, US, vol. 35, no. 9, 1 September 1997 (1997-09-01), pages 64-73, XP000704425 ISSN: 0163-6804 *
GAN X D ET AL: "Implementation of silence compression scheme for G.723.1 speech coder using TI TMS320C51 DSP chip" INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, 1997. ICICS., PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON SINGAPORE 9-12 SEPT. 1997, NEW YORK, NY, USA,IEEE, US, 9 September 1997 (1997-09-09), pages 1284-1287, XP010264023 ISBN: 0-7803-3676-3 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6954745B2 (en) 2000-06-02 2005-10-11 Canon Kabushiki Kaisha Signal processing system
US7010483B2 (en) 2000-06-02 2006-03-07 Canon Kabushiki Kaisha Speech processing system
US7035790B2 (en) 2000-06-02 2006-04-25 Canon Kabushiki Kaisha Speech processing system
US7072833B2 (en) 2000-06-02 2006-07-04 Canon Kabushiki Kaisha Speech processing system
US7653539B2 (en) 2004-02-24 2010-01-26 Panasonic Corporation Communication device, signal encoding/decoding method
WO2017039422A3 (fr) * 2015-09-04 2017-04-20 삼성전자 주식회사 Procédés de traitement de signal et appareils d'amélioration de la qualité sonore
US10803877B2 (en) 2015-09-04 2020-10-13 Samsung Electronics Co., Ltd. Signal processing methods and apparatuses for enhancing sound quality
US11380338B2 (en) 2015-09-04 2022-07-05 Samsung Electronics Co., Ltd. Signal processing methods and apparatuses for enhancing sound quality

Also Published As

Publication number Publication date
EP0996112A3 (fr) 2002-06-26
JP3061039B2 (ja) 2000-07-10
JP2000124915A (ja) 2000-04-28

Similar Documents

Publication Publication Date Title
US6172974B1 (en) Network element having tandem free operation capabilities
EP0909081B1 (fr) Méthode et appareil de stockage et d'envoi de signaux de parole
US6940819B2 (en) Internet telephone apparatus and method for an internet telephone set
EP0475196B1 (fr) Procédé de traitement de signaux audio dans un système de codage par sous-bandes
US7221662B2 (en) Tone detection elimination
JP3887651B2 (ja) マルチレート網におけるデータ伝送
EP0996112A2 (fr) Procédé et dispositif de codage/décodage à compression de silence
AU752863C (en) Communication device and method of operation
JP2002190831A (ja) 通信装置及び通信装置の処理方法
US6324188B1 (en) Voice and data multiplexing system and recording medium having a voice and data multiplexing program recorded thereon
US5900906A (en) Image communication apparatus having automatic answering and recording function
EP1014738A2 (fr) Procédé et appareil pour l'utilisation efficace de la bande passante dans réseau à commutation par paquets
US7181000B2 (en) Voice transmission device and voice transmission system
JPH08125620A (ja) ディジタル音声伝送装置
CN1331340C (zh) 声音码切换方法
Kitawaki et al. Speech coding technology for ATM networks
EP0942571A1 (fr) Appareil de communication et méthode d'exploitation
JPH1169349A (ja) 画像通信装置
JP2002300619A (ja) 音声伝送装置
JPH0429442A (ja) マルチメディア多重伝送装置
US6795497B1 (en) Use of end to end compression devices in telecommunication networks
JPS6390953A (ja) マルチメデイア通信装置
KR100357268B1 (ko) 보코더를 사용하는 시스템에서 데이터 모드로의 절환 장치및 방법
KR0149776B1 (ko) 복수 비데오코덱을 갖는 영상전화기 및 그 운용방법
JP2000175048A (ja) 画像処理装置

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

17P Request for examination filed

Effective date: 20020620

AKX Designation fees paid

Designated state(s): DE FR GB IT

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: NEC ELECTRONICS CORPORATION

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN

18W Application withdrawn

Effective date: 20030929