GB2348342A - Reducing the data rate of a speech signal by replacing portions of encoded speech with code-words representing recognised words or phrases - Google Patents

Reducing the data rate of a speech signal by replacing portions of encoded speech with code-words representing recognised words or phrases Download PDF

Info

Publication number
GB2348342A
GB2348342A GB9906763A GB9906763A GB2348342A GB 2348342 A GB2348342 A GB 2348342A GB 9906763 A GB9906763 A GB 9906763A GB 9906763 A GB9906763 A GB 9906763A GB 2348342 A GB2348342 A GB 2348342A
Authority
GB
United Kingdom
Prior art keywords
signal
speech
coding device
digital
analogue
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB9906763A
Other versions
GB9906763D0 (en
GB2348342B (en
Inventor
John Joseph Spicer
Peter Chambers
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Roke Manor Research Ltd
Original Assignee
Roke Manor Research Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Roke Manor Research Ltd filed Critical Roke Manor Research Ltd
Priority to GB9906763A priority Critical patent/GB2348342B/en
Publication of GB9906763D0 publication Critical patent/GB9906763D0/en
Priority to US09/535,989 priority patent/US6519560B1/en
Publication of GB2348342A publication Critical patent/GB2348342A/en
Application granted granted Critical
Publication of GB2348342B publication Critical patent/GB2348342B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0018Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

Described herein is a method and apparatus for reducing the bit rate of transmitted signals in a telecommunications system. This is achieved by operating a speech recognition device (212) in parallel with a speech coding device (210) in a transmitter terminal (200). An analogue input signal (204) from a microphone (202) is digitised in analogue-to-digital converter 206 to provide a digital signal (208) which is applied to both the speech coding device (210) and the speech recognition device (212). Output signal (220) for transmission by antenna (222) is formed by combining output signals (214, 216) from the speech coding device (210) and the speech recognition device (212) using switch (218), recognised speech being converted to codewords which replace packets or sequences of packets of the coded signal corresponding thereto. In a receiver terminal (250), antenna 252 receives the transmitted signal and passes it to both a speech coding device (256) and a speech generator (258) prior to being converted to an analogue signal (270) for speaker (272). Speech generator (258) recognises codewords present in the signal and converts them back to digital signals (262) and then overwrites digital signal (260) from speech coding device (256) via switch (264) to reconstruct the complete signal prior to being passed to digital-to-analogue converter (268). In alternative embodiments (Figs. 3, 4) the encoding and recognition devices operate in sequence.

Description

IMPROVEMENTS IN OR RELATING TO TELECOMMUNICATION SYSTEMS The present invention relates to improvements in or relating to telecommunication systems and is more particularly concerned with lowering the bit rate required for speech transmission in such systems.
Telecommunications systems include both'wired'systems, for example, standard telephony land lines, and'wireless'systems, for example, mobile or cellular systems, and each system requires that the bit rate for speech transmission be lowered to optimise its transmission carrying capacity. For example, a mobile system comprises a plurality of cells, each cell being defined by at least one base station and a plurality of mobile terminals and having a predetermined maximum capacity. When a connection is made between a pair of terminals, namely, a transmitting terminal and a receiving terminal, speech to be transmitted is input at a microphone in the transmitting terminal as an analogue signal. The analogue signal is digitised to provide a digital signal which has a predetermined bit rate, typically, 64kbit/s. The digital signal is then passed to a speech coding device which analyses the sounds in the signal and codes it for transmission on a predetermined channel to a receiver in the receiving terminal. Typically, the bit rate for the transmission is 8kbit/s. The receiver decodes the transmitted signal and converts it to an analogue signal which can then be broadcast through a speaker in the receiving terminal. The coded digital signal is normally transmitted from terminal to terminal via a base station.
When a plurality of terminals are transmitting at the same time in a cell, the cell may approach its predetermined maximum capacity and, as a result, the number of connections in that cell may be restricted. Furthermore, for a transmission bit rate of 8kbit/s, relatively high bandwidth channels may be required.
It is therefore an object of the present invention to provide a method of reducing the bandwidth requirements for speech transmission.
In accordance with one aspect of the present invention, there is provided a method for reducing transmission bit rate in a telecommunications system, the method comprising the steps of :- a) receiving a signal at a microphone; b) converting the received signal to a coded signal for transmission; c) recognising speech; and d) replacing parts of the coded signal with codewords representative of the recognised speech.
In one embodiment, steps b) and c) are carried out simultaneously. In another embodiment, step c) is carried out prior to step b). In a further embodiment, step b) is carried out prior to step c).
Advantageously, increased battery life may be achieved if a discontinuous transmit mode is used, lower bandwidth channels may be utilised, and higher cell capacity may be achieved. This is particularly true for code division multiple access (CDMA) systems.
The term'recognised speech'is intended to encompass not only words, phrases or sentences, but sounds which make up parts of words.
For a better understanding of the present invention, reference will now be made, by way of example only, to the accompanying drawings in which: Figure 1 is a block diagram of a conventional connection for speech transmission ; Figure 2 is a block diagram of one embodiment of a connection for speech transmission in accordance with the present invention; and Figure 3 is a block diagram of a second embodiment of a connection for speech transmission in accordance with the present invention.
The present invention is to be described with reference to a mobile or cellular telecommunications system, but it will readily be appreciated that it is not limited thereto and is equally applicable to all telecommunications systems in which speech transmission bit rates require to be lowered.
It will readily be appreciated that a speech coding device operates by analysing the sounds of a voice and sending that voice in a coded form which can be recreated at a receiver. A conventional speech coding device arrangement is shown in Figure 1.
In Figure 1, a transmitting mobile terminal 100 is shown. A voice message to be transmitted is input to a microphone 102 which produces an analogue signal 104 which corresponds to the input voice message. The analogue signal 104 is fed to an analogue-to-digital converter (ADC) 106 where it is digitised to produce digital signal 108. The digital signal 108 is pulse code modulated (PCM) and has a bit rate of 64kbit/s. Signal 108 is fed to a speech coding device 110 where it is coded to provide a coded signal 112 for transmission by antenna 114 on a predetermined broadcast channel.
Also shown in Figure 1 is a receiving mobile terminal 150 which receives the coded signal 112 at receiving antenna 152. Received signal 154 from antenna 152 is passed to a speech decoding device 156 where it is decoded to produce decoded signal 158. Decoded signal 158 is fed to a digital-to-analogue converter (DAC) 160 to produce an analogue signal 162 which is then fed to a speaker 164.
It will readily be appreciated that for one mobile terminal, the transmitting antenna and receiving antenna may be the same. Moreover, a mobile terminal may transmit and receive on different channels.
A normal speech coding device operates by analysing the sounds of a voice and transmitting a coded form thereof by which the sounds can be recreated at a receiver. The coded signal may comprise a fixed number of bits or a packet which represent a fixed duration, for example, the coded signal may comprise digital packets in which 80 bits are required to transmit 10ms of speech, that is, a bit rate of 8kbit/s.
In one embodiment of the present invention, a speech recognition device is arranged to operate in parallel with a speech coding device. This is illustrated in Figure 2.
A transmitting mobile terminal 200 and a receiving mobile terminal 250 are shown in Figure 2. Transmitting terminal 200 comprises a microphone 202, an ADC 206, and coding device 210 which are identical to microphone 102, ADC 106 and coding device 110 of Figure 1. Microphone 202 provides an analogue signal 204 to ADC 206 which converts it into a digital signal 208. Digital signal 208 comprises a PCM signal having a bit rate of 64kbit/s. Signal 208 is input to coding device 210 and also to a speech recognition device 212 as shown. Respective output signals 214,216 from the coding device 210 and the speech recognition device 212 are passed to a switch 218. When the speech recognition device 212 recognises a word or phrase, it produces signal 216 which comprises a codeword corresponding to the recognised word or phrase. Switch 218 is operated by speech recognition device 212 to switch between signals 214 and 216 to form output signal 220 for transmission by transmitting antenna 222. Output signal 220 comprises a coded signal which is interspersed with codewords corresponding to words or phrases which have been recognised by the speech recognition device 212.
It will be appreciated that, as an alternative to the speech recognition device 212 controlling the switch 218, both may be connected to control means (not shown) which receives signals from the speech recognition device 212 to control the switch 218 in accordance with a recognised word or phrase. The control means may then operates the switch 218 so that the codewords are added into the output signal 220 to replace the code produced by the coding device 210 relating to the recognised word or phrase, prior to transmission by antenna 222.
Digital signal 208 has a bit rate of 64kbits/s as described above with reference to signal 108 in Figure 1. As mentioned above, signal 220 to be transmitted comprises digital packets and codewords, the codewords having 32 bits, for example. This means that, when codewords are used to replace some of the digital packets which are being transmitted at a bit rate of 8kbit/s, a bit rate of less than 8kbit/s can be achieved. Naturally, a further reduction in bit rate can be achieved if the codewords have less than 32 bits.
Receiving terminal 250 comprises an antenna 252 which receives the coded signal corresponding to a voice signal and provides signal 254 for coding device 256 and speech generator 258. Coding device 256 is identical to coding device 156 (Figure 1) and decodes signal 254 to provide a digital signal 260 corresponding thereto. However, coding device 256 cannot decode any codewords present in signal 254. Speech generator 258 recognises codewords which have been included in signal 254 and provides an output digital signal 262 corresponding thereto. Both digital signals 260, 262 are input to switch 264 which is controlled by the speech generator 258 to intersperse signals corresponding to recognised codewords with the standard coding. Digital signal 266 is fed to a DAC 268 where it is converted into analogue signal 270 for supply to speaker 272, DAC 268 and speaker 272 corresponding to DAC 160 and speaker 164 in Figure 1.
Speech generator 258 and switch 264 may be connected to control means (not shown) in a similar way to that described above in relation to the speech recognition device and the switch in the transmitting terminal. Here, the control means receives signals indicating that codewords have been recognised from the speech generator and uses these signals to control the operation of switch 264 to provide a reconstructed digital signal 266.
As mentioned above with reference to Figure 1, the transmitting antenna and receiving antenna may be the same for any one mobile terminal, and that terminal may transmit and receive on different channels.
In another embodiment of the present invention, a speech coder recognition device is utilised at the output of the speech coding device. This is illustrated in Figure 3.
A transmitting terminal 300 and a receiving terminal 350 are shown in Figure 3. Transmitting terminal 300 comprises a microphone 302, a ADC 306, and a speech coding device 310 which are identical to microphone 202, ADC 206 and speech coding device 210 of Figure 2. Microphone 302 produces analogue signal 304 which is converted to digital signal 308 by ADC 306. Speech coding device 310 converts the digital signal 308 into a coded signal 312 which is similar to coded signal 214 of Figure 2. Coded signal 312 is then passed to a speech coder recognition device 314 which produces an output signal 316 for transmission by transmitting antenna 318.
Output signal 316 comprises signal 312 which has been overwritten with codewords representative of recognised packets or sequences of packets present therein so that the bits corresponding to the recognised packets or sequences of packets are replaced with bits corresponding to the codewords to reduce the overall bit rate.
Receiving terminal 350 comprises an antenna 352 for receiving the transmitted output signal 316. Antenna 352 produces signal 354 which is representative of the received signal. Signal 354 passes to a speech coder recognition device 356 which replaces recognised codewords with packets or sequences of packets corresponding thereto to form signal 358. Signal 358 is then passed to a speech coding device 360 where the packets or sequences of packets are converted into a digital signal 362 which is then converted in DAC 364 to an analogue signal 366 for supply to speaker 368.
The codewords may be generated by a Lempel-Ziv type algorithm or by some other feed-forward compression algorithm. Alternatively, the codewords may be derived from a fixed set. The speech coder recognition device 356 in receiving terminal 350 has copies of codewords and their corresponding packets or sequences of packets and re-inserts the packets or sequences thereof when it receives the appropriate codewords. In recognising a packet or sequence of packets, some form of"near matching"is allowed. This method is language independent.
In another embodiment of the present invention, a word/sound recognition device is operated at the input to the speech coding device in the transmitting terminal. This is shown in Figure 4.
A transmitting terminal 400 and receiving terminal 450 are shown in Figure 4. Transmitting terminal 400 comprises a microphone 402, an ADC 406, and coding device 410 which are identical to microphones 102,202, ADCs 106,206 and coding devices 110, 210 of Figures 1 and 2 respectively.
Microphone 402 provides an analogue signal 404 to ADC 406 which converts it into a digital signal 408. Digital signal 408 is input to coding device 410 and also to a word/sound recognition device 412 as shown.
Respective output signals 414, 416 from the coding device 410 and the word/sound recognition device 412 are passed to a switch 418. When the word/sound recognition device 412 recognises a word or sound, it produces signal 416 which comprises a codeword corresponding to the recognised word or sound. Switch 418 is operated by word/sound recognition device 412 to switch between signals 414 and 416 to form output signal 420 for transmission by transmitting antenna 422. Output signal 420 comprises a coded signal which is interspersed with codewords corresponding to recognised words or sounds.
Receiving terminal 450 comprises an antenna 452 which receives the coded signal and provides signal 254 for coding device 456 and speech generator 458. Coding device 456 is identical to coding devices 156,256 (Figures 1 and 2 respectively) and decodes signal 454 to provide a digital signal 460 corresponding thereto. However, coding device 456 cannot decode any codewords present in signal 454. Speech generator 458 recognises codewords which have been included in signal 454 and provides an output digital signal 462 corresponding thereto. Both digital signals 460, 462 are input to switch 464 which is controlled by the speech generator 458 to intersperse signals corresponding to recognised codewords with the standard coding. Digital signal 466 is fed to a DAC 468 where it is converted into analogue signal 470 for supply to speaker 472, DAC 468 and speaker 472 corresponding to DACs 160,268 and speakers 164,372 in Figures 1 and 2 respectively.
In this embodiment of the present invention, frequently used words or sounds can easily be replaced, for example, the words'and'and'the'and the sounds'sh''th'. This embodiment could be particularly useful for trained operators with a limited vocabulary. In recognising the word or sound, some form of"near matching"is allowed. This method reduces the bit rate when words or sounds are recognised and is of particular interest when the processing requirements of the speech coding device is high.
In all the embodiments described above, a lower bit rate results when words, phrases or sounds are recognised. However, if there is no recognition, nothing is lost as normal bit rates will apply.
Although the embodiments of Figures 2 to 4 illustrate the devices which provide speech coding and decoding as being separate to the ADCs and DACs respectively, it will readily be appreciated that the ADC may form an integral part of the speech coding device, and the DAC may form an integral part of the speech decoding device.
Moreover, bit rates different to those described above may be implemented in the telecommunications system according to the requirements of the particular application.

Claims (7)

  1. CLAIMS: 1. A method for reducing transmission bit rate in a telecommunications system, the method comprising the steps of : a) receiving a signal at a microphone; b) converting the received signal to a coded signal for transmission;
    c) recognising speech; and d) replacing parts of the coded signal with codewords representative of the recognised speech.
  2. 2. A method according to claim 1, wherein steps b) and c) are carried out simultaneously.
  3. 3. A method according to claim 1, wherein step c) is carried out prior to step b).
  4. 4. A method according to claim 1, wherein step b) is carried out prior to step c).
  5. 5. A method substantially as hereinbefore described with reference to Figure 2 of the accompanying drawings.
  6. 6. A method substantially as hereinbefore described with reference to Figure 3 of the accompanying drawings.
  7. 7. A method substantially as hereinbefore described with reference to Figure 4 of the accompanying drawings.
GB9906763A 1999-03-25 1999-03-25 Improvements in or relating to telecommunication systems Expired - Fee Related GB2348342B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB9906763A GB2348342B (en) 1999-03-25 1999-03-25 Improvements in or relating to telecommunication systems
US09/535,989 US6519560B1 (en) 1999-03-25 2000-03-27 Method for reducing transmission bit rate in a telecommunication system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB9906763A GB2348342B (en) 1999-03-25 1999-03-25 Improvements in or relating to telecommunication systems

Publications (3)

Publication Number Publication Date
GB9906763D0 GB9906763D0 (en) 1999-05-19
GB2348342A true GB2348342A (en) 2000-09-27
GB2348342B GB2348342B (en) 2004-01-21

Family

ID=10850252

Family Applications (1)

Application Number Title Priority Date Filing Date
GB9906763A Expired - Fee Related GB2348342B (en) 1999-03-25 1999-03-25 Improvements in or relating to telecommunication systems

Country Status (2)

Country Link
US (1) US6519560B1 (en)
GB (1) GB2348342B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9344802B2 (en) * 2005-06-28 2016-05-17 Field System, Inc. Information providing system
CN103828423B (en) * 2011-09-30 2018-04-27 Sca艾普拉控股有限公司 Mobile communication system, infrastructure equipment, base station and method
US11367445B2 (en) * 2020-02-05 2022-06-21 Citrix Systems, Inc. Virtualized speech in a distributed network environment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4661915A (en) * 1981-08-03 1987-04-28 Texas Instruments Incorporated Allophone vocoder
US4707858A (en) * 1983-05-02 1987-11-17 Motorola, Inc. Utilizing word-to-digital conversion
US4799261A (en) * 1983-11-03 1989-01-17 Texas Instruments Incorporated Low data rate speech encoding employing syllable duration patterns
US5006849A (en) * 1989-07-26 1991-04-09 Astro, Inc. Apparatus and method for effecting data compression
US5305421A (en) * 1991-08-28 1994-04-19 Itt Corporation Low bit rate speech coding system and compression
EP0706172A1 (en) * 1994-10-04 1996-04-10 Hughes Aircraft Company Low bit rate speech encoder and decoder

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2021514C (en) * 1989-09-01 1998-12-15 Yair Shoham Constrained-stochastic-excitation coding
CA2110090C (en) * 1992-11-27 1998-09-15 Toshihiro Hayata Voice encoder
JPH0743522A (en) * 1993-05-28 1995-02-14 Nippon Oil Co Ltd Method for forming, substrate having light-shading layer, substrate having light-shading layer, counter electrode substrate for black-and-white display thin film transistor(tft) array substrate, and black-and-white liquid crystal display device
FR2734389B1 (en) * 1995-05-17 1997-07-18 Proust Stephane METHOD FOR ADAPTING THE NOISE MASKING LEVEL IN A SYNTHESIS-ANALYZED SPEECH ENCODER USING A SHORT-TERM PERCEPTUAL WEIGHTING FILTER
US5649051A (en) * 1995-06-01 1997-07-15 Rothweiler; Joseph Harvey Constant data rate speech encoder for limited bandwidth path
US5706392A (en) * 1995-06-01 1998-01-06 Rutgers, The State University Of New Jersey Perceptual speech coder and method
US5668925A (en) * 1995-06-01 1997-09-16 Martin Marietta Corporation Low data rate speech encoder with mixed excitation
US5839098A (en) * 1996-12-19 1998-11-17 Lucent Technologies Inc. Speech coder methods and systems

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4661915A (en) * 1981-08-03 1987-04-28 Texas Instruments Incorporated Allophone vocoder
US4707858A (en) * 1983-05-02 1987-11-17 Motorola, Inc. Utilizing word-to-digital conversion
US4799261A (en) * 1983-11-03 1989-01-17 Texas Instruments Incorporated Low data rate speech encoding employing syllable duration patterns
US5006849A (en) * 1989-07-26 1991-04-09 Astro, Inc. Apparatus and method for effecting data compression
US5305421A (en) * 1991-08-28 1994-04-19 Itt Corporation Low bit rate speech coding system and compression
EP0706172A1 (en) * 1994-10-04 1996-04-10 Hughes Aircraft Company Low bit rate speech encoder and decoder

Also Published As

Publication number Publication date
GB9906763D0 (en) 1999-05-19
GB2348342B (en) 2004-01-21
US6519560B1 (en) 2003-02-11

Similar Documents

Publication Publication Date Title
CN1168332C (en) Method and apparatus for detection and bypass of tandem vocoding
CN1723724B (en) Tandem-free intersystem voice communication
JP4024767B2 (en) Method and apparatus for compressing and transmitting high speed data
Jayant Subsampling of a DPCM Speech Channel to Provide Two “Self‐Contained” Half‐Rate Channels
EP1299879B1 (en) Encoding and decoding of a digital signal
US6272123B1 (en) Variable rate CDMA transmitter-receiver and transmission method
US7009935B2 (en) Transmission over packet switched networks
WO2002099986A1 (en) System and method for reducing data quality degradation due to encoding/decoding
EP0984570B1 (en) Method and apparatus for improving the quality of digital signals transmitted over wireless communication facilities
US6047007A (en) Transmission of data on multirate networks
EP0098630B1 (en) System for the transmission of speech through a disturbed transmission path
US6633840B1 (en) Method and system for transmitting data on a speech channel
US6519560B1 (en) Method for reducing transmission bit rate in a telecommunication system
CA2359756C (en) Methods and apparatus for wireless transmission using multiple description coding
Virupaksha et al. Entropy-coded adaptive differential pulse-code modulation (DPCM) for speech
JP2000004171A (en) Mobile communication method
JP3130834B2 (en) Mobile phone
JP2001127697A (en) Codec and mobile remote communication terminal provided with additional decoder
JPH0525207B2 (en)
Jayant et al. Adaptive aperture coding for speech waveforms—I
JP3102673B2 (en) Voice quality improvement device
US7076260B1 (en) Unbalanced coding for cordless telephony
JP4299436B2 (en) Speech coding system and speech transmission system using Hamming distance
KR0183821B1 (en) Pulse code modulation decode/code device for digital keyphone terminal
KR20010078552A (en) Method for transmitting data from switching system to base station

Legal Events

Date Code Title Description
PCNP Patent ceased through non-payment of renewal fee

Effective date: 20080325