GB2229068A - Playing back recorded speech at faster rate with pitch reduction - Google Patents

Playing back recorded speech at faster rate with pitch reduction Download PDF

Info

Publication number
GB2229068A
GB2229068A GB8904471A GB8904471A GB2229068A GB 2229068 A GB2229068 A GB 2229068A GB 8904471 A GB8904471 A GB 8904471A GB 8904471 A GB8904471 A GB 8904471A GB 2229068 A GB2229068 A GB 2229068A
Authority
GB
United Kingdom
Prior art keywords
signal
pitch
audio
rate
recorded
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB8904471A
Other versions
GB8904471D0 (en
Inventor
David Jones
Christopher Hartly
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Open University
Original Assignee
Open University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Open University filed Critical Open University
Priority to GB8904471A priority Critical patent/GB2229068A/en
Publication of GB8904471D0 publication Critical patent/GB8904471D0/en
Publication of GB2229068A publication Critical patent/GB2229068A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B1/00Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
    • H04B1/66Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
    • H04B1/662Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using a time/frequency relationship, e.g. time compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)

Abstract

Audio information, in particular speech, is enabled to be played back intelligibly at a faster rate than the rate at which it was recorded by reducing the pitch of the audio information. This involves speeding up the original signal, sampling it, converting it to digital values, storing the digital signal, removing components to achieve a reduction in pitch, and reconstituting an audio signal. This is particularly intended for use by the partially sighted. <IMAGE>

Description

AUDIO SYSTEMS SPECIFICATION This invention relates generally to audio systems, and is more particularly concerned with methods of and apparatus for playing back recorded audio information at a faster rate than the rate at which it was recorded.
The present invention is more particularly concerned with methods of and apparatus for enabling people to listen to recorded speech at a faster rate than the rate at which the speech was originally recorded. The present invention has particular application for people with sight deficiencies who cannot read normally and who have to listen to specially recorded tapes. The problem with listening to recorded words is not that the listener has a slower rate of understanding by ear, but that the speaker has a limited rate at which he or she can talk, hence limiting the speed at which information can be assimilated by the listener.
Broadly in accordance with the present invention there is provided a method of and apparatus for varying the pitch of recorded audio information, especially speech information, and in particular reducing the pitch of such audio information, so that the audio information can be played back at a speed faster than the rate at which the audio information was originally recorded.
It is well known that if recorded speech is played back at a faster rate than the original recording rate, then the words will become distorted and "squeaky". In order to overcome this practical problem the pitch of the recorded audio information is lowered.
Digital systems are known which can effect a frequency or pitch shift, but these are expensive and not appropriate for the application of listening to recorded speech. Such digital systems are particularly used for creating special sound effects in the music industry as part of studio type setups. The method and apparatus of the present invention on the other hand are particularly designed and appropriate for speech recordings and for assisting the listener.
With the present invention, in its preferred form, recorded speech is played back at a faster rate than the rate at which it was recorded. The audio information, such as speech, can be recorded on tape or in digital memory. The playback device may have a fixed or a variable rate of playback speed. The user will play the speech back at any speed at which the user can still comprehend the subject matter. In practice, this can be as much as three times that at which the recording took place, which means that a tape that would normally take an hour to listen to can be understood by the listener in only twenty minutes.
The apparatus of the present invention requires an analogue input, but will work with recordings made either with digital or analogue techniques.
One presently preferred embodiment of a speech pitch reducing apparatus is shown in the accompanying block schematic diagram. This is based upon a microcomputer system design, coupled with analogue interfaces. The heart of the system is a microprocessor, with analogue-to-digital and digitalto-analogue converters for interfacing, and with a memory for storage of the signals. As shown in the drawing, a speeded-up audio signal is sampled at a certain rate and is then converted to digital values, which are stored in a digital store. A selection is made from these stored digital values, i.e. some of the samples are removed to achieve the change in pitch, and an audio signal is then reconstituted. This reconstituted audio signal is then transmitted at the appropriate rate to an output to an audio amplifier.
The sound is also filtered to remove the unwanted high frequencies present due to the digital processing. The audio output can be built into the speech pitch reducer, or alternatively one can use the amplifier from the audio source device.
In the presently preferred embodiment of apparatus there are four settings of pitch reduction, 1 (no change), 2/3, 1/2 and 1/3, which correspond respectively to no change, 1. 5, 2 and 3 times increase from normal listening speeds. This gives a reduction in time equal to the pitch setting, i.e. time * 2/3.
More settings could be provided within this range, thus providing smaller incremental changes of pitch.
However, increased speed of listening outside this range will be of reduced value due to the natural limit of human comprehension.
Although the method and apparatus of the present invention are particularly suitable for use by the disabled, there is no restriction on who may benefit from use of the method and apparatus. It is appropriate for use by anyone who has a need to listen to recorded material in a shorter period of time than the original recording.
Also, although the present invention is particularly applicable to the recording of and listening to speech, it will also provide a low-cost way of producing variable pitch of voice or music for special sound effects on any audio output device.

Claims (10)

CLAIMS:
1. A method of enabling recorded audio information, particularly speech, to be played back intelligibly at a faster rate than the rate at which it was recorded, which comprises processing an audio signal carrying said information including effecting a reduction in pitch of said audio information.
2. A method according to claim 1, which comprises the steps of speeding-up an audio signal carrying said information, sampling the speeded-up signal and converting the sample to a digital signal, storing the digital signal, selecting from the stored signal to effect a reduction in pitch, and reconstituting an audio signal from the selected information.
3. A method according to claim 2, in which the selection is effected by removing some components of the stored signal.
4. A method according to claim 2 or 3, which includes filtering the selected signal to remove unwanted high frequencies introduced by the digital processing.
5. A method according to claim 1, substantially as hereinbefore described with reference to the accompanying drawing.
6. Apparatus for enabling recorded audio information, particularly speech, to be played back intelligently at a faster rate than the rate at which it was recorded, comprising means to effect a reduction in the pitch of a signal carrying said audio information.
7. Apparatus according to claim 6, which includes means for speeding up a signal carrying said audio information, means for sampling said speeded-up signal and converting it to a digital signal, storage means for the digital signal, means to effect a selection from the stored signal to effect a reduction in pitch, and means to reconstitute an audio signal from the selected information.
8. Apparatus according to claim 7, which includes filter means to remove unwanted high frequencies introduced by the digital processing.
9. Apparatus according to any of claims 6 to 8, which has means to vary the rate of playback speed.
10. Apparatus according to claim 6, substantially as hereinbefore described with reference to the accompanying drawing.
GB8904471A 1989-02-28 1989-02-28 Playing back recorded speech at faster rate with pitch reduction Withdrawn GB2229068A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
GB8904471A GB2229068A (en) 1989-02-28 1989-02-28 Playing back recorded speech at faster rate with pitch reduction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB8904471A GB2229068A (en) 1989-02-28 1989-02-28 Playing back recorded speech at faster rate with pitch reduction

Publications (2)

Publication Number Publication Date
GB8904471D0 GB8904471D0 (en) 1989-04-12
GB2229068A true GB2229068A (en) 1990-09-12

Family

ID=10652409

Family Applications (1)

Application Number Title Priority Date Filing Date
GB8904471A Withdrawn GB2229068A (en) 1989-02-28 1989-02-28 Playing back recorded speech at faster rate with pitch reduction

Country Status (1)

Country Link
GB (1) GB2229068A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2689291A1 (en) * 1992-03-27 1993-10-01 Sorba Antoine Signal processing technique for altering speed of voice signal e.g. for VTR or dictation recorder - separating carrier and modulation signals, and subsequently processing and recombining to form altered output signal
WO1996012353A1 (en) * 1994-10-17 1996-04-25 Ronald Wynand Spies Signal transmission apparatus with time compression
GB2305830A (en) * 1995-09-30 1997-04-16 Ibm Voice processing system and method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3816664A (en) * 1971-09-28 1974-06-11 R Koch Signal compression and expansion apparatus with means for preserving or varying pitch
GB1356645A (en) * 1971-12-16 1974-06-12 Standard Telephones Cables Ltd Speech processor
GB1407196A (en) * 1971-11-16 1975-09-24 British Broadcasting Corp Apparatus for changing signal pitch
GB1411859A (en) * 1972-02-15 1975-10-29 Philips Electronic Associated Circuit arrangement for reproducing information at an output thereof at an instantaneous rate which is different from its instantaneous rate at an input thereof
US4406001A (en) * 1980-08-18 1983-09-20 The Variable Speech Control Company ("Vsc") Time compression/expansion with synchronized individual pitch correction of separate components
EP0127892A1 (en) * 1983-06-03 1984-12-12 The Variable Speech Control Company ("VSC") Method and apparatus for pitch period controlled voice signal processing

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3816664A (en) * 1971-09-28 1974-06-11 R Koch Signal compression and expansion apparatus with means for preserving or varying pitch
GB1407196A (en) * 1971-11-16 1975-09-24 British Broadcasting Corp Apparatus for changing signal pitch
GB1356645A (en) * 1971-12-16 1974-06-12 Standard Telephones Cables Ltd Speech processor
GB1411859A (en) * 1972-02-15 1975-10-29 Philips Electronic Associated Circuit arrangement for reproducing information at an output thereof at an instantaneous rate which is different from its instantaneous rate at an input thereof
US4406001A (en) * 1980-08-18 1983-09-20 The Variable Speech Control Company ("Vsc") Time compression/expansion with synchronized individual pitch correction of separate components
EP0127892A1 (en) * 1983-06-03 1984-12-12 The Variable Speech Control Company ("VSC") Method and apparatus for pitch period controlled voice signal processing

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2689291A1 (en) * 1992-03-27 1993-10-01 Sorba Antoine Signal processing technique for altering speed of voice signal e.g. for VTR or dictation recorder - separating carrier and modulation signals, and subsequently processing and recombining to form altered output signal
WO1996012353A1 (en) * 1994-10-17 1996-04-25 Ronald Wynand Spies Signal transmission apparatus with time compression
GB2305830A (en) * 1995-09-30 1997-04-16 Ibm Voice processing system and method
GB2305830B (en) * 1995-09-30 1999-09-22 Ibm Voice processing system and method

Also Published As

Publication number Publication date
GB8904471D0 (en) 1989-04-12

Similar Documents

Publication Publication Date Title
JP3151459B2 (en) Public address clarity enhancement system
EP0674467A4 (en) Audio reproducing device
JP3137995B2 (en) PCM digital audio signal playback device
JP3630609B2 (en) Audio information reproducing method and apparatus
JPH06112743A (en) Correction device of acoustic signal distortion by making use of audible frequency band division
KR100202207B1 (en) Recording and reproducing method for tape recorder
GB2229068A (en) Playing back recorded speech at faster rate with pitch reduction
Scott Time adjustment in speech synthesis
JPH06289898A (en) Speech signal processor
JPH0955634A (en) Harmonic addition circuit
JPS63138809A (en) Signal processing circuit
JPH0481279B2 (en)
JPH02279163A (en) Acoustic device
US3838218A (en) Bifrequency controlled analog shift register speech processor
Moftah et al. Language recognition from distorted speech: Comparison of techniques
KR100372576B1 (en) Method of Processing Audio Signal
JPH01267700A (en) Speech processor
JPH05252594A (en) Digital voice processing device
JPH11234788A (en) Audio equipment
JP2762250B2 (en) Recording and playback device
KR920008356Y1 (en) Audio monitoring device for special reproducing of vcr
JPS6260399A (en) Audio signal transmission system
JPS60239129A (en) Method for compressing sound information quantity
JPS62239198A (en) Voice reproduction circuit
JPS60117450A (en) Circuit for separating reproduced signal into left and right channel signals in digital audio reproducing device

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)