US5694521A - Variable speed playback system - Google Patents
Variable speed playback system Download PDFInfo
- Publication number
- US5694521A US5694521A US08/371,258 US37125895A US5694521A US 5694521 A US5694521 A US 5694521A US 37125895 A US37125895 A US 37125895A US 5694521 A US5694521 A US 5694521A
- Authority
- US
- United States
- Prior art keywords
- templates
- excitation signal
- lpc
- determining
- ratios
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000005284 excitation Effects 0.000 claims abstract description 61
- 238000007906 compression Methods 0.000 claims abstract description 20
- 230000006835 compression Effects 0.000 claims abstract description 20
- 230000000295 complement effect Effects 0.000 claims description 30
- 238000000034 method Methods 0.000 claims description 8
- 230000015572 biosynthetic process Effects 0.000 claims description 7
- 238000003786 synthesis reaction Methods 0.000 claims description 7
- 230000000737 periodic effect Effects 0.000 claims 6
- 238000001914 filtration Methods 0.000 claims 5
- 230000001419 dependent effect Effects 0.000 claims 2
- 230000003044 adaptive effect Effects 0.000 abstract description 11
- 230000002596 correlated effect Effects 0.000 abstract description 3
- 206010071299 Slow speech Diseases 0.000 abstract description 2
- 238000011524 similarity measure Methods 0.000 abstract description 2
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 238000013144 data compression Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 235000014676 Phragmites communis Nutrition 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 230000006837 decompression Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012886 linear function Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
Definitions
- the present invention relates to a combined speech coding and speech modification system. More particularly, the present invention relates to the manipulation of the periodical structure of speech signals.
- voice compression allows electronic devices to store and playback digital incoming messages and outgoing messages. Enhanced features, such as slow and fast playback are desirable to control and vary the recorded speech playback.
- LPC linear predictive coding
- LPC techniques may be reed for speech coding involving code excited linear prediction (CELP) speech coders.
- CELP code excited linear prediction
- These conventional speech coders generally utilize at least two excitation codebooks.
- the outputs of the codebooks provide the input to the LPC synthesis filter.
- the output of the LPC synthesis filter can then be processed by an additional postfilter to produce decoded speech, or may circumvent the postfilter and be output directly.
- CELP Code Excited Linear Prediction
- speech modification such as fast and slow playback
- speech modification has been achieved using a variety of time domain and frequency domain estimation and modification techniques, where several speech parameters are estimated, e.g., pitch frequency or lag, and the speech signal is accordingly modified.
- speech parameters e.g., pitch frequency or lag
- greater modified speech quality can be obtained by incorporating the speech modification device or scheme into a decoder, rather than external to the decoder.
- template matching instead of pitch estimation, simpler and more robust speech modification is achieved.
- energy-based adaptive windowing provides smoother modified speech.
- the present invention is directed to a variable speed playback system incorporating multiple-period template matching to alter the LPC excitation periodical structure, and thereby increase or decrease the rate of speech playback, while retaining the natural quality of the speech.
- Embodiments of the present invention enable accurate fast or slow speech playback for store and forward applications.
- a multiple-period similarity measure is determined for a decoded LPC excitation signal.
- a multiple-period similarity i.e., a normalized cross-correlation, is determined.
- Expansion or compression of the time domain LPC excitation signal may then be performed according to a rational factor, e.g., 1:2, 2:3, 3:4, 4:3, 3:2, and 2:1.
- the expansion and compression are performed on the LPC excitation signal, such that the periodicity is not obscured by the formant structure.
- fast playback is achieved by combining N templates to M templates (N>M), and slow playback is obtained by expanding N templates to M templates (N ⁇ M).
- a; least two templates of the LPC excitation signal are determined according to a maximal normalized cross-correlation.
- the templates are defined by one or more segments within the LPC excitation signal. Based on the energy ratios of these segments, two complementary windows are constructed. The templates are then multiplied by the windows, overlapped, and summed.
- the resultant excitation signal represents modified excitation signal, which is input into an LPC synthesis filter, to be later output as modified speech.
- FIG. 1 is a block diagram of a decoder incorporating an embodiment of a speech modification and playback system of the present invention.
- FIG. 2 illustrates speech compression and expansion according to the embodiment of FIG. 1.
- FIG. 3 is a flow diagram of an embodiment of the speech modification scheme shown in FIGS. 1 and 2.
- FIG. 4 shows an embodiment of window-overlap-and-add scheme of the present invention.
- an adaptive window-overlap-and-add technique for maximally correlated LPC excitation templates is utilized.
- the preferred template matching scheme results in high quality fast or slow playback of digitally-stored signals, such as speech signals.
- a decoded excitation signal 102 is sequentially processed from the beginning of a stored message to its end by a multiple-period compressor/expander 106.
- the compressor/expander two templates X ML and y ML are identified within the excitation signal 102 (step 200 in FIG. 2).
- the templates are formed of M segments. Accordingly, fast or slow playback is achieved by compressing or expanding, respectively, the excitation signal 302 in rational ratios of values N-to-M, e.g., 2-to-1, 3-to-2, 2-to-3, where M represents the resultant number of segments.
- Tstart indicates a dividing marker between the past, previously-processed portion of an excitation signal 302 (indicated as 102 in FIG. 1) and the remaining unprocessed portion.
- Tstart marks the beginning of the X ML template.
- properly aligned templates X ML and y ML of the excitation signal 302 are correlated (step 202 in FIG. 2) for each possible integer value L between a minimum number Lmin to a maximum Lmax.
- the normalized correlation is given by: ##EQU1##
- a maximum C ML can then be determined for a particular value of L, indicated as L * (step 202 in FIG. 2).
- L * represents the periodical structure of the excitation signal, and in most cases coincides with the pitch period. It will be recognized, however, that the normalized correlation is not confined to the usual frame structure used in LPC/CELP coding, and L * is not necessarily limited to the pitch period.
- two complementary adaptive windows of the size ML * are determined (step 204), W x ML* for x ML* and W 6 ML* for y ML ⁇ .
- the sum of the two windows equals 1 at every point.
- the adaptation is performed according to the energy ratio of each L * segment of x ML* and y ML* .
- the templates x ML* and y ML* are multiplied by the complementary adaptive windows of length ML * , overlapped, and then summed to yield the modified (fast or slow) excitation signal.
- Step 206 The indicator Tstart is then moved to the right of Y ML* (step 208), and points to the next part of the unprocessed excitation signal to be modified.
- the excitation signal can then be filtered by the LPC synthesis filter 104 (FIG. 1) to produce the decoded output speech 108.
- the general formulation of the adaptive windows is given.
- the windows are adapted according to the ratios of the energies between x ML* and y ML* on each L * segment.
- a window structure variable t can be defined as: ##EQU6## Accordingly, the windows are determined as: ##EQU7##
- Template x L 312 is defined by the L samples starting from Tstart, and y L is defined by the next segment of L samples.
- x L* is multiplied by W x L* (402) and y L* is multiplied by W Y L* (404).
- the resulting signals are then overlapped (406) and summed (408), yielding the compressed excitation signal (410).
- Tstart can then be shifted to the end of y L* (point 304 in FIG. 3(a)).
- the next template matching and combining loop can then be performed.
- data compression at a 3-to-2 ratio is achieved by combining templates x 2L 320 and y 2L 322 into one template of length 2L.
- Template x 2L 320 is defined by a segment of 2 L samples starting at Tstart
- y 2L is defined by 2L samples starting L samples subsequent to Tstart (i.e., to the right of Tstart in the figure).
- the normalized correlation C 2L is calculated for each L in the range Lmin to Lmax.
- x 2L* is multiplied by W x 2L* (402) and y 2L* is multiplied by W y 2L* (404).
- the resultant signals are overlapped (406) and summed (408) to yield a 3-to-2 compressed excitation signal (410).
- the trailing end of the first segment x 2L 320 is overlapped by the leading end of the next segment y 2L 322, each having lengths of 2 L * samples, such that the overlapped amount is L samples long.
- Tstart can be moved to the end of y 2L* for the next template matching and combining loop.
- data expansion at a 2-to-3 ratio is achieved by combining templates x 3L 330 and y 3L 332 into one template of length 3 L.
- the template x 3L 330 is defined by 3 L samples starting from Tstart, and yes is defined by 3 L samples beginning at point 334, L samples before Tstart, representing previous excitation signals in time (i.e., to the left of Tstart).
- the normalized correlation C 3L is calculated for each L in the range Lmin to Lmax.
- x 3L* is multiplied by W x 3L* (402) and y 3L* is multiplied by W y 3L* (404).
- the resultant signals are then overlapped (406) and summed (408), yielding the expanded excitation signal (410).
- 2-to-3 expansion is achieved by overlapping in a reverse fashion. That is, the leading end of the x ML template is overlapped with the trig end of the y ML template such that the two segments, each of 3 L * samples, are overlapped by 2 L * samples, and combined into one segment of 3 L * samples. Tstart is then moved to the right end of y 3L* , ready for the next template matching and combining loop.
- the excitation signal is expanded by selecting the particular placement of the y ML segment, and shifting the start point Tstart.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Circuit For Audible Band Transducer (AREA)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/371,258 US5694521A (en) | 1995-01-11 | 1995-01-11 | Variable speed playback system |
JP7320765A JPH08251030A (ja) | 1995-01-11 | 1995-12-08 | 高速および低速再生能力を与えるためのシステム、記憶および検索システム、ならびに高速および低速再生能力を与えるための方法 |
DE69521405T DE69521405T2 (de) | 1995-01-11 | 1995-12-21 | System zum Abspielen mit veränderbarer Geschwindigkeit |
EP95120294A EP0726560B1 (en) | 1995-01-11 | 1995-12-21 | Variable speed playback system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/371,258 US5694521A (en) | 1995-01-11 | 1995-01-11 | Variable speed playback system |
Publications (1)
Publication Number | Publication Date |
---|---|
US5694521A true US5694521A (en) | 1997-12-02 |
Family
ID=23463194
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/371,258 Expired - Lifetime US5694521A (en) | 1995-01-11 | 1995-01-11 | Variable speed playback system |
Country Status (4)
Country | Link |
---|---|
US (1) | US5694521A (ja) |
EP (1) | EP0726560B1 (ja) |
JP (1) | JPH08251030A (ja) |
DE (1) | DE69521405T2 (ja) |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000022611A1 (en) * | 1998-10-09 | 2000-04-20 | Hejna Donald J Jr | Method and apparatus to prepare listener-interest-filtered works |
US6266643B1 (en) * | 1999-03-03 | 2001-07-24 | Kenneth Canfield | Speeding up audio without changing pitch by comparing dominant frequencies |
US6377931B1 (en) | 1999-09-28 | 2002-04-23 | Mindspeed Technologies | Speech manipulation for continuous speech playback over a packet network |
US6625656B2 (en) * | 1999-05-04 | 2003-09-23 | Enounce, Incorporated | Method and apparatus for continuous playback or distribution of information including audio-visual streamed multimedia |
US20030212559A1 (en) * | 2002-05-09 | 2003-11-13 | Jianlei Xie | Text-to-speech (TTS) for hand-held devices |
US6718309B1 (en) * | 2000-07-26 | 2004-04-06 | Ssi Corporation | Continuously variable time scale modification of digital audio signals |
US20040267540A1 (en) * | 2003-06-27 | 2004-12-30 | Motorola, Inc. | Synchronization and overlap method and system for single buffer speech compression and expansion |
US20040267524A1 (en) * | 2003-06-27 | 2004-12-30 | Motorola, Inc. | Psychoacoustic method and system to impose a preferred talking rate through auditory feedback rate adjustment |
US6873954B1 (en) * | 1999-09-09 | 2005-03-29 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus in a telecommunications system |
US20060075347A1 (en) * | 2004-10-05 | 2006-04-06 | Rehm Peter H | Computerized notetaking system and method |
US20060149532A1 (en) * | 2004-12-31 | 2006-07-06 | Boillot Marc A | Method and apparatus for enhancing loudness of a speech signal |
US20060190809A1 (en) * | 1998-10-09 | 2006-08-24 | Enounce, Inc. A California Corporation | Method and apparatus to determine and use audience affinity and aptitude |
EP1750397A1 (en) * | 2004-05-26 | 2007-02-07 | Nippon Telegraph and Telephone Corporation | Sound packet reproducing method, sound packet reproducing apparatus, sound packet reproducing program, and recording medium |
US7302396B1 (en) | 1999-04-27 | 2007-11-27 | Realnetworks, Inc. | System and method for cross-fading between audio streams |
US20080133251A1 (en) * | 2002-10-03 | 2008-06-05 | Chu Wai C | Energy-based nonuniform time-scale modification of audio signals |
US7426221B1 (en) | 2003-02-04 | 2008-09-16 | Cisco Technology, Inc. | Pitch invariant synchronization of audio playout rates |
CN100464578C (zh) * | 2004-05-13 | 2009-02-25 | 美国博通公司 | 用于以高质量的可变速度播放视听媒体的系统和方法 |
US20110224990A1 (en) * | 2007-08-22 | 2011-09-15 | Satoshi Hosokawa | Speaker Speed Conversion System, Method for Same, and Speed Conversion Device |
US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5717823A (en) * | 1994-04-14 | 1998-02-10 | Lucent Technologies Inc. | Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders |
DE19710545C1 (de) * | 1997-03-14 | 1997-12-04 | Grundig Ag | Effizientes Verfahren zur Geschwindigkeitsmodifikation von Sprachsignalen |
JP4096915B2 (ja) * | 2004-06-01 | 2008-06-04 | 株式会社日立製作所 | デジタル情報再生装置及び方法 |
JP4940888B2 (ja) * | 2006-10-23 | 2012-05-30 | ソニー株式会社 | オーディオ信号伸張圧縮装置及び方法 |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4022974A (en) * | 1976-06-03 | 1977-05-10 | Bell Telephone Laboratories, Incorporated | Adaptive linear prediction speech synthesizer |
US4631746A (en) * | 1983-02-14 | 1986-12-23 | Wang Laboratories, Inc. | Compression and expansion of digitized voice signals |
US4852168A (en) * | 1986-11-18 | 1989-07-25 | Sprague Richard P | Compression of stored waveforms for artificial speech |
US4864620A (en) * | 1987-12-21 | 1989-09-05 | The Dsp Group, Inc. | Method for performing time-scale modification of speech information or speech signals |
US4890325A (en) * | 1987-02-20 | 1989-12-26 | Fujitsu Limited | Speech coding transmission equipment |
US4935963A (en) * | 1986-01-24 | 1990-06-19 | Racal Data Communications Inc. | Method and apparatus for processing speech signals |
US4991213A (en) * | 1988-05-26 | 1991-02-05 | Pacific Communication Sciences, Inc. | Speech specific adaptive transform coder |
US5175769A (en) * | 1991-07-23 | 1992-12-29 | Rolm Systems | Method for time-scale modification of signals |
US5327498A (en) * | 1988-09-02 | 1994-07-05 | Ministry Of Posts, Tele-French State Communications & Space | Processing device for speech synthesis by addition overlapping of wave forms |
US5341432A (en) * | 1989-10-06 | 1994-08-23 | Matsushita Electric Industrial Co., Ltd. | Apparatus and method for performing speech rate modification and improved fidelity |
US5386493A (en) * | 1992-09-25 | 1995-01-31 | Apple Computer, Inc. | Apparatus and method for playing back audio at faster or slower rates without pitch distortion |
US5479564A (en) * | 1991-08-09 | 1995-12-26 | U.S. Philips Corporation | Method and apparatus for manipulating pitch and/or duration of a signal |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2692070B1 (fr) * | 1992-06-05 | 1996-10-25 | Thomson Csf | Procede et dispositif de synthese vocale a vitesse variable. |
US5717823A (en) * | 1994-04-14 | 1998-02-10 | Lucent Technologies Inc. | Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders |
-
1995
- 1995-01-11 US US08/371,258 patent/US5694521A/en not_active Expired - Lifetime
- 1995-12-08 JP JP7320765A patent/JPH08251030A/ja not_active Withdrawn
- 1995-12-21 EP EP95120294A patent/EP0726560B1/en not_active Expired - Lifetime
- 1995-12-21 DE DE69521405T patent/DE69521405T2/de not_active Expired - Lifetime
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4022974A (en) * | 1976-06-03 | 1977-05-10 | Bell Telephone Laboratories, Incorporated | Adaptive linear prediction speech synthesizer |
US4631746A (en) * | 1983-02-14 | 1986-12-23 | Wang Laboratories, Inc. | Compression and expansion of digitized voice signals |
US4935963A (en) * | 1986-01-24 | 1990-06-19 | Racal Data Communications Inc. | Method and apparatus for processing speech signals |
US4852168A (en) * | 1986-11-18 | 1989-07-25 | Sprague Richard P | Compression of stored waveforms for artificial speech |
US4890325A (en) * | 1987-02-20 | 1989-12-26 | Fujitsu Limited | Speech coding transmission equipment |
US4864620A (en) * | 1987-12-21 | 1989-09-05 | The Dsp Group, Inc. | Method for performing time-scale modification of speech information or speech signals |
US4991213A (en) * | 1988-05-26 | 1991-02-05 | Pacific Communication Sciences, Inc. | Speech specific adaptive transform coder |
US5327498A (en) * | 1988-09-02 | 1994-07-05 | Ministry Of Posts, Tele-French State Communications & Space | Processing device for speech synthesis by addition overlapping of wave forms |
US5341432A (en) * | 1989-10-06 | 1994-08-23 | Matsushita Electric Industrial Co., Ltd. | Apparatus and method for performing speech rate modification and improved fidelity |
US5175769A (en) * | 1991-07-23 | 1992-12-29 | Rolm Systems | Method for time-scale modification of signals |
US5479564A (en) * | 1991-08-09 | 1995-12-26 | U.S. Philips Corporation | Method and apparatus for manipulating pitch and/or duration of a signal |
US5386493A (en) * | 1992-09-25 | 1995-01-31 | Apple Computer, Inc. | Apparatus and method for playing back audio at faster or slower rates without pitch distortion |
Non-Patent Citations (18)
Title |
---|
"Full-Rate Speech Codec Compatibility Standard PN-2972", TR45 Electronic Industries Association, 1990, pp. 1-64. |
"Methode de Modification de l'Echelle Temps of d' Enregistrements Audio, pour la Reecoute a Vitesse Variabel en Temps Reel," IEEE, 1993 Canadian Conference on Electrical and Computer Engineering, pp. 277-280, Sep. 1993. |
David Malah, Ronald E. Crochiere and Richard V. Cox, "Performance of Transform and Subband Coding Systems Combined with Harmonic Scaling of Speech", IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP-29, No. 2, Apr. 1981, pp. 273-283. |
David Malah, Ronald E. Crochiere and Richard V. Cox, Performance of Transform and Subband Coding Systems Combined with Harmonic Scaling of Speech , IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. ASSP 29, No. 2, Apr. 1981, pp. 273 283. * |
Full Rate Speech Codec Compatibility Standard PN 2972 , TR45 Electronic Industries Association, 1990, pp. 1 64. * |
Jianping, "Effective Time-Domain Method for Speech Rate-Change," IEEE Trans. on Consumer Electronics, pp. 339-346, May 1988. |
Jianping, Effective Time Domain Method for Speech Rate Change, IEEE Trans. on Consumer Electronics, pp. 339 346, May 1988. * |
Methode de Modification de l Echelle Temps of d Enregistrements Audio, pour la Reecoute a Vitesse Variabel en Temps Reel, IEEE, 1993 Canadian Conference on Electrical and Computer Engineering, pp. 277 280, Sep. 1993. * |
National Communications System Office of Technology & Standards, "Telecommunications: Analog to Digital Conversion of Radio Voice by 4.800 Bit/Second Code Excited Linear Prediction (CELP)", Federal Standard 1016, Feb. 14, 1991, pp. 1-12. |
National Communications System Office of Technology & Standards, Telecommunications: Analog to Digital Conversion of Radio Voice by 4.800 Bit/Second Code Excited Linear Prediction (CELP) , Federal Standard 1016, Feb. 14, 1991, pp. 1 12. * |
National Communications System, "Technical Information Bulletin 92-1 Details to Assist in Implementation of Federal Standard 1016 CELP", Jan. 1992, pp. 1-35. |
National Communications System, Technical Information Bulletin 92 1 Details to Assist in Implementation of Federal Standard 1016 CELP , Jan. 1992, pp. 1 35. * |
Roucos et al., "High Quality Time-Scale Modification for Speech," Proc. ICASSP '86, pp. 493-496, 1986. |
Roucos et al., High Quality Time Scale Modification for Speech, Proc. ICASSP 86, pp. 493 496, 1986. * |
Sadaoki Furui and Mohan Sondhi, "Advances in Speech Signal Processing", Marcel Dekker, Inc. |
Sadaoki Furui and Mohan Sondhi, Advances in Speech Signal Processing , Marcel Dekker, Inc. * |
Wayman et al., "Some Improvements on the Synchronized-Overlap-Add Method of Time Scale Modification for Use in Real-Time Speech Compression and Noise Filtering," IEEE Transactions on ASSP, pp. 139-140, Jan. 1988. |
Wayman et al., Some Improvements on the Synchronized Overlap Add Method of Time Scale Modification for Use in Real Time Speech Compression and Noise Filtering, IEEE Transactions on ASSP, pp. 139 140, Jan. 1988. * |
Cited By (44)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8452589B2 (en) | 1998-10-09 | 2013-05-28 | Enounce Incorporated | Method and apparatus to prepare listener-interest-filtered works |
US10614829B2 (en) | 1998-10-09 | 2020-04-07 | Virentem Ventures, Llc | Method and apparatus to determine and use audience affinity and aptitude |
US20060190809A1 (en) * | 1998-10-09 | 2006-08-24 | Enounce, Inc. A California Corporation | Method and apparatus to determine and use audience affinity and aptitude |
US6374225B1 (en) * | 1998-10-09 | 2002-04-16 | Enounce, Incorporated | Method and apparatus to prepare listener-interest-filtered works |
US6801888B2 (en) | 1998-10-09 | 2004-10-05 | Enounce Incorporated | Method and apparatus to prepare listener-interest-filtered works |
US9343080B2 (en) | 1998-10-09 | 2016-05-17 | Virentem Ventures, Llc | Method and apparatus to prepare listener-interest-filtered works |
US9185380B2 (en) | 1998-10-09 | 2015-11-10 | Virentem Ventures, Llc | Method and apparatus to determine and use audience affinity and aptitude |
US20110153319A1 (en) * | 1998-10-09 | 2011-06-23 | Enounce Incorporated | Method and Apparatus to Prepare Listener-Interest-Filtered Works |
WO2000022611A1 (en) * | 1998-10-09 | 2000-04-20 | Hejna Donald J Jr | Method and apparatus to prepare listener-interest-filtered works |
WO2001020596A1 (en) * | 1998-10-09 | 2001-03-22 | Enounce, Incorporated | Method and apparatus to determine and use audience affinity and aptitude |
US8478599B2 (en) | 1998-10-09 | 2013-07-02 | Enounce, Inc. | Method and apparatus to determine and use audience affinity and aptitude |
US7899668B2 (en) | 1998-10-09 | 2011-03-01 | Enounce Incorporated | Method and apparatus to prepare listener-interest-filtered works |
US20090306966A1 (en) * | 1998-10-09 | 2009-12-10 | Enounce, Inc. | Method and apparatus to determine and use audience affinity and aptitude |
US7536300B2 (en) | 1998-10-09 | 2009-05-19 | Enounce, Inc. | Method and apparatus to determine and use audience affinity and aptitude |
US20080140414A1 (en) * | 1998-10-09 | 2008-06-12 | Enounce Incorporated | Method and apparatus to prepare listener-interest-filtered works |
US7043433B2 (en) | 1998-10-09 | 2006-05-09 | Enounce, Inc. | Method and apparatus to determine and use audience affinity and aptitude |
US6266643B1 (en) * | 1999-03-03 | 2001-07-24 | Kenneth Canfield | Speeding up audio without changing pitch by comparing dominant frequencies |
US7302396B1 (en) | 1999-04-27 | 2007-11-27 | Realnetworks, Inc. | System and method for cross-fading between audio streams |
US6625656B2 (en) * | 1999-05-04 | 2003-09-23 | Enounce, Incorporated | Method and apparatus for continuous playback or distribution of information including audio-visual streamed multimedia |
US20040064576A1 (en) * | 1999-05-04 | 2004-04-01 | Enounce Incorporated | Method and apparatus for continuous playback of media |
US6873954B1 (en) * | 1999-09-09 | 2005-03-29 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus in a telecommunications system |
US6377931B1 (en) | 1999-09-28 | 2002-04-23 | Mindspeed Technologies | Speech manipulation for continuous speech playback over a packet network |
US6718309B1 (en) * | 2000-07-26 | 2004-04-06 | Ssi Corporation | Continuously variable time scale modification of digital audio signals |
US7299182B2 (en) * | 2002-05-09 | 2007-11-20 | Thomson Licensing | Text-to-speech (TTS) for hand-held devices |
US20030212559A1 (en) * | 2002-05-09 | 2003-11-13 | Jianlei Xie | Text-to-speech (TTS) for hand-held devices |
US20080133251A1 (en) * | 2002-10-03 | 2008-06-05 | Chu Wai C | Energy-based nonuniform time-scale modification of audio signals |
US20080133252A1 (en) * | 2002-10-03 | 2008-06-05 | Chu Wai C | Energy-based nonuniform time-scale modification of audio signals |
US7426221B1 (en) | 2003-02-04 | 2008-09-16 | Cisco Technology, Inc. | Pitch invariant synchronization of audio playout rates |
US8340972B2 (en) * | 2003-06-27 | 2012-12-25 | Motorola Mobility Llc | Psychoacoustic method and system to impose a preferred talking rate through auditory feedback rate adjustment |
US20040267524A1 (en) * | 2003-06-27 | 2004-12-30 | Motorola, Inc. | Psychoacoustic method and system to impose a preferred talking rate through auditory feedback rate adjustment |
US20040267540A1 (en) * | 2003-06-27 | 2004-12-30 | Motorola, Inc. | Synchronization and overlap method and system for single buffer speech compression and expansion |
US6999922B2 (en) | 2003-06-27 | 2006-02-14 | Motorola, Inc. | Synchronization and overlap method and system for single buffer speech compression and expansion |
CN100464578C (zh) * | 2004-05-13 | 2009-02-25 | 美国博通公司 | 用于以高质量的可变速度播放视听媒体的系统和方法 |
US7710982B2 (en) | 2004-05-26 | 2010-05-04 | Nippon Telegraph And Telephone Corporation | Sound packet reproducing method, sound packet reproducing apparatus, sound packet reproducing program, and recording medium |
US20070177620A1 (en) * | 2004-05-26 | 2007-08-02 | Nippon Telegraph And Telephone Corporation | Sound packet reproducing method, sound packet reproducing apparatus, sound packet reproducing program, and recording medium |
EP1750397A1 (en) * | 2004-05-26 | 2007-02-07 | Nippon Telegraph and Telephone Corporation | Sound packet reproducing method, sound packet reproducing apparatus, sound packet reproducing program, and recording medium |
EP1750397A4 (en) * | 2004-05-26 | 2007-10-31 | Nippon Telegraph & Telephone | SOUND PACKET PLAY PROCESS, SOUND PACKET PLAYER, SOUNDPACK PLAYBACK PROGRAM AND RECORDING MEDIUM |
US20060075347A1 (en) * | 2004-10-05 | 2006-04-06 | Rehm Peter H | Computerized notetaking system and method |
US7676362B2 (en) | 2004-12-31 | 2010-03-09 | Motorola, Inc. | Method and apparatus for enhancing loudness of a speech signal |
US20060149532A1 (en) * | 2004-12-31 | 2006-07-06 | Boillot Marc A | Method and apparatus for enhancing loudness of a speech signal |
US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
US8364477B2 (en) | 2005-05-25 | 2013-01-29 | Motorola Mobility Llc | Method and apparatus for increasing speech intelligibility in noisy environments |
US8392197B2 (en) * | 2007-08-22 | 2013-03-05 | Nec Corporation | Speaker speed conversion system, method for same, and speed conversion device |
US20110224990A1 (en) * | 2007-08-22 | 2011-09-15 | Satoshi Hosokawa | Speaker Speed Conversion System, Method for Same, and Speed Conversion Device |
Also Published As
Publication number | Publication date |
---|---|
EP0726560A2 (en) | 1996-08-14 |
DE69521405D1 (de) | 2001-07-26 |
JPH08251030A (ja) | 1996-09-27 |
EP0726560B1 (en) | 2001-06-20 |
DE69521405T2 (de) | 2002-05-02 |
EP0726560A3 (en) | 1998-01-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5694521A (en) | Variable speed playback system | |
US5752223A (en) | Code-excited linear predictive coder and decoder with conversion filter for converting stochastic and impulsive excitation signals | |
US7412379B2 (en) | Time-scale modification of signals | |
US4625286A (en) | Time encoding of LPC roots | |
US4220819A (en) | Residual excited predictive speech coding system | |
EP1202251B1 (en) | Transcoder for prevention of tandem coding of speech | |
KR100615113B1 (ko) | 주기적 음성 코딩 | |
CN100369112C (zh) | 可变速率语音编码 | |
EP1243090B1 (en) | Method and arrangement in a communication system | |
US4821324A (en) | Low bit-rate pattern encoding and decoding capable of reducing an information transmission rate | |
CA2430111C (en) | Speech parameter coding and decoding methods, coder and decoder, and programs, and speech coding and decoding methods, coder and decoder, and programs | |
US5682502A (en) | Syllable-beat-point synchronized rule-based speech synthesis from coded utterance-speed-independent phoneme combination parameters | |
EP1793370A2 (en) | Apparatus and method for creating pitch wave signals and apparatus and method for compressing, expanding and synthesizing speech signals using these pitch wave signals | |
JPH0683400A (ja) | 音声メッセージ処理方法 | |
US4724535A (en) | Low bit-rate pattern coding with recursive orthogonal decision of parameters | |
WO2003010752A1 (en) | Speech bandwidth extension apparatus and speech bandwidth extension method | |
JP2707564B2 (ja) | 音声符号化方式 | |
US4945565A (en) | Low bit-rate pattern encoding and decoding with a reduced number of excitation pulses | |
JPS63142399A (ja) | 音声分析合成方法及び装置 | |
JP3070955B2 (ja) | 音声符号器において使用するためのスペクトルノイズ重み付けフィルタを発生する方法 | |
JP3891309B2 (ja) | 音声再生速度変換装置 | |
US5202953A (en) | Multi-pulse type coding system with correlation calculation by backward-filtering operation for multi-pulse searching | |
US20020143541A1 (en) | Voice rule-synthesizer and compressed voice-element data generator for the same | |
JPH0738116B2 (ja) | マルチパルス符号化装置 | |
JP3515216B2 (ja) | 音声符号化装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: ROCKWELL INTERNATIONAL CORPORATION, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SHLOMOT, EYAL;HSUEH, ALBERT A.;REEL/FRAME:007463/0696;SIGNING DATES FROM 19950104 TO 19950404 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: CREDIT SUISSE FIRST BOSTON, NEW YORK Free format text: SECURITY INTEREST;ASSIGNORS:CONEXANT SYSTEMS, INC.;BROOKTREE CORPORATION;BROOKTREE WORLDWIDE SALES CORPORATION;AND OTHERS;REEL/FRAME:009719/0537 Effective date: 19981221 |
|
AS | Assignment |
Owner name: CONEXANT SYSTEMS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ROCKWELL SCIENCE CENTER, LLC;REEL/FRAME:010415/0761 Effective date: 19981210 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
REMI | Maintenance fee reminder mailed | ||
FPAY | Fee payment |
Year of fee payment: 4 |
|
SULP | Surcharge for late payment | ||
AS | Assignment |
Owner name: CONEXANT SYSTEMS, INC., CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 Owner name: BROOKTREE CORPORATION, CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 Owner name: BROOKTREE WORLDWIDE SALES CORPORATION, CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 Owner name: CONEXANT SYSTEMS WORLDWIDE, INC., CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 |
|
FEPP | Fee payment procedure |
Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
AS | Assignment |
Owner name: MINDSPEED TECHNOLOGIES, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:CONEXANT SYSTEMS, INC.;REEL/FRAME:014468/0137 Effective date: 20030627 |
|
AS | Assignment |
Owner name: CONEXANT SYSTEMS, INC., CALIFORNIA Free format text: SECURITY AGREEMENT;ASSIGNOR:MINDSPEED TECHNOLOGIES, INC.;REEL/FRAME:014546/0305 Effective date: 20030930 |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
AS | Assignment |
Owner name: SKYWORKS SOLUTIONS, INC., MASSACHUSETTS Free format text: EXCLUSIVE LICENSE;ASSIGNOR:CONEXANT SYSTEMS, INC.;REEL/FRAME:019649/0544 Effective date: 20030108 Owner name: SKYWORKS SOLUTIONS, INC.,MASSACHUSETTS Free format text: EXCLUSIVE LICENSE;ASSIGNOR:CONEXANT SYSTEMS, INC.;REEL/FRAME:019649/0544 Effective date: 20030108 |
|
AS | Assignment |
Owner name: ROCKWELL SCIENCE CENTER, LLC, CALIFORNIA Free format text: MERGER;ASSIGNOR:ROCKWELL SCIENCE CENTER, INC.;REEL/FRAME:019767/0211 Effective date: 19970827 Owner name: ROCKWELL SCIENCE CENTER, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ROCKWELL INTERNATIONAL CORPORATION;REEL/FRAME:019767/0161 Effective date: 19961115 |
|
AS | Assignment |
Owner name: WIAV SOLUTIONS LLC, VIRGINIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SKYWORKS SOLUTIONS INC.;REEL/FRAME:019899/0305 Effective date: 20070926 |
|
FEPP | Fee payment procedure |
Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
REMI | Maintenance fee reminder mailed | ||
FPAY | Fee payment |
Year of fee payment: 12 |
|
SULP | Surcharge for late payment |
Year of fee payment: 11 |
|
AS | Assignment |
Owner name: MINDSPEED TECHNOLOGIES, INC, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:WIAV SOLUTIONS LLC;REEL/FRAME:025599/0472 Effective date: 20100928 |
|
AS | Assignment |
Owner name: MINDSPEED TECHNOLOGIES, INC., CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CONEXANT SYSTEMS, INC;REEL/FRAME:029286/0458 Effective date: 20041208 |
|
AS | Assignment |
Owner name: O'HEARN AUDIO LLC, DELAWARE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MINDSPEED TECHNOLOGIES, INC.;REEL/FRAME:029343/0322 Effective date: 20121030 |
|
FEPP | Fee payment procedure |
Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
AS | Assignment |
Owner name: MINDSPEED TECHNOLOGIES, INC., CALIFORNIA Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE GRANT LANGUAGE WITHIN THE ASSIGNMENT DOCUMENT PREVIOUSLY RECORDED ON REEL 014468 FRAME 0137. ASSIGNOR(S) HEREBY CONFIRMS THE ASSIGNMENT DOCUMENT;ASSIGNOR:CONEXANT SYSTEMS, INC.;REEL/FRAME:029405/0728 Effective date: 20030627 |