JP4523257B2 - 音声データ処理方法、プログラム及び音声信号処理システム - Google Patents
音声データ処理方法、プログラム及び音声信号処理システム Download PDFInfo
- Publication number
- JP4523257B2 JP4523257B2 JP2003345865A JP2003345865A JP4523257B2 JP 4523257 B2 JP4523257 B2 JP 4523257B2 JP 2003345865 A JP2003345865 A JP 2003345865A JP 2003345865 A JP2003345865 A JP 2003345865A JP 4523257 B2 JP4523257 B2 JP 4523257B2
- Authority
- JP
- Japan
- Prior art keywords
- segment length
- energy
- input segment
- data
- threshold
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US10/264,042 US7426470B2 (en) | 2002-10-03 | 2002-10-03 | Energy-based nonuniform time-scale modification of audio signals |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2004126595A JP2004126595A (ja) | 2004-04-22 |
| JP2004126595A5 JP2004126595A5 (enExample) | 2006-11-16 |
| JP4523257B2 true JP4523257B2 (ja) | 2010-08-11 |
Family
ID=32042136
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2003345865A Expired - Fee Related JP4523257B2 (ja) | 2002-10-03 | 2003-10-03 | 音声データ処理方法、プログラム及び音声信号処理システム |
Country Status (2)
| Country | Link |
|---|---|
| US (3) | US7426470B2 (enExample) |
| JP (1) | JP4523257B2 (enExample) |
Families Citing this family (24)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6889383B1 (en) | 2000-10-23 | 2005-05-03 | Clearplay, Inc. | Delivery of navigation data for playback of audio and video content |
| US7975021B2 (en) | 2000-10-23 | 2011-07-05 | Clearplay, Inc. | Method and user interface for downloading audio and video content filters to a media player |
| US7426470B2 (en) * | 2002-10-03 | 2008-09-16 | Ntt Docomo, Inc. | Energy-based nonuniform time-scale modification of audio signals |
| US8086448B1 (en) * | 2003-06-24 | 2011-12-27 | Creative Technology Ltd | Dynamic modification of a high-order perceptual attribute of an audio signal |
| US20050086705A1 (en) * | 2003-08-26 | 2005-04-21 | Jarman Matthew T. | Method and apparatus for controlling play of an audio signal |
| US7596488B2 (en) * | 2003-09-15 | 2009-09-29 | Microsoft Corporation | System and method for real-time jitter control and packet-loss concealment in an audio signal |
| US8117282B2 (en) | 2004-10-20 | 2012-02-14 | Clearplay, Inc. | Media player configured to receive playback filters from alternative storage mediums |
| US20060109983A1 (en) * | 2004-11-19 | 2006-05-25 | Young Randall K | Signal masking and method thereof |
| AU2006236335A1 (en) | 2005-04-18 | 2006-10-26 | Clearplay, Inc. | Apparatus, system and method for associating one or more filter files with a particular multimedia presentation |
| US20070276657A1 (en) * | 2006-04-27 | 2007-11-29 | Technologies Humanware Canada, Inc. | Method for the time scaling of an audio signal |
| US7961851B2 (en) * | 2006-07-26 | 2011-06-14 | Cisco Technology, Inc. | Method and system to select messages using voice commands and a telephone user interface |
| US20080221876A1 (en) * | 2007-03-08 | 2008-09-11 | Universitat Fur Musik Und Darstellende Kunst | Method for processing audio data into a condensed version |
| US8285241B2 (en) * | 2009-07-30 | 2012-10-09 | Broadcom Corporation | Receiver apparatus having filters implemented using frequency translation techniques |
| US8670990B2 (en) * | 2009-08-03 | 2014-03-11 | Broadcom Corporation | Dynamic time scale modification for reduced bit rate audio coding |
| BR112015032174B1 (pt) | 2013-06-21 | 2021-02-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V | escalador de tempo, descodificador de áudio, método e um programa de computador utilizando um controle de qualidade |
| CA2916121C (en) | 2013-06-21 | 2019-01-29 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Jitter buffer control, audio decoder, method and computer program |
| US10629223B2 (en) * | 2017-05-31 | 2020-04-21 | International Business Machines Corporation | Fast playback in media files with reduced impact to speech quality |
| US10878835B1 (en) * | 2018-11-16 | 2020-12-29 | Amazon Technologies, Inc | System for shortening audio playback times |
| US11039177B2 (en) * | 2019-03-19 | 2021-06-15 | Rovi Guides, Inc. | Systems and methods for varied audio segment compression for accelerated playback of media assets |
| US11102523B2 (en) | 2019-03-19 | 2021-08-24 | Rovi Guides, Inc. | Systems and methods for selective audio segment compression for accelerated playback of media assets by service providers |
| US10708633B1 (en) | 2019-03-19 | 2020-07-07 | Rovi Guides, Inc. | Systems and methods for selective audio segment compression for accelerated playback of media assets |
| CN110311424B (zh) * | 2019-05-21 | 2023-01-20 | 沈阳工业大学 | 一种基于双时间尺度净负荷预测的储能调峰控制方法 |
| US11227579B2 (en) * | 2019-08-08 | 2022-01-18 | International Business Machines Corporation | Data augmentation by frame insertion for speech data |
| US20240013792A1 (en) * | 2022-07-08 | 2024-01-11 | Mstream Technologies., Inc. | Audio compression method for improving compression ratio |
Family Cites Families (33)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US671309A (en) * | 1900-07-26 | 1901-04-02 | William J Cunningham | Bottle-stopper. |
| US4052568A (en) * | 1976-04-23 | 1977-10-04 | Communications Satellite Corporation | Digital voice switch |
| US4665548A (en) * | 1983-10-07 | 1987-05-12 | American Telephone And Telegraph Company At&T Bell Laboratories | Speech analysis syllabic segmenter |
| US4998280A (en) * | 1986-12-12 | 1991-03-05 | Hitachi, Ltd. | Speech recognition apparatus capable of discriminating between similar acoustic features of speech |
| EP0427953B1 (en) * | 1989-10-06 | 1996-01-17 | Matsushita Electric Industrial Co., Ltd. | Apparatus and method for speech rate modification |
| US5195138A (en) * | 1990-01-18 | 1993-03-16 | Matsushita Electric Industrial Co., Ltd. | Voice signal processing device |
| US5349645A (en) * | 1991-12-31 | 1994-09-20 | Matsushita Electric Industrial Co., Ltd. | Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches |
| JPH06202692A (ja) * | 1993-01-06 | 1994-07-22 | Nippon Telegr & Teleph Corp <Ntt> | 音声再生速度制御システム |
| DE69428612T2 (de) * | 1993-01-25 | 2002-07-11 | Matsushita Electric Industrial Co., Ltd. | Verfahren und Vorrichtung zur Durchführung einer Zeitskalenmodifikation von Sprachsignalen |
| US5675705A (en) * | 1993-09-27 | 1997-10-07 | Singhal; Tara Chand | Spectrogram-feature-based speech syllable and word recognition using syllabic language dictionary |
| US5717823A (en) * | 1994-04-14 | 1998-02-10 | Lucent Technologies Inc. | Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders |
| US5694521A (en) * | 1995-01-11 | 1997-12-02 | Rockwell International Corporation | Variable speed playback system |
| US5920840A (en) * | 1995-02-28 | 1999-07-06 | Motorola, Inc. | Communication system and method using a speaker dependent time-scaling technique |
| US5828955A (en) * | 1995-08-30 | 1998-10-27 | Rockwell Semiconductor Systems, Inc. | Near direct conversion receiver and method for equalizing amplitude and phase therein |
| WO1997017692A1 (en) * | 1995-11-07 | 1997-05-15 | Euphonics, Incorporated | Parametric signal modeling musical synthesizer |
| US5828994A (en) * | 1996-06-05 | 1998-10-27 | Interval Research Corporation | Non-uniform time scale modification of recorded audio |
| US5893062A (en) * | 1996-12-05 | 1999-04-06 | Interval Research Corporation | Variable rate video playback with synchronized audio |
| JP3619946B2 (ja) * | 1997-03-19 | 2005-02-16 | 富士通株式会社 | 話速変換装置、話速変換方法及び記録媒体 |
| JP3017715B2 (ja) * | 1997-10-31 | 2000-03-13 | 松下電器産業株式会社 | 音声再生装置 |
| US6226608B1 (en) * | 1999-01-28 | 2001-05-01 | Dolby Laboratories Licensing Corporation | Data framing for adaptive-block-length coding system |
| US6625655B2 (en) * | 1999-05-04 | 2003-09-23 | Enounce, Incorporated | Method and apparatus for providing continuous playback or distribution of audio and audio-visual streamed multimedia reveived over networks having non-deterministic delays |
| JP3430968B2 (ja) * | 1999-05-06 | 2003-07-28 | ヤマハ株式会社 | ディジタル信号の時間軸圧伸方法及び装置 |
| GB9911737D0 (en) * | 1999-05-21 | 1999-07-21 | Philips Electronics Nv | Audio signal time scale modification |
| US6377931B1 (en) * | 1999-09-28 | 2002-04-23 | Mindspeed Technologies | Speech manipulation for continuous speech playback over a packet network |
| AU2001242520A1 (en) * | 2000-04-06 | 2001-10-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Speech rate conversion |
| US6505153B1 (en) * | 2000-05-22 | 2003-01-07 | Compaq Information Technologies Group, L.P. | Efficient method for producing off-line closed captions |
| US6718309B1 (en) * | 2000-07-26 | 2004-04-06 | Ssi Corporation | Continuously variable time scale modification of digital audio signals |
| MXPA03001198A (es) * | 2000-08-09 | 2003-06-30 | Thomson Licensing Sa | Metodo y sistema para habilitar la conversion de velocidad de audio. |
| JP2002258900A (ja) * | 2001-02-28 | 2002-09-11 | Toshiba Corp | 音声再生装置及び音声再生方法 |
| US7171367B2 (en) * | 2001-12-05 | 2007-01-30 | Ssi Corporation | Digital audio with parameters for real-time time scaling |
| US7065485B1 (en) * | 2002-01-09 | 2006-06-20 | At&T Corp | Enhancing speech intelligibility using variable-rate time-scale modification |
| US6844510B2 (en) * | 2002-08-09 | 2005-01-18 | Stonebridge Control Devices, Inc. | Stalk switch |
| US7426470B2 (en) * | 2002-10-03 | 2008-09-16 | Ntt Docomo, Inc. | Energy-based nonuniform time-scale modification of audio signals |
-
2002
- 2002-10-03 US US10/264,042 patent/US7426470B2/en not_active Expired - Fee Related
-
2003
- 2003-10-03 JP JP2003345865A patent/JP4523257B2/ja not_active Expired - Fee Related
-
2008
- 2008-01-09 US US11/971,623 patent/US20080133251A1/en not_active Abandoned
- 2008-01-09 US US11/971,625 patent/US20080133252A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| US20040068412A1 (en) | 2004-04-08 |
| US20080133251A1 (en) | 2008-06-05 |
| US20080133252A1 (en) | 2008-06-05 |
| JP2004126595A (ja) | 2004-04-22 |
| US7426470B2 (en) | 2008-09-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP4523257B2 (ja) | 音声データ処理方法、プログラム及び音声信号処理システム | |
| US6205420B1 (en) | Method and device for instantly changing the speed of a speech | |
| JP3017715B2 (ja) | 音声再生装置 | |
| JP2000137494A (ja) | 音響データ・動画データの同期再構築方法及び装置 | |
| JP2001344905A (ja) | データ再生装置、その方法及び記録媒体 | |
| US7143029B2 (en) | Apparatus and method for changing the playback rate of recorded speech | |
| JP4965371B2 (ja) | 音声再生装置 | |
| JP3249567B2 (ja) | 話速変換方法および装置 | |
| JP3553828B2 (ja) | 音声蓄積再生方法および音声蓄積再生装置 | |
| US6678650B2 (en) | Apparatus and method for converting reproducing speed | |
| JP3803302B2 (ja) | 映像要約装置 | |
| WO2006106466A1 (en) | Method and signal processor for modification of audio signals | |
| JP2009075280A (ja) | コンテンツ再生装置 | |
| JP2965788B2 (ja) | 音声用利得制御装置および音声記録再生装置 | |
| JP3373933B2 (ja) | 話速変換装置 | |
| JP3187242B2 (ja) | 話速変換装置 | |
| JP2001222300A (ja) | 音声再生装置および記録媒体 | |
| JP2867744B2 (ja) | 音声再生装置 | |
| JPH07210192A (ja) | 出力データ制御方法及び装置 | |
| JP7725436B2 (ja) | 調波音・背景音を用いた音声補償プログラム、装置及び方法 | |
| JP3187241B2 (ja) | 話速変換装置 | |
| JPH10224898A (ja) | 補聴器 | |
| JPH1078798A (ja) | 音声信号処理装置 | |
| JP2003271198A (ja) | 圧縮データ処理装置、方法および圧縮データ処理プログラム | |
| JP2861005B2 (ja) | 音声蓄積再生装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20051130 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20061003 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20061003 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20090813 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20090825 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20091023 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20100525 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20100527 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130604 Year of fee payment: 3 |
|
| LAPS | Cancellation because of no payment of annual fees |