JP2004126595A5 - - Google Patents

Download PDF

Info

Publication number
JP2004126595A5
JP2004126595A5 JP2003345865A JP2003345865A JP2004126595A5 JP 2004126595 A5 JP2004126595 A5 JP 2004126595A5 JP 2003345865 A JP2003345865 A JP 2003345865A JP 2003345865 A JP2003345865 A JP 2003345865A JP 2004126595 A5 JP2004126595 A5 JP 2004126595A5
Authority
JP
Japan
Prior art keywords
energy
input
segment length
data
audio signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2003345865A
Other languages
English (en)
Other versions
JP2004126595A (ja
JP4523257B2 (ja
Filing date
Publication date
Priority claimed from US10/264,042 external-priority patent/US7426470B2/en
Application filed filed Critical
Publication of JP2004126595A publication Critical patent/JP2004126595A/ja
Publication of JP2004126595A5 publication Critical patent/JP2004126595A5/ja
Application granted granted Critical
Publication of JP4523257B2 publication Critical patent/JP4523257B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Claims (4)

  1. 入力音声信号に対応するデータを受信するステップと、
    該データを複数のセグメントに分割するステップと、
    所定セグメントのエネルギーに基づいて該データに対する入力セグメント長を変化させることにより、該入力音声信号と出力される圧縮音声信号との時間スケール比を補正するステップと、
    該出力された圧縮音声信号を提供するステップと
    を有する音声データ処理方法。
  2. 入力音声信号に対応する音声データのフレームを受信するステップと、
    前記音声データを複数のセグメントに分割するステップと、
    前記フレームのエネルギーに関連する値であるエネルギー関連値を算出するステップと、
    前記フレームの予測ピークエネルギーを決定するステップと、
    該予測ピークエネルギーに基づいて、前記フレームのエネルギー閾値を決定するステップと、
    該エネルギー関連値と該エネルギー閾値とを比較することにより、前記音声データの時間スケール圧縮を制御する比較ステップと
    前記比較ステップにて得られた比較結果に基づいて、前記フレームに対する入力セグメント長を決定するステップと
    を有する音声データ処理方法。
  3. コンピュータ装置を、
    入力音声データを受信する手段と、
    該入力音声データに対応するエネルギーを決定する手段と、
    該エネルギーまたは参照セグメント長に対する残余セグメント長の累積のうち少なくともいずれか一に基づいて、該入力音声データの入力セグメント長を変化させる手段と
    して機能させるためのプログラム。
  4. 受信した入力音声信号のエネルギーを決定し、該エネルギーまたは参照セグメント長に対する残余セグメント長の累積のうち少なくともいずれか一に基づいて、該入力音声データの入力セグメント長を変化させるようにプログラムされたプロセッサと、
    プログラムおよびデータのいずれか一が記憶された、前記プロセッサがアクセス可能な記憶部と
    を有する音声信号処理システム。
JP2003345865A 2002-10-03 2003-10-03 音声データ処理方法、プログラム及び音声信号処理システム Expired - Fee Related JP4523257B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/264,042 US7426470B2 (en) 2002-10-03 2002-10-03 Energy-based nonuniform time-scale modification of audio signals

Publications (3)

Publication Number Publication Date
JP2004126595A JP2004126595A (ja) 2004-04-22
JP2004126595A5 true JP2004126595A5 (ja) 2006-11-16
JP4523257B2 JP4523257B2 (ja) 2010-08-11

Family

ID=32042136

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2003345865A Expired - Fee Related JP4523257B2 (ja) 2002-10-03 2003-10-03 音声データ処理方法、プログラム及び音声信号処理システム

Country Status (2)

Country Link
US (3) US7426470B2 (ja)
JP (1) JP4523257B2 (ja)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6889383B1 (en) 2000-10-23 2005-05-03 Clearplay, Inc. Delivery of navigation data for playback of audio and video content
US7975021B2 (en) 2000-10-23 2011-07-05 Clearplay, Inc. Method and user interface for downloading audio and video content filters to a media player
US7426470B2 (en) * 2002-10-03 2008-09-16 Ntt Docomo, Inc. Energy-based nonuniform time-scale modification of audio signals
US8086448B1 (en) * 2003-06-24 2011-12-27 Creative Technology Ltd Dynamic modification of a high-order perceptual attribute of an audio signal
BRPI0413407A (pt) * 2003-08-26 2006-10-10 Clearplay Inc método e processador de controle da reprodução de um sinal de áudio
US7596488B2 (en) * 2003-09-15 2009-09-29 Microsoft Corporation System and method for real-time jitter control and packet-loss concealment in an audio signal
US8117282B2 (en) 2004-10-20 2012-02-14 Clearplay, Inc. Media player configured to receive playback filters from alternative storage mediums
US20060109983A1 (en) * 2004-11-19 2006-05-25 Young Randall K Signal masking and method thereof
BRPI0612974A2 (pt) 2005-04-18 2010-12-14 Clearplay Inc produto de programa de computador, sinal de dados de computador incorporado em uma mÍdia de transmissço, mÉtodo para associar uma apresentaÇço de multimÍdia com informaÇÕes de filtro de conteédo e reprodutor de multimÍdia
WO2007124582A1 (en) * 2006-04-27 2007-11-08 Technologies Humanware Canada Inc. Method for the time scaling of an audio signal
US7961851B2 (en) * 2006-07-26 2011-06-14 Cisco Technology, Inc. Method and system to select messages using voice commands and a telephone user interface
US20080221876A1 (en) * 2007-03-08 2008-09-11 Universitat Fur Musik Und Darstellende Kunst Method for processing audio data into a condensed version
US8285241B2 (en) * 2009-07-30 2012-10-09 Broadcom Corporation Receiver apparatus having filters implemented using frequency translation techniques
US8670990B2 (en) * 2009-08-03 2014-03-11 Broadcom Corporation Dynamic time scale modification for reduced bit rate audio coding
BR112015032174B1 (pt) 2013-06-21 2021-02-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V escalador de tempo, descodificador de áudio, método e um programa de computador utilizando um controle de qualidade
EP3011692B1 (en) 2013-06-21 2017-06-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Jitter buffer control, audio decoder, method and computer program
US10629223B2 (en) 2017-05-31 2020-04-21 International Business Machines Corporation Fast playback in media files with reduced impact to speech quality
US10878835B1 (en) * 2018-11-16 2020-12-29 Amazon Technologies, Inc System for shortening audio playback times
US11039177B2 (en) * 2019-03-19 2021-06-15 Rovi Guides, Inc. Systems and methods for varied audio segment compression for accelerated playback of media assets
US10708633B1 (en) 2019-03-19 2020-07-07 Rovi Guides, Inc. Systems and methods for selective audio segment compression for accelerated playback of media assets
US11102523B2 (en) 2019-03-19 2021-08-24 Rovi Guides, Inc. Systems and methods for selective audio segment compression for accelerated playback of media assets by service providers
CN110311424B (zh) * 2019-05-21 2023-01-20 沈阳工业大学 一种基于双时间尺度净负荷预测的储能调峰控制方法
US11227579B2 (en) * 2019-08-08 2022-01-18 International Business Machines Corporation Data augmentation by frame insertion for speech data

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US671309A (en) * 1900-07-26 1901-04-02 William J Cunningham Bottle-stopper.
US4052568A (en) * 1976-04-23 1977-10-04 Communications Satellite Corporation Digital voice switch
US4665548A (en) * 1983-10-07 1987-05-12 American Telephone And Telegraph Company At&T Bell Laboratories Speech analysis syllabic segmenter
US4998280A (en) * 1986-12-12 1991-03-05 Hitachi, Ltd. Speech recognition apparatus capable of discriminating between similar acoustic features of speech
EP0427953B1 (en) * 1989-10-06 1996-01-17 Matsushita Electric Industrial Co., Ltd. Apparatus and method for speech rate modification
US5195138A (en) * 1990-01-18 1993-03-16 Matsushita Electric Industrial Co., Ltd. Voice signal processing device
US5349645A (en) * 1991-12-31 1994-09-20 Matsushita Electric Industrial Co., Ltd. Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches
JPH06202692A (ja) * 1993-01-06 1994-07-22 Nippon Telegr & Teleph Corp <Ntt> 音声再生速度制御システム
US5630013A (en) * 1993-01-25 1997-05-13 Matsushita Electric Industrial Co., Ltd. Method of and apparatus for performing time-scale modification of speech signals
US5675705A (en) * 1993-09-27 1997-10-07 Singhal; Tara Chand Spectrogram-feature-based speech syllable and word recognition using syllabic language dictionary
US5717823A (en) * 1994-04-14 1998-02-10 Lucent Technologies Inc. Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders
US5694521A (en) * 1995-01-11 1997-12-02 Rockwell International Corporation Variable speed playback system
US5920840A (en) * 1995-02-28 1999-07-06 Motorola, Inc. Communication system and method using a speaker dependent time-scaling technique
US5828955A (en) * 1995-08-30 1998-10-27 Rockwell Semiconductor Systems, Inc. Near direct conversion receiver and method for equalizing amplitude and phase therein
WO1997017692A1 (en) * 1995-11-07 1997-05-15 Euphonics, Incorporated Parametric signal modeling musical synthesizer
US5828994A (en) * 1996-06-05 1998-10-27 Interval Research Corporation Non-uniform time scale modification of recorded audio
US5893062A (en) * 1996-12-05 1999-04-06 Interval Research Corporation Variable rate video playback with synchronized audio
JP3619946B2 (ja) * 1997-03-19 2005-02-16 富士通株式会社 話速変換装置、話速変換方法及び記録媒体
JP3017715B2 (ja) * 1997-10-31 2000-03-13 松下電器産業株式会社 音声再生装置
US6226608B1 (en) * 1999-01-28 2001-05-01 Dolby Laboratories Licensing Corporation Data framing for adaptive-block-length coding system
US6625655B2 (en) * 1999-05-04 2003-09-23 Enounce, Incorporated Method and apparatus for providing continuous playback or distribution of audio and audio-visual streamed multimedia reveived over networks having non-deterministic delays
JP3430968B2 (ja) * 1999-05-06 2003-07-28 ヤマハ株式会社 ディジタル信号の時間軸圧伸方法及び装置
GB9911737D0 (en) * 1999-05-21 1999-07-21 Philips Electronics Nv Audio signal time scale modification
US6377931B1 (en) * 1999-09-28 2002-04-23 Mindspeed Technologies Speech manipulation for continuous speech playback over a packet network
AU2001242520A1 (en) * 2000-04-06 2001-10-23 Telefonaktiebolaget Lm Ericsson (Publ) Speech rate conversion
US6505153B1 (en) * 2000-05-22 2003-01-07 Compaq Information Technologies Group, L.P. Efficient method for producing off-line closed captions
US6718309B1 (en) * 2000-07-26 2004-04-06 Ssi Corporation Continuously variable time scale modification of digital audio signals
WO2002013185A1 (en) * 2000-08-09 2002-02-14 Thomson Licensing S.A. Method and system for enabling audio speed conversion
JP2002258900A (ja) * 2001-02-28 2002-09-11 Toshiba Corp 音声再生装置及び音声再生方法
US7171367B2 (en) * 2001-12-05 2007-01-30 Ssi Corporation Digital audio with parameters for real-time time scaling
US7065485B1 (en) * 2002-01-09 2006-06-20 At&T Corp Enhancing speech intelligibility using variable-rate time-scale modification
US6844510B2 (en) * 2002-08-09 2005-01-18 Stonebridge Control Devices, Inc. Stalk switch
US7426470B2 (en) * 2002-10-03 2008-09-16 Ntt Docomo, Inc. Energy-based nonuniform time-scale modification of audio signals

Similar Documents

Publication Publication Date Title
JP2004126595A5 (ja)
JP6752255B2 (ja) オーディオ信号分類方法及び装置
RU2417456C2 (ru) Системы, способы и устройства для обнаружения изменения сигналов
JP2009503615A5 (ja)
US20060111901A1 (en) Method and apparatus for detecting speech segments in speech signal processing
US11417353B2 (en) Method for detecting audio signal and apparatus
CN102113050B (zh) 音频信号的瞬态检测方法及设备
US20200285933A1 (en) Deep neural network-based method and device for quantifying activation amount
WO2005066868A3 (en) Sleep and environment control method and system
AU2017204235A1 (en) Signal encoding method and device
JP2006508559A5 (ja)
EP2037412A3 (de) Verfahren und Anordnung zur Arithmetischen Enkodierung und Dekodierung von binaeren Zustaenden sowie ein entsprechendes Computerprogramm und ein entsprechendes computerlesbares Speichermedium
WO2006050145A3 (en) Methods and apparatus for parallel execution of a process
CN100444106C (zh) 在可变比特率格式的mp3文件中实现定位的方法
JP6141443B2 (ja) 符号化方法、復号化方法、符号化装置及び復号化装置
JP2005080123A5 (ja)
CN112331188A (zh) 一种语音数据处理方法、系统及终端设备
AU2002363894A1 (en) Method of optimising the execution of a neural network in a speech recognition system through conditionally skipping a variable number of frames
US10896021B2 (en) Dynamically preventing audio underrun using machine learning
WO2004072848A3 (en) Method and apparatus for hazard detection and management in a pipelined digital processor
JPWO2003107326A1 (ja) 音声認識方法及びその装置
KR20190042770A (ko) 오디오 코딩 방법 및 관련 장치
CN104038611A (zh) 依据环境调整音量的装置与方法
US9165561B2 (en) Apparatus and method for processing voice signal
EP4350694A2 (en) Method for processing lost frame, and decoder