CN1181468C - 数字音频信号的连续可变时间标度改变技术 - Google Patents
数字音频信号的连续可变时间标度改变技术 Download PDFInfo
- Publication number
- CN1181468C CN1181468C CNB018122051A CN01812205A CN1181468C CN 1181468 C CN1181468 C CN 1181468C CN B018122051 A CNB018122051 A CN B018122051A CN 01812205 A CN01812205 A CN 01812205A CN 1181468 C CN1181468 C CN 1181468C
- Authority
- CN
- China
- Prior art keywords
- sample
- input
- described method
- output
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 21
- 230000004048 modification Effects 0.000 title abstract description 4
- 238000012986 modification Methods 0.000 title abstract description 4
- 238000000034 method Methods 0.000 claims abstract description 128
- 230000008859 change Effects 0.000 claims description 39
- 238000006243 chemical reaction Methods 0.000 claims description 21
- 230000009466 transformation Effects 0.000 claims description 15
- 230000015572 biosynthetic process Effects 0.000 claims description 2
- 238000012886 linear function Methods 0.000 claims description 2
- 238000005314 correlation function Methods 0.000 abstract description 4
- 230000008569 process Effects 0.000 abstract description 2
- 239000000872 buffer Substances 0.000 description 83
- 230000006870 function Effects 0.000 description 36
- 230000006835 compression Effects 0.000 description 25
- 238000007906 compression Methods 0.000 description 25
- 230000000875 corresponding effect Effects 0.000 description 15
- 238000005070 sampling Methods 0.000 description 13
- 238000012545 processing Methods 0.000 description 12
- 238000005516 engineering process Methods 0.000 description 8
- 230000008447 perception Effects 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 1
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 1
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000003708 edge detection Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/01—Correction of time axis
Landscapes
- Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (24)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/626,046 US6718309B1 (en) | 2000-07-26 | 2000-07-26 | Continuously variable time scale modification of digital audio signals |
US09/626,046 | 2000-07-26 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1440549A CN1440549A (zh) | 2003-09-03 |
CN1181468C true CN1181468C (zh) | 2004-12-22 |
Family
ID=24508730
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB018122051A Expired - Fee Related CN1181468C (zh) | 2000-07-26 | 2001-07-17 | 数字音频信号的连续可变时间标度改变技术 |
Country Status (7)
Country | Link |
---|---|
US (1) | US6718309B1 (zh) |
EP (1) | EP1303855A2 (zh) |
JP (1) | JP2004505304A (zh) |
KR (1) | KR20030024784A (zh) |
CN (1) | CN1181468C (zh) |
TW (1) | TW518557B (zh) |
WO (1) | WO2002009090A2 (zh) |
Families Citing this family (71)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE338333T1 (de) * | 2001-04-05 | 2006-09-15 | Koninkl Philips Electronics Nv | Zeitskalenmodifikation von signalen mit spezifischem verfahren je nach ermitteltem signaltyp |
US7711123B2 (en) * | 2001-04-13 | 2010-05-04 | Dolby Laboratories Licensing Corporation | Segmenting audio signals into auditory events |
US7610205B2 (en) * | 2002-02-12 | 2009-10-27 | Dolby Laboratories Licensing Corporation | High quality time-scaling and pitch-scaling of audio signals |
US7131007B1 (en) * | 2001-06-04 | 2006-10-31 | At & T Corp. | System and method of retrieving a watermark within a signal |
US7146503B1 (en) * | 2001-06-04 | 2006-12-05 | At&T Corp. | System and method of watermarking signal |
US7171367B2 (en) * | 2001-12-05 | 2007-01-30 | Ssi Corporation | Digital audio with parameters for real-time time scaling |
KR100547444B1 (ko) * | 2002-08-08 | 2006-01-31 | 주식회사 코스모탄 | 가변길이합성과 상관도계산 감축 기법을 이용한오디오신호의 시간스케일 수정방법 |
US7941037B1 (en) * | 2002-08-27 | 2011-05-10 | Nvidia Corporation | Audio/video timescale compression system and method |
US7426470B2 (en) * | 2002-10-03 | 2008-09-16 | Ntt Docomo, Inc. | Energy-based nonuniform time-scale modification of audio signals |
US7426221B1 (en) | 2003-02-04 | 2008-09-16 | Cisco Technology, Inc. | Pitch invariant synchronization of audio playout rates |
US20040186709A1 (en) * | 2003-03-17 | 2004-09-23 | Chao-Wen Chi | System and method of synthesizing a plurality of voices |
JP3871657B2 (ja) * | 2003-05-27 | 2007-01-24 | 株式会社東芝 | 話速変換装置、方法、及びそのプログラム |
US8340972B2 (en) * | 2003-06-27 | 2012-12-25 | Motorola Mobility Llc | Psychoacoustic method and system to impose a preferred talking rate through auditory feedback rate adjustment |
US6999922B2 (en) * | 2003-06-27 | 2006-02-14 | Motorola, Inc. | Synchronization and overlap method and system for single buffer speech compression and expansion |
US7337108B2 (en) * | 2003-09-10 | 2008-02-26 | Microsoft Corporation | System and method for providing high-quality stretching and compression of a digital audio signal |
US20050137730A1 (en) * | 2003-12-18 | 2005-06-23 | Steven Trautmann | Time-scale modification of audio using separated frequency bands |
US6982377B2 (en) * | 2003-12-18 | 2006-01-03 | Texas Instruments Incorporated | Time-scale modification of music signals based on polyphase filterbanks and constrained time-domain processing |
US20050137729A1 (en) * | 2003-12-18 | 2005-06-23 | Atsuhiro Sakurai | Time-scale modification stereo audio signals |
US20050166135A1 (en) * | 2004-01-05 | 2005-07-28 | Burke David G. | Apparatus, system and method for synchronized playback of data transmitted over an asynchronous network |
US8423372B2 (en) * | 2004-08-26 | 2013-04-16 | Sisvel International S.A. | Processing of encoded signals |
US20060075347A1 (en) * | 2004-10-05 | 2006-04-06 | Rehm Peter H | Computerized notetaking system and method |
US20060149535A1 (en) * | 2004-12-30 | 2006-07-06 | Lg Electronics Inc. | Method for controlling speed of audio signals |
US7983922B2 (en) * | 2005-04-15 | 2011-07-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing |
US10438690B2 (en) * | 2005-05-16 | 2019-10-08 | Panvia Future Technologies, Inc. | Associative memory and data searching system and method |
US11561951B2 (en) | 2005-05-16 | 2023-01-24 | Panvia Future Technologies, Inc. | Multidimensional associative memory and data searching |
US20060269057A1 (en) * | 2005-05-26 | 2006-11-30 | Groove Mobile, Inc. | Systems and methods for high resolution signal analysis and chaotic data compression |
TW200709035A (en) * | 2005-08-30 | 2007-03-01 | Realtek Semiconductor Corp | Audio processing device and method thereof |
US8155972B2 (en) * | 2005-10-05 | 2012-04-10 | Texas Instruments Incorporated | Seamless audio speed change based on time scale modification |
US20070081663A1 (en) * | 2005-10-12 | 2007-04-12 | Atsuhiro Sakurai | Time scale modification of audio based on power-complementary IIR filter decomposition |
US8345890B2 (en) * | 2006-01-05 | 2013-01-01 | Audience, Inc. | System and method for utilizing inter-microphone level differences for speech enhancement |
JP5096932B2 (ja) * | 2006-01-24 | 2012-12-12 | パナソニック株式会社 | 変換装置 |
US8194880B2 (en) | 2006-01-30 | 2012-06-05 | Audience, Inc. | System and method for utilizing omni-directional microphones for speech enhancement |
US9185487B2 (en) | 2006-01-30 | 2015-11-10 | Audience, Inc. | System and method for providing noise suppression utilizing null processing noise subtraction |
US8744844B2 (en) | 2007-07-06 | 2014-06-03 | Audience, Inc. | System and method for adaptive intelligent noise suppression |
US8204252B1 (en) | 2006-10-10 | 2012-06-19 | Audience, Inc. | System and method for providing close microphone adaptive array processing |
WO2007124582A1 (en) * | 2006-04-27 | 2007-11-08 | Technologies Humanware Canada Inc. | Method for the time scaling of an audio signal |
US8150065B2 (en) | 2006-05-25 | 2012-04-03 | Audience, Inc. | System and method for processing an audio signal |
US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
US8204253B1 (en) | 2008-06-30 | 2012-06-19 | Audience, Inc. | Self calibration of audio device |
US8849231B1 (en) | 2007-08-08 | 2014-09-30 | Audience, Inc. | System and method for adaptive power control |
US8934641B2 (en) * | 2006-05-25 | 2015-01-13 | Audience, Inc. | Systems and methods for reconstructing decomposed audio signals |
US7752038B2 (en) * | 2006-10-13 | 2010-07-06 | Nokia Corporation | Pitch lag estimation |
TWI312500B (en) * | 2006-12-08 | 2009-07-21 | Micro Star Int Co Ltd | Method of varying speech speed |
US8259926B1 (en) | 2007-02-23 | 2012-09-04 | Audience, Inc. | System and method for 2-channel and 3-channel acoustic echo cancellation |
US20080221876A1 (en) * | 2007-03-08 | 2008-09-11 | Universitat Fur Musik Und Darstellende Kunst | Method for processing audio data into a condensed version |
US8189766B1 (en) | 2007-07-26 | 2012-05-29 | Audience, Inc. | System and method for blind subband acoustic echo cancellation postfiltering |
US8050934B2 (en) * | 2007-11-29 | 2011-11-01 | Texas Instruments Incorporated | Local pitch control based on seamless time scale modification and synchronized sampling rate conversion |
US8180064B1 (en) | 2007-12-21 | 2012-05-15 | Audience, Inc. | System and method for providing voice equalization |
US8143620B1 (en) | 2007-12-21 | 2012-03-27 | Audience, Inc. | System and method for adaptive classification of audio sources |
EP2077671B1 (en) * | 2008-01-07 | 2019-06-19 | Vestel Elektronik Sanayi ve Ticaret A.S. | Streaming media player and method |
US8194882B2 (en) | 2008-02-29 | 2012-06-05 | Audience, Inc. | System and method for providing single microphone noise suppression fallback |
US8355511B2 (en) | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
US8521530B1 (en) | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
US8774423B1 (en) | 2008-06-30 | 2014-07-08 | Audience, Inc. | System and method for controlling adaptivity of signal modification using a phantom coefficient |
EP2141696A1 (en) * | 2008-07-03 | 2010-01-06 | Deutsche Thomson OHG | Method for time scaling of a sequence of input signal values |
ES2379761T3 (es) * | 2008-07-11 | 2012-05-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Proporcinar una señal de activación de distorsión de tiempo y codificar una señal de audio con la misma |
US20100063825A1 (en) * | 2008-09-05 | 2010-03-11 | Apple Inc. | Systems and Methods for Memory Management and Crossfading in an Electronic Device |
US8379794B2 (en) * | 2008-09-05 | 2013-02-19 | The Board Of Trustees Of The Leland Stanford Junior University | Method to estimate position, motion and trajectory of a target with a single x-ray imager |
US8655466B2 (en) * | 2009-02-27 | 2014-02-18 | Apple Inc. | Correlating changes in audio |
US9031850B2 (en) * | 2009-08-20 | 2015-05-12 | Gvbb Holdings S.A.R.L. | Audio stream combining apparatus, method and program |
CN102117613B (zh) * | 2009-12-31 | 2012-12-12 | 展讯通信(上海)有限公司 | 数字音频变速处理方法及其设备 |
US9008329B1 (en) | 2010-01-26 | 2015-04-14 | Audience, Inc. | Noise reduction using multi-feature cluster tracker |
US20120035922A1 (en) * | 2010-08-05 | 2012-02-09 | Carroll Martin D | Method and apparatus for controlling word-separation during audio playout |
US8473084B2 (en) | 2010-09-01 | 2013-06-25 | Apple Inc. | Audio crossfading |
US8996389B2 (en) * | 2011-06-14 | 2015-03-31 | Polycom, Inc. | Artifact reduction in time compression |
US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
CN104123943B (zh) * | 2013-04-28 | 2017-05-31 | 安凯(广州)微电子技术有限公司 | 一种音频信号重采样的方法和装置 |
US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
EP2881944B1 (en) * | 2013-12-05 | 2016-04-13 | Nxp B.V. | Audio signal processing apparatus |
WO2016033364A1 (en) | 2014-08-28 | 2016-03-03 | Audience, Inc. | Multi-sourced noise suppression |
US11418879B2 (en) * | 2020-05-13 | 2022-08-16 | Nxp B.V. | Audio signal blending with beat alignment |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4417103A (en) | 1981-05-11 | 1983-11-22 | The Variable Speech Control Company ("Vsc") | Stereo reproduction with gapless splicing of pitch altered waveforms |
IL84902A (en) | 1987-12-21 | 1991-12-15 | D S P Group Israel Ltd | Digital autocorrelation system for detecting speech in noisy audio signal |
DE69024919T2 (de) | 1989-10-06 | 1996-10-17 | Matsushita Electric Ind Co Ltd | Einrichtung und Methode zur Veränderung von Sprechgeschwindigkeit |
US5175769A (en) | 1991-07-23 | 1992-12-29 | Rolm Systems | Method for time-scale modification of signals |
DE69228211T2 (de) | 1991-08-09 | 1999-07-08 | Koninkl Philips Electronics Nv | Verfahren und Apparat zur Handhabung von Höhe und Dauer eines physikalischen Audiosignals |
EP0608833B1 (en) | 1993-01-25 | 2001-10-17 | Matsushita Electric Industrial Co., Ltd. | Method of and apparatus for performing time-scale modification of speech signals |
US5694521A (en) * | 1995-01-11 | 1997-12-02 | Rockwell International Corporation | Variable speed playback system |
US5828995A (en) | 1995-02-28 | 1998-10-27 | Motorola, Inc. | Method and apparatus for intelligible fast forward and reverse playback of time-scale compressed voice messages |
US5832442A (en) | 1995-06-23 | 1998-11-03 | Electronics Research & Service Organization | High-effeciency algorithms using minimum mean absolute error splicing for pitch and rate modification of audio signals |
US5806023A (en) | 1996-02-23 | 1998-09-08 | Motorola, Inc. | Method and apparatus for time-scale modification of a signal |
US5893062A (en) * | 1996-12-05 | 1999-04-06 | Interval Research Corporation | Variable rate video playback with synchronized audio |
US6622171B2 (en) * | 1998-09-15 | 2003-09-16 | Microsoft Corporation | Multimedia timeline modification in networked client/server systems |
US6665751B1 (en) * | 1999-04-17 | 2003-12-16 | International Business Machines Corporation | Streaming media player varying a play speed from an original to a maximum allowable slowdown proportionally in accordance with a buffer state |
US6625655B2 (en) * | 1999-05-04 | 2003-09-23 | Enounce, Incorporated | Method and apparatus for providing continuous playback or distribution of audio and audio-visual streamed multimedia reveived over networks having non-deterministic delays |
US6278387B1 (en) * | 1999-09-28 | 2001-08-21 | Conexant Systems, Inc. | Audio encoder and decoder utilizing time scaling for variable playback |
-
2000
- 2000-07-26 US US09/626,046 patent/US6718309B1/en not_active Expired - Fee Related
-
2001
- 2001-07-17 EP EP01955854A patent/EP1303855A2/en not_active Withdrawn
- 2001-07-17 WO PCT/US2001/022540 patent/WO2002009090A2/en not_active Application Discontinuation
- 2001-07-17 JP JP2002514712A patent/JP2004505304A/ja active Pending
- 2001-07-17 CN CNB018122051A patent/CN1181468C/zh not_active Expired - Fee Related
- 2001-07-17 KR KR10-2003-7000621A patent/KR20030024784A/ko not_active Application Discontinuation
- 2001-07-25 TW TW090118180A patent/TW518557B/zh not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
KR20030024784A (ko) | 2003-03-26 |
JP2004505304A (ja) | 2004-02-19 |
US6718309B1 (en) | 2004-04-06 |
EP1303855A2 (en) | 2003-04-23 |
CN1440549A (zh) | 2003-09-03 |
TW518557B (en) | 2003-01-21 |
WO2002009090A3 (en) | 2002-07-18 |
WO2002009090A2 (en) | 2002-01-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1181468C (zh) | 数字音频信号的连续可变时间标度改变技术 | |
US8473298B2 (en) | Pre-resampling to achieve continuously variable analysis time/frequency resolution | |
EP1735779B1 (en) | Encoder apparatus, decoder apparatus, methods thereof and associated audio system | |
EP1595247B1 (en) | Audio coding | |
KR101016982B1 (ko) | 디코딩 장치 | |
CN101189661B (zh) | 用于产生数据流和产生多通道表示的设备和方法 | |
CN101542597B (zh) | 用于编码和解码基于对象的音频信号的方法和装置 | |
US7917358B2 (en) | Transient detection by power weighted average | |
CN101385075B (zh) | 用于编码/解码信号的装置和方法 | |
US20050169482A1 (en) | Audio spatial environment engine | |
CN1144369A (zh) | 音乐伴奏演奏装置的自动音调调整 | |
CN105190747A (zh) | 用于空间音频对象编码中时间/频率分辨率的反向兼容动态适应的编码器、解码器及方法 | |
CN1669358A (zh) | 音频编码 | |
CN104681030A (zh) | 用于编码/解码信号的装置和方法 | |
CN1761998A (zh) | 多信道信号的处理 | |
CN1781338A (zh) | 基于复指数调制的滤波器组的高级处理和自适应时间信号传送方法 | |
WO2007102675A1 (en) | Method, medium, and system generating a stereo signal | |
CN102483921A (zh) | 用于对多声道音频信号进行编码的方法和设备以及用于对多声道音频信号进行解码的方法和设备 | |
CN1848691A (zh) | 声信号处理装置和方法 | |
US7580833B2 (en) | Constant pitch variable speed audio decoding | |
CN1573920A (zh) | 使用独立分量分析算法分离音乐与语音的装置与方法 | |
EP3916725B1 (en) | Stereo signal processing apparatus | |
CN1062365C (zh) | 发送和接收编码话音的方法 | |
US20230306943A1 (en) | Vocal track removal by convolutional neural network embedded voice finger printing on standard arm embedded platform | |
Turner | Linear predictive modelling and efficient speech encoding. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Free format text: FORMER OWNER: R. SELLY Effective date: 20040709 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20040709 Address after: Tokyo, Japan, Japan Applicant after: SSI Corp. Address before: Tokyo, Japan, Japan Applicant before: SSI Corp. Co-applicant before: R. Selly |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1056252 Country of ref document: HK |