MXPA03010749A - Comparacion de audio usando caracterizaciones basadas en eventos auditivos. - Google Patents
Comparacion de audio usando caracterizaciones basadas en eventos auditivos.Info
- Publication number
- MXPA03010749A MXPA03010749A MXPA03010749A MXPA03010749A MXPA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A
- Authority
- MX
- Mexico
- Prior art keywords
- characterizations
- delay
- measure
- similarity
- effect
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/60—Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/04—Synchronising
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Auxiliary Devices For Music (AREA)
- Television Systems (AREA)
- Stereophonic System (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
Abstract
Un metodo para determinar si una senal de audio se deriva de otra senal de audio o si dos senales de audio se derivan de la misma senal de audio, compara caracterizaciones, de informacion reducida, de esas senales de audio, en donde las caracterizaciones se basan en analisis de escenas auditivas. La comparacion elimina de las caracterizaciones, o minimiza en las caracterizaciones, el efecto del desplazamiento o retardo temporal en las senales de audio (5-1), calcula una medida de similitud (5-2), y compara la medida de similitud contra un umbral. En una alternativa, el efecto del desplazamiento o retardo temporal se elimina o minimiza mediante la correlacion cruzada de las dos caracterizaciones. En otra alternativa, el efecto del desplazamiento o retardo temporal se elimina o minimiza transformando las caracterizaciones a un dominio que sea independiente de los efectos de retardo temporal, tal como el dominio de la frecuencia. En ambos casos se calcula una medida de similitud, calculando un coeficiente de correlacion. La figura mas representativa de la invencion es la numero 5.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US29382501P | 2001-05-25 | 2001-05-25 | |
US4564402A | 2002-01-11 | 2002-01-11 | |
US35149802P | 2002-01-23 | 2002-01-23 | |
PCT/US2002/004317 WO2002084645A2 (en) | 2001-04-13 | 2002-02-12 | High quality time-scaling and pitch-scaling of audio signals |
PCT/US2002/005329 WO2002097790A1 (en) | 2001-05-25 | 2002-02-22 | Comparing audio using characterizations based on auditory events |
Publications (1)
Publication Number | Publication Date |
---|---|
MXPA03010749A true MXPA03010749A (es) | 2004-07-01 |
Family
ID=27485955
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MXPA03010749A MXPA03010749A (es) | 2001-05-25 | 2002-02-22 | Comparacion de audio usando caracterizaciones basadas en eventos auditivos. |
Country Status (7)
Country | Link |
---|---|
EP (3) | EP1393298B1 (es) |
JP (1) | JP4272050B2 (es) |
CN (1) | CN1524258B (es) |
AU (3) | AU2002240461B2 (es) |
CA (3) | CA2447911C (es) |
MX (1) | MXPA03010749A (es) |
WO (2) | WO2002097790A1 (es) |
Families Citing this family (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7711123B2 (en) | 2001-04-13 | 2010-05-04 | Dolby Laboratories Licensing Corporation | Segmenting audio signals into auditory events |
US7283954B2 (en) | 2001-04-13 | 2007-10-16 | Dolby Laboratories Licensing Corporation | Comparing audio using characterizations based on auditory events |
US7610205B2 (en) | 2002-02-12 | 2009-10-27 | Dolby Laboratories Licensing Corporation | High quality time-scaling and pitch-scaling of audio signals |
US7461002B2 (en) | 2001-04-13 | 2008-12-02 | Dolby Laboratories Licensing Corporation | Method for time aligning audio signals using characterizations based on auditory events |
WO2002093560A1 (en) | 2001-05-10 | 2002-11-21 | Dolby Laboratories Licensing Corporation | Improving transient performance of low bit rate audio coding systems by reducing pre-noise |
US6934677B2 (en) | 2001-12-14 | 2005-08-23 | Microsoft Corporation | Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands |
US7240001B2 (en) | 2001-12-14 | 2007-07-03 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
US7502743B2 (en) * | 2002-09-04 | 2009-03-10 | Microsoft Corporation | Multi-channel audio encoding and decoding with multi-channel transform selection |
ATE527654T1 (de) | 2004-03-01 | 2011-10-15 | Dolby Lab Licensing Corp | Mehrkanal-audiodecodierung |
US7508947B2 (en) | 2004-08-03 | 2009-03-24 | Dolby Laboratories Licensing Corporation | Method for combining audio signals using auditory scene analysis |
WO2006030754A1 (ja) * | 2004-09-17 | 2006-03-23 | Matsushita Electric Industrial Co., Ltd. | オーディオ符号化装置、復号化装置、方法、及びプログラム |
MX2007006164A (es) | 2004-11-22 | 2007-09-19 | Nielsen Media Res Inc | Metodos y aparatos para identificaci??n de fuentes de medios y mediciones de consumo de medios con desplazamiento de tiempo. |
EP1729173A3 (en) * | 2005-05-27 | 2007-01-03 | Telegraf ApS | System for generating synchronized add-on information |
AU2006255662B2 (en) | 2005-06-03 | 2012-08-23 | Dolby Laboratories Licensing Corporation | Apparatus and method for encoding audio signals with decoding instructions |
DE102005045573B3 (de) * | 2005-06-22 | 2006-11-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Ermitteln einer Stelle in einem Film |
DE102005045627A1 (de) | 2005-06-22 | 2007-01-25 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Durchführen einer Korrelation zwischen einem Testtonsignal, das mit variabler Geschwindigkeit abspielbar ist, und einem Referenztonsignal |
US7948557B2 (en) | 2005-06-22 | 2011-05-24 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for generating a control signal for a film event system |
DE102005045628B3 (de) * | 2005-06-22 | 2007-01-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Ermitteln einer Stelle in einem Film, der in einer zeitlichen Folge aufgebrachte Filminformationen aufweist |
TWI396188B (zh) * | 2005-08-02 | 2013-05-11 | Dolby Lab Licensing Corp | 依聆聽事件之函數控制空間音訊編碼參數的技術 |
CN1937032B (zh) * | 2005-09-22 | 2011-06-15 | 财团法人工业技术研究院 | 切割语音数据序列的方法 |
US7831434B2 (en) | 2006-01-20 | 2010-11-09 | Microsoft Corporation | Complex-transform channel coding with extended-band frequency coding |
US8378964B2 (en) | 2006-04-13 | 2013-02-19 | Immersion Corporation | System and method for automatically producing haptic events from a digital audio signal |
US8000825B2 (en) | 2006-04-13 | 2011-08-16 | Immersion Corporation | System and method for automatically producing haptic events from a digital audio file |
US7979146B2 (en) * | 2006-04-13 | 2011-07-12 | Immersion Corporation | System and method for automatically producing haptic events from a digital audio signal |
ATE493794T1 (de) | 2006-04-27 | 2011-01-15 | Dolby Lab Licensing Corp | Tonverstärkungsregelung mit erfassung von publikumsereignissen auf der basis von spezifischer lautstärke |
US7885819B2 (en) | 2007-06-29 | 2011-02-08 | Microsoft Corporation | Bitstream syntax for multi-process audio decoding |
GB2457694B (en) | 2008-02-21 | 2012-09-26 | Snell Ltd | Method of Deriving an Audio-Visual Signature |
US8860883B2 (en) | 2009-11-30 | 2014-10-14 | Miranda Technologies Partnership | Method and apparatus for providing signatures of audio/video signals and for making use thereof |
GB2511655B (en) * | 2009-11-30 | 2014-10-15 | Miranda Technologies Inc | Method and apparatus for providing signatures of audio/video signals and for making use thereof |
KR101841313B1 (ko) * | 2010-09-22 | 2018-03-22 | 톰슨 라이센싱 | 멀티미디어 흐름 처리 방법 및 대응하는 장치 |
WO2014151813A1 (en) | 2013-03-15 | 2014-09-25 | Dolby Laboratories Licensing Corporation | Normalization of soundfield orientations based on auditory scene analysis |
CN106792346A (zh) * | 2016-11-14 | 2017-05-31 | 广东小天才科技有限公司 | 一种教学视频中的音频调整方法及装置 |
CN106653029A (zh) * | 2016-12-02 | 2017-05-10 | 广东小天才科技有限公司 | 一种音频批量分割方法及装置 |
EP3646323B1 (en) | 2017-06-27 | 2021-07-07 | Dolby International AB | Hybrid audio signal synchronization based on cross-correlation and attack analysis |
CN107481739B (zh) * | 2017-08-16 | 2021-04-02 | 成都品果科技有限公司 | 音频切割方法及装置 |
CN112927710B (zh) * | 2021-01-21 | 2021-10-26 | 安徽南瑞继远电网技术有限公司 | 一种基于无监督方式的电力变压器工况噪声分离方法 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4624009A (en) * | 1980-05-02 | 1986-11-18 | Figgie International, Inc. | Signal pattern encoder and classifier |
US5040081A (en) * | 1986-09-23 | 1991-08-13 | Mccutchen David | Audiovisual synchronization signal generator using audio signature comparison |
US5055939A (en) * | 1987-12-15 | 1991-10-08 | Karamon John J | Method system & apparatus for synchronizing an auxiliary sound source containing multiple language channels with motion picture film video tape or other picture source containing a sound track |
WO1991019989A1 (en) * | 1990-06-21 | 1991-12-26 | Reynolds Software, Inc. | Method and apparatus for wave analysis and event recognition |
US5175769A (en) * | 1991-07-23 | 1992-12-29 | Rolm Systems | Method for time-scale modification of signals |
US6211919B1 (en) * | 1997-03-28 | 2001-04-03 | Tektronix, Inc. | Transparent embedment of data in a video signal |
US7457422B2 (en) * | 2000-11-29 | 2008-11-25 | Ford Global Technologies, Llc | Method and implementation for detecting and characterizing audible transients in noise |
-
2002
- 2002-02-22 MX MXPA03010749A patent/MXPA03010749A/es active IP Right Grant
- 2002-02-22 CA CA2447911A patent/CA2447911C/en not_active Expired - Fee Related
- 2002-02-22 JP JP2003500891A patent/JP4272050B2/ja not_active Expired - Fee Related
- 2002-02-22 WO PCT/US2002/005329 patent/WO2002097790A1/en active IP Right Grant
- 2002-02-22 EP EP02706372A patent/EP1393298B1/en not_active Expired - Lifetime
- 2002-02-22 AU AU2002240461A patent/AU2002240461B2/en not_active Ceased
- 2002-02-25 AU AU2002242265A patent/AU2002242265B2/en not_active Ceased
- 2002-02-25 EP EP04029183.3A patent/EP1519363B1/en not_active Expired - Lifetime
- 2002-02-25 CA CA2448178A patent/CA2448178C/en not_active Expired - Fee Related
- 2002-02-26 AU AU2002252143A patent/AU2002252143B2/en not_active Expired
- 2002-02-26 CA CA2448182A patent/CA2448182C/en not_active Expired - Lifetime
- 2002-02-26 WO PCT/US2002/005999 patent/WO2002097792A1/en active Application Filing
- 2002-02-26 EP EP02721201A patent/EP1393300B1/en not_active Expired - Lifetime
- 2002-02-26 CN CN028106717A patent/CN1524258B/zh not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
AU2002252143B2 (en) | 2008-05-29 |
JP2004528599A (ja) | 2004-09-16 |
WO2002097790A1 (en) | 2002-12-05 |
EP1519363A1 (en) | 2005-03-30 |
CN1524258B (zh) | 2012-03-21 |
CA2448182A1 (en) | 2002-12-05 |
EP1519363B1 (en) | 2013-07-24 |
CA2448178A1 (en) | 2002-12-05 |
CA2448182C (en) | 2011-06-28 |
WO2002097790A8 (en) | 2003-07-31 |
EP1393300A1 (en) | 2004-03-03 |
CA2447911C (en) | 2011-07-05 |
EP1393298B1 (en) | 2010-06-09 |
EP1393298A1 (en) | 2004-03-03 |
CA2448178C (en) | 2011-05-10 |
WO2002097792A1 (en) | 2002-12-05 |
AU2002242265B2 (en) | 2007-05-17 |
AU2002240461B2 (en) | 2007-05-17 |
CN1524258A (zh) | 2004-08-25 |
EP1393300B1 (en) | 2012-11-28 |
AU2002242265B8 (en) | 2002-12-09 |
JP4272050B2 (ja) | 2009-06-03 |
CA2447911A1 (en) | 2002-12-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MXPA03010749A (es) | Comparacion de audio usando caracterizaciones basadas en eventos auditivos. | |
US7505823B1 (en) | Acoustic communication system | |
US5867581A (en) | Hearing aid | |
AU2003268002A8 (en) | Frequency domain equalization of communication signals | |
WO2003001684A3 (en) | A timing estimation method and apparatus for a location system | |
DE69617744D1 (de) | System zur positionsbestimmung | |
EP1167993A3 (en) | A method for measuring distance and position using spread spectrum signals, and an equipment using the method | |
WO2003023440A3 (en) | System and method to estimate the location of a receiver in a multi-path environment | |
WO2006041735A3 (en) | Reverberation removal | |
AU2003212592A1 (en) | Coding of stereo signals | |
WO2001034264A1 (en) | Acoustic location system | |
WO2001052188A3 (en) | Method and apparatus for edge detection | |
WO2001073751A8 (en) | Speech presence measurement detection techniques | |
MY124731A (en) | Method and apparatus for estimating a frequency offset by combining pilot symbols and data symbols | |
WO2002103890A3 (en) | Time alignment of signals | |
MXPA05003922A (es) | Procedimiento para calcular un parametro de maximos o minimos locales de una funcion de correlacion derivada de una senal recibida. | |
WO2004102813A3 (en) | Estimation of multipath channel with sub-chip resolution | |
WO2003030588A3 (de) | Verfahren und vorrichtung zur auswahl eines klangalgorithmus | |
Christensen et al. | Integrating pitch and localisation cues at a speech fragment level | |
KR101943535B1 (ko) | 스위칭 목적들을 위한 디지털 스위칭 신호 시퀀스,상기 디지털 스위칭 신호 시퀀스를 디지털 오디오 정보 신호 내에 포함시키기 위한 장치,및 상기 스위칭 신호 시퀀스가 제공된 상기 정보 신호를 수신하기 위한 장치 | |
NO20010313L (no) | Estimering av kanalimpulsrespons ved bruk av mottatt signalvarians | |
EP0871299A3 (en) | A synchronization circuit with correlated signal in the direct spread spectrum telecommunication system | |
EP0725492A3 (en) | Perceptual stereo audio encoder | |
US5926553A (en) | Method for measuring the conservation of stereophonic audio signals and method for identifying jointly coded stereophonic audio signals | |
ATE297338T1 (de) | Verfahren zur erhöhung des störabstands bei zählpunkten eines achszählsystems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |