MXPA03010749A - Comparacion de audio usando caracterizaciones basadas en eventos auditivos. - Google Patents

Comparacion de audio usando caracterizaciones basadas en eventos auditivos.

Info

Publication number
MXPA03010749A
MXPA03010749A MXPA03010749A MXPA03010749A MXPA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A MX PA03010749 A MXPA03010749 A MX PA03010749A
Authority
MX
Mexico
Prior art keywords
characterizations
delay
measure
similarity
effect
Prior art date
Application number
MXPA03010749A
Other languages
English (en)
Inventor
G Crockett Breet
Original Assignee
Dolby Lab Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from PCT/US2002/004317 external-priority patent/WO2002084645A2/en
Application filed by Dolby Lab Licensing Corp filed Critical Dolby Lab Licensing Corp
Publication of MXPA03010749A publication Critical patent/MXPA03010749A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/60Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/04Synchronising

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Auxiliary Devices For Music (AREA)
  • Television Systems (AREA)
  • Stereophonic System (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)

Abstract

Un metodo para determinar si una senal de audio se deriva de otra senal de audio o si dos senales de audio se derivan de la misma senal de audio, compara caracterizaciones, de informacion reducida, de esas senales de audio, en donde las caracterizaciones se basan en analisis de escenas auditivas. La comparacion elimina de las caracterizaciones, o minimiza en las caracterizaciones, el efecto del desplazamiento o retardo temporal en las senales de audio (5-1), calcula una medida de similitud (5-2), y compara la medida de similitud contra un umbral. En una alternativa, el efecto del desplazamiento o retardo temporal se elimina o minimiza mediante la correlacion cruzada de las dos caracterizaciones. En otra alternativa, el efecto del desplazamiento o retardo temporal se elimina o minimiza transformando las caracterizaciones a un dominio que sea independiente de los efectos de retardo temporal, tal como el dominio de la frecuencia. En ambos casos se calcula una medida de similitud, calculando un coeficiente de correlacion. La figura mas representativa de la invencion es la numero 5.
MXPA03010749A 2001-05-25 2002-02-22 Comparacion de audio usando caracterizaciones basadas en eventos auditivos. MXPA03010749A (es)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US29382501P 2001-05-25 2001-05-25
US4564402A 2002-01-11 2002-01-11
US35149802P 2002-01-23 2002-01-23
PCT/US2002/004317 WO2002084645A2 (en) 2001-04-13 2002-02-12 High quality time-scaling and pitch-scaling of audio signals
PCT/US2002/005329 WO2002097790A1 (en) 2001-05-25 2002-02-22 Comparing audio using characterizations based on auditory events

Publications (1)

Publication Number Publication Date
MXPA03010749A true MXPA03010749A (es) 2004-07-01

Family

ID=27485955

Family Applications (1)

Application Number Title Priority Date Filing Date
MXPA03010749A MXPA03010749A (es) 2001-05-25 2002-02-22 Comparacion de audio usando caracterizaciones basadas en eventos auditivos.

Country Status (7)

Country Link
EP (3) EP1393298B1 (es)
JP (1) JP4272050B2 (es)
CN (1) CN1524258B (es)
AU (3) AU2002240461B2 (es)
CA (3) CA2447911C (es)
MX (1) MXPA03010749A (es)
WO (2) WO2002097790A1 (es)

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7711123B2 (en) 2001-04-13 2010-05-04 Dolby Laboratories Licensing Corporation Segmenting audio signals into auditory events
US7283954B2 (en) 2001-04-13 2007-10-16 Dolby Laboratories Licensing Corporation Comparing audio using characterizations based on auditory events
US7610205B2 (en) 2002-02-12 2009-10-27 Dolby Laboratories Licensing Corporation High quality time-scaling and pitch-scaling of audio signals
US7461002B2 (en) 2001-04-13 2008-12-02 Dolby Laboratories Licensing Corporation Method for time aligning audio signals using characterizations based on auditory events
WO2002093560A1 (en) 2001-05-10 2002-11-21 Dolby Laboratories Licensing Corporation Improving transient performance of low bit rate audio coding systems by reducing pre-noise
US6934677B2 (en) 2001-12-14 2005-08-23 Microsoft Corporation Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands
US7240001B2 (en) 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
US7502743B2 (en) * 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
ATE527654T1 (de) 2004-03-01 2011-10-15 Dolby Lab Licensing Corp Mehrkanal-audiodecodierung
US7508947B2 (en) 2004-08-03 2009-03-24 Dolby Laboratories Licensing Corporation Method for combining audio signals using auditory scene analysis
WO2006030754A1 (ja) * 2004-09-17 2006-03-23 Matsushita Electric Industrial Co., Ltd. オーディオ符号化装置、復号化装置、方法、及びプログラム
MX2007006164A (es) 2004-11-22 2007-09-19 Nielsen Media Res Inc Metodos y aparatos para identificaci??n de fuentes de medios y mediciones de consumo de medios con desplazamiento de tiempo.
EP1729173A3 (en) * 2005-05-27 2007-01-03 Telegraf ApS System for generating synchronized add-on information
AU2006255662B2 (en) 2005-06-03 2012-08-23 Dolby Laboratories Licensing Corporation Apparatus and method for encoding audio signals with decoding instructions
DE102005045573B3 (de) * 2005-06-22 2006-11-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Ermitteln einer Stelle in einem Film
DE102005045627A1 (de) 2005-06-22 2007-01-25 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Durchführen einer Korrelation zwischen einem Testtonsignal, das mit variabler Geschwindigkeit abspielbar ist, und einem Referenztonsignal
US7948557B2 (en) 2005-06-22 2011-05-24 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating a control signal for a film event system
DE102005045628B3 (de) * 2005-06-22 2007-01-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Ermitteln einer Stelle in einem Film, der in einer zeitlichen Folge aufgebrachte Filminformationen aufweist
TWI396188B (zh) * 2005-08-02 2013-05-11 Dolby Lab Licensing Corp 依聆聽事件之函數控制空間音訊編碼參數的技術
CN1937032B (zh) * 2005-09-22 2011-06-15 财团法人工业技术研究院 切割语音数据序列的方法
US7831434B2 (en) 2006-01-20 2010-11-09 Microsoft Corporation Complex-transform channel coding with extended-band frequency coding
US8378964B2 (en) 2006-04-13 2013-02-19 Immersion Corporation System and method for automatically producing haptic events from a digital audio signal
US8000825B2 (en) 2006-04-13 2011-08-16 Immersion Corporation System and method for automatically producing haptic events from a digital audio file
US7979146B2 (en) * 2006-04-13 2011-07-12 Immersion Corporation System and method for automatically producing haptic events from a digital audio signal
ATE493794T1 (de) 2006-04-27 2011-01-15 Dolby Lab Licensing Corp Tonverstärkungsregelung mit erfassung von publikumsereignissen auf der basis von spezifischer lautstärke
US7885819B2 (en) 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
GB2457694B (en) 2008-02-21 2012-09-26 Snell Ltd Method of Deriving an Audio-Visual Signature
US8860883B2 (en) 2009-11-30 2014-10-14 Miranda Technologies Partnership Method and apparatus for providing signatures of audio/video signals and for making use thereof
GB2511655B (en) * 2009-11-30 2014-10-15 Miranda Technologies Inc Method and apparatus for providing signatures of audio/video signals and for making use thereof
KR101841313B1 (ko) * 2010-09-22 2018-03-22 톰슨 라이센싱 멀티미디어 흐름 처리 방법 및 대응하는 장치
WO2014151813A1 (en) 2013-03-15 2014-09-25 Dolby Laboratories Licensing Corporation Normalization of soundfield orientations based on auditory scene analysis
CN106792346A (zh) * 2016-11-14 2017-05-31 广东小天才科技有限公司 一种教学视频中的音频调整方法及装置
CN106653029A (zh) * 2016-12-02 2017-05-10 广东小天才科技有限公司 一种音频批量分割方法及装置
EP3646323B1 (en) 2017-06-27 2021-07-07 Dolby International AB Hybrid audio signal synchronization based on cross-correlation and attack analysis
CN107481739B (zh) * 2017-08-16 2021-04-02 成都品果科技有限公司 音频切割方法及装置
CN112927710B (zh) * 2021-01-21 2021-10-26 安徽南瑞继远电网技术有限公司 一种基于无监督方式的电力变压器工况噪声分离方法

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4624009A (en) * 1980-05-02 1986-11-18 Figgie International, Inc. Signal pattern encoder and classifier
US5040081A (en) * 1986-09-23 1991-08-13 Mccutchen David Audiovisual synchronization signal generator using audio signature comparison
US5055939A (en) * 1987-12-15 1991-10-08 Karamon John J Method system & apparatus for synchronizing an auxiliary sound source containing multiple language channels with motion picture film video tape or other picture source containing a sound track
WO1991019989A1 (en) * 1990-06-21 1991-12-26 Reynolds Software, Inc. Method and apparatus for wave analysis and event recognition
US5175769A (en) * 1991-07-23 1992-12-29 Rolm Systems Method for time-scale modification of signals
US6211919B1 (en) * 1997-03-28 2001-04-03 Tektronix, Inc. Transparent embedment of data in a video signal
US7457422B2 (en) * 2000-11-29 2008-11-25 Ford Global Technologies, Llc Method and implementation for detecting and characterizing audible transients in noise

Also Published As

Publication number Publication date
AU2002252143B2 (en) 2008-05-29
JP2004528599A (ja) 2004-09-16
WO2002097790A1 (en) 2002-12-05
EP1519363A1 (en) 2005-03-30
CN1524258B (zh) 2012-03-21
CA2448182A1 (en) 2002-12-05
EP1519363B1 (en) 2013-07-24
CA2448178A1 (en) 2002-12-05
CA2448182C (en) 2011-06-28
WO2002097790A8 (en) 2003-07-31
EP1393300A1 (en) 2004-03-03
CA2447911C (en) 2011-07-05
EP1393298B1 (en) 2010-06-09
EP1393298A1 (en) 2004-03-03
CA2448178C (en) 2011-05-10
WO2002097792A1 (en) 2002-12-05
AU2002242265B2 (en) 2007-05-17
AU2002240461B2 (en) 2007-05-17
CN1524258A (zh) 2004-08-25
EP1393300B1 (en) 2012-11-28
AU2002242265B8 (en) 2002-12-09
JP4272050B2 (ja) 2009-06-03
CA2447911A1 (en) 2002-12-05

Similar Documents

Publication Publication Date Title
MXPA03010749A (es) Comparacion de audio usando caracterizaciones basadas en eventos auditivos.
US7505823B1 (en) Acoustic communication system
US5867581A (en) Hearing aid
AU2003268002A8 (en) Frequency domain equalization of communication signals
WO2003001684A3 (en) A timing estimation method and apparatus for a location system
DE69617744D1 (de) System zur positionsbestimmung
EP1167993A3 (en) A method for measuring distance and position using spread spectrum signals, and an equipment using the method
WO2003023440A3 (en) System and method to estimate the location of a receiver in a multi-path environment
WO2006041735A3 (en) Reverberation removal
AU2003212592A1 (en) Coding of stereo signals
WO2001034264A1 (en) Acoustic location system
WO2001052188A3 (en) Method and apparatus for edge detection
WO2001073751A8 (en) Speech presence measurement detection techniques
MY124731A (en) Method and apparatus for estimating a frequency offset by combining pilot symbols and data symbols
WO2002103890A3 (en) Time alignment of signals
MXPA05003922A (es) Procedimiento para calcular un parametro de maximos o minimos locales de una funcion de correlacion derivada de una senal recibida.
WO2004102813A3 (en) Estimation of multipath channel with sub-chip resolution
WO2003030588A3 (de) Verfahren und vorrichtung zur auswahl eines klangalgorithmus
Christensen et al. Integrating pitch and localisation cues at a speech fragment level
KR101943535B1 (ko) 스위칭 목적들을 위한 디지털 스위칭 신호 시퀀스,상기 디지털 스위칭 신호 시퀀스를 디지털 오디오 정보 신호 내에 포함시키기 위한 장치,및 상기 스위칭 신호 시퀀스가 제공된 상기 정보 신호를 수신하기 위한 장치
NO20010313L (no) Estimering av kanalimpulsrespons ved bruk av mottatt signalvarians
EP0871299A3 (en) A synchronization circuit with correlated signal in the direct spread spectrum telecommunication system
EP0725492A3 (en) Perceptual stereo audio encoder
US5926553A (en) Method for measuring the conservation of stereophonic audio signals and method for identifying jointly coded stereophonic audio signals
ATE297338T1 (de) Verfahren zur erhöhung des störabstands bei zählpunkten eines achszählsystems

Legal Events

Date Code Title Description
FG Grant or registration