ATE386999T1 - Merkmal-basierte audio-inhaltsidentifikation - Google Patents

Merkmal-basierte audio-inhaltsidentifikation

Info

Publication number
ATE386999T1
ATE386999T1 AT02723802T AT02723802T ATE386999T1 AT E386999 T1 ATE386999 T1 AT E386999T1 AT 02723802 T AT02723802 T AT 02723802T AT 02723802 T AT02723802 T AT 02723802T AT E386999 T1 ATE386999 T1 AT E386999T1
Authority
AT
Austria
Prior art keywords
feature
audio content
content identification
based audio
semitone
Prior art date
Application number
AT02723802T
Other languages
English (en)
Inventor
Michael Pitman
Blake Fitch
Steven Abrams
Robert Germain
Original Assignee
Ibm
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ibm filed Critical Ibm
Application granted granted Critical
Publication of ATE386999T1 publication Critical patent/ATE386999T1/de

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/632Query formulation
    • G06F16/634Query by example, e.g. query by humming
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/56Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
    • H04H60/58Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 of audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Library & Information Science (AREA)
  • Mathematical Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Stereophonic System (AREA)
  • Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
AT02723802T 2002-04-05 2002-04-05 Merkmal-basierte audio-inhaltsidentifikation ATE386999T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2002/011091 WO2003088534A1 (en) 2002-04-05 2002-04-05 Feature-based audio content identification

Publications (1)

Publication Number Publication Date
ATE386999T1 true ATE386999T1 (de) 2008-03-15

Family

ID=29247966

Family Applications (1)

Application Number Title Priority Date Filing Date
AT02723802T ATE386999T1 (de) 2002-04-05 2002-04-05 Merkmal-basierte audio-inhaltsidentifikation

Country Status (8)

Country Link
EP (1) EP1497935B1 (de)
JP (1) JP4267463B2 (de)
KR (1) KR100754294B1 (de)
CN (1) CN100545834C (de)
AT (1) ATE386999T1 (de)
AU (1) AU2002254568A1 (de)
DE (1) DE60225190T2 (de)
WO (1) WO2003088534A1 (de)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7711123B2 (en) * 2001-04-13 2010-05-04 Dolby Laboratories Licensing Corporation Segmenting audio signals into auditory events
US7598447B2 (en) * 2004-10-29 2009-10-06 Zenph Studios, Inc. Methods, systems and computer program products for detecting musical notes in an audio signal
JP2007171772A (ja) * 2005-12-26 2007-07-05 Clarion Co Ltd 音楽情報処理装置、音楽情報処理方法および制御プログラム
US7539616B2 (en) * 2006-02-20 2009-05-26 Microsoft Corporation Speaker authentication using adapted background models
JP2009192725A (ja) * 2008-02-13 2009-08-27 Sanyo Electric Co Ltd 楽曲記録装置
CN104252480B (zh) * 2013-06-27 2018-09-07 深圳市腾讯计算机系统有限公司 一种音频信息检索的方法和装置
CN104900238B (zh) * 2015-05-14 2018-08-21 电子科技大学 一种基于感知滤波的音频实时比对方法
CN104900239B (zh) * 2015-05-14 2018-08-21 电子科技大学 一种基于沃尔什-哈达码变换的音频实时比对方法
CN105653596A (zh) * 2015-12-22 2016-06-08 惠州Tcl移动通信有限公司 一种基于音频对比的特定功能快速启动方法及其装置
CN105976828A (zh) * 2016-04-19 2016-09-28 乐视控股(北京)有限公司 一种声音区分方法和终端
US11294954B2 (en) * 2018-01-04 2022-04-05 Audible Magic Corporation Music cover identification for search, compliance, and licensing
KR102097534B1 (ko) * 2018-07-25 2020-04-06 주식회사 키네틱랩 사용자의 모션 인식 기반 댄스 게임을 제공하는 방법 및 장치
CN113112993B (zh) * 2020-01-10 2024-04-02 阿里巴巴集团控股有限公司 一种音频信息处理方法、装置、电子设备以及存储介质
US11816151B2 (en) 2020-05-15 2023-11-14 Audible Magic Corporation Music cover identification with lyrics for search, compliance, and licensing
CN111724824B (zh) * 2020-06-11 2021-12-03 北京凯视达信息技术有限公司 一种音频的储存和检索方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4450531A (en) * 1982-09-10 1984-05-22 Ensco, Inc. Broadcast signal recognition system and method
US4843562A (en) * 1987-06-24 1989-06-27 Broadcast Data Systems Limited Partnership Broadcast information classification system and method
DE3720882A1 (de) * 1987-06-24 1989-01-05 Media Control Musik Medien Verfahren und schaltungsanordnung zum automatischen wiedererkennen von signalfolgen
US5437050A (en) * 1992-11-09 1995-07-25 Lamb; Robert G. Method and apparatus for recognizing broadcast information using multi-frequency magnitude detection

Also Published As

Publication number Publication date
KR100754294B1 (ko) 2007-09-03
JP2005522744A (ja) 2005-07-28
DE60225190T2 (de) 2009-09-10
CN100545834C (zh) 2009-09-30
EP1497935B1 (de) 2008-02-20
EP1497935A4 (de) 2006-12-06
CN1623289A (zh) 2005-06-01
WO2003088534A1 (en) 2003-10-23
DE60225190D1 (de) 2008-04-03
EP1497935A1 (de) 2005-01-19
JP4267463B2 (ja) 2009-05-27
AU2002254568A1 (en) 2003-10-27
KR20040101299A (ko) 2004-12-02

Similar Documents

Publication Publication Date Title
ATE386999T1 (de) Merkmal-basierte audio-inhaltsidentifikation
DE60225348D1 (de) Auswahl eines Musikstücks anhand von Metadaten und einer externen Tempo-Eingabe
CA2538568A1 (en) Data profiling
WO2002097658A3 (en) Multidimensional data entry in a spread sheet
AU2003264774A8 (en) Improved audio data fingerprint searching
WO2001084374A3 (en) Information access method
DE60332770D1 (de) Geldautomat, der Banknoten und andere Finanzinstrumentblätter ausgibt, annimmt und speichert
WO2004081750A3 (en) Verified personal information database
WO2003104928A3 (en) METHOD AND SYSTEM FOR DYNAMICALLY MODIFYING ADVERTISEMENTS
AU2002343175A1 (en) Method and device for determining and outputting the similarity between two data strings
WO2005045725A3 (en) Determining a location for placing data in a spreadsheet based on a location of the data source
FR2697661B1 (fr) Pupitre informatique interactif, notamment pupitre musical.
WO2002095611A3 (en) Selection of an item of music based on access statistics
DE60104658D1 (de) Datenwiederherstellung in einem verteilten system
ATE402544T1 (de) Vorwählen der daten-pakete
WO2005010861A3 (en) Relative chord keyboard instructional method
ATE223100T1 (de) Spielhilfe zum greifen von akkorden
GB9919922D0 (en) Acoustic device
Iliopoulos et al. String Matching with Gaps for Musical Melodic Recognition.
WO2002078340A3 (en) Monitoring apparatus, computer program and network for secure data storage
WO2003094146A8 (en) String instrument with sound enhancing channel extending in the neck
KR980000939U (ko) 기타줄을 튕기면 수록되어 있는 멜로디가 발음토록 되는 기타완구
ITBZ950035A0 (it) Palco a dimensioni e configurazione variabili,in particolare per spettacoli musicali.
WO2004021022A3 (en) Integrated circuit with embedded identification code
Girosi et al. YourCast

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties