ATE386999T1 - Merkmal-basierte audio-inhaltsidentifikation - Google Patents
Merkmal-basierte audio-inhaltsidentifikationInfo
- Publication number
- ATE386999T1 ATE386999T1 AT02723802T AT02723802T ATE386999T1 AT E386999 T1 ATE386999 T1 AT E386999T1 AT 02723802 T AT02723802 T AT 02723802T AT 02723802 T AT02723802 T AT 02723802T AT E386999 T1 ATE386999 T1 AT E386999T1
- Authority
- AT
- Austria
- Prior art keywords
- feature
- audio content
- content identification
- based audio
- semitone
- Prior art date
Links
- 238000001228 spectrum Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/63—Querying
- G06F16/632—Query formulation
- G06F16/634—Query by example, e.g. query by humming
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/56—Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
- H04H60/58—Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 of audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Library & Information Science (AREA)
- Mathematical Physics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Stereophonic System (AREA)
- Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2002/011091 WO2003088534A1 (en) | 2002-04-05 | 2002-04-05 | Feature-based audio content identification |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE386999T1 true ATE386999T1 (de) | 2008-03-15 |
Family
ID=29247966
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT02723802T ATE386999T1 (de) | 2002-04-05 | 2002-04-05 | Merkmal-basierte audio-inhaltsidentifikation |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP1497935B1 (de) |
JP (1) | JP4267463B2 (de) |
KR (1) | KR100754294B1 (de) |
CN (1) | CN100545834C (de) |
AT (1) | ATE386999T1 (de) |
AU (1) | AU2002254568A1 (de) |
DE (1) | DE60225190T2 (de) |
WO (1) | WO2003088534A1 (de) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7711123B2 (en) * | 2001-04-13 | 2010-05-04 | Dolby Laboratories Licensing Corporation | Segmenting audio signals into auditory events |
US7598447B2 (en) * | 2004-10-29 | 2009-10-06 | Zenph Studios, Inc. | Methods, systems and computer program products for detecting musical notes in an audio signal |
JP2007171772A (ja) * | 2005-12-26 | 2007-07-05 | Clarion Co Ltd | 音楽情報処理装置、音楽情報処理方法および制御プログラム |
US7539616B2 (en) * | 2006-02-20 | 2009-05-26 | Microsoft Corporation | Speaker authentication using adapted background models |
JP2009192725A (ja) * | 2008-02-13 | 2009-08-27 | Sanyo Electric Co Ltd | 楽曲記録装置 |
CN104252480B (zh) * | 2013-06-27 | 2018-09-07 | 深圳市腾讯计算机系统有限公司 | 一种音频信息检索的方法和装置 |
CN104900238B (zh) * | 2015-05-14 | 2018-08-21 | 电子科技大学 | 一种基于感知滤波的音频实时比对方法 |
CN104900239B (zh) * | 2015-05-14 | 2018-08-21 | 电子科技大学 | 一种基于沃尔什-哈达码变换的音频实时比对方法 |
CN105653596A (zh) * | 2015-12-22 | 2016-06-08 | 惠州Tcl移动通信有限公司 | 一种基于音频对比的特定功能快速启动方法及其装置 |
CN105976828A (zh) * | 2016-04-19 | 2016-09-28 | 乐视控股(北京)有限公司 | 一种声音区分方法和终端 |
US11294954B2 (en) * | 2018-01-04 | 2022-04-05 | Audible Magic Corporation | Music cover identification for search, compliance, and licensing |
KR102097534B1 (ko) * | 2018-07-25 | 2020-04-06 | 주식회사 키네틱랩 | 사용자의 모션 인식 기반 댄스 게임을 제공하는 방법 및 장치 |
CN113112993B (zh) * | 2020-01-10 | 2024-04-02 | 阿里巴巴集团控股有限公司 | 一种音频信息处理方法、装置、电子设备以及存储介质 |
US11816151B2 (en) | 2020-05-15 | 2023-11-14 | Audible Magic Corporation | Music cover identification with lyrics for search, compliance, and licensing |
CN111724824B (zh) * | 2020-06-11 | 2021-12-03 | 北京凯视达信息技术有限公司 | 一种音频的储存和检索方法 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4450531A (en) * | 1982-09-10 | 1984-05-22 | Ensco, Inc. | Broadcast signal recognition system and method |
US4843562A (en) * | 1987-06-24 | 1989-06-27 | Broadcast Data Systems Limited Partnership | Broadcast information classification system and method |
DE3720882A1 (de) * | 1987-06-24 | 1989-01-05 | Media Control Musik Medien | Verfahren und schaltungsanordnung zum automatischen wiedererkennen von signalfolgen |
US5437050A (en) * | 1992-11-09 | 1995-07-25 | Lamb; Robert G. | Method and apparatus for recognizing broadcast information using multi-frequency magnitude detection |
-
2002
- 2002-04-05 AT AT02723802T patent/ATE386999T1/de not_active IP Right Cessation
- 2002-04-05 WO PCT/US2002/011091 patent/WO2003088534A1/en active IP Right Grant
- 2002-04-05 JP JP2003585328A patent/JP4267463B2/ja not_active Expired - Fee Related
- 2002-04-05 KR KR1020047014248A patent/KR100754294B1/ko not_active IP Right Cessation
- 2002-04-05 CN CNB028286847A patent/CN100545834C/zh not_active Expired - Lifetime
- 2002-04-05 DE DE60225190T patent/DE60225190T2/de not_active Expired - Lifetime
- 2002-04-05 AU AU2002254568A patent/AU2002254568A1/en not_active Abandoned
- 2002-04-05 EP EP02723802A patent/EP1497935B1/de not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
KR100754294B1 (ko) | 2007-09-03 |
JP2005522744A (ja) | 2005-07-28 |
DE60225190T2 (de) | 2009-09-10 |
CN100545834C (zh) | 2009-09-30 |
EP1497935B1 (de) | 2008-02-20 |
EP1497935A4 (de) | 2006-12-06 |
CN1623289A (zh) | 2005-06-01 |
WO2003088534A1 (en) | 2003-10-23 |
DE60225190D1 (de) | 2008-04-03 |
EP1497935A1 (de) | 2005-01-19 |
JP4267463B2 (ja) | 2009-05-27 |
AU2002254568A1 (en) | 2003-10-27 |
KR20040101299A (ko) | 2004-12-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE386999T1 (de) | Merkmal-basierte audio-inhaltsidentifikation | |
DE60225348D1 (de) | Auswahl eines Musikstücks anhand von Metadaten und einer externen Tempo-Eingabe | |
CA2538568A1 (en) | Data profiling | |
WO2002097658A3 (en) | Multidimensional data entry in a spread sheet | |
AU2003264774A8 (en) | Improved audio data fingerprint searching | |
WO2001084374A3 (en) | Information access method | |
DE60332770D1 (de) | Geldautomat, der Banknoten und andere Finanzinstrumentblätter ausgibt, annimmt und speichert | |
WO2004081750A3 (en) | Verified personal information database | |
WO2003104928A3 (en) | METHOD AND SYSTEM FOR DYNAMICALLY MODIFYING ADVERTISEMENTS | |
AU2002343175A1 (en) | Method and device for determining and outputting the similarity between two data strings | |
WO2005045725A3 (en) | Determining a location for placing data in a spreadsheet based on a location of the data source | |
FR2697661B1 (fr) | Pupitre informatique interactif, notamment pupitre musical. | |
WO2002095611A3 (en) | Selection of an item of music based on access statistics | |
DE60104658D1 (de) | Datenwiederherstellung in einem verteilten system | |
ATE402544T1 (de) | Vorwählen der daten-pakete | |
WO2005010861A3 (en) | Relative chord keyboard instructional method | |
ATE223100T1 (de) | Spielhilfe zum greifen von akkorden | |
GB9919922D0 (en) | Acoustic device | |
Iliopoulos et al. | String Matching with Gaps for Musical Melodic Recognition. | |
WO2002078340A3 (en) | Monitoring apparatus, computer program and network for secure data storage | |
WO2003094146A8 (en) | String instrument with sound enhancing channel extending in the neck | |
KR980000939U (ko) | 기타줄을 튕기면 수록되어 있는 멜로디가 발음토록 되는 기타완구 | |
ITBZ950035A0 (it) | Palco a dimensioni e configurazione variabili,in particolare per spettacoli musicali. | |
WO2004021022A3 (en) | Integrated circuit with embedded identification code | |
Girosi et al. | YourCast |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |