JP4425126B2 - ロバストかつインバリアントな音声パターンマッチング - Google Patents
ロバストかつインバリアントな音声パターンマッチング Download PDFInfo
- Publication number
- JP4425126B2 JP4425126B2 JP2004500283A JP2004500283A JP4425126B2 JP 4425126 B2 JP4425126 B2 JP 4425126B2 JP 2004500283 A JP2004500283 A JP 2004500283A JP 2004500283 A JP2004500283 A JP 2004500283A JP 4425126 B2 JP4425126 B2 JP 4425126B2
- Authority
- JP
- Japan
- Prior art keywords
- fingerprint
- value
- relative
- determined
- peak
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 claims abstract description 62
- 238000012512 characterization method Methods 0.000 claims description 2
- 238000004590 computer program Methods 0.000 claims 2
- 230000009466 transformation Effects 0.000 abstract description 4
- 238000000844 transformation Methods 0.000 abstract description 3
- 238000010586 diagram Methods 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000010420 art technique Methods 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000012905 input function Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000000611 regression analysis Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/12—Classification; Matching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/121—Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
- G10H2240/131—Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
- G10H2240/135—Library retrieval index, i.e. using an indexing scheme to efficiently retrieve a music piece
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/121—Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
- G10H2240/131—Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
- G10H2240/141—Library retrieval matching, i.e. any of the steps of matching an inputted segment or phrase with musical database contents, e.g. query by humming, singing or playing; the steps may include, e.g. musical analysis of the input, musical feature extraction, query formulation, or details of the retrieval process
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/131—Mathematical functions for musical analysis, processing, synthesis or composition
- G10H2250/215—Transforms, i.e. mathematical transforms into domains appropriate for musical signal processing, coding or compression
- G10H2250/235—Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Signal Processing (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Stereo-Broadcasting Methods (AREA)
- Collating Specific Patterns (AREA)
- Auxiliary Devices For Music (AREA)
Description
各音声サンプルのスペクトログラムが作成された後、スペクトログラムは局所的特徴に対して、例えば図2に示されているような局所エネルギーピークに対してスキャンされる。マッチング処理は、各音声サンプルに対して対応する局所的特徴から1組のフィンガープリントオブジェクトを抽出することによって開始する。例示的な実施例では、一方の音声サンプルは認識されるべき未知のサウンドサンプルであり、他方の音声サンプルはデータベースに格納された既知の録音である。各フィンガープリントオブジェクトは、それぞれの音声サンプル内の特定の位置で生じる。ある実施例では、各フィンガープリントオブジェクトは、音声ファイル内のタイムオフセットに配置され、そのそれぞれの時間座標付近の音声ファイルに関する記述的情報を含む。つまり、各フィンガープリントオブジェクトに含まれる記述的情報は、それぞれのタイムオフセット付近の音声サンプルに依存して計算される。これは、小さなデータ構造にコード化される。好適には、位置及び記述的情報は、雑音、歪み及び他の変換、例えば再生速度が変動するような条件下であっても概ね再現性のあるような方法で決定される。この場合、各位置はそれぞれの音声サンプルの内容に基づいて決定され、各フィンガープリントオブジェクトはそれぞれの特定の位置またはその付近で、例えば図1に示されているような位置(t1,f1)または(t2,f2)でそれぞれの音声サンプルの1若しくは複数の局所的特徴を特徴付ける。
マッチング操作では、それぞれのフィンガープリントオブジェクトによって2つの音声サンプルが比較される。図3に関連して前記したように、各々が実質的にマッチング要素を含むような複数のフィンガープリントオブジェクトのマッチングペアが作成される。高速探索を可能にするためにデータを準備する1つの方法は、フィンガープリントオブジェクトを数値トークン、例えば32ビットの符号のない整数にコード化し、数値トークンを並び替え(ソーティング)及び探索のためのキーとして用いることである。効率的なデータ操作のための技術は当該分野で公知であり、例えば、「Art of Computer Programming, Volume 3: Sorting and Searching (2nd Edition)」Donald Ervin Knuth (1998年4月) Addison-Wesley が挙げられるが、ここに引用したことを以って本明細書の一部とする。
本明細書中で述べられているように、フィンガープリントオブジェクトのマッチングペアのリストから計算された1組の相対値に対してヒストグラムが作成される。ヒストグラムはそのときのピークに対して探索される。ヒストグラムにおける統計学的に著しいピークの存在は、可能なマッチが生じたことを示す。この方法は特に、タイムオフセットの差、例えば(t1’−t1)に代えて、相対値のヒストグラムにおけるクラスタを探索する。本発明の原理によれば、ヒストグラムはカウント値のビン(bin)を形成するのに役立つ。各ビンは、ヒストグラムの独立軸に沿って特定の値に対応する。本発明のために、ヒストグラムの作成は、相対値のリストを単純にソートすることによってなし得る。従って、値のリストのヒストグラムのピークを検出する高速かつ効率的な方法は、リストを昇順にソートし、次に、同じまたは類似の値を有する項が最も集中している箇所をスキャンすることである。
本明細書中で述べているように、全ての歪みを切り抜けて残存しかつ正しくマッチされるのが僅か2%のフィンガープリントオブジェクトのみであったとしても、本発明を用いて、2つの音声サンプルは正しくマッチされることができる。これは、2つの音声サンプルの比較をスコアリングすることにより可能である。具体的には、ヒストグラムのピーク付近で近傍が選択され、近傍に分類される全てのマッチングペアがカウントされ、スコアを与える。更に、ピークの中心からより離れたペアの寄与を減じる(割り引く)ような重み付けスコアが計算されることもある。
統計学的に著しいヒストグラムピークが見つかったら、広域相対値の高分解能「超高精度(hyperfine)」予測(例えば相対再生速度)が計算されることができる。これは、例えばピークヒストグラムビンの中央に位置する約3または5ビン幅の間隔を含むピーク付近の近傍を選択し、かつ近傍における相対値の平均を計算することによって達成される。この技術を用いて、0.05%以内の精度の相対再生速度を見つけることができる。本明細書中で開示されているオフセット(相対位置)導出を用いて、1ミリ秒精度よりよい精度で広域タイムオフセットを予測することができるが、これは上述のスペクトログラムフレームの時間分解能より高精度である。
上記米国特許出願において考察されているように、サンプルが実際にマッチした場合には、図6−Aに示されるように、マッチングサンプルが、互いにプロットされたマッチングフィンガープリントオブジェクトの対応時間座標(t’,t)を有する散布図にて、斜線が示される。問題はリグレッサーの方程式をいかに見つけるかであるが、これは多くの雑音の存在下で線の傾き及びオフセットによって決定される。傾きは相対再生速度を示し、オフセットは1つの音声サンプルの始めから第2の音声サンプルの始めまでの相対オフセットである。最小2乗近似などの従来の回帰技術が利用可能であり、例えば、William H. Press、Brian P. Flannery、Saul A. Teukolsky、及びWilliam T. Vetterling による「Numerical Recipes in C: The Art of Scientific Computing (2nd Edition)」(January 1993), Cambridge University Pressを参照されたい。尚、この文献は、ここに引用したことを以って本明細書の一部となす。残念なことに、これらの従来技術には、1つの大きなアウトライアー(異常な値)が予測された回帰パラメータを大幅に非対称にし得るような、感度のバランスの悪さがある。実際には、点は多くの場合アウトライアーに左右され、正しい斜めの線を検出することを困難にしている。アウトライアーの問題を克服して雑音の存在下で点の間に線形関係を見つけるべくロバスト回帰のための他の技術を用いることができるが、これらの技術は緩慢で反復的な傾向にあり、局所的な最適条件で行き詰まる可能性がある。未知の線形リグレッサーを見つけるための文献には多種多様の技術が存在する。MATLABツールキットは、マスワークス社(The Mathworks)から入手可能であり、引用することを以って本明細書の一部となすが、回帰分析のための種々のソフトウェアルーチンを含む。
オフセット=t1’−Rt*t1
の関係を有すると仮定する。ここで、Rtは、前述のようにして得られる。これは、補正されたタイムオフセットであり、2つの音声サンプル間の時間座標系をノーマライズ(正規化)するのに役立つ。これは、図7−Aでは傾きが未知な斜線、図7−Cでは垂直をなすような時間−時間散布図上での横ずれ変換としても見られる。図7−Bのヒストグラム720は、広域相対再生速度比Rを示す累積された相対再生速度比のピークを示す。新たな相対値はこのときオフセット公式によって与えられ、図7−Dに示されるような新たなヒストグラム740が作成される。新たなヒストグラム740のピークは、広域オフセットの予測を与えるものであるが、上記のようにピークの近傍において値の平均を用いることによって峻鋭にできる。
本発明は、非同期音声録音のキュー(cueing)及び時間正規化のために実施され得る。例えば、DATレコーダとカセットレコーダが僅かに異なる位置または環境で異なるマイクロホンを用いて独立して作動していると考える。後でそれぞれのレコーダからの2つの録音を1つのミックスに統合することが望ましいならば、タイムオフセットを得るために本明細書中で述べられたロバスト回帰技術を用いて2つのトラックを同期してもよい。従って、非同期レコーダが僅かに異なる速度で作動しても、相対速度は高精度で決定されることができ、1つの録音が別の録音に関連して補正されることが可能になる。これは、録音の1つが破損し、別の源から補充される必要があることがわかったときに特に有用である。本明細書中で述べられているような時間正規化及び同期は、従って、トランスペアレントなミキシングを可能にする。
比較方法は非常に高速なので、音声サンプルの大型データベースをフィンガープリントオブジェクトのそれぞれのリストへ前処理することが可能である。当業者であれば理解し得るように、それゆえに、未知の音声サンプルは現在利用可能なデータ処理技術を用い、フィンガープリントオブジェクト各々のリストへと前処理されることができる。上述のマッチング、ヒストグラム化、及びピーク検出技術は、このときマッチを見つけるためにデータベース内の前処理されたフィンガープリントオブジェクトを用いて実行可能である。
Claims (16)
- 第1及び第2の音声サンプルの関係を特徴付ける方法であって、
前記第1の音声サンプルに対して、各フィンガープリントオブジェクトが前記第1の音声サンプル内のそれぞれの位置で生じ、前記それぞれの位置が第1の音声サンプルの内容に基づいて決定され、各フィンガープリントオブジェクトが各それぞれの位置またはその付近で前記第1の音声サンプルの1若しくは複数の特徴を特徴付けるような、第1の組のフィンガープリントオブジェクトを作成する過程と、
第2の音声サンプルに対して、各フィンガープリントオブジェクトが前記第2の音声サンプル内のそれぞれの位置で生じ、前記それぞれの位置が前記第2の音声サンプルの内容に基づいて決定され、各フィンガープリントオブジェクトが各それぞれの位置またはその付近で前記第2の音声サンプルの1若しくは複数の特徴を特徴付けるような、第2の組のフィンガープリントオブジェクトを作成する過程と、
前記第1の音声サンプルからの前記第1のフィンガープリントオブジェクトを、前記第1のフィンガープリントオブジェクトに実質的に類似した前記第2の音声サンプルからの前記第2のフィンガープリントオブジェクトに照合することによりフィンガープリントオブジェクトをペアにする過程であって、各フィンガープリントオブジェクトは1つの位置と、不変要素と、可変要素とを有し、フィンガープリントオブジェクトの各マッチングペアにおける前記第1及び第2のフィンガープリントオブジェクトは、マッチする不変要素を有する、該過程と、
前記ペアにする過程に基づき、複数のフィンガープリントオブジェクトのマッチングペアのリストを作成する過程と、
前記不変要素を用いて、前記フィンガープリントオブジェクトの各マッチングペアに対して相対値を決定する過程と、
前記相対値のヒストグラムを作成する過程と、
前記ヒストグラムにおいて、前記第1及び第2の音声サンプルの前記関係を特徴付けるような統計学的に有意のピークを探索する過程であって、前記第1及び第2の音声サンプルの前記関係が時間伸縮比を含む、該過程とを含むことを特徴とする方法。 - 統計学的に有意のピークが発見された場合、前記第1及び第2の音声サンプルの前記関係が実質的にマッチングするものとして特徴付けられることを特徴とする請求項1に記載の方法。
- 前記第1及び第2の音声サンプルの前記関係を更に特徴付けるような広域相対値を、前記ヒストグラムの軸上に前記ピーク位置を用いて予測する過程を更に含むことを特徴とする請求項1または2に記載の方法。
- 前記広域相対値の超高精度予測を決定する過程を更に含み、前記決定する過程が、
前記ピーク付近で近傍を選択する過程と、
前記近傍における前記相対値の平均を計算する過程とを含むことを特徴とする請求項3に記載の方法。 - 前記不変要素が、
(i)第1の周波数値と第2の周波数値との比であって、各周波数値が各フィンガープリントオブジェクトの前記それぞれの位置付近で第1及び第2の局所的特徴からそれぞれ決定されるような前記比と、
(ii)周波数値とデルタタイム値との積であって、前記周波数値が第1の局所的特徴から決定され、前記デルタタイム値が各フィンガープリントオブジェクトの前記それぞれの位置付近で前記第1の局所的特徴と第2の局所的特徴の間で決定されるような前記積と、
(iii)第1のデルタタイム値と第2のデルタタイム値との比であって、前記第1のデルタタイム値が第1及び第2の局所的特徴から決定され、前記第2のデルタタイム値が前記第1及び第3の局所的特徴から決定され、各局所的特徴が各フィンガープリントオブジェクトの前記それぞれの位置付近にあるような前記比のうちの少なくとも1つを用いて作成されることを特徴とする請求項1に記載の方法。 - 各局所的特徴がスペクトログラムピークであり、各周波数値が、対応するスペクトログラムピークの周波数座標から決定されることを特徴とする請求項5に記載の方法。
- フィンガープリントオブジェクトのマッチングペアの前記相対値が前記第1及び第2のフィンガープリントオブジェクトのそれぞれの周波数値の比として特徴付けられ、かつ、前記第1及び第2の音声サンプルの前記関係を特徴付けるヒストグラムにおける前記ピークが、相対ピッチとして、または線形伸縮の場合には相対再生速度として特徴付けられるように、前記可変要素が、各フィンガープリントオブジェクトの前記それぞれの位置付近で局所的特徴から決定される周波数値であることを特徴とする請求項1に記載の方法。
- それぞれの周波数値の前記比が、対数の除算または差のいずれかとして特徴付けられることを特徴とする請求項7に記載の方法。
- 各局所的特徴がスペクトログラムピークであり、各周波数値が、対応するスペクトログラムピークの周波数座標から決定されることを特徴とする請求項7に記載の方法。
- フィンガープリントオブジェクトのマッチングペアの前記相対値がそれぞれの可変デルタタイム値の比として特徴付けられ、かつ、前記第1及び第2の音声サンプルの前記関係を特徴付けるヒストグラムにおける前記ピークが相対再生速度として、または線形伸縮の場合には相対ピッチとして特徴付けられるように、前記可変要素が、各フィンガープリントオブジェクトの前記それぞれの位置付近で第1及び第2の局所的特徴から決定されるデルタタイム値であることを特徴とする請求項1に記載の方法。
- それぞれの可変デルタタイム値の前記比が、対数の除算または差のいずれかとして特徴付けられることを特徴とする請求項10に記載の方法。
- 各局所的特徴がスペクトログラムピークであり、各周波数値が対応するスペクトログラムピークの周波数座標から決定されることを特徴とする請求項10に記載の方法。
- 各可変要素が各フィンガープリントオブジェクトの前記それぞれの位置付近で局所的特徴から決定される周波数値であるような前記それぞれの可変要素を用いて前記第1及び第2の音声サンプルに対する相対ピッチを決定する過程と、
各可変要素が各フィンガープリントオブジェクトの前記それぞれの位置付近で第1及び第2の局所的特徴から決定されるデルタタイム値であるような前記それぞれの可変要素を用いて前記第1及び第2の音声サンプルに対する相対再生速度を決定する過程と、
前記第1及び第2の音声サンプルの前記関係が非線形として特徴付けられる場合に、前記相対ピッチと前記相対再生速度の逆数とが実質的に異なるかどうか検出する過程とを更に含むことを特徴とする請求項1に記載の方法。 - 前記相対値のヒストグラムの前記ピークから決定される相対再生速度値をRとするとき、
前記第1及び第2のフィンガープリントオブジェクトに関連する経時的な位置をt及びt’とするとき、前記リストにおけるフィンガープリントオブジェクトの各マッチングペアに対して、補正相対タイムオフセット値t−R*t’を決定する過程と、
前記補正相対タイムオフセット値の第2のヒストグラムを作成する過程と、
前記補正相対タイムオフセット値の前記第2のヒストグラムにおいて、前記第1及び第2の音声サンプルの前記関係を特徴付けるような統計学的に有意のピークを探索する過程とを更に含むことを特徴とする請求項1に記載の方法。 - 請求項1乃至14のいずれか1つに記載された方法をコンピュータに実行させるためのコンピュータプログラム。
- 請求項1乃至14のいずれか1つに記載された方法を実行するためのコンピュータシステムであって、前記コンピュータシステムが、
前記方法の各過程をコンピュータに実行させるためのコンピュータプログラムを含み、それに基づいて当該各過程を実行するサーバと、
前記第1及び第2の音声サンプルの前記関係の特徴付けに必要な情報を、前記サーバに送るためのクライアントとを含むことを特徴とするコンピュータシステム。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US37605502P | 2002-04-25 | 2002-04-25 | |
PCT/US2003/012126 WO2003091990A1 (en) | 2002-04-25 | 2003-04-18 | Robust and invariant audio pattern matching |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2005524108A JP2005524108A (ja) | 2005-08-11 |
JP4425126B2 true JP4425126B2 (ja) | 2010-03-03 |
Family
ID=29270756
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2004500283A Expired - Fee Related JP4425126B2 (ja) | 2002-04-25 | 2003-04-18 | ロバストかつインバリアントな音声パターンマッチング |
Country Status (16)
Country | Link |
---|---|
US (1) | US7627477B2 (ja) |
EP (1) | EP1504445B1 (ja) |
JP (1) | JP4425126B2 (ja) |
KR (1) | KR100820385B1 (ja) |
CN (1) | CN1315110C (ja) |
AT (1) | ATE405924T1 (ja) |
AU (1) | AU2003230993A1 (ja) |
BR (1) | BR0309598A (ja) |
CA (1) | CA2483104C (ja) |
DE (1) | DE60323086D1 (ja) |
DK (1) | DK1504445T3 (ja) |
ES (1) | ES2312772T3 (ja) |
HK (1) | HK1073382A1 (ja) |
PT (1) | PT1504445E (ja) |
TW (1) | TWI269196B (ja) |
WO (1) | WO2003091990A1 (ja) |
Families Citing this family (288)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6834308B1 (en) | 2000-02-17 | 2004-12-21 | Audible Magic Corporation | Method and apparatus for identifying media content presented on a media playing device |
US7853664B1 (en) * | 2000-07-31 | 2010-12-14 | Landmark Digital Services Llc | Method and system for purchasing pre-recorded music |
US6990453B2 (en) | 2000-07-31 | 2006-01-24 | Landmark Digital Services Llc | System and methods for recognizing sound and music signals in high noise and distortion |
US7562012B1 (en) | 2000-11-03 | 2009-07-14 | Audible Magic Corporation | Method and apparatus for creating a unique audio signature |
US7363278B2 (en) | 2001-04-05 | 2008-04-22 | Audible Magic Corporation | Copyright detection and protection system and method |
US7529659B2 (en) | 2005-09-28 | 2009-05-05 | Audible Magic Corporation | Method and apparatus for identifying an unknown work |
US7877438B2 (en) | 2001-07-20 | 2011-01-25 | Audible Magic Corporation | Method and apparatus for identifying new media content |
US8972481B2 (en) | 2001-07-20 | 2015-03-03 | Audible Magic, Inc. | Playlist generation method and apparatus |
US7239981B2 (en) | 2002-07-26 | 2007-07-03 | Arbitron Inc. | Systems and methods for gathering audience measurement data |
US8959016B2 (en) | 2002-09-27 | 2015-02-17 | The Nielsen Company (Us), Llc | Activating functions in processing devices using start codes embedded in audio |
US9711153B2 (en) | 2002-09-27 | 2017-07-18 | The Nielsen Company (Us), Llc | Activating functions in processing devices using encoded audio and detecting audio signatures |
EP1586045A1 (en) | 2002-12-27 | 2005-10-19 | Nielsen Media Research, Inc. | Methods and apparatus for transcoding metadata |
US8332326B2 (en) | 2003-02-01 | 2012-12-11 | Audible Magic Corporation | Method and apparatus to identify a work received by a processing system |
EP1647144A1 (en) | 2003-07-11 | 2006-04-19 | Koninklijke Philips Electronics N.V. | Method and device for generating and detecting a fingerprint functioning as a trigger marker in a multimedia signal |
ATE373389T1 (de) * | 2003-07-25 | 2007-09-15 | Koninkl Philips Electronics Nv | Verfahren und einrichtung zur erzeugung und erkennung von fingerabdrücken zur synchronisierung von audio und video |
US7884274B1 (en) | 2003-11-03 | 2011-02-08 | Wieder James W | Adaptive personalized music and entertainment |
US20150128039A1 (en) | 2003-11-03 | 2015-05-07 | James W. Wieder | Newness Control of a Personalized Music and/or Entertainment Sequence |
US11165999B1 (en) | 2003-11-03 | 2021-11-02 | Synergyze Technologies Llc | Identifying and providing compositions and digital-works |
US9098681B2 (en) | 2003-11-03 | 2015-08-04 | James W. Wieder | Adaptive personalized playback or presentation using cumulative time |
US8001612B1 (en) | 2003-11-03 | 2011-08-16 | Wieder James W | Distributing digital-works and usage-rights to user-devices |
US9053299B2 (en) | 2003-11-03 | 2015-06-09 | James W. Wieder | Adaptive personalized playback or presentation using rating |
US9053181B2 (en) | 2003-11-03 | 2015-06-09 | James W. Wieder | Adaptive personalized playback or presentation using count |
US8554681B1 (en) * | 2003-11-03 | 2013-10-08 | James W. Wieder | Providing “identified” compositions and digital-works |
US8396800B1 (en) | 2003-11-03 | 2013-03-12 | James W. Wieder | Adaptive personalized music and entertainment |
EP1704695B1 (fr) * | 2003-11-27 | 2008-02-27 | Advestigo | Systeme d'interception de documents multimedias |
US7986913B2 (en) | 2004-02-19 | 2011-07-26 | Landmark Digital Services, Llc | Method and apparatus for identificaton of broadcast source |
CN101142591A (zh) | 2004-04-19 | 2008-03-12 | 兰德马克数字服务有限责任公司 | 内容采样和标识 |
US20050267750A1 (en) | 2004-05-27 | 2005-12-01 | Anonymous Media, Llc | Media usage monitoring and measurement system and method |
US20150051967A1 (en) | 2004-05-27 | 2015-02-19 | Anonymous Media Research, Llc | Media usage monitoring and measurment system and method |
CN100485399C (zh) * | 2004-06-24 | 2009-05-06 | 兰德马克数字服务有限责任公司 | 表征两个媒体段的重叠的方法 |
US8130746B2 (en) | 2004-07-28 | 2012-03-06 | Audible Magic Corporation | System for distributing decoy content in a peer to peer network |
US7623823B2 (en) | 2004-08-31 | 2009-11-24 | Integrated Media Measurement, Inc. | Detecting and measuring exposure to media content items |
DE102004046746B4 (de) * | 2004-09-27 | 2007-03-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Verfahren zum Synchronisieren von Zusatzdaten und Basisdaten |
CA2595634C (en) | 2005-02-08 | 2014-12-30 | Landmark Digital Services Llc | Automatic identification of repeated material in audio signals |
DE102005014477A1 (de) | 2005-03-30 | 2006-10-12 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Erzeugen eines Datenstroms und zum Erzeugen einer Multikanal-Darstellung |
US20070016918A1 (en) * | 2005-05-20 | 2007-01-18 | Alcorn Allan E | Detecting and tracking advertisements |
US20160321253A1 (en) | 2005-10-26 | 2016-11-03 | Cortica, Ltd. | System and method for providing recommendations based on user profiles |
US8312031B2 (en) * | 2005-10-26 | 2012-11-13 | Cortica Ltd. | System and method for generation of complex signatures for multimedia data content |
US9477658B2 (en) | 2005-10-26 | 2016-10-25 | Cortica, Ltd. | Systems and method for speech to speech translation using cores of a natural liquid architecture system |
US10380623B2 (en) | 2005-10-26 | 2019-08-13 | Cortica, Ltd. | System and method for generating an advertisement effectiveness performance score |
US9031999B2 (en) | 2005-10-26 | 2015-05-12 | Cortica, Ltd. | System and methods for generation of a concept based database |
US10191976B2 (en) | 2005-10-26 | 2019-01-29 | Cortica, Ltd. | System and method of detecting common patterns within unstructured data elements retrieved from big data sources |
US9646005B2 (en) | 2005-10-26 | 2017-05-09 | Cortica, Ltd. | System and method for creating a database of multimedia content elements assigned to users |
US10742340B2 (en) | 2005-10-26 | 2020-08-11 | Cortica Ltd. | System and method for identifying the context of multimedia content elements displayed in a web-page and providing contextual filters respective thereto |
US10535192B2 (en) | 2005-10-26 | 2020-01-14 | Cortica Ltd. | System and method for generating a customized augmented reality environment to a user |
US9087049B2 (en) | 2005-10-26 | 2015-07-21 | Cortica, Ltd. | System and method for context translation of natural language |
US11361014B2 (en) | 2005-10-26 | 2022-06-14 | Cortica Ltd. | System and method for completing a user profile |
US11003706B2 (en) | 2005-10-26 | 2021-05-11 | Cortica Ltd | System and methods for determining access permissions on personalized clusters of multimedia content elements |
US10380267B2 (en) | 2005-10-26 | 2019-08-13 | Cortica, Ltd. | System and method for tagging multimedia content elements |
US9953032B2 (en) | 2005-10-26 | 2018-04-24 | Cortica, Ltd. | System and method for characterization of multimedia content signals using cores of a natural liquid architecture system |
US11403336B2 (en) | 2005-10-26 | 2022-08-02 | Cortica Ltd. | System and method for removing contextually identical multimedia content elements |
US10372746B2 (en) | 2005-10-26 | 2019-08-06 | Cortica, Ltd. | System and method for searching applications using multimedia content elements |
US11604847B2 (en) | 2005-10-26 | 2023-03-14 | Cortica Ltd. | System and method for overlaying content on a multimedia content element based on user interest |
US9529984B2 (en) | 2005-10-26 | 2016-12-27 | Cortica, Ltd. | System and method for verification of user identification based on multimedia content elements |
US11216498B2 (en) | 2005-10-26 | 2022-01-04 | Cortica, Ltd. | System and method for generating signatures to three-dimensional multimedia data elements |
US8326775B2 (en) | 2005-10-26 | 2012-12-04 | Cortica Ltd. | Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof |
US11032017B2 (en) | 2005-10-26 | 2021-06-08 | Cortica, Ltd. | System and method for identifying the context of multimedia content elements |
US9396435B2 (en) | 2005-10-26 | 2016-07-19 | Cortica, Ltd. | System and method for identification of deviations from periodic behavior patterns in multimedia content |
US10698939B2 (en) | 2005-10-26 | 2020-06-30 | Cortica Ltd | System and method for customizing images |
US10180942B2 (en) | 2005-10-26 | 2019-01-15 | Cortica Ltd. | System and method for generation of concept structures based on sub-concepts |
US10635640B2 (en) | 2005-10-26 | 2020-04-28 | Cortica, Ltd. | System and method for enriching a concept database |
US9256668B2 (en) | 2005-10-26 | 2016-02-09 | Cortica, Ltd. | System and method of detecting common patterns within unstructured data elements retrieved from big data sources |
US9466068B2 (en) | 2005-10-26 | 2016-10-11 | Cortica, Ltd. | System and method for determining a pupillary response to a multimedia data element |
US9191626B2 (en) | 2005-10-26 | 2015-11-17 | Cortica, Ltd. | System and methods thereof for visual analysis of an image on a web-page and matching an advertisement thereto |
US9218606B2 (en) | 2005-10-26 | 2015-12-22 | Cortica, Ltd. | System and method for brand monitoring and trend analysis based on deep-content-classification |
US9330189B2 (en) | 2005-10-26 | 2016-05-03 | Cortica, Ltd. | System and method for capturing a multimedia content item by a mobile device and matching sequentially relevant content to the multimedia content item |
US8818916B2 (en) * | 2005-10-26 | 2014-08-26 | Cortica, Ltd. | System and method for linking multimedia data elements to web pages |
US10360253B2 (en) | 2005-10-26 | 2019-07-23 | Cortica, Ltd. | Systems and methods for generation of searchable structures respective of multimedia data content |
US10848590B2 (en) | 2005-10-26 | 2020-11-24 | Cortica Ltd | System and method for determining a contextual insight and providing recommendations based thereon |
US9558449B2 (en) | 2005-10-26 | 2017-01-31 | Cortica, Ltd. | System and method for identifying a target area in a multimedia content element |
US11019161B2 (en) | 2005-10-26 | 2021-05-25 | Cortica, Ltd. | System and method for profiling users interest based on multimedia content analysis |
US9384196B2 (en) | 2005-10-26 | 2016-07-05 | Cortica, Ltd. | Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof |
US10387914B2 (en) | 2005-10-26 | 2019-08-20 | Cortica, Ltd. | Method for identification of multimedia content elements and adding advertising content respective thereof |
US10691642B2 (en) | 2005-10-26 | 2020-06-23 | Cortica Ltd | System and method for enriching a concept database with homogenous concepts |
US9372940B2 (en) | 2005-10-26 | 2016-06-21 | Cortica, Ltd. | Apparatus and method for determining user attention using a deep-content-classification (DCC) system |
US9235557B2 (en) | 2005-10-26 | 2016-01-12 | Cortica, Ltd. | System and method thereof for dynamically associating a link to an information resource with a multimedia content displayed in a web-page |
US9286623B2 (en) | 2005-10-26 | 2016-03-15 | Cortica, Ltd. | Method for determining an area within a multimedia content element over which an advertisement can be displayed |
IL185414A0 (en) * | 2005-10-26 | 2008-01-06 | Igal Raichelgauz | Large-scale matching system and method for multimedia deep-content-classification |
US11386139B2 (en) | 2005-10-26 | 2022-07-12 | Cortica Ltd. | System and method for generating analytics for entities depicted in multimedia content |
US10193990B2 (en) | 2005-10-26 | 2019-01-29 | Cortica Ltd. | System and method for creating user profiles based on multimedia content |
US10949773B2 (en) | 2005-10-26 | 2021-03-16 | Cortica, Ltd. | System and methods thereof for recommending tags for multimedia content elements based on context |
US10380164B2 (en) | 2005-10-26 | 2019-08-13 | Cortica, Ltd. | System and method for using on-image gestures and multimedia content elements as search queries |
US10585934B2 (en) | 2005-10-26 | 2020-03-10 | Cortica Ltd. | Method and system for populating a concept database with respect to user identifiers |
US9747420B2 (en) | 2005-10-26 | 2017-08-29 | Cortica, Ltd. | System and method for diagnosing a patient based on an analysis of multimedia content |
US9767143B2 (en) | 2005-10-26 | 2017-09-19 | Cortica, Ltd. | System and method for caching of concept structures |
US10621988B2 (en) | 2005-10-26 | 2020-04-14 | Cortica Ltd | System and method for speech to text translation using cores of a natural liquid architecture system |
US8266185B2 (en) | 2005-10-26 | 2012-09-11 | Cortica Ltd. | System and methods thereof for generation of searchable structures respective of multimedia data content |
US9639532B2 (en) | 2005-10-26 | 2017-05-02 | Cortica, Ltd. | Context-based analysis of multimedia content items using signatures of multimedia elements and matching concepts |
US9489431B2 (en) | 2005-10-26 | 2016-11-08 | Cortica, Ltd. | System and method for distributed search-by-content |
US10776585B2 (en) | 2005-10-26 | 2020-09-15 | Cortica, Ltd. | System and method for recognizing characters in multimedia content |
US10607355B2 (en) | 2005-10-26 | 2020-03-31 | Cortica, Ltd. | Method and system for determining the dimensions of an object shown in a multimedia content item |
US10614626B2 (en) | 2005-10-26 | 2020-04-07 | Cortica Ltd. | System and method for providing augmented reality challenges |
US7688686B2 (en) | 2005-10-27 | 2010-03-30 | Microsoft Corporation | Enhanced table of contents (TOC) identifiers |
GB2431839B (en) | 2005-10-28 | 2010-05-19 | Sony Uk Ltd | Audio processing |
KR100803206B1 (ko) | 2005-11-11 | 2008-02-14 | 삼성전자주식회사 | 오디오 지문 생성과 오디오 데이터 검색 장치 및 방법 |
EP2070231B1 (en) | 2006-10-03 | 2013-07-03 | Shazam Entertainment, Ltd. | Method for high throughput of identification of distributed broadcast content |
EP3493074A1 (en) | 2006-10-05 | 2019-06-05 | Splunk Inc. | Time series search engine |
US10733326B2 (en) | 2006-10-26 | 2020-08-04 | Cortica Ltd. | System and method for identification of inappropriate multimedia content |
US8077839B2 (en) * | 2007-01-09 | 2011-12-13 | Freescale Semiconductor, Inc. | Handheld device for dialing of phone numbers extracted from a voicemail |
US20080317226A1 (en) * | 2007-01-09 | 2008-12-25 | Freescale Semiconductor, Inc. | Handheld device for transmitting a visual format message |
US10489795B2 (en) | 2007-04-23 | 2019-11-26 | The Nielsen Company (Us), Llc | Determining relative effectiveness of media content items |
US8849432B2 (en) * | 2007-05-31 | 2014-09-30 | Adobe Systems Incorporated | Acoustic pattern identification using spectral characteristics to synchronize audio and/or video |
US8140331B2 (en) * | 2007-07-06 | 2012-03-20 | Xia Lou | Feature extraction for identification and classification of audio signals |
US8006314B2 (en) | 2007-07-27 | 2011-08-23 | Audible Magic Corporation | System for identifying content of digital data |
US8213521B2 (en) * | 2007-08-15 | 2012-07-03 | The Nielsen Company (Us), Llc | Methods and apparatus for audience measurement using global signature representation and matching |
US8473283B2 (en) * | 2007-11-02 | 2013-06-25 | Soundhound, Inc. | Pitch selection modules in a system for automatic transcription of sung or hummed melodies |
CN101226741B (zh) * | 2007-12-28 | 2011-06-15 | 无敌科技(西安)有限公司 | 一种活动语音端点的侦测方法 |
DE102008009025A1 (de) * | 2008-02-14 | 2009-08-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Berechnen eines Fingerabdrucks eines Audiosignals, Vorrichtung und Verfahren zum Synchronisieren und Vorrichtung und Verfahren zum Charakterisieren eines Testaudiosignals |
DE102008009024A1 (de) * | 2008-02-14 | 2009-08-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum synchronisieren von Mehrkanalerweiterungsdaten mit einem Audiosignal und zum Verarbeiten des Audiosignals |
GB2457694B (en) * | 2008-02-21 | 2012-09-26 | Snell Ltd | Method of Deriving an Audio-Visual Signature |
ES2739667T3 (es) * | 2008-03-10 | 2020-02-03 | Fraunhofer Ges Forschung | Dispositivo y método para manipular una señal de audio que tiene un evento transitorio |
GB2458471A (en) * | 2008-03-17 | 2009-09-23 | Taylor Nelson Sofres Plc | A signature generating device for an audio signal and associated methods |
EP2114079B2 (en) | 2008-05-02 | 2018-01-24 | Psytechnics Ltd | Method and apparatus for aligning signals |
JP2010033265A (ja) | 2008-07-28 | 2010-02-12 | Nec Corp | コンテンツ配信方法およびシステム |
US8359205B2 (en) | 2008-10-24 | 2013-01-22 | The Nielsen Company (Us), Llc | Methods and apparatus to perform audio watermarking and watermark detection and extraction |
US9667365B2 (en) | 2008-10-24 | 2017-05-30 | The Nielsen Company (Us), Llc | Methods and apparatus to perform audio watermarking and watermark detection and extraction |
US8121830B2 (en) | 2008-10-24 | 2012-02-21 | The Nielsen Company (Us), Llc | Methods and apparatus to extract data encoded in media content |
US8508357B2 (en) | 2008-11-26 | 2013-08-13 | The Nielsen Company (Us), Llc | Methods and apparatus to encode and decode audio for shopper location and advertisement presentation tracking |
US8199651B1 (en) | 2009-03-16 | 2012-06-12 | Audible Magic Corporation | Method and system for modifying communication flows at a port level |
US8738367B2 (en) * | 2009-03-18 | 2014-05-27 | Nec Corporation | Speech signal processing device |
US8351712B2 (en) | 2009-04-27 | 2013-01-08 | The Neilsen Company (US), LLC | Methods and apparatus to perform image classification based on pseudorandom features |
CA2760677C (en) | 2009-05-01 | 2018-07-24 | David Henry Harkness | Methods, apparatus and articles of manufacture to provide secondary content in association with primary broadcast media content |
GB2470201A (en) * | 2009-05-12 | 2010-11-17 | Nokia Corp | Synchronising audio and image data |
WO2010135623A1 (en) | 2009-05-21 | 2010-11-25 | Digimarc Corporation | Robust signatures derived from local nonlinear filters |
WO2010138777A1 (en) * | 2009-05-27 | 2010-12-02 | Arsh Technologies, Llc | Automatic resource retrieval and use |
US8489774B2 (en) * | 2009-05-27 | 2013-07-16 | Spot411 Technologies, Inc. | Synchronized delivery of interactive content |
US9071868B2 (en) | 2009-05-29 | 2015-06-30 | Cognitive Networks, Inc. | Systems and methods for improving server and client performance in fingerprint ACR systems |
US10949458B2 (en) | 2009-05-29 | 2021-03-16 | Inscape Data, Inc. | System and method for improving work load management in ACR television monitoring system |
US9449090B2 (en) | 2009-05-29 | 2016-09-20 | Vizio Inscape Technologies, Llc | Systems and methods for addressing a media database using distance associative hashing |
US8595781B2 (en) | 2009-05-29 | 2013-11-26 | Cognitive Media Networks, Inc. | Methods for identifying video segments and displaying contextual targeted content on a connected television |
US8190663B2 (en) * | 2009-07-06 | 2012-05-29 | Osterreichisches Forschungsinstitut Fur Artificial Intelligence Der Osterreichischen Studiengesellschaft Fur Kybernetik Of Freyung | Method and a system for identifying similar audio tracks |
WO2011009946A1 (en) | 2009-07-24 | 2011-01-27 | Johannes Kepler Universität Linz | A method and an apparatus for deriving information from an audio track and determining similarity between audio tracks |
US20110041154A1 (en) * | 2009-08-14 | 2011-02-17 | All Media Guide, Llc | Content Recognition and Synchronization on a Television or Consumer Electronics Device |
US8161071B2 (en) | 2009-09-30 | 2012-04-17 | United Video Properties, Inc. | Systems and methods for audio asset storage and management |
US8677400B2 (en) | 2009-09-30 | 2014-03-18 | United Video Properties, Inc. | Systems and methods for identifying audio content using an interactive media guidance application |
US8706276B2 (en) | 2009-10-09 | 2014-04-22 | The Trustees Of Columbia University In The City Of New York | Systems, methods, and media for identifying matching audio |
US8521779B2 (en) | 2009-10-09 | 2013-08-27 | Adelphoi Limited | Metadata record generation |
US9197736B2 (en) * | 2009-12-31 | 2015-11-24 | Digimarc Corporation | Intuitive computing methods and systems |
US8121618B2 (en) | 2009-10-28 | 2012-02-21 | Digimarc Corporation | Intuitive computing methods and systems |
US8860883B2 (en) * | 2009-11-30 | 2014-10-14 | Miranda Technologies Partnership | Method and apparatus for providing signatures of audio/video signals and for making use thereof |
US8682145B2 (en) | 2009-12-04 | 2014-03-25 | Tivo Inc. | Recording system based on multimedia content fingerprints |
US8886531B2 (en) * | 2010-01-13 | 2014-11-11 | Rovi Technologies Corporation | Apparatus and method for generating an audio fingerprint and using a two-stage query |
KR20150095957A (ko) | 2010-05-04 | 2015-08-21 | 샤잠 엔터테인먼트 리미티드 | 미디어 스트림의 샘플을 처리하는 방법 및 시스템 |
CA2943957C (en) | 2010-05-04 | 2017-10-03 | Avery Li-Chun Wang | Methods and systems for synchronizing media |
US9159338B2 (en) | 2010-05-04 | 2015-10-13 | Shazam Entertainment Ltd. | Systems and methods of rendering a textual animation |
JP5907511B2 (ja) | 2010-06-09 | 2016-04-26 | アデルフォイ リミテッド | オーディオメディア認識のためのシステム及び方法 |
US9876905B2 (en) | 2010-09-29 | 2018-01-23 | Genesys Telecommunications Laboratories, Inc. | System for initiating interactive communication in response to audio codes |
WO2012071442A1 (en) * | 2010-11-22 | 2012-05-31 | Listening Methods, Llc | System and method for pattern recognition and analysis |
WO2012112573A1 (en) | 2011-02-18 | 2012-08-23 | Shazam Entertainment Ltd. | Methods and systems for identifying content in a data stream by a client device |
US8589171B2 (en) | 2011-03-17 | 2013-11-19 | Remote Media, Llc | System and method for custom marking a media file for file matching |
US8688631B2 (en) | 2011-03-17 | 2014-04-01 | Alexander Savenok | System and method for media file synchronization |
US8478719B2 (en) | 2011-03-17 | 2013-07-02 | Remote Media LLC | System and method for media file synchronization |
US9380356B2 (en) | 2011-04-12 | 2016-06-28 | The Nielsen Company (Us), Llc | Methods and apparatus to generate a tag for media content |
US8996557B2 (en) | 2011-05-18 | 2015-03-31 | Microsoft Technology Licensing, Llc | Query and matching for content recognition |
EP2507790B1 (en) | 2011-06-06 | 2014-01-22 | Bridge Mediatech, S.L. | Method and system for robust audio hashing. |
KR20150113991A (ko) | 2011-06-08 | 2015-10-08 | 샤잠 엔터테인먼트 리미티드 | 수신된 데이터의 비교를 수행하고 비교에 기초하여 후속 서비스를 제공하는 방법 및 시스템 |
US9256673B2 (en) | 2011-06-10 | 2016-02-09 | Shazam Entertainment Ltd. | Methods and systems for identifying content in a data stream |
US9209978B2 (en) | 2012-05-15 | 2015-12-08 | The Nielsen Company (Us), Llc | Methods and apparatus to measure exposure to streaming media |
US9515904B2 (en) | 2011-06-21 | 2016-12-06 | The Nielsen Company (Us), Llc | Monitoring streaming media content |
US8620646B2 (en) * | 2011-08-08 | 2013-12-31 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal using harmonic envelope |
US8639178B2 (en) | 2011-08-30 | 2014-01-28 | Clear Channel Management Sevices, Inc. | Broadcast source identification based on matching broadcast signal fingerprints |
US9461759B2 (en) | 2011-08-30 | 2016-10-04 | Iheartmedia Management Services, Inc. | Identification of changed broadcast media items |
US9374183B2 (en) | 2011-08-30 | 2016-06-21 | Iheartmedia Management Services, Inc. | Broadcast source identification based on matching via bit count |
US9049496B2 (en) * | 2011-09-01 | 2015-06-02 | Gracenote, Inc. | Media source identification |
US9113202B1 (en) * | 2011-09-21 | 2015-08-18 | Google Inc. | Inverted client-side fingerprinting and matching |
US9460465B2 (en) | 2011-09-21 | 2016-10-04 | Genesys Telecommunications Laboratories, Inc. | Graphical menu builder for encoding applications in an image |
US9384272B2 (en) | 2011-10-05 | 2016-07-05 | The Trustees Of Columbia University In The City Of New York | Methods, systems, and media for identifying similar songs using jumpcodes |
US8831763B1 (en) * | 2011-10-18 | 2014-09-09 | Google Inc. | Intelligent interest point pruning for audio matching |
US8538333B2 (en) | 2011-12-16 | 2013-09-17 | Arbitron Inc. | Media exposure linking utilizing bluetooth signal characteristics |
US8977194B2 (en) | 2011-12-16 | 2015-03-10 | The Nielsen Company (Us), Llc | Media exposure and verification utilizing inductive coupling |
US9268845B1 (en) * | 2012-03-08 | 2016-02-23 | Google Inc. | Audio matching using time alignment, frequency alignment, and interest point overlap to filter false positives |
JP2013205830A (ja) * | 2012-03-29 | 2013-10-07 | Sony Corp | トーン成分検出方法、トーン成分検出装置およびプログラム |
EP2648418A1 (en) * | 2012-04-05 | 2013-10-09 | Thomson Licensing | Synchronization of multimedia streams |
US9235867B2 (en) * | 2012-06-04 | 2016-01-12 | Microsoft Technology Licensing, Llc | Concurrent media delivery |
US9129015B1 (en) * | 2012-06-26 | 2015-09-08 | Google Inc. | Min/max filter for audio matching |
US9282366B2 (en) | 2012-08-13 | 2016-03-08 | The Nielsen Company (Us), Llc | Methods and apparatus to communicate audience measurement information |
US20140074466A1 (en) * | 2012-09-10 | 2014-03-13 | Google Inc. | Answering questions using environmental context |
US9081778B2 (en) | 2012-09-25 | 2015-07-14 | Audible Magic Corporation | Using digital fingerprints to associate data with a work |
US9390719B1 (en) * | 2012-10-09 | 2016-07-12 | Google Inc. | Interest points density control for audio matching |
US9069849B1 (en) * | 2012-10-10 | 2015-06-30 | Google Inc. | Methods for enforcing time alignment for speed resistant audio matching |
EP2731030A1 (en) * | 2012-11-13 | 2014-05-14 | Samsung Electronics Co., Ltd | Music information searching method and apparatus thereof |
US9158760B2 (en) | 2012-12-21 | 2015-10-13 | The Nielsen Company (Us), Llc | Audio decoding with supplemental semantic audio recognition and report generation |
US9195649B2 (en) | 2012-12-21 | 2015-11-24 | The Nielsen Company (Us), Llc | Audio processing techniques for semantic audio recognition and report generation |
US9183849B2 (en) | 2012-12-21 | 2015-11-10 | The Nielsen Company (Us), Llc | Audio matching with semantic audio recognition and report generation |
US9706252B2 (en) | 2013-02-04 | 2017-07-11 | Universal Electronics Inc. | System and method for user monitoring and intent determination |
CN103971689B (zh) * | 2013-02-04 | 2016-01-27 | 腾讯科技(深圳)有限公司 | 一种音频识别方法及装置 |
US9313544B2 (en) | 2013-02-14 | 2016-04-12 | The Nielsen Company (Us), Llc | Methods and apparatus to measure exposure to streaming media |
US9311640B2 (en) | 2014-02-11 | 2016-04-12 | Digimarc Corporation | Methods and arrangements for smartphone payments and transactions |
FR3002713B1 (fr) * | 2013-02-27 | 2015-02-27 | Inst Mines Telecom | Generation d'une signature d'un signal audio musical |
US9451048B2 (en) | 2013-03-12 | 2016-09-20 | Shazam Investments Ltd. | Methods and systems for identifying information of a broadcast station and information of broadcasted content |
US9390170B2 (en) | 2013-03-15 | 2016-07-12 | Shazam Investments Ltd. | Methods and systems for arranging and searching a database of media content recordings |
US9773058B2 (en) | 2013-03-15 | 2017-09-26 | Shazam Investments Ltd. | Methods and systems for arranging and searching a database of media content recordings |
US20140278845A1 (en) | 2013-03-15 | 2014-09-18 | Shazam Investments Limited | Methods and Systems for Identifying Target Media Content and Determining Supplemental Information about the Target Media Content |
WO2014169238A1 (en) | 2013-04-11 | 2014-10-16 | Digimarc Corporation | Methods for object recognition and related arrangements |
US10614132B2 (en) | 2013-04-30 | 2020-04-07 | Splunk Inc. | GUI-triggered processing of performance data and log data from an information technology environment |
US10019496B2 (en) | 2013-04-30 | 2018-07-10 | Splunk Inc. | Processing of performance data and log data from an information technology environment by using diverse data stores |
US10997191B2 (en) | 2013-04-30 | 2021-05-04 | Splunk Inc. | Query-triggered processing of performance data and log data from an information technology environment |
US10318541B2 (en) | 2013-04-30 | 2019-06-11 | Splunk Inc. | Correlating log data with performance measurements having a specified relationship to a threshold value |
US10353957B2 (en) | 2013-04-30 | 2019-07-16 | Splunk Inc. | Processing of performance data and raw log data from an information technology environment |
US10225136B2 (en) | 2013-04-30 | 2019-03-05 | Splunk Inc. | Processing of log data and performance data obtained via an application programming interface (API) |
US10346357B2 (en) | 2013-04-30 | 2019-07-09 | Splunk Inc. | Processing of performance data and structure data from an information technology environment |
US9460201B2 (en) | 2013-05-06 | 2016-10-04 | Iheartmedia Management Services, Inc. | Unordered matching of audio fingerprints |
CN103402118B (zh) * | 2013-07-05 | 2017-12-01 | Tcl集团股份有限公司 | 一种媒体节目互动方法及系统 |
US20150039321A1 (en) | 2013-07-31 | 2015-02-05 | Arbitron Inc. | Apparatus, System and Method for Reading Codes From Digital Audio on a Processing Device |
US9711152B2 (en) | 2013-07-31 | 2017-07-18 | The Nielsen Company (Us), Llc | Systems apparatus and methods for encoding/decoding persistent universal media codes to encoded audio |
US9275427B1 (en) * | 2013-09-05 | 2016-03-01 | Google Inc. | Multi-channel audio video fingerprinting |
US9898086B2 (en) * | 2013-09-06 | 2018-02-20 | Immersion Corporation | Systems and methods for visual processing of spectrograms to generate haptic effects |
US9053711B1 (en) | 2013-09-10 | 2015-06-09 | Ampersand, Inc. | Method of matching a digitized stream of audio signals to a known audio recording |
US10014006B1 (en) | 2013-09-10 | 2018-07-03 | Ampersand, Inc. | Method of determining whether a phone call is answered by a human or by an automated device |
TWI527025B (zh) * | 2013-11-11 | 2016-03-21 | 財團法人資訊工業策進會 | 電腦系統、音訊比對方法及其電腦可讀取記錄媒體 |
NL2011893C2 (en) * | 2013-12-04 | 2015-06-08 | Stichting Incas3 | Method and system for predicting human activity. |
US9426525B2 (en) | 2013-12-31 | 2016-08-23 | The Nielsen Company (Us), Llc. | Methods and apparatus to count people in an audience |
WO2015118431A1 (en) | 2014-02-05 | 2015-08-13 | Edge Innovation, Lda. | Method for capture and analysis of multimedia content |
US10430985B2 (en) | 2014-03-14 | 2019-10-01 | Magic Leap, Inc. | Augmented reality systems and methods utilizing reflections |
US9699499B2 (en) | 2014-04-30 | 2017-07-04 | The Nielsen Company (Us), Llc | Methods and apparatus to measure exposure to streaming media |
CN104093079B (zh) | 2014-05-29 | 2015-10-07 | 腾讯科技(深圳)有限公司 | 基于多媒体节目的交互方法、终端、服务器和系统 |
EP3023884A1 (en) * | 2014-11-21 | 2016-05-25 | Thomson Licensing | Method and apparatus for generating fingerprint of an audio signal |
CN111757189B (zh) * | 2014-12-01 | 2022-07-15 | 构造数据有限责任公司 | 用于连续介质片段识别的系统和方法 |
WO2016086905A1 (es) * | 2014-12-05 | 2016-06-09 | Monitoreo Tecnológico, S.A | Método de medición de audiencias |
CA2973740C (en) | 2015-01-30 | 2021-06-08 | Inscape Data, Inc. | Methods for identifying video segments and displaying option to view from an alternative source and/or on an alternative device |
US10360583B2 (en) | 2015-02-05 | 2019-07-23 | Direct Path, Llc | System and method for direct response advertising |
EP4375952A3 (en) | 2015-04-17 | 2024-06-19 | Inscape Data, Inc. | Systems and methods for reducing data density in large datasets |
CN106294331B (zh) * | 2015-05-11 | 2020-01-21 | 阿里巴巴集团控股有限公司 | 音频信息检索方法及装置 |
US9762965B2 (en) | 2015-05-29 | 2017-09-12 | The Nielsen Company (Us), Llc | Methods and apparatus to measure exposure to streaming media |
BR112018000801A2 (pt) | 2015-07-16 | 2018-09-04 | Inscape Data Inc | sistema, e método |
US10080062B2 (en) | 2015-07-16 | 2018-09-18 | Inscape Data, Inc. | Optimizing media fingerprint retention to improve system resource utilization |
CA3216076A1 (en) | 2015-07-16 | 2017-01-19 | Inscape Data, Inc. | Detection of common media segments |
CN106558318B (zh) * | 2015-09-24 | 2020-04-28 | 阿里巴巴集团控股有限公司 | 音频识别方法和系统 |
WO2017105641A1 (en) | 2015-12-15 | 2017-06-22 | Cortica, Ltd. | Identification of key points in multimedia data elements |
US11195043B2 (en) | 2015-12-15 | 2021-12-07 | Cortica, Ltd. | System and method for determining common patterns in multimedia content elements based on key points |
US9596502B1 (en) | 2015-12-21 | 2017-03-14 | Max Abecassis | Integration of multiple synchronization methodologies |
US9516373B1 (en) | 2015-12-21 | 2016-12-06 | Max Abecassis | Presets of synchronized second screen functions |
JP6952713B2 (ja) | 2016-01-19 | 2021-10-20 | マジック リープ, インコーポレイテッドMagic Leap,Inc. | 反射を利用する拡張現実システムおよび方法 |
US9786298B1 (en) | 2016-04-08 | 2017-10-10 | Source Digital, Inc. | Audio fingerprinting based on audio energy characteristics |
US10397663B2 (en) | 2016-04-08 | 2019-08-27 | Source Digital, Inc. | Synchronizing ancillary data to content including audio |
US10951935B2 (en) | 2016-04-08 | 2021-03-16 | Source Digital, Inc. | Media environment driven content distribution platform |
US10311918B1 (en) | 2016-04-19 | 2019-06-04 | Space Projects Ltd. | System, media, and method for synchronization of independent sensors and recording devices |
AU2017257549B2 (en) | 2016-04-26 | 2021-09-09 | Magic Leap, Inc. | Electromagnetic tracking with augmented reality systems |
US10015612B2 (en) | 2016-05-25 | 2018-07-03 | Dolby Laboratories Licensing Corporation | Measurement, verification and correction of time alignment of multiple audio channels and associated metadata |
CN106910494B (zh) | 2016-06-28 | 2020-11-13 | 创新先进技术有限公司 | 一种音频识别方法和装置 |
JPWO2018047805A1 (ja) * | 2016-09-09 | 2019-06-24 | 日本電気株式会社 | 移動音源速度推定装置、速度監視システム、移動音源速度推定方法、および移動音源速度推定用プログラム |
EP3312724B1 (en) | 2016-10-21 | 2019-10-30 | Fujitsu Limited | Microservice-based data processing apparatus, method, and program |
EP3312722A1 (en) | 2016-10-21 | 2018-04-25 | Fujitsu Limited | Data processing apparatus, method, and program |
US10776170B2 (en) | 2016-10-21 | 2020-09-15 | Fujitsu Limited | Software service execution apparatus, system, and method |
JP7100422B2 (ja) | 2016-10-21 | 2022-07-13 | 富士通株式会社 | データプロパティ認識のための装置、プログラム、及び方法 |
JP6805765B2 (ja) | 2016-10-21 | 2020-12-23 | 富士通株式会社 | ソフトウェアサービスの実行のためのシステム、方法、及びプログラム |
US10922720B2 (en) | 2017-01-11 | 2021-02-16 | Adobe Inc. | Managing content delivery via audio cues |
US10166472B2 (en) | 2017-05-04 | 2019-01-01 | Shazam Investments Ltd. | Methods and systems for determining a reaction time for a response and synchronizing user interface(s) with content being rendered |
US10860786B2 (en) | 2017-06-01 | 2020-12-08 | Global Tel*Link Corporation | System and method for analyzing and investigating communication data from a controlled environment |
WO2019008581A1 (en) | 2017-07-05 | 2019-01-10 | Cortica Ltd. | DETERMINATION OF DRIVING POLICIES |
GB2564495A (en) * | 2017-07-07 | 2019-01-16 | Cirrus Logic Int Semiconductor Ltd | Audio data transfer |
US11899707B2 (en) | 2017-07-09 | 2024-02-13 | Cortica Ltd. | Driving policies determination |
US10129392B1 (en) * | 2017-08-25 | 2018-11-13 | Global Tel*Link Corporation | Systems and methods for detecting inmate to inmate conference calls |
FR3071994A1 (fr) * | 2017-09-29 | 2019-04-05 | Theater Ears, LLC | Procede et programme de reconnaissance et synchronisation audio |
US20190104335A1 (en) * | 2017-09-29 | 2019-04-04 | Theater Ears, LLC | Theater ears audio recognition & synchronization algorithm |
US10158907B1 (en) * | 2017-10-10 | 2018-12-18 | Shazam Investments Ltd. | Systems and methods for performing playout of multiple media recordings based on a matching segment among the recordings |
US20190109804A1 (en) * | 2017-10-10 | 2019-04-11 | Microsoft Technology Licensing, Llc | Audio processing for voice simulated noise effects |
US10129575B1 (en) | 2017-10-25 | 2018-11-13 | Shazam Entertainment Limited | Methods and systems for determining a latency between a source and an alternative feed of the source |
US10846544B2 (en) | 2018-07-16 | 2020-11-24 | Cartica Ai Ltd. | Transportation prediction system and method |
US11418655B2 (en) * | 2018-07-18 | 2022-08-16 | Google Llc | Echo detection |
US11443724B2 (en) * | 2018-07-31 | 2022-09-13 | Mediawave Intelligent Communication | Method of synchronizing electronic interactive device |
US10839694B2 (en) | 2018-10-18 | 2020-11-17 | Cartica Ai Ltd | Blind spot alert |
US11181911B2 (en) | 2018-10-18 | 2021-11-23 | Cartica Ai Ltd | Control transfer of a vehicle |
US11126870B2 (en) | 2018-10-18 | 2021-09-21 | Cartica Ai Ltd. | Method and system for obstacle detection |
US20200133308A1 (en) | 2018-10-18 | 2020-04-30 | Cartica Ai Ltd | Vehicle to vehicle (v2v) communication less truck platooning |
US11270132B2 (en) | 2018-10-26 | 2022-03-08 | Cartica Ai Ltd | Vehicle to vehicle communication and signatures |
US10789535B2 (en) | 2018-11-26 | 2020-09-29 | Cartica Ai Ltd | Detection of road elements |
US11643005B2 (en) | 2019-02-27 | 2023-05-09 | Autobrains Technologies Ltd | Adjusting adjustable headlights of a vehicle |
US11285963B2 (en) | 2019-03-10 | 2022-03-29 | Cartica Ai Ltd. | Driver-based prediction of dangerous events |
US11694088B2 (en) | 2019-03-13 | 2023-07-04 | Cortica Ltd. | Method for object detection using knowledge distillation |
US11132548B2 (en) | 2019-03-20 | 2021-09-28 | Cortica Ltd. | Determining object information that does not explicitly appear in a media unit signature |
US12055408B2 (en) | 2019-03-28 | 2024-08-06 | Autobrains Technologies Ltd | Estimating a movement of a hybrid-behavior vehicle |
US11488290B2 (en) | 2019-03-31 | 2022-11-01 | Cortica Ltd. | Hybrid representation of a media unit |
US11222069B2 (en) | 2019-03-31 | 2022-01-11 | Cortica Ltd. | Low-power calculation of a signature of a media unit |
US10776669B1 (en) | 2019-03-31 | 2020-09-15 | Cortica Ltd. | Signature generation and object detection that refer to rare scenes |
US10789527B1 (en) | 2019-03-31 | 2020-09-29 | Cortica Ltd. | Method for object detection using shallow neural networks |
US10796444B1 (en) | 2019-03-31 | 2020-10-06 | Cortica Ltd | Configuring spanning elements of a signature generator |
US11245959B2 (en) | 2019-06-20 | 2022-02-08 | Source Digital, Inc. | Continuous dual authentication to access media content |
US10748022B1 (en) | 2019-12-12 | 2020-08-18 | Cartica Ai Ltd | Crowd separation |
US11593662B2 (en) | 2019-12-12 | 2023-02-28 | Autobrains Technologies Ltd | Unsupervised cluster generation |
DE102020202400A1 (de) | 2020-02-25 | 2021-08-26 | OSRAM Opto Semiconductors Gesellschaft mit beschränkter Haftung | Optoelektronische sensorvorrichtung, detektor und elektronisches gerät sowie verfahren zum betreiben einer derartigen sensorvorrichtung bzw. eines derartigen detektors |
US11590988B2 (en) | 2020-03-19 | 2023-02-28 | Autobrains Technologies Ltd | Predictive turning assistant |
US11827215B2 (en) | 2020-03-31 | 2023-11-28 | AutoBrains Technologies Ltd. | Method for training a driving related object detector |
US11756424B2 (en) | 2020-07-24 | 2023-09-12 | AutoBrains Technologies Ltd. | Parking assist |
US12049116B2 (en) | 2020-09-30 | 2024-07-30 | Autobrains Technologies Ltd | Configuring an active suspension |
US11694692B2 (en) | 2020-11-11 | 2023-07-04 | Bank Of America Corporation | Systems and methods for audio enhancement and conversion |
EP4194300A1 (en) | 2021-08-05 | 2023-06-14 | Autobrains Technologies LTD. | Providing a prediction of a radius of a motorcycle turn |
WO2023023628A1 (en) * | 2021-08-18 | 2023-02-23 | Advanced Neuromodulation Systems, Inc. | Systems and methods for providing digital health services |
Family Cites Families (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4415767A (en) * | 1981-10-19 | 1983-11-15 | Votan | Method and apparatus for speech recognition and reproduction |
US4450531A (en) * | 1982-09-10 | 1984-05-22 | Ensco, Inc. | Broadcast signal recognition system and method |
US4843562A (en) * | 1987-06-24 | 1989-06-27 | Broadcast Data Systems Limited Partnership | Broadcast information classification system and method |
US5210820A (en) * | 1990-05-02 | 1993-05-11 | Broadcast Data Systems Limited Partnership | Signal recognition system and method |
GB9424429D0 (en) * | 1994-12-02 | 1995-01-18 | Philips Electronics Uk Ltd | Audio/video timing discrepancy management |
US5918223A (en) * | 1996-07-22 | 1999-06-29 | Muscle Fish | Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information |
US6088455A (en) * | 1997-01-07 | 2000-07-11 | Logan; James D. | Methods and apparatus for selectively reproducing segments of broadcast programming |
EP0896712A4 (en) * | 1997-01-31 | 2000-01-26 | T Netix Inc | SYSTEM AND METHOD FOR DISCOVERING RECORDED LANGUAGE |
US5940799A (en) | 1997-09-15 | 1999-08-17 | Motorola, Inc. | System and method for securing speech transactions |
US5913196A (en) | 1997-11-17 | 1999-06-15 | Talmor; Rita | System and method for establishing identity of a speaker |
CN1219810A (zh) * | 1997-12-12 | 1999-06-16 | 上海金陵股份有限公司 | 远程公共电脑系统 |
US6434520B1 (en) * | 1999-04-16 | 2002-08-13 | International Business Machines Corporation | System and method for indexing and querying audio archives |
US20010044719A1 (en) * | 1999-07-02 | 2001-11-22 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for recognizing, indexing, and searching acoustic signals |
GR1003625B (el) * | 1999-07-08 | 2001-08-31 | Μεθοδος χημικης αποθεσης συνθετων επικαλυψεων αγωγιμων πολυμερων σε επιφανειες κραματων αλουμινιου | |
US7194752B1 (en) * | 1999-10-19 | 2007-03-20 | Iceberg Industries, Llc | Method and apparatus for automatically recognizing input audio and/or video streams |
US7174293B2 (en) * | 1999-09-21 | 2007-02-06 | Iceberg Industries Llc | Audio identification system and method |
US6453252B1 (en) * | 2000-05-15 | 2002-09-17 | Creative Technology Ltd. | Process for identifying audio content |
US6990453B2 (en) * | 2000-07-31 | 2006-01-24 | Landmark Digital Services Llc | System and methods for recognizing sound and music signals in high noise and distortion |
US7853664B1 (en) * | 2000-07-31 | 2010-12-14 | Landmark Digital Services Llc | Method and system for purchasing pre-recorded music |
US20020072982A1 (en) * | 2000-12-12 | 2002-06-13 | Shazam Entertainment Ltd. | Method and system for interacting with a user in an experiential environment |
US6483927B2 (en) | 2000-12-18 | 2002-11-19 | Digimarc Corporation | Synchronizing readers of hidden auxiliary data in quantization-based data hiding schemes |
ATE405101T1 (de) * | 2001-02-12 | 2008-08-15 | Gracenote Inc | Verfahren zum erzeugen einer identifikations hash vom inhalt einer multimedia datei |
WO2003009277A2 (en) * | 2001-07-20 | 2003-01-30 | Gracenote, Inc. | Automatic identification of sound recordings |
US7082394B2 (en) * | 2002-06-25 | 2006-07-25 | Microsoft Corporation | Noise-robust feature extraction using multi-layer principal component analysis |
CN1708758A (zh) * | 2002-11-01 | 2005-12-14 | 皇家飞利浦电子股份有限公司 | 改进的音频数据指纹搜索 |
KR100456408B1 (ko) * | 2004-02-06 | 2004-11-10 | (주)뮤레카 | 오디오유전자 생성방법 및 오디오데이터 검색방법 |
CA2595634C (en) * | 2005-02-08 | 2014-12-30 | Landmark Digital Services Llc | Automatic identification of repeated material in audio signals |
-
2003
- 2003-04-18 DE DE60323086T patent/DE60323086D1/de not_active Expired - Lifetime
- 2003-04-18 CA CA2483104A patent/CA2483104C/en not_active Expired - Fee Related
- 2003-04-18 DK DK03724113T patent/DK1504445T3/da active
- 2003-04-18 CN CNB038089386A patent/CN1315110C/zh not_active Expired - Fee Related
- 2003-04-18 PT PT03724113T patent/PT1504445E/pt unknown
- 2003-04-18 WO PCT/US2003/012126 patent/WO2003091990A1/en active Application Filing
- 2003-04-18 AT AT03724113T patent/ATE405924T1/de active
- 2003-04-18 AU AU2003230993A patent/AU2003230993A1/en not_active Abandoned
- 2003-04-18 ES ES03724113T patent/ES2312772T3/es not_active Expired - Lifetime
- 2003-04-18 BR BR0309598-3A patent/BR0309598A/pt active Pending
- 2003-04-18 EP EP03724113A patent/EP1504445B1/en not_active Expired - Lifetime
- 2003-04-18 KR KR1020047016919A patent/KR100820385B1/ko not_active IP Right Cessation
- 2003-04-18 JP JP2004500283A patent/JP4425126B2/ja not_active Expired - Fee Related
- 2003-04-24 TW TW092109632A patent/TWI269196B/zh not_active IP Right Cessation
-
2004
- 2004-10-21 US US10/978,313 patent/US7627477B2/en active Active
-
2005
- 2005-07-14 HK HK05105991A patent/HK1073382A1/xx unknown
Also Published As
Publication number | Publication date |
---|---|
TW200307205A (en) | 2003-12-01 |
US20050177372A1 (en) | 2005-08-11 |
CA2483104A1 (en) | 2003-11-06 |
DK1504445T3 (da) | 2008-12-01 |
US7627477B2 (en) | 2009-12-01 |
ES2312772T3 (es) | 2009-03-01 |
EP1504445A1 (en) | 2005-02-09 |
EP1504445B1 (en) | 2008-08-20 |
ATE405924T1 (de) | 2008-09-15 |
CA2483104C (en) | 2011-06-21 |
US20090265174A9 (en) | 2009-10-22 |
JP2005524108A (ja) | 2005-08-11 |
CN1315110C (zh) | 2007-05-09 |
PT1504445E (pt) | 2008-11-24 |
CN1647160A (zh) | 2005-07-27 |
EP1504445A4 (en) | 2005-08-17 |
WO2003091990A1 (en) | 2003-11-06 |
DE60323086D1 (de) | 2008-10-02 |
TWI269196B (en) | 2006-12-21 |
KR20050010763A (ko) | 2005-01-28 |
AU2003230993A1 (en) | 2003-11-10 |
KR100820385B1 (ko) | 2008-04-10 |
BR0309598A (pt) | 2005-02-09 |
HK1073382A1 (en) | 2005-09-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4425126B2 (ja) | ロバストかつインバリアントな音声パターンマッチング | |
US9313593B2 (en) | Ranking representative segments in media data | |
KR100725018B1 (ko) | 음악 내용 자동 요약 방법 및 그 장치 | |
Gillet et al. | Transcription and separation of drum signals from polyphonic music | |
US7342167B2 (en) | Apparatus and method for generating an encoded rhythmic pattern | |
US7626111B2 (en) | Similar music search method and apparatus using music content summary | |
KR100880480B1 (ko) | 디지털 오디오 신호의 실시간 음악/음성 식별 방법 및시스템 | |
US20150134329A1 (en) | Content identification system | |
US20140330556A1 (en) | Low complexity repetition detection in media data | |
US20060155399A1 (en) | Method and system for generating acoustic fingerprints | |
Arzt et al. | Fast Identification of Piece and Score Position via Symbolic Fingerprinting. | |
WO2005122141A1 (en) | Effective audio segmentation and classification | |
Tsipas et al. | Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination | |
Verma et al. | Structural segmentation of Hindustani concert audio with posterior features | |
US7680654B2 (en) | Apparatus and method for segmentation of audio data into meta patterns | |
Zhang et al. | Audio segmentation based on multi-scale audio classification | |
JP2010038943A (ja) | 音響信号処理装置及び方法 | |
Gillet et al. | Comparing audio and video segmentations for music videos indexing | |
Struharová | Performance of the Dejavu audio fingerprinting framework in music identification in movies | |
Gruhne et al. | Extraction of Drum Patterns and their Description within the MPEG-7 High-Level-Framework. | |
Ghaemmaghami et al. | BIRTH-DEATH FREQUENCIES VARIANCE OF SINUSOIDAL MODEL |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20051222 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A821 Effective date: 20051222 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20060407 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20090714 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20091013 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20091110 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20091208 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 4425126 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20121218 Year of fee payment: 3 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20121218 Year of fee payment: 3 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20131218 Year of fee payment: 4 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
S531 | Written request for registration of change of domicile |
Free format text: JAPANESE INTERMEDIATE CODE: R313531 |
|
S533 | Written request for registration of change of name |
Free format text: JAPANESE INTERMEDIATE CODE: R313533 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
RD02 | Notification of acceptance of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: R3D02 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
S531 | Written request for registration of change of domicile |
Free format text: JAPANESE INTERMEDIATE CODE: R313531 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
LAPS | Cancellation because of no payment of annual fees |