CN1315110C - 坚固而且不变的音频图样匹配 - Google Patents

坚固而且不变的音频图样匹配 Download PDF

Info

Publication number
CN1315110C
CN1315110C CNB038089386A CN03808938A CN1315110C CN 1315110 C CN1315110 C CN 1315110C CN B038089386 A CNB038089386 A CN B038089386A CN 03808938 A CN03808938 A CN 03808938A CN 1315110 C CN1315110 C CN 1315110C
Authority
CN
China
Prior art keywords
audio frequency
ordinary
fingerprint
relative
fingerprint objects
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB038089386A
Other languages
English (en)
Other versions
CN1647160A (zh
Inventor
A·礼俊·王
丹尼尔·库伯特
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shazam Investments Ltd
Original Assignee
Landmark Digital Services LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Landmark Digital Services LLC filed Critical Landmark Digital Services LLC
Publication of CN1647160A publication Critical patent/CN1647160A/zh
Application granted granted Critical
Publication of CN1315110C publication Critical patent/CN1315110C/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/121Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
    • G10H2240/131Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
    • G10H2240/135Library retrieval index, i.e. using an indexing scheme to efficiently retrieve a music piece
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/121Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
    • G10H2240/131Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
    • G10H2240/141Library retrieval matching, i.e. any of the steps of matching an inputted segment or phrase with musical database contents, e.g. query by humming, singing or playing; the steps may include, e.g. musical analysis of the input, musical feature extraction, query formulation, or details of the retrieval process
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2250/00Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
    • G10H2250/131Mathematical functions for musical analysis, processing, synthesis or composition
    • G10H2250/215Transforms, i.e. mathematical transforms into domains appropriate for musical signal processing, coding or compression
    • G10H2250/235Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech

Abstract

本发明提供一种用以快速并准确决定两个音频试样是否匹配、以及是否免于如为播放速度变动的各种变换的发明技术。两音频试样的间的关系的特征是首先匹配得自各别试样的某些指纹物件。对每个音频试样(210)产生一组(230)指纹物件(231、232),它们中的每一个发生在一特别位置(242)上。各位置(242)的决定依各音频试样(210)的内容而定,而且各指纹物件(232)在或接近各别特殊位置(242)处具备一或更多局部特性(222)。接着为每对匹配指纹物件决定相对值。然后产生一相对值的直方图。如发现一统计上的明显峰值,则两音频试样具备实质上匹配的特征。

Description

坚固而且不变的音频图样匹配
技术领域
本发明通常是有关于在一大音频档案资料库上处理音频讯号。特别是关于一种用以快速并准确地决定两个音频试样是否匹配,以及是否免于含播放速度变动的各种变换的发明技术。本发明技术更能准确估算变换。
背景技术
用以快速且准确地自动辨识音乐及其它音频讯号的需求持续成长。先前可用的音频辨识技术常因准确性、或为了减少噪讯而牺牲速度。在某些申请案中,要在极度噪讯存在时对于估算时间对时间的分布图的斜率时需要计算回归分析,这导引出许多难度及低效能的速度和准确性。先前既存的音频辨识技术在明显的播放速度变动存在时,因此无法实施快速及准确的辨识,例如,无法辨识以高于正常速度播放时的录音。
增加问题复杂性的是DJ在无线电台、俱乐部和其它场所所使用的渐增,受到欢迎种类的速度变动,修正音调的节奏变动。目前,不管播放速度变动及/或修正音调节奏变动的话,没有能实施快速且准确音频辨识的稳健而且可靠的技术。
发明内容
本发明提供一种说明两音频档案之间关系特征的快速且不变动的方法,完成音频辨识技术的要求。本发明方法克服既有技术的以上所提及缺点,甚至在极度噪讯存在时亦为准确。
根据本发明一观点,两音频试样间的关系具有首先匹配得自各别试样的某种指纹物件的特征。对各音频试样产生一组指纹物件。各指纹物件是发生在各别音频试样内的一特别位置。各位置的决定依各别音频试样内容而定,且各指纹物件在或接近各特别位置处具备各别音频试样的一或更多局部特性。在一实施例中,各指纹物件进一步的特征是具备一变动成份和不变动成份。接着每对匹配指纹物件确定一相对值。然后产生相对值的直方图。如在直方图中发现统计上的明显峰值,则两音频试样具备这样的特征,例如实质上的匹配。
根据本发明的另一方面,由直方图轴上的一峰值位置提供一全面相对值的估算,使上述技术可以进一步被提高。接着,通过选取一个在感兴趣的峰值邻近区域并计算一在所选取的邻近区域的相对平均值可改进该全面相对值。
还有,在另一实施例中,从直方图的峰值决定一相对播放速度值,对每对匹配的指纹物件计算一补偿的相对时间偏置值。根据补偿的相对时间偏置值产生另一直方图。如在第二直方图中发现统计上的明显峰值时,两音频试样间的关系则进而具备峰值特征,进而加强本发明的准确性。
附图说明
第1图代表一分析音频试样的频谱图。
第2图为一表示根据本发明一观点,产生自一音频试样指纹物件的范例图。
第3图说明根据本发明原理所比较的两音频试样。
第4A-B图表示具有及不具一统计上明显峰值的典范直方图。
第5A-B图说明当播放速度变动时,时间-频率点的运动。
第6A-B图表示匹配混合标识的第一音频试样(试样声音)和第二音频试样(资料库声音)的对应时间。当试样声音的播放速度与资料库声音相同时,斜率为1。
第7A-D图说明找到并绘制本发明直方图技术的快速及有效斜率。
元件对照表
210:音频试样
220:频谱
221,222:能量区
230:清单
231:,232:指纹物件
242:位置栏位
252:变动成份
262:不变动成份
310,320:指纹物件清单
1,2:音频试样
311,322:指纹物件
具体实施方式
本发明能在一大音频档案资料库上作快速、强力、不变动、及在一个大的音频档案数据库里可扩缩的索引及搜寻,并对音频图样辨识应用特别有用。在某些实施例中,此处所发表的技术改进并增强了在以上所参考的美国专利申请案中所发表的音频辨识系统和方法。
两音频试样档案间的非常快速与有效的比较运算在建立一商业上可行的音频辨识系统中是重要的。根据本发明一个方面,两音频试样间关系具备这样的特征,即,如第1图中所示,首先匹配得自各个音频试样频谱的某种指纹物件。频谱为一时间、频率代表/分析,它是以滑动窗框中一次取样2*K并计算傅立叶(Fourier)变换产生的,因此在各音框中产生K频箱。音框可重叠加以改进时间的解析分析。使用的特别参数依处理的音频试样种类而定。最好使用取样率8KHZ,K=512的音框,和跨步为64试样的离散时间音频档案。
指纹物件
产生各音频试样的音谱后,被扫描求得局部特性,例如局部能量峰值(如第2图中所示)。匹配程序通过一个音频试样的对应局部特性抽取一组指纹物件而开始。在一典范实施例中,一音频试样为一要加以辨识的未知声音试样而另一音频试样为一储存在资料数据库中的已知录音。每一指纹物件发生在各音频试样内的一特别位置。在某些实施例中,每个指纹物件被定位在一音频档案内的某些时间偏置位置,并在接近其各别时间座标位置,包含有关音频档案的一组叙述资讯。那就是,依接近各别时间偏置的音频试样而定加以计算各指纹物件中所包含的叙述资讯。这被编码成一小资料结构。最好,以通常可再生,甚至存在噪讯,失真,及如变动播放速度的其它变换的方式,决定位置和叙述资讯。在这情况中,依各别音频试样的内容而定,决定各位置,且每个指纹物件具备这样的特性,如第1图中所示,在或接近例如,位置(t1,f1)或(t2,f2)的各别特别位置处各指纹物件具各别音频试样的一或更多局部特性。
在一典范实施例中,各指纹物件具备其位置,变动成份、和不变动成份的特征。各局部特性为一音谱峰值并从一对应音谱峰值的频率座标决定各频率值。峰值的决定是藉在各时间-频率座标附近加以搜寻并选取比其邻近具较大值的点。更明确地说,如第2图中所示,将一音频试样210分析成在区域221和222表示高能量的频谱代表220。抽取与局部能量区221和222有关的资讯并将其摘要成一指纹物件231,232等的清单230。各指纹物件选择性地包含一位置栏242,一变动成份252,及一不变动成份262。最好,选取邻近区,使得各选取点在以其为中心的一21×21单位区内为最大。读者可参考以上所参考的美国专利申请案,更加讨论邻近区及点的选取。接着,对各对匹配的指纹物件,决定一相对值。在某些实施例中,相对值为各别音频试样参数值的对数商或差。然后产生一相对值的直方图。如果在直方图中发现一统计上的明显峰值,则两音频试样具实质上匹配的特性。
参考第三图,分别如音频试样1和2的以上说明,分别备制指纹物件清单310和320。从各清单比较各指纹物件311和322。在步骤351中,例如使用各不变动成份1NV和1NV′将匹配指纹物件配成对,并在步骤352中将其放在一清单中。在步骤353中,计算各匹配对的相对值。接着,在步骤354中,产生一相对值的直方图。在步骤355中,在直方图中搜寻一统计上的明显峰值。在步骤356中,如找不到,则音频试样1和2不匹配,例如为第4A图的直方图410。另外,如检测到一统计上的明显峰值,则音频试样1和2匹配,例如为第4B图的直方图420。
如第361步骤中的说明,通过直方图一轴上的一峰值位置提供一个全面相对值R的估算可进而加强上述技术。在某些实施例中,首先选取所关注峰值邻近区能将R细调。在第1图中,这以一特殊位置(t1,f1)附近的一关注区110表示。接着,计算所选取邻近区中的平均相对值。这平均值可为在所选取邻近区中以数点各相对值计算加权的平均值。在某些实施例中,能进而将R细调,对各匹配配对产生相对时间偏置值t′-R*t。以这些相对时间偏置值,步骤362-364表示产生一第二直方图,允许计算一补偿时间偏置。
例如,,为抽取指纹物件,例如为Wigner-Ville分布或子波,可实施其它种的时间-频率分析。而且,不用频谱图峰值,亦能使用例如为倒频谱系数的其它特性。而且,可使用超解析技术,得到由频谱峰值所提供的时间-频率座标的更细微频率和时间估算。例如,可使用有关频率箱的抛物线内插法增加频率解析度。在朱利亚斯(史密斯三世(JuliusO.Smith III)和萨比亚西拉(Xavier Serra)的″PARSHL:根据正弦波代表,对非谐和声音的分析/合成程式″,国际电脑音乐会议录(ICMC-87,东京),电脑音乐协会,1987,及Prentice Hall公司所出版由史提芬凱(Steren M.kay)(1988年元月)所著的″现代频谱估算:理论与应用″中可发现相关的典范教义,此处将后两者纳入参考。
匹配处理
在一匹配运算中,经由其各别指纹物件比较两音频试样。如以前参考第3图的讨论,产生匹配指纹物件配对,各配对实质上包含匹配成份。备置资料,允许快速搜寻的一种方式为将指纹物件编码成数值标识,如32位元无符号的整数,并使用数值标识作为储存和搜寻的关键。例如在艾迪生卫斯理(Addison Wesley)公司所出版,由唐纳欧文努斯(Donald Ervin Kmuth)(1998年4月7所著的″计算机程式规划技术,第3册:储存和搜寻(第2版)″中熟知有效资料处理技术,此处将其纳入参考。
在一典范实施例中,各指纹物件包含一不变动成份和一变动成份。不变动成份指的是对应于频谱峰值的频率值比率,而且在时间延长下,频谱峰值间的时间差(即,时间差距)比率不变动。例如,参考第5A和5B图,如音频试样频谱在座标(t1,f1),(t2,f2),和(t3,f3)是某些局部频谱峰值,则对于两点的不变动量为f1/f2,即f2′/f1′=f2/f1。额外3点的不变动量指定为f3/f1,(t3-t1)/(t2-t1),或(t3-t2)/(t2-t1),或藉变更这些点及/或计算这些数量或其组合的函数加以产生任何其它组合。例如,f2/f1除以f3/f1可以产生f2/f3。而且,如使音频试样线性延长,如只是快速播放,则频率和时间差额外地享受交互关系,故如f1*(t2-t1)的数量亦为不变动量。可使用这些数量的对数,以加减取代先进乘除。为探求频率和时间延长的比,假设他们无相依性,故具有一频率变动量和一时间变动量是必要的。为使匹配运算有效率,我们使用不变动部位编列指纹索引并使用近似或正确值加以搜寻。使用近似匹配加以搜寻允许某些特别强韧性,对抗失真及圆弧化误差,但如果搜寻不变动成份变成多维范围的搜寻则产生更多成本。在较佳实施例中,需要正确匹配各指纹物件的不变动成份,因此产生一非常快速的系统,为了噪讯存在的辨识而对敏感度有一些妥协。重要的是要注意甚至在对应的音频试样中,只有少数指纹物件正确匹配,则这方法亦运作良好。在直方图峰值侦测步骤中,甚至如果正确匹配并残存少如1-2%的指纹物件则在统计上明显有一峰值。
除了,或不用不变动成份外,亦能使用变动成份,减小匹配指纹物件的个数。例如,我们可能需要来自第一音频试样的一变动成份V在+/-20%内匹配来自第二音频试样的一对应成份V′。在那样情况中,我们可形成一数值标识代表,使得上部位(例如,最高有效位元)包含不变动成份,而下部位(例如,最低有效位元)包含变动成份。然后,搜寻一近似匹配变成在使用变动成份的最低和最高值组成的标识上作范围搜寻。如使用一变动成份完成搜寻,则因此未必严格需要在匹配运算时使用不变动成份。然后,建议在匹配程序中使用不变动成份,因它有助降低疑似匹配的个数,因此使直方图编程程序有效率并降低处理一般开销量。
另一方面,新变动成份本身可能是或不是两指纹物件间匹配准则的一部分。变动成份的代表值可因从一原始录音至一取样录音的某些参数变换而失真。例如,可选取如f1,f2,f3的频率变动成份以及如(t2-t1),(t3-t1)或(t3-t2)的时间变动成份作为播放速度的变动成份。假设第二音频试样,例如,授引自资料库的匹配试样有一座标为(t1′,f1′),(t2′,f2′)和(t3′,f3′)的频谱,这些座标对应于以第一音频试样所列的相同点。然后,频率成份f1′可能有一比例化的值f1=Rf*f1,其中,Rf为一线性延长参数,说明多快或多慢会将第一试样录音与第二试样录音比较。可使用各两匹配音频试样的变动成份藉两频率值Rf=f1′/f1间的比率加以计算说明一宏观参数的全面延长值的估算。这指定两匹配时间-频率点的相对音调比;例如,Rf=2意为第一音频试样为第二音频试样音调(频率)的半。另一可能性为使用Rt=(t2′-t1′)/(t2-t1).在这情况中,相对值R为相对播放速度比,即,Rt=2意为第一音频试样播放速度为第二音频试样的两倍。
如Rf=1/Rt,即,f′/f=(t2-t1)/(t2′-t1′),则由于这种音频试样的交互时间-频率关系,两音频试样有一线性时间延长关系。在这情况下,我们可使用此处所发表的直方图编程法,形成估算利用对应变动频率成份的相对频率比Rf,且再次开成估算相对播放速度Rt,然后实施比较加以侦测播放关系是否为线性或非线性。
通常,利用来自第一和第二音频试样的对应变动成份,从所匹配的指纹物件加以计算一相对值。相对值可为频率的简单比或时间差,或造成估算用以说明第一与第二音频试样间映射的全面参数的某些其它函数。但通常可使用例如为R=F(v1,v1′)的任何两个输入的函数F(),其中,v1和v1′各为变动量。最佳者为F()为一连续函数,使得测量v1和v1′时的小误差在输出R形成小误差。
直方图编程
如此处的说明,对从指纹物件的匹配配对清单所计算的相对值组产生一直方图。然后在直方图中搜寻一峰值。直方图中,统计上存在的明显峰值表示已发生可能的匹配。这种方法不用如(t1′-t′)的时间偏置差,而在直方图中特别搜寻相对值的集业。
根据本发明的原理,直方图的作用在形成计数值箱,各箱相当于沿着直方图独立轴的一特定值。为达本发明的目的,直方图的产生可就对相对值清单的分类加以完成。因此,侦测相对值清单的直方图峰值的一种快速和有效方式为将清单由小至大分类,然后筛检找出具相同或类似值的最大块项目。
统计意义
如本发明此处的讨论,甚至假如只有少至2%的指纹物件幸免于所有失真并匹配无误时,两音频试样亦能匹配无误。通过记下两音频试样间的比较刻痕,这是可能的。明确地说,在直方图峰值附近选取一邻近区并计数落在邻近区中的所有匹配配对,记下刻痕。此外,可计算权重点数,扣减离峰值中心较远的配对的贡献。
估算截止准则的一种方式为假设非匹配音轨刻痕的概率分布以指数末尾往下掉。将这图样套用在实际所测量的非匹配音轨刻痕分布。接着,对于一N音轨资料库,计标最高刻痕的累积概率分布(例如,取一单一非匹配刻痕的累积概率分布的第N阶指数)。一旦知道概率曲线并选取为正量的一最大位准时(例如,0.5%),即可选取一数字临界值并用以决定直方图峰值的匹配配对是否有一统计上明显的个数。
超精细估算
一旦找到一统计上明显的直方图峰值,则可计算全面相对值(如相对播放速度)的高解析″超精细″估算。这种计算的完成是通过在峰值附近选取一邻近区,例如,包含离峰值直方图箱中心约3或5箱宽的间隔,并计算邻近区中的平均相对值。使用这种技术,我们可发现准确性达0.05%内的相对播放速度。以此处所发表的偏置衍生,可以优于1ms的准确性估算全面的时间偏置,该准确性比以上所讨论的频谱音框的时间解析更精细。
强力回归分析
如以上所参考的美国专利申请案中的讨论,在试样真正匹配的情况中,如第6A图中所示,在匹配试样的匹配指纹物件的对应时间座标(t′,t)彼此相对所划的分布图中可看到一斜线。难题是在高噪讯量存在中找寻回归方程式,它是由斜线的斜率和偏置所决定的。斜率表示相对播放速度,而偏置为一音频试样一开始对第二音频试样一开始的相对偏置。习知上有如最低均方调和的回归技术,例如,为威廉培斯(WilliamHo Press),布莱恩佛莱纳利(Brian P.Flannery),沙乌提可夫斯基(Saul A.Tenkolsky),及威廉维特宁(William T.VeHerling)(1993年元月)在剑桥大学校刊所著的″C写成的数值秘笈:科学计算的技术(第二版)″,此处将该文纳入参考。不幸地,习知技术苦于不相称的敏感度,其中,单一的远局外物可使所估算的回归参数急剧倾斜。实际上,相对点常由局外物主导,使其非常难以检测正确斜线。强力回归分析的其它技术可用以克服局外问题,在噪讯存在的相对点的间找到线性关系,但这些倾向于缓慢与反复且在局部最佳化中可能卡住。在找寻一未知线性回归变数的文献中存在广大各种技术。从数学作品(Mathworks)及此处所纳入参考的Matlab工具帮手包含回归分析用的各种软体常规。
本发明提供估算相对播放速度(或,在线性播放关系情况下,对等地为相对音调的例数)的发明方法,该方法解决问题,甚至假如匹配的斜率不等于1时,如第6B图,在时间-时间分布图中找到一回归线。如此处的发表,使用局部相对播放速度的直方图,利用先前未考虑的资讯并提供快速且有效解决回归分析问题的未预期优点。
为找寻偏置,假设对应的时间点具下列关系
                        偏置=t1-Rt*t1,
其中,Rt由先前的讨论求得。这为补偿的时间偏置且作用在使两音频试样间的时间座标系统正常化。这在如构成第7A图中未知斜率的斜线及第7C图中垂直线的时间-时间分布图的剪切变换亦可看到。第7B图的直方图720说明表示全面相对播放速度比R的累积相对播放速度比峰值。然后由偏置公式指定新相对值,如第7D图中所见到的,产生一新的直方图740。新直方图740的峰值指定全面偏置的估算值,如上述,利用峰值邻近区的平均值,该估算值可会是陡峭的。
简言之,第一直方图编程阶段提供一种方式加以估算相对播放速度,以及决定是否存在匹配。第二直方图编程阶段确信候选匹配音频试样有明显个数的亦暂时对齐的指纹物件。第二直方图编程阶段亦作为一第二独立筛检准则并有助降低伪正量的概率,因此提供较有力准则加以决定两音频试样是否匹配。只在第一直方图中如有一统计上的明显峰值时可选择实施第二直方图的编程阶段,因此节省计算资源和努力。可选择实施进一步的最佳化,例如,降低计算上的混乱,不用对清单上匹配指纹物件的所有配对计算第二直方图,第二直方图可只使用对应于第一直方图峰值的匹配对加以产生。
多重录间的同步处理
本发明的执行可用以对非同步的音频录音加入旁白及时间校准。例如,假设在稍微不同位置或环境,以不同麦克风独立操作一DAT录音机和一卡带录音机。如稍后预期要从各别录音机将两段录音组合成一段混音,则可使用此处说明的强力回归分析技术两音轨同步化,得到时间偏置。照这样,甚至假如非同步化的录音机以稍微不同速度操作时可以高度准确性决定相对速度,允许参考另一段录音补偿一段录音。如发现其中一段录音已损毁且需从另一音源加以补遗时,这尤其有用。如此处说明的时间校准和同步化因此允许透通性混音。
资料库搜寻
因比较方法极快速,可能要将一大资料库的变频试样预先处理成各别的指纹物件清单。因一娴熟此技术者会认知到,使用目前可用的资料处理技术因此可将一未知音频试样预先处理成指纹物件的其本身各别清单。使用资料库中预先处理的指纹物件,然后可实施上述的匹配,直方图编程,及峰值检测技术加以找寻匹配。
虽然已详细说明本发明及其优点,应了解的是本发明并不限于或被界定成此处所表示者或所讨论者。尤其是,此处所发表的图示和说明以图例解释有关本发明的技术,表示本发明的实例,并提供利用本发明的实例且不可推断为使本发明受到限制。已知的方法,技术,或系统可不详细加以讨论,故能避免模糊本发明的原理。因以技术中的其中一项平常技能将会认知到,只要不偏离本发明的原理和精神,对本发明可加以实施,修饰,或另外改变。例如,可以在电脑可读取媒体中具体的电脑可执行指令的形式加以实施或另外实现此处所说明的方法,技术,和步骤。另外,本发明可在一具有客户终端和伺服器的电脑系统中加以实施。客户终端传送第一和第二音频试样间关系特征所需的,例如,为指纹物件的资讯至表现特征的伺服器处。因此,应以下列请求项目及其法律上的等效请求项加以决定发明范围。

Claims (16)

1.一种具备一个第一和一个第二音频试样之间关系特征的音频图样匹配的方法,它包含以下步骤:
产生第一音频试样的第一组指纹物件,各指纹物件发生在第一音频试样内的一各别位置,各别位置的决定依第一音频试样内容而定,且各指纹物件在或接近各别位置处具备第一音频试样的一个或更多特性;
产生第二音频试样的第二组指纹物件,各指纹物件发生在第二音频试样内的一各别位置,各别位置的决定依第二音频试样内容而定,且各指纹物件在或接近各别位置处具备第二音频试样的一个或更多特性;
通过使来自第一音频试样的第一指纹物件和来自第二音频试样实质上类似于第一指纹物件的第二指纹物件相匹配而使指纹物件配成对;
根据配对步骤产生所匹配指纹物件的配对清单;
决定各对匹配指纹物件的相对值;
产生一幅相对值的直方图;以及
在直方图中搜寻一个统计上的明显峰值,该峰值具备第一和第二音频试样间关系的特征。
2.如权利要求1所述的方法,其特征是:如发现一个统计上的明显峰值时则第一和第二音频试样之间关系具备实质上匹配的特征。
3.如权利要求1或2所述的方法,其特征是:它进一步包含有以直方图轴上一峰值位置加以估算一全面相对值的步骤,全面相对值更具备第一和第二音频试样间关系的特征。
4.如权利要求3所述的方法,它进一步包含有超精细估算全面相对值的决定步骤,其特征是,其中的决定步骤包含有:
在峰值附近选取一邻近区域,以及
在邻近区域中计算一平均相对值。
5.如权利要求1所述的方法,其特征是:各指纹物件具有一不变成份,而各对匹配的指纹物件中的第一和第二指纹物件具有匹配的不变成份。
6.如权利要求5所述的方法,其特征是:使用至少以下的一种方法来产生不变动成份:
①一个第一和一个第二频率值之间的比率,从接近各指纹物件各别位置的第一和第二局部特性分别决定各频率值;
②一个频率值和一个时间差值间的乘积,从第一局部特性决定频率值,并在接近各指纹物件各别位置的第一局部特性和第二局部特性之间决定时间差值;以及
③一个第一和一个第二时间差值之间的比率从第一和第二局部特性决定第一时间差值,从第一和第三局部特性决定第二时间差值,各局部特性接近各指纹物件的各别位置。
7.如权利要求6所述的方法,其特征是:各局部特性为一频谱峰值,并从一对应频谱峰值的一频率坐标来决定各频率值。
8.如权利要求1或5所述的方法,其特征是:各指纹物件具有一变动成份,并利用第一与第二指纹物件的各别变动成份加以决定各对匹配指纹物件的相对值。
9.如权利要求8所述的方法,其特征是:变动成份为从接近各指纹物件的各别位置的一局部特性所决定的频率值,使得一对匹配指纹物件的相对值具备第一和第二指纹物件各别频率值比率的特征,且直方图中的峰值具备第一和第二音频试样间关系的特征而第一和第二音频试样具备相对音调的特征,或在线性延长的情况中为相对播放速度的特征。
10.如权利要求9所述的方法,其特征是:各别频率值的比率既可以是对数除法也可以是对数减法的演算。
11.如权利要求9所述的方法,其特征是:各局部特性为一频谱峰值,并从一对应
频谱峰值的一频率坐标决定各频率值。
12.如权利要求8所述的方法,其特征是:变动成份为从接近各指纹物件的各别位置的第一和第二局部特性所决定的时间差值,使得一对匹配指纹物件的相对值具备各别变动时间差值比率的特征,且直方图中的峰值具备第一和第二音频试样间关系的特征,而第一和第二音频试样具备相对播放速度的特征,或在线性延长的情况中为相对音调的特征。
13.如权利要求12所述的方法,其特征是:各别变动的时间差值的比率既可以是对数除法也可以是对数减法的演算。
14.如权利要求12所述的方法,其特征是:各局部特性为一频谱峰值,并从一个对应频谱峰值的一个频率坐标确定各频率值。
15.如权利要求8所述的方法,它进一步包含以下步骤:
利用各别变动成份决定第一和第二音频试样的一相对音调,其特征是:各变动成分为从接近各指纹物件各别位置的一局部特征所决定的频率值;
利用各别变动成份决定第一和第二音频试样的一相对速度,其特征是:各变动成分为从接近各指纹物件各别位置的一第二局部特性所决定的时间差值;以及
检测相对播放速度的相对音调和倒数实质上是否相异,在这情况下,第一和第二音频试样间的关系具备非线性的特征。
16.如权利要求1所述的方法,其特征是:R为一从相对值直方图的峰值所决定的相对播放速度值,它进一步包含以下步骤:
对于清单中的各对匹配指纹物件,决定一所补偿相对时间偏置值,t-R*t′,其特征是,t和t′为有关第一和第二指纹物件的时间位置;
产生所补偿相对时间偏置值的第二直方图;以及
在所补偿相对时间偏置值的第二直方图中搜寻一统计上的明显峰值,该峰值更具备第一和第二音频试样间关系的特征。
CNB038089386A 2002-04-25 2003-04-18 坚固而且不变的音频图样匹配 Expired - Fee Related CN1315110C (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US37605502P 2002-04-25 2002-04-25
US60/376,055 2002-04-25

Publications (2)

Publication Number Publication Date
CN1647160A CN1647160A (zh) 2005-07-27
CN1315110C true CN1315110C (zh) 2007-05-09

Family

ID=29270756

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB038089386A Expired - Fee Related CN1315110C (zh) 2002-04-25 2003-04-18 坚固而且不变的音频图样匹配

Country Status (16)

Country Link
US (1) US7627477B2 (zh)
EP (1) EP1504445B1 (zh)
JP (1) JP4425126B2 (zh)
KR (1) KR100820385B1 (zh)
CN (1) CN1315110C (zh)
AT (1) ATE405924T1 (zh)
AU (1) AU2003230993A1 (zh)
BR (1) BR0309598A (zh)
CA (1) CA2483104C (zh)
DE (1) DE60323086D1 (zh)
DK (1) DK1504445T3 (zh)
ES (1) ES2312772T3 (zh)
HK (1) HK1073382A1 (zh)
PT (1) PT1504445E (zh)
TW (1) TWI269196B (zh)
WO (1) WO2003091990A1 (zh)

Families Citing this family (284)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6834308B1 (en) 2000-02-17 2004-12-21 Audible Magic Corporation Method and apparatus for identifying media content presented on a media playing device
US7853664B1 (en) * 2000-07-31 2010-12-14 Landmark Digital Services Llc Method and system for purchasing pre-recorded music
US6990453B2 (en) 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
US7562012B1 (en) 2000-11-03 2009-07-14 Audible Magic Corporation Method and apparatus for creating a unique audio signature
US7363278B2 (en) 2001-04-05 2008-04-22 Audible Magic Corporation Copyright detection and protection system and method
US7529659B2 (en) 2005-09-28 2009-05-05 Audible Magic Corporation Method and apparatus for identifying an unknown work
US7877438B2 (en) 2001-07-20 2011-01-25 Audible Magic Corporation Method and apparatus for identifying new media content
US8972481B2 (en) 2001-07-20 2015-03-03 Audible Magic, Inc. Playlist generation method and apparatus
US7239981B2 (en) 2002-07-26 2007-07-03 Arbitron Inc. Systems and methods for gathering audience measurement data
US8959016B2 (en) 2002-09-27 2015-02-17 The Nielsen Company (Us), Llc Activating functions in processing devices using start codes embedded in audio
US9711153B2 (en) 2002-09-27 2017-07-18 The Nielsen Company (Us), Llc Activating functions in processing devices using encoded audio and detecting audio signatures
MXPA05007001A (es) 2002-12-27 2005-11-23 Nielsen Media Res Inc Metodos y aparatos para transcodificar metadatos.
US8332326B2 (en) 2003-02-01 2012-12-11 Audible Magic Corporation Method and apparatus to identify a work received by a processing system
WO2005006758A1 (en) 2003-07-11 2005-01-20 Koninklijke Philips Electronics N.V. Method and device for generating and detecting a fingerprint functioning as a trigger marker in a multimedia signal
JP2006528859A (ja) * 2003-07-25 2006-12-21 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ オーディオとビデオを同期させるための指紋生成及び検出の方法及び装置
US7884274B1 (en) 2003-11-03 2011-02-08 Wieder James W Adaptive personalized music and entertainment
US8396800B1 (en) 2003-11-03 2013-03-12 James W. Wieder Adaptive personalized music and entertainment
US11165999B1 (en) 2003-11-03 2021-11-02 Synergyze Technologies Llc Identifying and providing compositions and digital-works
US8001612B1 (en) 2003-11-03 2011-08-16 Wieder James W Distributing digital-works and usage-rights to user-devices
US20150128039A1 (en) 2003-11-03 2015-05-07 James W. Wieder Newness Control of a Personalized Music and/or Entertainment Sequence
US9053181B2 (en) 2003-11-03 2015-06-09 James W. Wieder Adaptive personalized playback or presentation using count
US9053299B2 (en) 2003-11-03 2015-06-09 James W. Wieder Adaptive personalized playback or presentation using rating
US9098681B2 (en) 2003-11-03 2015-08-04 James W. Wieder Adaptive personalized playback or presentation using cumulative time
US8554681B1 (en) * 2003-11-03 2013-10-08 James W. Wieder Providing “identified” compositions and digital-works
WO2005064885A1 (fr) * 2003-11-27 2005-07-14 Advestigo Systeme d'interception de documents multimedias
CA2556552C (en) 2004-02-19 2015-02-17 Landmark Digital Services Llc Method and apparatus for identification of broadcast source
WO2005101998A2 (en) 2004-04-19 2005-11-03 Landmark Digital Services Llc Content sampling and identification
US20150051967A1 (en) 2004-05-27 2015-02-19 Anonymous Media Research, Llc Media usage monitoring and measurment system and method
US20050267750A1 (en) 2004-05-27 2005-12-01 Anonymous Media, Llc Media usage monitoring and measurement system and method
US7739062B2 (en) 2004-06-24 2010-06-15 Landmark Digital Services Llc Method of characterizing the overlap of two media segments
US8130746B2 (en) 2004-07-28 2012-03-06 Audible Magic Corporation System for distributing decoy content in a peer to peer network
US7623823B2 (en) 2004-08-31 2009-11-24 Integrated Media Measurement, Inc. Detecting and measuring exposure to media content items
DE102004046746B4 (de) 2004-09-27 2007-03-01 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Verfahren zum Synchronisieren von Zusatzdaten und Basisdaten
JP5150266B2 (ja) 2005-02-08 2013-02-20 ランドマーク、ディジタル、サーヴィセズ、エルエルシー オーディオ信号において繰り返されるマテリアルの自動識別
DE102005014477A1 (de) * 2005-03-30 2006-10-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Erzeugen eines Datenstroms und zum Erzeugen einer Multikanal-Darstellung
US20070016918A1 (en) * 2005-05-20 2007-01-18 Alcorn Allan E Detecting and tracking advertisements
US11386139B2 (en) 2005-10-26 2022-07-12 Cortica Ltd. System and method for generating analytics for entities depicted in multimedia content
US10380267B2 (en) 2005-10-26 2019-08-13 Cortica, Ltd. System and method for tagging multimedia content elements
US11032017B2 (en) 2005-10-26 2021-06-08 Cortica, Ltd. System and method for identifying the context of multimedia content elements
US8266185B2 (en) 2005-10-26 2012-09-11 Cortica Ltd. System and methods thereof for generation of searchable structures respective of multimedia data content
US9747420B2 (en) 2005-10-26 2017-08-29 Cortica, Ltd. System and method for diagnosing a patient based on an analysis of multimedia content
US9384196B2 (en) 2005-10-26 2016-07-05 Cortica, Ltd. Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof
US9646005B2 (en) 2005-10-26 2017-05-09 Cortica, Ltd. System and method for creating a database of multimedia content elements assigned to users
US9256668B2 (en) 2005-10-26 2016-02-09 Cortica, Ltd. System and method of detecting common patterns within unstructured data elements retrieved from big data sources
US10607355B2 (en) 2005-10-26 2020-03-31 Cortica, Ltd. Method and system for determining the dimensions of an object shown in a multimedia content item
US10372746B2 (en) 2005-10-26 2019-08-06 Cortica, Ltd. System and method for searching applications using multimedia content elements
US10949773B2 (en) 2005-10-26 2021-03-16 Cortica, Ltd. System and methods thereof for recommending tags for multimedia content elements based on context
US10380164B2 (en) 2005-10-26 2019-08-13 Cortica, Ltd. System and method for using on-image gestures and multimedia content elements as search queries
US9031999B2 (en) 2005-10-26 2015-05-12 Cortica, Ltd. System and methods for generation of a concept based database
US11403336B2 (en) 2005-10-26 2022-08-02 Cortica Ltd. System and method for removing contextually identical multimedia content elements
US10180942B2 (en) 2005-10-26 2019-01-15 Cortica Ltd. System and method for generation of concept structures based on sub-concepts
US9235557B2 (en) 2005-10-26 2016-01-12 Cortica, Ltd. System and method thereof for dynamically associating a link to an information resource with a multimedia content displayed in a web-page
US10621988B2 (en) 2005-10-26 2020-04-14 Cortica Ltd System and method for speech to text translation using cores of a natural liquid architecture system
US8312031B2 (en) 2005-10-26 2012-11-13 Cortica Ltd. System and method for generation of complex signatures for multimedia data content
US10691642B2 (en) 2005-10-26 2020-06-23 Cortica Ltd System and method for enriching a concept database with homogenous concepts
US10742340B2 (en) 2005-10-26 2020-08-11 Cortica Ltd. System and method for identifying the context of multimedia content elements displayed in a web-page and providing contextual filters respective thereto
US20160321253A1 (en) 2005-10-26 2016-11-03 Cortica, Ltd. System and method for providing recommendations based on user profiles
US9639532B2 (en) 2005-10-26 2017-05-02 Cortica, Ltd. Context-based analysis of multimedia content items using signatures of multimedia elements and matching concepts
US11003706B2 (en) 2005-10-26 2021-05-11 Cortica Ltd System and methods for determining access permissions on personalized clusters of multimedia content elements
US9087049B2 (en) 2005-10-26 2015-07-21 Cortica, Ltd. System and method for context translation of natural language
US11361014B2 (en) 2005-10-26 2022-06-14 Cortica Ltd. System and method for completing a user profile
US9396435B2 (en) 2005-10-26 2016-07-19 Cortica, Ltd. System and method for identification of deviations from periodic behavior patterns in multimedia content
US10193990B2 (en) 2005-10-26 2019-01-29 Cortica Ltd. System and method for creating user profiles based on multimedia content
US9372940B2 (en) 2005-10-26 2016-06-21 Cortica, Ltd. Apparatus and method for determining user attention using a deep-content-classification (DCC) system
US9330189B2 (en) 2005-10-26 2016-05-03 Cortica, Ltd. System and method for capturing a multimedia content item by a mobile device and matching sequentially relevant content to the multimedia content item
US9558449B2 (en) 2005-10-26 2017-01-31 Cortica, Ltd. System and method for identifying a target area in a multimedia content element
US9191626B2 (en) 2005-10-26 2015-11-17 Cortica, Ltd. System and methods thereof for visual analysis of an image on a web-page and matching an advertisement thereto
US9466068B2 (en) 2005-10-26 2016-10-11 Cortica, Ltd. System and method for determining a pupillary response to a multimedia data element
US9477658B2 (en) 2005-10-26 2016-10-25 Cortica, Ltd. Systems and method for speech to speech translation using cores of a natural liquid architecture system
US8818916B2 (en) * 2005-10-26 2014-08-26 Cortica, Ltd. System and method for linking multimedia data elements to web pages
US9218606B2 (en) 2005-10-26 2015-12-22 Cortica, Ltd. System and method for brand monitoring and trend analysis based on deep-content-classification
US10191976B2 (en) 2005-10-26 2019-01-29 Cortica, Ltd. System and method of detecting common patterns within unstructured data elements retrieved from big data sources
US10848590B2 (en) 2005-10-26 2020-11-24 Cortica Ltd System and method for determining a contextual insight and providing recommendations based thereon
US10360253B2 (en) 2005-10-26 2019-07-23 Cortica, Ltd. Systems and methods for generation of searchable structures respective of multimedia data content
US10380623B2 (en) 2005-10-26 2019-08-13 Cortica, Ltd. System and method for generating an advertisement effectiveness performance score
US11019161B2 (en) 2005-10-26 2021-05-25 Cortica, Ltd. System and method for profiling users interest based on multimedia content analysis
US10535192B2 (en) 2005-10-26 2020-01-14 Cortica Ltd. System and method for generating a customized augmented reality environment to a user
US9529984B2 (en) 2005-10-26 2016-12-27 Cortica, Ltd. System and method for verification of user identification based on multimedia content elements
US9489431B2 (en) 2005-10-26 2016-11-08 Cortica, Ltd. System and method for distributed search-by-content
US9953032B2 (en) 2005-10-26 2018-04-24 Cortica, Ltd. System and method for characterization of multimedia content signals using cores of a natural liquid architecture system
IL185414A0 (en) * 2005-10-26 2008-01-06 Igal Raichelgauz Large-scale matching system and method for multimedia deep-content-classification
US10585934B2 (en) 2005-10-26 2020-03-10 Cortica Ltd. Method and system for populating a concept database with respect to user identifiers
US11216498B2 (en) 2005-10-26 2022-01-04 Cortica, Ltd. System and method for generating signatures to three-dimensional multimedia data elements
US10614626B2 (en) 2005-10-26 2020-04-07 Cortica Ltd. System and method for providing augmented reality challenges
US11604847B2 (en) 2005-10-26 2023-03-14 Cortica Ltd. System and method for overlaying content on a multimedia content element based on user interest
US10698939B2 (en) 2005-10-26 2020-06-30 Cortica Ltd System and method for customizing images
US8326775B2 (en) 2005-10-26 2012-12-04 Cortica Ltd. Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof
US9767143B2 (en) 2005-10-26 2017-09-19 Cortica, Ltd. System and method for caching of concept structures
US10635640B2 (en) 2005-10-26 2020-04-28 Cortica, Ltd. System and method for enriching a concept database
US10387914B2 (en) 2005-10-26 2019-08-20 Cortica, Ltd. Method for identification of multimedia content elements and adding advertising content respective thereof
US9286623B2 (en) 2005-10-26 2016-03-15 Cortica, Ltd. Method for determining an area within a multimedia content element over which an advertisement can be displayed
US10776585B2 (en) 2005-10-26 2020-09-15 Cortica, Ltd. System and method for recognizing characters in multimedia content
US7688686B2 (en) 2005-10-27 2010-03-30 Microsoft Corporation Enhanced table of contents (TOC) identifiers
GB2431839B (en) 2005-10-28 2010-05-19 Sony Uk Ltd Audio processing
KR100803206B1 (ko) 2005-11-11 2008-02-14 삼성전자주식회사 오디오 지문 생성과 오디오 데이터 검색 장치 및 방법
EP2070231B1 (en) 2006-10-03 2013-07-03 Shazam Entertainment, Ltd. Method for high throughput of identification of distributed broadcast content
CN101641674B (zh) 2006-10-05 2012-10-10 斯普兰克公司 时间序列搜索引擎
US10733326B2 (en) 2006-10-26 2020-08-04 Cortica Ltd. System and method for identification of inappropriate multimedia content
US20080317226A1 (en) * 2007-01-09 2008-12-25 Freescale Semiconductor, Inc. Handheld device for transmitting a visual format message
US8077839B2 (en) * 2007-01-09 2011-12-13 Freescale Semiconductor, Inc. Handheld device for dialing of phone numbers extracted from a voicemail
US10489795B2 (en) 2007-04-23 2019-11-26 The Nielsen Company (Us), Llc Determining relative effectiveness of media content items
US8849432B2 (en) * 2007-05-31 2014-09-30 Adobe Systems Incorporated Acoustic pattern identification using spectral characteristics to synchronize audio and/or video
US8140331B2 (en) * 2007-07-06 2012-03-20 Xia Lou Feature extraction for identification and classification of audio signals
US8006314B2 (en) 2007-07-27 2011-08-23 Audible Magic Corporation System for identifying content of digital data
US8213521B2 (en) * 2007-08-15 2012-07-03 The Nielsen Company (Us), Llc Methods and apparatus for audience measurement using global signature representation and matching
US8468014B2 (en) * 2007-11-02 2013-06-18 Soundhound, Inc. Voicing detection modules in a system for automatic transcription of sung or hummed melodies
CN101226741B (zh) * 2007-12-28 2011-06-15 无敌科技(西安)有限公司 一种活动语音端点的侦测方法
DE102008009025A1 (de) * 2008-02-14 2009-08-27 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Berechnen eines Fingerabdrucks eines Audiosignals, Vorrichtung und Verfahren zum Synchronisieren und Vorrichtung und Verfahren zum Charakterisieren eines Testaudiosignals
DE102008009024A1 (de) * 2008-02-14 2009-08-27 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum synchronisieren von Mehrkanalerweiterungsdaten mit einem Audiosignal und zum Verarbeiten des Audiosignals
GB2457694B (en) * 2008-02-21 2012-09-26 Snell Ltd Method of Deriving an Audio-Visual Signature
CA2897276C (en) 2008-03-10 2017-11-28 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Device and method for manipulating an audio signal having a transient event
GB2458471A (en) * 2008-03-17 2009-09-23 Taylor Nelson Sofres Plc A signature generating device for an audio signal and associated methods
EP2114079B2 (en) 2008-05-02 2018-01-24 Psytechnics Ltd Method and apparatus for aligning signals
JP2010033265A (ja) 2008-07-28 2010-02-12 Nec Corp コンテンツ配信方法およびシステム
US8121830B2 (en) 2008-10-24 2012-02-21 The Nielsen Company (Us), Llc Methods and apparatus to extract data encoded in media content
US9667365B2 (en) 2008-10-24 2017-05-30 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
US8359205B2 (en) 2008-10-24 2013-01-22 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
US8508357B2 (en) 2008-11-26 2013-08-13 The Nielsen Company (Us), Llc Methods and apparatus to encode and decode audio for shopper location and advertisement presentation tracking
US8199651B1 (en) 2009-03-16 2012-06-12 Audible Magic Corporation Method and system for modifying communication flows at a port level
US8738367B2 (en) * 2009-03-18 2014-05-27 Nec Corporation Speech signal processing device
US8351712B2 (en) 2009-04-27 2013-01-08 The Neilsen Company (US), LLC Methods and apparatus to perform image classification based on pseudorandom features
JP2012525655A (ja) 2009-05-01 2012-10-22 ザ ニールセン カンパニー (ユー エス) エルエルシー 一次ブロードキャストメディアコンテンツに関連する二次コンテンツを提供するための方法、機器、及び製造品
GB2470201A (en) * 2009-05-12 2010-11-17 Nokia Corp Synchronising audio and image data
US8687839B2 (en) 2009-05-21 2014-04-01 Digimarc Corporation Robust signatures derived from local nonlinear filters
US8489774B2 (en) 2009-05-27 2013-07-16 Spot411 Technologies, Inc. Synchronized delivery of interactive content
US8718805B2 (en) * 2009-05-27 2014-05-06 Spot411 Technologies, Inc. Audio-based synchronization to media
US9449090B2 (en) 2009-05-29 2016-09-20 Vizio Inscape Technologies, Llc Systems and methods for addressing a media database using distance associative hashing
US10949458B2 (en) 2009-05-29 2021-03-16 Inscape Data, Inc. System and method for improving work load management in ACR television monitoring system
US9071868B2 (en) 2009-05-29 2015-06-30 Cognitive Networks, Inc. Systems and methods for improving server and client performance in fingerprint ACR systems
US8595781B2 (en) 2009-05-29 2013-11-26 Cognitive Media Networks, Inc. Methods for identifying video segments and displaying contextual targeted content on a connected television
US8190663B2 (en) * 2009-07-06 2012-05-29 Osterreichisches Forschungsinstitut Fur Artificial Intelligence Der Osterreichischen Studiengesellschaft Fur Kybernetik Of Freyung Method and a system for identifying similar audio tracks
WO2011009946A1 (en) 2009-07-24 2011-01-27 Johannes Kepler Universität Linz A method and an apparatus for deriving information from an audio track and determining similarity between audio tracks
US20110041154A1 (en) * 2009-08-14 2011-02-17 All Media Guide, Llc Content Recognition and Synchronization on a Television or Consumer Electronics Device
US8677400B2 (en) 2009-09-30 2014-03-18 United Video Properties, Inc. Systems and methods for identifying audio content using an interactive media guidance application
US8161071B2 (en) 2009-09-30 2012-04-17 United Video Properties, Inc. Systems and methods for audio asset storage and management
US8706276B2 (en) 2009-10-09 2014-04-22 The Trustees Of Columbia University In The City Of New York Systems, methods, and media for identifying matching audio
US8521779B2 (en) 2009-10-09 2013-08-27 Adelphoi Limited Metadata record generation
US9197736B2 (en) * 2009-12-31 2015-11-24 Digimarc Corporation Intuitive computing methods and systems
US8121618B2 (en) 2009-10-28 2012-02-21 Digimarc Corporation Intuitive computing methods and systems
US8860883B2 (en) * 2009-11-30 2014-10-14 Miranda Technologies Partnership Method and apparatus for providing signatures of audio/video signals and for making use thereof
US8682145B2 (en) 2009-12-04 2014-03-25 Tivo Inc. Recording system based on multimedia content fingerprints
US8886531B2 (en) * 2010-01-13 2014-11-11 Rovi Technologies Corporation Apparatus and method for generating an audio fingerprint and using a two-stage query
US9159338B2 (en) 2010-05-04 2015-10-13 Shazam Entertainment Ltd. Systems and methods of rendering a textual animation
KR101490576B1 (ko) 2010-05-04 2015-02-11 샤잠 엔터테인먼트 리미티드 미디어의 동기화 방법 및 시스템
WO2011140269A1 (en) 2010-05-04 2011-11-10 Shazam Entertainment Ltd. Methods and systems for processing a sample of a media stream
ES2488719T3 (es) 2010-06-09 2014-08-28 Adelphoi Limited Sistema y método para el reconocimiento de medios de audio
US9876905B2 (en) 2010-09-29 2018-01-23 Genesys Telecommunications Laboratories, Inc. System for initiating interactive communication in response to audio codes
CA2856496A1 (en) * 2010-11-22 2012-05-31 Listening Methods, Llc System and method for pattern recognition and analysis
KR20140038374A (ko) 2011-02-18 2014-03-28 샤잠 엔터테인먼트 리미티드 클라이언트 장치에 의해 데이터 스트림 내 콘텐트를 식별하는 방법 및 시스템
US8589171B2 (en) 2011-03-17 2013-11-19 Remote Media, Llc System and method for custom marking a media file for file matching
US8688631B2 (en) 2011-03-17 2014-04-01 Alexander Savenok System and method for media file synchronization
US8478719B2 (en) 2011-03-17 2013-07-02 Remote Media LLC System and method for media file synchronization
US9380356B2 (en) 2011-04-12 2016-06-28 The Nielsen Company (Us), Llc Methods and apparatus to generate a tag for media content
US8996557B2 (en) 2011-05-18 2015-03-31 Microsoft Technology Licensing, Llc Query and matching for content recognition
US9286909B2 (en) 2011-06-06 2016-03-15 Bridge Mediatech, S.L. Method and system for robust audio hashing
US20120317241A1 (en) 2011-06-08 2012-12-13 Shazam Entertainment Ltd. Methods and Systems for Performing Comparisons of Received Data and Providing a Follow-On Service Based on the Comparisons
EP2718849A1 (en) 2011-06-10 2014-04-16 Shazam Entertainment Ltd. Methods and systems for identifying content in a data stream
US9210208B2 (en) 2011-06-21 2015-12-08 The Nielsen Company (Us), Llc Monitoring streaming media content
US9209978B2 (en) 2012-05-15 2015-12-08 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
US8620646B2 (en) * 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US8639178B2 (en) 2011-08-30 2014-01-28 Clear Channel Management Sevices, Inc. Broadcast source identification based on matching broadcast signal fingerprints
US9461759B2 (en) 2011-08-30 2016-10-04 Iheartmedia Management Services, Inc. Identification of changed broadcast media items
US9374183B2 (en) 2011-08-30 2016-06-21 Iheartmedia Management Services, Inc. Broadcast source identification based on matching via bit count
US9049496B2 (en) * 2011-09-01 2015-06-02 Gracenote, Inc. Media source identification
US9113202B1 (en) * 2011-09-21 2015-08-18 Google Inc. Inverted client-side fingerprinting and matching
US9460465B2 (en) 2011-09-21 2016-10-04 Genesys Telecommunications Laboratories, Inc. Graphical menu builder for encoding applications in an image
US9384272B2 (en) 2011-10-05 2016-07-05 The Trustees Of Columbia University In The City Of New York Methods, systems, and media for identifying similar songs using jumpcodes
US8831763B1 (en) * 2011-10-18 2014-09-09 Google Inc. Intelligent interest point pruning for audio matching
US8977194B2 (en) 2011-12-16 2015-03-10 The Nielsen Company (Us), Llc Media exposure and verification utilizing inductive coupling
US8538333B2 (en) 2011-12-16 2013-09-17 Arbitron Inc. Media exposure linking utilizing bluetooth signal characteristics
US9268845B1 (en) * 2012-03-08 2016-02-23 Google Inc. Audio matching using time alignment, frequency alignment, and interest point overlap to filter false positives
JP2013205830A (ja) * 2012-03-29 2013-10-07 Sony Corp トーン成分検出方法、トーン成分検出装置およびプログラム
EP2648418A1 (en) * 2012-04-05 2013-10-09 Thomson Licensing Synchronization of multimedia streams
US9235867B2 (en) * 2012-06-04 2016-01-12 Microsoft Technology Licensing, Llc Concurrent media delivery
US9129015B1 (en) * 2012-06-26 2015-09-08 Google Inc. Min/max filter for audio matching
US9282366B2 (en) 2012-08-13 2016-03-08 The Nielsen Company (Us), Llc Methods and apparatus to communicate audience measurement information
US20140074466A1 (en) * 2012-09-10 2014-03-13 Google Inc. Answering questions using environmental context
US9081778B2 (en) 2012-09-25 2015-07-14 Audible Magic Corporation Using digital fingerprints to associate data with a work
US9390719B1 (en) * 2012-10-09 2016-07-12 Google Inc. Interest points density control for audio matching
US9069849B1 (en) * 2012-10-10 2015-06-30 Google Inc. Methods for enforcing time alignment for speed resistant audio matching
EP2731030A1 (en) * 2012-11-13 2014-05-14 Samsung Electronics Co., Ltd Music information searching method and apparatus thereof
US9195649B2 (en) 2012-12-21 2015-11-24 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US9158760B2 (en) 2012-12-21 2015-10-13 The Nielsen Company (Us), Llc Audio decoding with supplemental semantic audio recognition and report generation
US9183849B2 (en) 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US9706252B2 (en) 2013-02-04 2017-07-11 Universal Electronics Inc. System and method for user monitoring and intent determination
CN103971689B (zh) * 2013-02-04 2016-01-27 腾讯科技(深圳)有限公司 一种音频识别方法及装置
US9313544B2 (en) 2013-02-14 2016-04-12 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
US9311640B2 (en) 2014-02-11 2016-04-12 Digimarc Corporation Methods and arrangements for smartphone payments and transactions
FR3002713B1 (fr) * 2013-02-27 2015-02-27 Inst Mines Telecom Generation d'une signature d'un signal audio musical
US9451048B2 (en) 2013-03-12 2016-09-20 Shazam Investments Ltd. Methods and systems for identifying information of a broadcast station and information of broadcasted content
US9390170B2 (en) 2013-03-15 2016-07-12 Shazam Investments Ltd. Methods and systems for arranging and searching a database of media content recordings
US20140278845A1 (en) 2013-03-15 2014-09-18 Shazam Investments Limited Methods and Systems for Identifying Target Media Content and Determining Supplemental Information about the Target Media Content
US9773058B2 (en) 2013-03-15 2017-09-26 Shazam Investments Ltd. Methods and systems for arranging and searching a database of media content recordings
US9269022B2 (en) 2013-04-11 2016-02-23 Digimarc Corporation Methods for object recognition and related arrangements
US10346357B2 (en) 2013-04-30 2019-07-09 Splunk Inc. Processing of performance data and structure data from an information technology environment
US10318541B2 (en) 2013-04-30 2019-06-11 Splunk Inc. Correlating log data with performance measurements having a specified relationship to a threshold value
US10225136B2 (en) 2013-04-30 2019-03-05 Splunk Inc. Processing of log data and performance data obtained via an application programming interface (API)
US10997191B2 (en) 2013-04-30 2021-05-04 Splunk Inc. Query-triggered processing of performance data and log data from an information technology environment
US10614132B2 (en) 2013-04-30 2020-04-07 Splunk Inc. GUI-triggered processing of performance data and log data from an information technology environment
US10019496B2 (en) 2013-04-30 2018-07-10 Splunk Inc. Processing of performance data and log data from an information technology environment by using diverse data stores
US10353957B2 (en) 2013-04-30 2019-07-16 Splunk Inc. Processing of performance data and raw log data from an information technology environment
US9460201B2 (en) 2013-05-06 2016-10-04 Iheartmedia Management Services, Inc. Unordered matching of audio fingerprints
CN103402118B (zh) * 2013-07-05 2017-12-01 Tcl集团股份有限公司 一种媒体节目互动方法及系统
US9711152B2 (en) 2013-07-31 2017-07-18 The Nielsen Company (Us), Llc Systems apparatus and methods for encoding/decoding persistent universal media codes to encoded audio
US20150039321A1 (en) 2013-07-31 2015-02-05 Arbitron Inc. Apparatus, System and Method for Reading Codes From Digital Audio on a Processing Device
US9275427B1 (en) * 2013-09-05 2016-03-01 Google Inc. Multi-channel audio video fingerprinting
US9898086B2 (en) * 2013-09-06 2018-02-20 Immersion Corporation Systems and methods for visual processing of spectrograms to generate haptic effects
US10014006B1 (en) 2013-09-10 2018-07-03 Ampersand, Inc. Method of determining whether a phone call is answered by a human or by an automated device
US9053711B1 (en) 2013-09-10 2015-06-09 Ampersand, Inc. Method of matching a digitized stream of audio signals to a known audio recording
TWI527025B (zh) * 2013-11-11 2016-03-21 財團法人資訊工業策進會 電腦系統、音訊比對方法及其電腦可讀取記錄媒體
NL2011893C2 (en) * 2013-12-04 2015-06-08 Stichting Incas3 Method and system for predicting human activity.
US9426525B2 (en) 2013-12-31 2016-08-23 The Nielsen Company (Us), Llc. Methods and apparatus to count people in an audience
WO2015118431A1 (en) 2014-02-05 2015-08-13 Edge Innovation, Lda. Method for capture and analysis of multimedia content
US10430985B2 (en) 2014-03-14 2019-10-01 Magic Leap, Inc. Augmented reality systems and methods utilizing reflections
US9699499B2 (en) 2014-04-30 2017-07-04 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
CN104093079B (zh) 2014-05-29 2015-10-07 腾讯科技(深圳)有限公司 基于多媒体节目的交互方法、终端、服务器和系统
EP3023884A1 (en) * 2014-11-21 2016-05-25 Thomson Licensing Method and apparatus for generating fingerprint of an audio signal
EP3228084A4 (en) * 2014-12-01 2018-04-25 Inscape Data, Inc. System and method for continuous media segment identification
WO2016086905A1 (es) * 2014-12-05 2016-06-09 Monitoreo Tecnológico, S.A Método de medición de audiencias
WO2016123495A1 (en) 2015-01-30 2016-08-04 Vizio Inscape Technologies, Llc Methods for identifying video segments and displaying option to view from an alternative source and/or on an alternative device
US10360583B2 (en) 2015-02-05 2019-07-23 Direct Path, Llc System and method for direct response advertising
MX2017013128A (es) 2015-04-17 2018-01-26 Inscape Data Inc Sistemas y metodos para reducir densidad de los datos en grandes conjuntos de datos.
CN106294331B (zh) * 2015-05-11 2020-01-21 阿里巴巴集团控股有限公司 音频信息检索方法及装置
US9762965B2 (en) 2015-05-29 2017-09-12 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
CA2992319C (en) 2015-07-16 2023-11-21 Inscape Data, Inc. Detection of common media segments
WO2017011768A1 (en) 2015-07-16 2017-01-19 Vizio Inscape Technologies, Llc Systems and methods for partitioning search indexes for improved efficiency in identifying media segments
US10080062B2 (en) 2015-07-16 2018-09-18 Inscape Data, Inc. Optimizing media fingerprint retention to improve system resource utilization
CN106558318B (zh) 2015-09-24 2020-04-28 阿里巴巴集团控股有限公司 音频识别方法和系统
US11195043B2 (en) 2015-12-15 2021-12-07 Cortica, Ltd. System and method for determining common patterns in multimedia content elements based on key points
US11037015B2 (en) 2015-12-15 2021-06-15 Cortica Ltd. Identification of key points in multimedia data elements
US9516373B1 (en) 2015-12-21 2016-12-06 Max Abecassis Presets of synchronized second screen functions
US9596502B1 (en) 2015-12-21 2017-03-14 Max Abecassis Integration of multiple synchronization methodologies
JP6952713B2 (ja) 2016-01-19 2021-10-20 マジック リープ, インコーポレイテッドMagic Leap,Inc. 反射を利用する拡張現実システムおよび方法
US10951935B2 (en) 2016-04-08 2021-03-16 Source Digital, Inc. Media environment driven content distribution platform
US9786298B1 (en) 2016-04-08 2017-10-10 Source Digital, Inc. Audio fingerprinting based on audio energy characteristics
US10397663B2 (en) * 2016-04-08 2019-08-27 Source Digital, Inc. Synchronizing ancillary data to content including audio
US10311918B1 (en) 2016-04-19 2019-06-04 Space Projects Ltd. System, media, and method for synchronization of independent sensors and recording devices
NZ787464A (en) 2016-04-26 2023-06-30 Magic Leap Inc Electromagnetic tracking with augmented reality systems
US10015612B2 (en) 2016-05-25 2018-07-03 Dolby Laboratories Licensing Corporation Measurement, verification and correction of time alignment of multiple audio channels and associated metadata
CN106910494B (zh) 2016-06-28 2020-11-13 创新先进技术有限公司 一种音频识别方法和装置
WO2018047805A1 (ja) * 2016-09-09 2018-03-15 日本電気株式会社 移動音源速度推定装置、速度監視システム、移動音源速度推定方法、および移動音源速度推定用プログラムが記憶された記憶媒体
EP3312722A1 (en) 2016-10-21 2018-04-25 Fujitsu Limited Data processing apparatus, method, and program
US10776170B2 (en) 2016-10-21 2020-09-15 Fujitsu Limited Software service execution apparatus, system, and method
JP7100422B2 (ja) 2016-10-21 2022-07-13 富士通株式会社 データプロパティ認識のための装置、プログラム、及び方法
EP3312724B1 (en) 2016-10-21 2019-10-30 Fujitsu Limited Microservice-based data processing apparatus, method, and program
JP6805765B2 (ja) 2016-10-21 2020-12-23 富士通株式会社 ソフトウェアサービスの実行のためのシステム、方法、及びプログラム
US10922720B2 (en) 2017-01-11 2021-02-16 Adobe Inc. Managing content delivery via audio cues
US10166472B2 (en) 2017-05-04 2019-01-01 Shazam Investments Ltd. Methods and systems for determining a reaction time for a response and synchronizing user interface(s) with content being rendered
US10860786B2 (en) * 2017-06-01 2020-12-08 Global Tel*Link Corporation System and method for analyzing and investigating communication data from a controlled environment
WO2019008581A1 (en) 2017-07-05 2019-01-10 Cortica Ltd. DETERMINATION OF DRIVING POLICIES
GB2564495A (en) * 2017-07-07 2019-01-16 Cirrus Logic Int Semiconductor Ltd Audio data transfer
US11899707B2 (en) 2017-07-09 2024-02-13 Cortica Ltd. Driving policies determination
US10129392B1 (en) * 2017-08-25 2018-11-13 Global Tel*Link Corporation Systems and methods for detecting inmate to inmate conference calls
FR3071994A1 (fr) * 2017-09-29 2019-04-05 Theater Ears, LLC Procede et programme de reconnaissance et synchronisation audio
US20190104335A1 (en) * 2017-09-29 2019-04-04 Theater Ears, LLC Theater ears audio recognition & synchronization algorithm
US20190109804A1 (en) * 2017-10-10 2019-04-11 Microsoft Technology Licensing, Llc Audio processing for voice simulated noise effects
US10158907B1 (en) * 2017-10-10 2018-12-18 Shazam Investments Ltd. Systems and methods for performing playout of multiple media recordings based on a matching segment among the recordings
US10129575B1 (en) 2017-10-25 2018-11-13 Shazam Entertainment Limited Methods and systems for determining a latency between a source and an alternative feed of the source
US10846544B2 (en) 2018-07-16 2020-11-24 Cartica Ai Ltd. Transportation prediction system and method
CN112534800B (zh) * 2018-07-18 2021-10-15 谷歌有限责任公司 一种回波检测的方法和系统
US11443724B2 (en) * 2018-07-31 2022-09-13 Mediawave Intelligent Communication Method of synchronizing electronic interactive device
US20200133308A1 (en) 2018-10-18 2020-04-30 Cartica Ai Ltd Vehicle to vehicle (v2v) communication less truck platooning
US10839694B2 (en) 2018-10-18 2020-11-17 Cartica Ai Ltd Blind spot alert
US11126870B2 (en) 2018-10-18 2021-09-21 Cartica Ai Ltd. Method and system for obstacle detection
US11181911B2 (en) 2018-10-18 2021-11-23 Cartica Ai Ltd Control transfer of a vehicle
US11700356B2 (en) 2018-10-26 2023-07-11 AutoBrains Technologies Ltd. Control transfer of a vehicle
US10789535B2 (en) 2018-11-26 2020-09-29 Cartica Ai Ltd Detection of road elements
US11643005B2 (en) 2019-02-27 2023-05-09 Autobrains Technologies Ltd Adjusting adjustable headlights of a vehicle
US11285963B2 (en) 2019-03-10 2022-03-29 Cartica Ai Ltd. Driver-based prediction of dangerous events
US11694088B2 (en) 2019-03-13 2023-07-04 Cortica Ltd. Method for object detection using knowledge distillation
US11132548B2 (en) 2019-03-20 2021-09-28 Cortica Ltd. Determining object information that does not explicitly appear in a media unit signature
US11488290B2 (en) 2019-03-31 2022-11-01 Cortica Ltd. Hybrid representation of a media unit
US10776669B1 (en) 2019-03-31 2020-09-15 Cortica Ltd. Signature generation and object detection that refer to rare scenes
US10796444B1 (en) 2019-03-31 2020-10-06 Cortica Ltd Configuring spanning elements of a signature generator
US11222069B2 (en) 2019-03-31 2022-01-11 Cortica Ltd. Low-power calculation of a signature of a media unit
US10789527B1 (en) 2019-03-31 2020-09-29 Cortica Ltd. Method for object detection using shallow neural networks
US11245959B2 (en) 2019-06-20 2022-02-08 Source Digital, Inc. Continuous dual authentication to access media content
US11593662B2 (en) 2019-12-12 2023-02-28 Autobrains Technologies Ltd Unsupervised cluster generation
US10748022B1 (en) 2019-12-12 2020-08-18 Cartica Ai Ltd Crowd separation
US11590988B2 (en) 2020-03-19 2023-02-28 Autobrains Technologies Ltd Predictive turning assistant
US11827215B2 (en) 2020-03-31 2023-11-28 AutoBrains Technologies Ltd. Method for training a driving related object detector
US11756424B2 (en) 2020-07-24 2023-09-12 AutoBrains Technologies Ltd. Parking assist
US11694692B2 (en) 2020-11-11 2023-07-04 Bank Of America Corporation Systems and methods for audio enhancement and conversion
US20230388562A1 (en) * 2022-05-27 2023-11-30 Sling TV L.L.C. Media signature recognition with resource constrained devices

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1144030A (zh) * 1994-12-02 1997-02-26 菲利浦电子有限公司 音频/视频定时差异的处理
CN1219810A (zh) * 1997-12-12 1999-06-16 上海金陵股份有限公司 远程公共电脑系统

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4415767A (en) * 1981-10-19 1983-11-15 Votan Method and apparatus for speech recognition and reproduction
US4450531A (en) 1982-09-10 1984-05-22 Ensco, Inc. Broadcast signal recognition system and method
US4843562A (en) 1987-06-24 1989-06-27 Broadcast Data Systems Limited Partnership Broadcast information classification system and method
US5210820A (en) 1990-05-02 1993-05-11 Broadcast Data Systems Limited Partnership Signal recognition system and method
US5918223A (en) * 1996-07-22 1999-06-29 Muscle Fish Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information
US6088455A (en) * 1997-01-07 2000-07-11 Logan; James D. Methods and apparatus for selectively reproducing segments of broadcast programming
JP2002514318A (ja) 1997-01-31 2002-05-14 ティ―ネティックス,インコーポレイテッド 録音された音声を検出するシステムおよび方法
US5940799A (en) 1997-09-15 1999-08-17 Motorola, Inc. System and method for securing speech transactions
US5913196A (en) 1997-11-17 1999-06-15 Talmor; Rita System and method for establishing identity of a speaker
US6434520B1 (en) * 1999-04-16 2002-08-13 International Business Machines Corporation System and method for indexing and querying audio archives
US20010044719A1 (en) * 1999-07-02 2001-11-22 Mitsubishi Electric Research Laboratories, Inc. Method and system for recognizing, indexing, and searching acoustic signals
GR1003625B (el) * 1999-07-08 2001-08-31 Μεθοδος χημικης αποθεσης συνθετων επικαλυψεων αγωγιμων πολυμερων σε επιφανειες κραματων αλουμινιου
US7174293B2 (en) * 1999-09-21 2007-02-06 Iceberg Industries Llc Audio identification system and method
US7194752B1 (en) 1999-10-19 2007-03-20 Iceberg Industries, Llc Method and apparatus for automatically recognizing input audio and/or video streams
US6453252B1 (en) * 2000-05-15 2002-09-17 Creative Technology Ltd. Process for identifying audio content
US6990453B2 (en) * 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
US7853664B1 (en) * 2000-07-31 2010-12-14 Landmark Digital Services Llc Method and system for purchasing pre-recorded music
US20020072982A1 (en) * 2000-12-12 2002-06-13 Shazam Entertainment Ltd. Method and system for interacting with a user in an experiential environment
US6483927B2 (en) 2000-12-18 2002-11-19 Digimarc Corporation Synchronizing readers of hidden auxiliary data in quantization-based data hiding schemes
JP4723171B2 (ja) * 2001-02-12 2011-07-13 グレースノート インク マルチメディア・コンテンツのハッシュの生成および突合せ
AU2002346116A1 (en) * 2001-07-20 2003-03-03 Gracenote, Inc. Automatic identification of sound recordings
US7082394B2 (en) * 2002-06-25 2006-07-25 Microsoft Corporation Noise-robust feature extraction using multi-layer principal component analysis
AU2003264774A1 (en) * 2002-11-01 2004-05-25 Koninklijke Philips Electronics N.V. Improved audio data fingerprint searching
KR100456408B1 (ko) * 2004-02-06 2004-11-10 (주)뮤레카 오디오유전자 생성방법 및 오디오데이터 검색방법
JP5150266B2 (ja) * 2005-02-08 2013-02-20 ランドマーク、ディジタル、サーヴィセズ、エルエルシー オーディオ信号において繰り返されるマテリアルの自動識別

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1144030A (zh) * 1994-12-02 1997-02-26 菲利浦电子有限公司 音频/视频定时差异的处理
CN1219810A (zh) * 1997-12-12 1999-06-16 上海金陵股份有限公司 远程公共电脑系统

Also Published As

Publication number Publication date
KR20050010763A (ko) 2005-01-28
KR100820385B1 (ko) 2008-04-10
PT1504445E (pt) 2008-11-24
HK1073382A1 (en) 2005-09-30
EP1504445A4 (en) 2005-08-17
CA2483104A1 (en) 2003-11-06
JP4425126B2 (ja) 2010-03-03
US20090265174A9 (en) 2009-10-22
CA2483104C (en) 2011-06-21
EP1504445B1 (en) 2008-08-20
AU2003230993A1 (en) 2003-11-10
EP1504445A1 (en) 2005-02-09
DE60323086D1 (de) 2008-10-02
CN1647160A (zh) 2005-07-27
JP2005524108A (ja) 2005-08-11
DK1504445T3 (da) 2008-12-01
TWI269196B (en) 2006-12-21
ES2312772T3 (es) 2009-03-01
US7627477B2 (en) 2009-12-01
ATE405924T1 (de) 2008-09-15
BR0309598A (pt) 2005-02-09
TW200307205A (en) 2003-12-01
US20050177372A1 (en) 2005-08-11
WO2003091990A1 (en) 2003-11-06

Similar Documents

Publication Publication Date Title
CN1315110C (zh) 坚固而且不变的音频图样匹配
CN102959624B (zh) 用于音频媒体识别的系统和方法
Bello Measuring structural similarity in music
CN100437572C (zh) 音频指纹识别系统和方法
Futrelle et al. Interdisciplinary communities and research issues in Music Information Retrieval.
Bertin-Mahieux et al. Large-scale cover song recognition using hashed chroma landmarks
US20020133499A1 (en) System and method for acoustic fingerprinting
CN100472515C (zh) 用于管理声频信息的系统
US6990453B2 (en) System and methods for recognizing sound and music signals in high noise and distortion
KR100659672B1 (ko) 핑거프린트를 생성하는 방법과 장치 및 오디오 신호를 식별하는 방법과 장치
Casey et al. Song Intersection by Approximate Nearest Neighbor Search.
US8589171B2 (en) System and method for custom marking a media file for file matching
CN1890665A (zh) 旋律数据库搜索
US20060155399A1 (en) Method and system for generating acoustic fingerprints
CN1997989A (zh) 用于自动检测和标识广播音频或视频节目信号的方法和装置
Arzt et al. Fast Identification of Piece and Score Position via Symbolic Fingerprinting.
Liu et al. Content-based retrieval of MP3 music objects
Burges et al. Using audio fingerprinting for duplicate detection and thumbnail generation
Gupta et al. CRIM’s content-based audio copy detection system for TRECVID 2009
Yu et al. Combining multi-probe histogram and order-statistics based lsh for scalable audio content retrieval
Osmalsky Combining features for cover song identification
WO2012120531A2 (en) A method for fast and accurate audio content match detection
Six et al. A robust audio fingerprinter based on pitch class histograms applications for ethnic music archives
Grosche et al. Toward musically-motivated audio fingerprints
Wieczorkowska et al. Audio content description in sound databases

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: LANDLER MARK DIGITAL SERVICE CO., LTD.

Free format text: FORMER OWNER: SHAZAM ENTERTAINMENT LTD.

Effective date: 20060210

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20060210

Address after: Tennessee

Applicant after: Landmark Digital Services LLC

Address before: London kafin Ducie Plaza No. 2

Applicant before: Shazam Entertainment Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: SHAZAM INVESTMENT CO., LTD.

Free format text: FORMER OWNER: LANDMARK DIGITAL SERVICES LLC

Effective date: 20140108

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20140108

Address after: London City

Patentee after: Shazam Investments Ltd

Address before: Tennessee

Patentee before: Landmark Digital Services LLC

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20070509

Termination date: 20160418

CF01 Termination of patent right due to non-payment of annual fee