SG11202008533VA - Audio fingerprint extraction method and device - Google Patents

Audio fingerprint extraction method and device

Info

Publication number
SG11202008533VA
SG11202008533VA SG11202008533VA SG11202008533VA SG11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA
Authority
SG
Singapore
Prior art keywords
extraction method
audio fingerprint
fingerprint extraction
audio
fingerprint
Prior art date
Application number
SG11202008533VA
Other languages
English (en)
Inventor
Gen Li
Lei Li
Yi He
Original Assignee
Beijing Bytedance Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Bytedance Network Technology Co Ltd filed Critical Beijing Bytedance Network Technology Co Ltd
Publication of SG11202008533VA publication Critical patent/SG11202008533VA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Circuit For Audible Band Transducer (AREA)
SG11202008533VA 2018-03-29 2018-12-29 Audio fingerprint extraction method and device SG11202008533VA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810273669.6A CN110322886A (zh) 2018-03-29 2018-03-29 一种音频指纹提取方法及装置
PCT/CN2018/125491 WO2019184517A1 (zh) 2018-03-29 2018-12-29 一种音频指纹提取方法及装置

Publications (1)

Publication Number Publication Date
SG11202008533VA true SG11202008533VA (en) 2020-10-29

Family

ID=68062543

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202008533VA SG11202008533VA (en) 2018-03-29 2018-12-29 Audio fingerprint extraction method and device

Country Status (5)

Country Link
US (1) US10950255B2 (ja)
JP (1) JP6908774B2 (ja)
CN (1) CN110322886A (ja)
SG (1) SG11202008533VA (ja)
WO (1) WO2019184517A1 (ja)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11138471B2 (en) * 2018-05-18 2021-10-05 Google Llc Augmentation of audiographic images for improved machine learning
CN111581430B (zh) * 2020-04-30 2022-05-17 厦门快商通科技股份有限公司 一种音频指纹的生成方法和装置以及设备
CN111862989B (zh) * 2020-06-01 2024-03-08 北京捷通华声科技股份有限公司 一种声学特征处理方法和装置
CN112104892B (zh) * 2020-09-11 2021-12-10 腾讯科技(深圳)有限公司 一种多媒体信息处理方法、装置、电子设备及存储介质
US11798577B2 (en) * 2021-03-04 2023-10-24 Gracenote, Inc. Methods and apparatus to fingerprint an audio signal
CN113889119A (zh) * 2021-09-15 2022-01-04 北京市农林科学院信息技术研究中心 一种畜禽音频指纹提取方法及装置

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6990453B2 (en) * 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
CN1672155A (zh) 2002-07-24 2005-09-21 皇家飞利浦电子股份有限公司 用于调控文件共享的方法和设备
KR20050046815A (ko) * 2002-09-30 2005-05-18 코닌클리케 필립스 일렉트로닉스 엔.브이. 지문 추출
US20050249080A1 (en) * 2004-05-07 2005-11-10 Fuji Xerox Co., Ltd. Method and system for harvesting a media stream
US7516074B2 (en) * 2005-09-01 2009-04-07 Auditude, Inc. Extraction and matching of characteristic fingerprints from audio signals
KR100862616B1 (ko) * 2007-04-17 2008-10-09 한국전자통신연구원 인덱스 정보를 이용한 오디오 핑거프린트 검색 시스템 및방법
US9299364B1 (en) * 2008-06-18 2016-03-29 Gracenote, Inc. Audio content fingerprinting based on two-dimensional constant Q-factor transform representation and robust audio identification for time-aligned applications
US20130152767A1 (en) * 2010-04-22 2013-06-20 Jamrt Ltd Generating pitched musical events corresponding to musical content
US9275141B2 (en) * 2010-05-04 2016-03-01 Shazam Entertainment Ltd. Methods and systems for processing a sample of a media stream
US8584197B2 (en) * 2010-11-12 2013-11-12 Google Inc. Media rights management using melody identification
US9093120B2 (en) * 2011-02-10 2015-07-28 Yahoo! Inc. Audio fingerprint extraction by scaling in time and resampling
JP2012185195A (ja) 2011-03-03 2012-09-27 Jvc Kenwood Corp オーディオデータ特徴抽出方法、オーディオデータ照合方法、オーディオデータ特徴抽出プログラム、オーディオデータ照合プログラム、オーディオデータ特徴抽出装置、オーディオデータ照合装置及びオーディオデータ照合システム
ES2459391T3 (es) * 2011-06-06 2014-05-09 Bridge Mediatech, S.L. Método y sistema para conseguir hashing de audio invariante al canal
JP5772957B2 (ja) * 2011-07-14 2015-09-02 日本電気株式会社 音響処理装置、音響処理システム、ビデオ処理システム、制御方法および制御プログラム
CN102324232A (zh) * 2011-09-12 2012-01-18 辽宁工业大学 基于高斯混合模型的声纹识别方法及系统
US9384272B2 (en) * 2011-10-05 2016-07-05 The Trustees Of Columbia University In The City Of New York Methods, systems, and media for identifying similar songs using jumpcodes
CN103999150B (zh) * 2011-12-12 2016-10-19 杜比实验室特许公司 媒体数据中的低复杂度重复检测
US8949872B2 (en) * 2011-12-20 2015-02-03 Yahoo! Inc. Audio fingerprint for content identification
US10986399B2 (en) * 2012-02-21 2021-04-20 Gracenote, Inc. Media content identification on mobile devices
US8681950B2 (en) * 2012-03-28 2014-03-25 Interactive Intelligence, Inc. System and method for fingerprinting datasets
CN102820033B (zh) * 2012-08-17 2013-12-04 南京大学 一种声纹识别方法
US9305559B2 (en) * 2012-10-15 2016-04-05 Digimarc Corporation Audio watermark encoding with reversing polarity and pairwise embedding
US9183849B2 (en) * 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US9451048B2 (en) * 2013-03-12 2016-09-20 Shazam Investments Ltd. Methods and systems for identifying information of a broadcast station and information of broadcasted content
CN104050259A (zh) * 2014-06-16 2014-09-17 上海大学 一种基于som算法的音频指纹提取方法
JP6257537B2 (ja) 2015-01-19 2018-01-10 日本電信電話株式会社 顕著度推定方法、顕著度推定装置、プログラム
US9971928B2 (en) * 2015-02-27 2018-05-15 Qualcomm Incorporated Fingerprint verification system
CN104865313B (zh) * 2015-05-12 2017-11-17 福建星网锐捷通讯股份有限公司 一种基于声谱条纹检测玻璃破碎的检测方法及装置
US20170097992A1 (en) * 2015-10-02 2017-04-06 Evergig Music S.A.S.U. Systems and methods for searching, comparing and/or matching digital audio files
US10318813B1 (en) * 2016-03-11 2019-06-11 Gracenote, Inc. Digital video fingerprinting using motion segmentation
CN106296890B (zh) * 2016-07-22 2019-06-04 北京小米移动软件有限公司 移动终端的解锁方法、装置和移动终端
CN106250742A (zh) * 2016-07-22 2016-12-21 北京小米移动软件有限公司 移动终端的解锁方法、装置和移动终端
US10236006B1 (en) * 2016-08-05 2019-03-19 Digimarc Corporation Digital watermarks adapted to compensate for time scaling, pitch shifting and mixing
CN106782568A (zh) * 2016-11-22 2017-05-31 合肥星服信息科技有限责任公司 一种频率极值和均值结合的声纹过滤方法
CN107610708B (zh) * 2017-06-09 2018-06-19 平安科技(深圳)有限公司 识别声纹的方法及设备
CN107622773B (zh) * 2017-09-08 2021-04-06 科大讯飞股份有限公司 一种音频特征提取方法与装置、电子设备
EP3701528B1 (en) * 2017-11-02 2023-03-15 Huawei Technologies Co., Ltd. Segmentation-based feature extraction for acoustic scene classification

Also Published As

Publication number Publication date
JP6908774B2 (ja) 2021-07-28
US20200273483A1 (en) 2020-08-27
JP2020527255A (ja) 2020-09-03
CN110322886A (zh) 2019-10-11
WO2019184517A1 (zh) 2019-10-03
US10950255B2 (en) 2021-03-16

Similar Documents

Publication Publication Date Title
EP3536158A4 (en) EXTRACTION PROCESS AND EXTRACTION DEVICE
SG11202008533VA (en) Audio fingerprint extraction method and device
SG11202006204TA (en) Identity verification method and device and electronic device
ZA201902460B (en) Identity recognition method and device
EP3892210A4 (en) THROMBUS EXTRACTION DEVICE AND PROCEDURE
SG11202008548VA (en) Audio Retrieval And Recognition Method And Device
SG11202107218TA (en) Separation device and separation method
EP3895586C0 (en) EXTRACTION DEVICE AND EXTRACTION METHOD
HK1251870A1 (zh) 一種指紋採集方法及終端
EP3547186A4 (en) FINGERPRINTER RECOGNITION PROCESS AND DEVICE DEVICE
EP3750092A4 (en) DIGITAL DEVICE AND BIOMETRIC AUTHENTICATION PROCESS
EP3682355A4 (en) DIGITAL DEVICE AND BIOMETRIC AUTHENTICATION PROCESS
SG11202008272RA (en) Video feature extraction method and device
SG11202101538TA (en) Information processing method and device
EP3842990A4 (en) FACIAL RECOGNITION PROCESS AND DEVICE
EP3734568A4 (en) METHOD AND DEVICE FOR DATA EXTRACTION
EP3474223A4 (en) DEVICE FOR READING DIGITAL IMPRESSIONS AND METHOD FOR READING DIGITAL IMPRESSIONS
GB2567519B (en) Fingerprint recognition method and electronic device using the same
GB201818884D0 (en) Forthing device and method thereof
EP3722175A4 (en) DEVICE AND METHOD FOR PERIPHERAL RECOGNITION
EP3745297A4 (en) DEVICE FOR COLLECTING FINGERPRINTS AND METHOD FOR COLLECTING FINGERPRINTS
EP3860148A4 (en) ACOUSTIC OBJECT EXTRACTION DEVICE AND ACOUSTIC OBJECT EXTRACTION PROCESS
ZA201908003B (en) Extraction system and apparatus and method thereof
GB201821084D0 (en) Device and method
SG11202004317RA (en) Information processing device and information processing method