SG11202008533VA - Audio fingerprint extraction method and device - Google Patents

Audio fingerprint extraction method and device

Info

Publication number
SG11202008533VA
SG11202008533VA SG11202008533VA SG11202008533VA SG11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA
Authority
SG
Singapore
Prior art keywords
extraction method
audio fingerprint
fingerprint extraction
audio
fingerprint
Prior art date
Application number
SG11202008533VA
Inventor
Gen Li
Lei Li
Yi He
Original Assignee
Beijing Bytedance Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Bytedance Network Technology Co Ltd filed Critical Beijing Bytedance Network Technology Co Ltd
Publication of SG11202008533VA publication Critical patent/SG11202008533VA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Library & Information Science (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Circuit For Audible Band Transducer (AREA)
SG11202008533VA 2018-03-29 2018-12-29 Audio fingerprint extraction method and device SG11202008533VA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810273669.6A CN110322886A (en) 2018-03-29 2018-03-29 A kind of audio-frequency fingerprint extracting method and device
PCT/CN2018/125491 WO2019184517A1 (en) 2018-03-29 2018-12-29 Audio fingerprint extraction method and device

Publications (1)

Publication Number Publication Date
SG11202008533VA true SG11202008533VA (en) 2020-10-29

Family

ID=68062543

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202008533VA SG11202008533VA (en) 2018-03-29 2018-12-29 Audio fingerprint extraction method and device

Country Status (5)

Country Link
US (1) US10950255B2 (en)
JP (1) JP6908774B2 (en)
CN (1) CN110322886A (en)
SG (1) SG11202008533VA (en)
WO (1) WO2019184517A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11138471B2 (en) * 2018-05-18 2021-10-05 Google Llc Augmentation of audiographic images for improved machine learning
CN111581430B (en) * 2020-04-30 2022-05-17 厦门快商通科技股份有限公司 Audio fingerprint generation method and device and equipment
CN111862989B (en) * 2020-06-01 2024-03-08 北京捷通华声科技股份有限公司 Acoustic feature processing method and device
CN112104892B (en) * 2020-09-11 2021-12-10 腾讯科技(深圳)有限公司 Multimedia information processing method and device, electronic equipment and storage medium
US11798577B2 (en) * 2021-03-04 2023-10-24 Gracenote, Inc. Methods and apparatus to fingerprint an audio signal

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6990453B2 (en) * 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
KR20050029723A (en) * 2002-07-24 2005-03-28 코닌클리케 필립스 일렉트로닉스 엔.브이. Method and device for regulating file sharing
DE60326743D1 (en) * 2002-09-30 2009-04-30 Gracenote Inc FINGERPRINT EXTRACTION
US20050249080A1 (en) * 2004-05-07 2005-11-10 Fuji Xerox Co., Ltd. Method and system for harvesting a media stream
US7516074B2 (en) * 2005-09-01 2009-04-07 Auditude, Inc. Extraction and matching of characteristic fingerprints from audio signals
KR100862616B1 (en) * 2007-04-17 2008-10-09 한국전자통신연구원 Searching system and method of audio fingerprint by index information
US9299364B1 (en) * 2008-06-18 2016-03-29 Gracenote, Inc. Audio content fingerprinting based on two-dimensional constant Q-factor transform representation and robust audio identification for time-aligned applications
WO2011132184A1 (en) * 2010-04-22 2011-10-27 Jamrt Ltd. Generating pitched musical events corresponding to musical content
CN102959543B (en) * 2010-05-04 2016-05-25 沙扎姆娱乐有限公司 For the treatment of the method and system of the sample of Media Stream
US8584197B2 (en) * 2010-11-12 2013-11-12 Google Inc. Media rights management using melody identification
US9093120B2 (en) * 2011-02-10 2015-07-28 Yahoo! Inc. Audio fingerprint extraction by scaling in time and resampling
JP2012185195A (en) * 2011-03-03 2012-09-27 Jvc Kenwood Corp Audio data feature extraction method, audio data collation method, audio data feature extraction program, audio data collation program, audio data feature extraction device, audio data collation device, and audio data collation system
US9286909B2 (en) * 2011-06-06 2016-03-15 Bridge Mediatech, S.L. Method and system for robust audio hashing
WO2013008956A1 (en) * 2011-07-14 2013-01-17 日本電気株式会社 Sound processing method, sound processing system, video processing method, video processing system, sound processing device, and method and program for controlling same
CN102324232A (en) * 2011-09-12 2012-01-18 辽宁工业大学 Method for recognizing sound-groove and system based on gauss hybrid models
US9384272B2 (en) * 2011-10-05 2016-07-05 The Trustees Of Columbia University In The City Of New York Methods, systems, and media for identifying similar songs using jumpcodes
CN103999150B (en) * 2011-12-12 2016-10-19 杜比实验室特许公司 Low complex degree duplicate detection in media data
US8949872B2 (en) * 2011-12-20 2015-02-03 Yahoo! Inc. Audio fingerprint for content identification
US11445242B2 (en) * 2012-02-21 2022-09-13 Roku, Inc. Media content identification on mobile devices
US8681950B2 (en) * 2012-03-28 2014-03-25 Interactive Intelligence, Inc. System and method for fingerprinting datasets
CN102820033B (en) * 2012-08-17 2013-12-04 南京大学 Voiceprint identification method
US9305559B2 (en) * 2012-10-15 2016-04-05 Digimarc Corporation Audio watermark encoding with reversing polarity and pairwise embedding
US9183849B2 (en) * 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US9451048B2 (en) * 2013-03-12 2016-09-20 Shazam Investments Ltd. Methods and systems for identifying information of a broadcast station and information of broadcasted content
CN104050259A (en) * 2014-06-16 2014-09-17 上海大学 Audio fingerprint extracting method based on SOM (Self Organized Mapping) algorithm
JP6257537B2 (en) * 2015-01-19 2018-01-10 日本電信電話株式会社 Saliency estimation method, saliency estimation device, and program
US9971928B2 (en) * 2015-02-27 2018-05-15 Qualcomm Incorporated Fingerprint verification system
CN104865313B (en) * 2015-05-12 2017-11-17 福建星网锐捷通讯股份有限公司 A kind of detection method and device based on sound spectrum bar detection glass breaking
US20170097992A1 (en) * 2015-10-02 2017-04-06 Evergig Music S.A.S.U. Systems and methods for searching, comparing and/or matching digital audio files
US10318813B1 (en) * 2016-03-11 2019-06-11 Gracenote, Inc. Digital video fingerprinting using motion segmentation
CN106296890B (en) * 2016-07-22 2019-06-04 北京小米移动软件有限公司 Unlocking method, device and the mobile terminal of mobile terminal
CN106250742A (en) * 2016-07-22 2016-12-21 北京小米移动软件有限公司 The unlocking method of mobile terminal, device and mobile terminal
US10236006B1 (en) * 2016-08-05 2019-03-19 Digimarc Corporation Digital watermarks adapted to compensate for time scaling, pitch shifting and mixing
CN106782568A (en) * 2016-11-22 2017-05-31 合肥星服信息科技有限责任公司 The vocal print filter method that a kind of frequency extremes and average are combined
CN107610708B (en) * 2017-06-09 2018-06-19 平安科技(深圳)有限公司 Identify the method and apparatus of vocal print
CN107622773B (en) * 2017-09-08 2021-04-06 科大讯飞股份有限公司 Audio feature extraction method and device and electronic equipment
CN111279414B (en) * 2017-11-02 2022-12-06 华为技术有限公司 Segmentation-based feature extraction for sound scene classification

Also Published As

Publication number Publication date
CN110322886A (en) 2019-10-11
JP6908774B2 (en) 2021-07-28
JP2020527255A (en) 2020-09-03
US10950255B2 (en) 2021-03-16
WO2019184517A1 (en) 2019-10-03
US20200273483A1 (en) 2020-08-27

Similar Documents

Publication Publication Date Title
SG11202011791SA (en) Pedestrian recognition method and device
EP3536158A4 (en) Extraction method and extraction device
SG11202006204TA (en) Identity verification method and device and electronic device
EP3892210A4 (en) Thrombus extraction device and method
SG11202008533VA (en) Audio fingerprint extraction method and device
ZA201902460B (en) Identity recognition method and device
SG11202008548VA (en) Audio Retrieval And Recognition Method And Device
SG11202107218TA (en) Separation device and separation method
EP3895586C0 (en) Extraction apparatus and extraction method
EP3547186A4 (en) Fingerprint recognition method and terminal device
HK1251870A1 (en) Fingerprint collection method and terminal
EP3682355A4 (en) Digital device and biometric authentication method therein
EP3750092A4 (en) Digital device and biometric authentication method therein
SG11202101538TA (en) Information processing method and device
EP3734568A4 (en) Data extraction method and device
SG11202008272RA (en) Video feature extraction method and device
EP3842990A4 (en) Face recognition method and device
GB201818884D0 (en) Forthing device and method thereof
EP3722175A4 (en) Periphery recognition device and periphery recognition method
EP3474223A4 (en) Fingerprint reading device and fingerprint reading method
EP3745297A4 (en) Fingerprint collecting device and fingerprint collecting method
EP3860148A4 (en) Acoustic object extraction device and acoustic object extraction method
ZA201908003B (en) Extraction system and apparatus and method thereof
GB2567519B (en) Fingerprint recognition method and electronic device using the same
SG11202004317RA (en) Information processing device and information processing method