SG11202008533VA - Audio fingerprint extraction method and device - Google Patents
Audio fingerprint extraction method and deviceInfo
- Publication number
- SG11202008533VA SG11202008533VA SG11202008533VA SG11202008533VA SG11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA SG 11202008533V A SG11202008533V A SG 11202008533VA
- Authority
- SG
- Singapore
- Prior art keywords
- extraction method
- audio fingerprint
- fingerprint extraction
- audio
- fingerprint
- Prior art date
Links
- 238000000605 extraction Methods 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/54—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Library & Information Science (AREA)
- Theoretical Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810273669.6A CN110322886A (en) | 2018-03-29 | 2018-03-29 | A kind of audio-frequency fingerprint extracting method and device |
PCT/CN2018/125491 WO2019184517A1 (en) | 2018-03-29 | 2018-12-29 | Audio fingerprint extraction method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202008533VA true SG11202008533VA (en) | 2020-10-29 |
Family
ID=68062543
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202008533VA SG11202008533VA (en) | 2018-03-29 | 2018-12-29 | Audio fingerprint extraction method and device |
Country Status (5)
Country | Link |
---|---|
US (1) | US10950255B2 (en) |
JP (1) | JP6908774B2 (en) |
CN (1) | CN110322886A (en) |
SG (1) | SG11202008533VA (en) |
WO (1) | WO2019184517A1 (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11138471B2 (en) * | 2018-05-18 | 2021-10-05 | Google Llc | Augmentation of audiographic images for improved machine learning |
CN111581430B (en) * | 2020-04-30 | 2022-05-17 | 厦门快商通科技股份有限公司 | Audio fingerprint generation method and device and equipment |
CN111862989B (en) * | 2020-06-01 | 2024-03-08 | 北京捷通华声科技股份有限公司 | Acoustic feature processing method and device |
CN112104892B (en) * | 2020-09-11 | 2021-12-10 | 腾讯科技(深圳)有限公司 | Multimedia information processing method and device, electronic equipment and storage medium |
US11798577B2 (en) * | 2021-03-04 | 2023-10-24 | Gracenote, Inc. | Methods and apparatus to fingerprint an audio signal |
Family Cites Families (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6990453B2 (en) * | 2000-07-31 | 2006-01-24 | Landmark Digital Services Llc | System and methods for recognizing sound and music signals in high noise and distortion |
KR20050029723A (en) * | 2002-07-24 | 2005-03-28 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Method and device for regulating file sharing |
DE60326743D1 (en) * | 2002-09-30 | 2009-04-30 | Gracenote Inc | FINGERPRINT EXTRACTION |
US20050249080A1 (en) * | 2004-05-07 | 2005-11-10 | Fuji Xerox Co., Ltd. | Method and system for harvesting a media stream |
US7516074B2 (en) * | 2005-09-01 | 2009-04-07 | Auditude, Inc. | Extraction and matching of characteristic fingerprints from audio signals |
KR100862616B1 (en) * | 2007-04-17 | 2008-10-09 | 한국전자통신연구원 | Searching system and method of audio fingerprint by index information |
US9299364B1 (en) * | 2008-06-18 | 2016-03-29 | Gracenote, Inc. | Audio content fingerprinting based on two-dimensional constant Q-factor transform representation and robust audio identification for time-aligned applications |
WO2011132184A1 (en) * | 2010-04-22 | 2011-10-27 | Jamrt Ltd. | Generating pitched musical events corresponding to musical content |
CN102959543B (en) * | 2010-05-04 | 2016-05-25 | 沙扎姆娱乐有限公司 | For the treatment of the method and system of the sample of Media Stream |
US8584197B2 (en) * | 2010-11-12 | 2013-11-12 | Google Inc. | Media rights management using melody identification |
US9093120B2 (en) * | 2011-02-10 | 2015-07-28 | Yahoo! Inc. | Audio fingerprint extraction by scaling in time and resampling |
JP2012185195A (en) * | 2011-03-03 | 2012-09-27 | Jvc Kenwood Corp | Audio data feature extraction method, audio data collation method, audio data feature extraction program, audio data collation program, audio data feature extraction device, audio data collation device, and audio data collation system |
US9286909B2 (en) * | 2011-06-06 | 2016-03-15 | Bridge Mediatech, S.L. | Method and system for robust audio hashing |
WO2013008956A1 (en) * | 2011-07-14 | 2013-01-17 | 日本電気株式会社 | Sound processing method, sound processing system, video processing method, video processing system, sound processing device, and method and program for controlling same |
CN102324232A (en) * | 2011-09-12 | 2012-01-18 | 辽宁工业大学 | Method for recognizing sound-groove and system based on gauss hybrid models |
US9384272B2 (en) * | 2011-10-05 | 2016-07-05 | The Trustees Of Columbia University In The City Of New York | Methods, systems, and media for identifying similar songs using jumpcodes |
CN103999150B (en) * | 2011-12-12 | 2016-10-19 | 杜比实验室特许公司 | Low complex degree duplicate detection in media data |
US8949872B2 (en) * | 2011-12-20 | 2015-02-03 | Yahoo! Inc. | Audio fingerprint for content identification |
US11445242B2 (en) * | 2012-02-21 | 2022-09-13 | Roku, Inc. | Media content identification on mobile devices |
US8681950B2 (en) * | 2012-03-28 | 2014-03-25 | Interactive Intelligence, Inc. | System and method for fingerprinting datasets |
CN102820033B (en) * | 2012-08-17 | 2013-12-04 | 南京大学 | Voiceprint identification method |
US9305559B2 (en) * | 2012-10-15 | 2016-04-05 | Digimarc Corporation | Audio watermark encoding with reversing polarity and pairwise embedding |
US9183849B2 (en) * | 2012-12-21 | 2015-11-10 | The Nielsen Company (Us), Llc | Audio matching with semantic audio recognition and report generation |
US9451048B2 (en) * | 2013-03-12 | 2016-09-20 | Shazam Investments Ltd. | Methods and systems for identifying information of a broadcast station and information of broadcasted content |
CN104050259A (en) * | 2014-06-16 | 2014-09-17 | 上海大学 | Audio fingerprint extracting method based on SOM (Self Organized Mapping) algorithm |
JP6257537B2 (en) * | 2015-01-19 | 2018-01-10 | 日本電信電話株式会社 | Saliency estimation method, saliency estimation device, and program |
US9971928B2 (en) * | 2015-02-27 | 2018-05-15 | Qualcomm Incorporated | Fingerprint verification system |
CN104865313B (en) * | 2015-05-12 | 2017-11-17 | 福建星网锐捷通讯股份有限公司 | A kind of detection method and device based on sound spectrum bar detection glass breaking |
US20170097992A1 (en) * | 2015-10-02 | 2017-04-06 | Evergig Music S.A.S.U. | Systems and methods for searching, comparing and/or matching digital audio files |
US10318813B1 (en) * | 2016-03-11 | 2019-06-11 | Gracenote, Inc. | Digital video fingerprinting using motion segmentation |
CN106296890B (en) * | 2016-07-22 | 2019-06-04 | 北京小米移动软件有限公司 | Unlocking method, device and the mobile terminal of mobile terminal |
CN106250742A (en) * | 2016-07-22 | 2016-12-21 | 北京小米移动软件有限公司 | The unlocking method of mobile terminal, device and mobile terminal |
US10236006B1 (en) * | 2016-08-05 | 2019-03-19 | Digimarc Corporation | Digital watermarks adapted to compensate for time scaling, pitch shifting and mixing |
CN106782568A (en) * | 2016-11-22 | 2017-05-31 | 合肥星服信息科技有限责任公司 | The vocal print filter method that a kind of frequency extremes and average are combined |
CN107610708B (en) * | 2017-06-09 | 2018-06-19 | 平安科技(深圳)有限公司 | Identify the method and apparatus of vocal print |
CN107622773B (en) * | 2017-09-08 | 2021-04-06 | 科大讯飞股份有限公司 | Audio feature extraction method and device and electronic equipment |
CN111279414B (en) * | 2017-11-02 | 2022-12-06 | 华为技术有限公司 | Segmentation-based feature extraction for sound scene classification |
-
2018
- 2018-03-29 CN CN201810273669.6A patent/CN110322886A/en active Pending
- 2018-12-29 US US16/652,028 patent/US10950255B2/en active Active
- 2018-12-29 SG SG11202008533VA patent/SG11202008533VA/en unknown
- 2018-12-29 WO PCT/CN2018/125491 patent/WO2019184517A1/en active Application Filing
- 2018-12-29 JP JP2020502951A patent/JP6908774B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN110322886A (en) | 2019-10-11 |
JP6908774B2 (en) | 2021-07-28 |
JP2020527255A (en) | 2020-09-03 |
US10950255B2 (en) | 2021-03-16 |
WO2019184517A1 (en) | 2019-10-03 |
US20200273483A1 (en) | 2020-08-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202011791SA (en) | Pedestrian recognition method and device | |
EP3536158A4 (en) | Extraction method and extraction device | |
SG11202006204TA (en) | Identity verification method and device and electronic device | |
EP3892210A4 (en) | Thrombus extraction device and method | |
SG11202008533VA (en) | Audio fingerprint extraction method and device | |
ZA201902460B (en) | Identity recognition method and device | |
SG11202008548VA (en) | Audio Retrieval And Recognition Method And Device | |
SG11202107218TA (en) | Separation device and separation method | |
EP3895586C0 (en) | Extraction apparatus and extraction method | |
EP3547186A4 (en) | Fingerprint recognition method and terminal device | |
HK1251870A1 (en) | Fingerprint collection method and terminal | |
EP3682355A4 (en) | Digital device and biometric authentication method therein | |
EP3750092A4 (en) | Digital device and biometric authentication method therein | |
SG11202101538TA (en) | Information processing method and device | |
EP3734568A4 (en) | Data extraction method and device | |
SG11202008272RA (en) | Video feature extraction method and device | |
EP3842990A4 (en) | Face recognition method and device | |
GB201818884D0 (en) | Forthing device and method thereof | |
EP3722175A4 (en) | Periphery recognition device and periphery recognition method | |
EP3474223A4 (en) | Fingerprint reading device and fingerprint reading method | |
EP3745297A4 (en) | Fingerprint collecting device and fingerprint collecting method | |
EP3860148A4 (en) | Acoustic object extraction device and acoustic object extraction method | |
ZA201908003B (en) | Extraction system and apparatus and method thereof | |
GB2567519B (en) | Fingerprint recognition method and electronic device using the same | |
SG11202004317RA (en) | Information processing device and information processing method |