SG11201801808RA - Audio recognition method and system - Google Patents

Audio recognition method and system

Info

Publication number
SG11201801808RA
SG11201801808RA SG11201801808RA SG11201801808RA SG11201801808RA SG 11201801808R A SG11201801808R A SG 11201801808RA SG 11201801808R A SG11201801808R A SG 11201801808RA SG 11201801808R A SG11201801808R A SG 11201801808RA SG 11201801808R A SG11201801808R A SG 11201801808RA
Authority
SG
Singapore
Prior art keywords
recognition method
audio recognition
audio
recognition
Prior art date
Application number
SG11201801808RA
Inventor
Zhijun Du
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of SG11201801808RA publication Critical patent/SG11201801808RA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
SG11201801808RA 2015-09-24 2016-09-14 Audio recognition method and system SG11201801808RA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510618550.4A CN106558318B (en) 2015-09-24 2015-09-24 Audio recognition method and system
PCT/CN2016/099053 WO2017050175A1 (en) 2015-09-24 2016-09-14 Audio recognition method and system

Publications (1)

Publication Number Publication Date
SG11201801808RA true SG11201801808RA (en) 2018-04-27

Family

ID=58385690

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201801808RA SG11201801808RA (en) 2015-09-24 2016-09-14 Audio recognition method and system

Country Status (7)

Country Link
US (1) US10679647B2 (en)
EP (1) EP3355302B1 (en)
JP (1) JP6585835B2 (en)
KR (1) KR102077411B1 (en)
CN (1) CN106558318B (en)
SG (1) SG11201801808RA (en)
WO (1) WO2017050175A1 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10397663B2 (en) * 2016-04-08 2019-08-27 Source Digital, Inc. Synchronizing ancillary data to content including audio
CN108364661B (en) * 2017-12-15 2020-11-24 海尔优家智能科技(北京)有限公司 Visual voice performance evaluation method and device, computer equipment and storage medium
CN108615006B (en) * 2018-04-23 2020-04-17 百度在线网络技术(北京)有限公司 Method and apparatus for outputting information
CN109035419A (en) * 2018-08-06 2018-12-18 深圳市果壳文化科技有限公司 A kind of social contact method and system based on AR technology
WO2020102979A1 (en) * 2018-11-20 2020-05-28 深圳市欢太科技有限公司 Method and apparatus for processing voice information, storage medium and electronic device
KR20210037229A (en) 2019-09-27 2021-04-06 주식회사 케이티 User device, server and method for transmitting multimedia data through multi channels
CN111444384B (en) * 2020-03-31 2023-10-13 北京字节跳动网络技术有限公司 Audio key point determining method, device, equipment and storage medium
CN111640421B (en) * 2020-05-13 2023-06-16 广州国音智能科技有限公司 Speech comparison method, device, equipment and computer readable storage medium
CN112101301B (en) * 2020-11-03 2021-02-26 武汉工程大学 Good sound stability early warning method and device for screw water cooling unit and storage medium
US11929078B2 (en) * 2021-02-23 2024-03-12 Intuit, Inc. Method and system for user voice identification using ensembled deep learning algorithms
CN114255741B (en) * 2022-02-28 2022-06-10 腾讯科技(深圳)有限公司 Repetitive audio detection method, device and storage medium

Family Cites Families (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2969862B2 (en) * 1989-10-04 1999-11-02 松下電器産業株式会社 Voice recognition device
US6990453B2 (en) * 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
BR0309598A (en) 2002-04-25 2005-02-09 Shazam Entertainment Ltd Method for characterizing a relationship between first and second audio samples, computer program product, and computer system
US20070195963A1 (en) 2006-02-21 2007-08-23 Nokia Corporation Measuring ear biometrics for sound optimization
KR20090083098A (en) 2008-01-29 2009-08-03 삼성전자주식회사 Method for recognition music using harmonic characteristic and method for producing action of mobile robot using music recognition
US8706276B2 (en) * 2009-10-09 2014-04-22 The Trustees Of Columbia University In The City Of New York Systems, methods, and media for identifying matching audio
CN101720048B (en) * 2009-12-04 2011-06-01 山东大学 Audience rating information searching method for audience rating survey system based on audio frequency characteristics
US8886531B2 (en) 2010-01-13 2014-11-11 Rovi Technologies Corporation Apparatus and method for generating an audio fingerprint and using a two-stage query
JP5728888B2 (en) * 2010-10-29 2015-06-03 ソニー株式会社 Signal processing apparatus and method, and program
US20120296458A1 (en) 2011-05-18 2012-11-22 Microsoft Corporation Background Audio Listening for Content Recognition
US9461759B2 (en) 2011-08-30 2016-10-04 Iheartmedia Management Services, Inc. Identification of changed broadcast media items
US8586847B2 (en) 2011-12-02 2013-11-19 The Echo Nest Corporation Musical fingerprinting based on onset intervals
US8949872B2 (en) * 2011-12-20 2015-02-03 Yahoo! Inc. Audio fingerprint for content identification
US9292894B2 (en) 2012-03-14 2016-03-22 Digimarc Corporation Content recognition and synchronization using local caching
US9113203B2 (en) 2012-06-28 2015-08-18 Google Inc. Generating a sequence of audio fingerprints at a set top box
US9661361B2 (en) 2012-09-19 2017-05-23 Google Inc. Systems and methods for live media content matching
CN103729368B (en) * 2012-10-13 2016-12-21 复旦大学 A kind of robust audio recognition methods based on local spectrum iamge description
US8867028B2 (en) * 2012-10-19 2014-10-21 Interfiber Analysis, LLC System and/or method for measuring waveguide modes
US9373336B2 (en) 2013-02-04 2016-06-21 Tencent Technology (Shenzhen) Company Limited Method and device for audio recognition
CN103971689B (en) * 2013-02-04 2016-01-27 腾讯科技(深圳)有限公司 A kind of audio identification methods and device
US9269022B2 (en) * 2013-04-11 2016-02-23 Digimarc Corporation Methods for object recognition and related arrangements
CN104125509B (en) * 2013-04-28 2015-09-30 腾讯科技(深圳)有限公司 program identification method, device and server
GB2518663A (en) * 2013-09-27 2015-04-01 Nokia Corp Audio analysis apparatus
JP2015103088A (en) * 2013-11-26 2015-06-04 キヤノン株式会社 Image processing apparatus, image processing method, and program
US10321842B2 (en) * 2014-04-22 2019-06-18 Interaxon Inc. System and method for associating music with brain-state data
CN103971676B (en) * 2014-04-23 2017-07-14 上海师范大学 A kind of Rapid Speech isolated word recognition algorithm and application thereof, speech recognition system
US9894413B2 (en) 2014-06-12 2018-02-13 Google Llc Systems and methods for locally detecting consumed video content
US9946769B2 (en) 2014-06-20 2018-04-17 Google Llc Displaying information related to spoken dialogue in content playing on a device
US9838759B2 (en) 2014-06-20 2017-12-05 Google Inc. Displaying information related to content playing on a device
US9805125B2 (en) 2014-06-20 2017-10-31 Google Inc. Displaying a summary of media content items
US9905233B1 (en) 2014-08-07 2018-02-27 Digimarc Corporation Methods and apparatus for facilitating ambient content recognition using digital watermarks, and related arrangements
JP6464650B2 (en) 2014-10-03 2019-02-06 日本電気株式会社 Audio processing apparatus, audio processing method, and program
US10750236B2 (en) 2015-04-23 2020-08-18 The Nielsen Company (Us), Llc Automatic content recognition with local matching
US9743138B2 (en) 2015-07-31 2017-08-22 Mutr Llc Method for sound recognition task trigger
US9913056B2 (en) 2015-08-06 2018-03-06 Dolby Laboratories Licensing Corporation System and method to enhance speakers connected to devices with microphones

Also Published As

Publication number Publication date
WO2017050175A1 (en) 2017-03-30
JP6585835B2 (en) 2019-10-02
EP3355302B1 (en) 2022-02-09
EP3355302A4 (en) 2019-06-05
JP2018534609A (en) 2018-11-22
KR20180044957A (en) 2018-05-03
US20180174599A1 (en) 2018-06-21
EP3355302A1 (en) 2018-08-01
US10679647B2 (en) 2020-06-09
KR102077411B1 (en) 2020-02-13
CN106558318A (en) 2017-04-05
CN106558318B (en) 2020-04-28

Similar Documents

Publication Publication Date Title
ZA201900535B (en) Blockchain implemented method and system
ZA201900536B (en) Blockchain-implemented method and system
ZA201900509B (en) Blockchain-implemented method and system
ZA201705418B (en) Image recognition system and method
PT3371808T (en) Speech processing system and method
GB201719944D0 (en) Parking-lot-navigation system and method
HUE040549T2 (en) Method and system for recognizing physiological sound
EP3311309A4 (en) Methods and systems for object recognition
HK1248018A1 (en) Method and system for facial recognition
EP3319010A4 (en) Face recognition system and face recognition method
PT3260813T (en) Ranging system and ranging method
SG11201801808RA (en) Audio recognition method and system
GB2546504B (en) Audio system and method
GB201508963D0 (en) Audio identification method
GB2542548B (en) System and method
GB201510480D0 (en) System and method
GB201604012D0 (en) Refridgeration system and method
GB201515115D0 (en) System and method
GB201803529D0 (en) Radio-station-recommendation system and method
GB2536729B (en) A speech processing system and speech processing method
GB201620926D0 (en) Method and system
GB201616123D0 (en) System and method
IL248237A0 (en) Signature method and system
EP3267146A4 (en) Recognition device and recognition method
GB201619958D0 (en) Object recognition system and object recognition method