SG11202103561TA - Audio detection method and apparatus, and device and storage medium - Google Patents

Audio detection method and apparatus, and device and storage medium

Info

Publication number
SG11202103561TA
SG11202103561TA SG11202103561TA SG11202103561TA SG11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA
Authority
SG
Singapore
Prior art keywords
storage medium
detection method
audio detection
audio
medium
Prior art date
Application number
SG11202103561TA
Inventor
Zhen Li
Zhenchuan Huang
Yu Zou
Original Assignee
Bigo Tech Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bigo Tech Pte Ltd filed Critical Bigo Tech Pte Ltd
Publication of SG11202103561TA publication Critical patent/SG11202103561TA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/06Decision making techniques; Pattern matching strategies
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/18Artificial neural networks; Connectionist approaches
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
SG11202103561TA 2018-10-10 2019-08-23 Audio detection method and apparatus, and device and storage medium SG11202103561TA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811178750.2A CN109065069B (en) 2018-10-10 2018-10-10 Audio detection method, device, equipment and storage medium
PCT/CN2019/102172 WO2020073743A1 (en) 2018-10-10 2019-08-23 Audio detection method and apparatus, and device and storage medium

Publications (1)

Publication Number Publication Date
SG11202103561TA true SG11202103561TA (en) 2021-05-28

Family

ID=64763727

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202103561TA SG11202103561TA (en) 2018-10-10 2019-08-23 Audio detection method and apparatus, and device and storage medium

Country Status (4)

Country Link
US (1) US11948595B2 (en)
CN (1) CN109065069B (en)
SG (1) SG11202103561TA (en)
WO (1) WO2020073743A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109065069B (en) * 2018-10-10 2020-09-04 广州市百果园信息技术有限公司 Audio detection method, device, equipment and storage medium
CN109949827A (en) * 2019-03-15 2019-06-28 上海师范大学 A kind of room acoustics Activity recognition method based on deep learning and intensified learning
CN112182441A (en) * 2019-07-02 2021-01-05 中国移动通信集团贵州有限公司 Method and device for detecting violation data
JP7290507B2 (en) * 2019-08-06 2023-06-13 本田技研工業株式会社 Information processing device, information processing method, recognition model and program
CN111883139A (en) * 2020-07-24 2020-11-03 北京字节跳动网络技术有限公司 Method, apparatus, device and medium for screening target voices
CN114125506B (en) * 2020-08-28 2024-03-19 上海哔哩哔哩科技有限公司 Voice auditing method and device
CN113782036A (en) * 2021-09-10 2021-12-10 北京声智科技有限公司 Audio quality evaluation method and device, electronic equipment and storage medium
US11948599B2 (en) * 2022-01-06 2024-04-02 Microsoft Technology Licensing, Llc Audio event detection with window-based prediction

Family Cites Families (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9020114B2 (en) * 2002-04-29 2015-04-28 Securus Technologies, Inc. Systems and methods for detecting a call anomaly using biometric identification
US8930261B2 (en) * 2005-04-21 2015-01-06 Verint Americas Inc. Method and system for generating a fraud risk score using telephony channel based audio and non-audio data
US20060248019A1 (en) * 2005-04-21 2006-11-02 Anthony Rajakumar Method and system to detect fraud using voice data
US9300790B2 (en) * 2005-06-24 2016-03-29 Securus Technologies, Inc. Multi-party conversation analyzer and logger
EP2122610B1 (en) * 2007-01-31 2018-12-26 Telecom Italia S.p.A. Customizable method and system for emotional recognition
CN101226743A (en) * 2007-12-05 2008-07-23 浙江大学 Method for recognizing speaker based on conversion of neutral and affection sound-groove model
US8886663B2 (en) * 2008-09-20 2014-11-11 Securus Technologies, Inc. Multi-party conversation analyzer and logger
CN101770774B (en) * 2009-12-31 2011-12-07 吉林大学 Embedded-based open set speaker recognition method and system thereof
CN201698746U (en) * 2010-06-25 2011-01-05 北京安慧音通科技有限责任公司 Portable multi-functional audio detector
CN102572839B (en) * 2010-12-14 2016-03-02 中国移动通信集团四川有限公司 A kind of method and system controlling voice communication
CN102436806A (en) * 2011-09-29 2012-05-02 复旦大学 Audio frequency copy detection method based on similarity
CN102820033B (en) * 2012-08-17 2013-12-04 南京大学 Voiceprint identification method
US20140123166A1 (en) * 2012-10-26 2014-05-01 Tektronix, Inc. Loudness log for recovery of gated loudness measurements and associated analyzer
CN103796183B (en) * 2012-10-26 2017-08-04 中国移动通信集团上海有限公司 A kind of refuse messages recognition methods and device
CN104282303B (en) * 2013-07-09 2019-03-29 威盛电子股份有限公司 The method and its electronic device of speech recognition are carried out using Application on Voiceprint Recognition
CN103731832A (en) * 2013-12-26 2014-04-16 黄伟 System and method for preventing phone and short message frauds
CN105827787B (en) * 2015-01-04 2019-12-17 中国移动通信集团公司 number marking method and device
US10142471B2 (en) * 2015-03-02 2018-11-27 Genesys Telecommunications Laboratories, Inc. System and method for call progress detection
CN104616666B (en) * 2015-03-03 2018-05-25 广东小天才科技有限公司 A kind of method and device for improving dialogue communication effectiveness based on speech analysis
US10008209B1 (en) 2015-09-25 2018-06-26 Educational Testing Service Computer-implemented systems and methods for speaker recognition using a neural network
WO2017096473A1 (en) * 2015-12-07 2017-06-15 Syngrafii Inc. Systems and methods for an advanced moderated online event
CN107492382B (en) * 2016-06-13 2020-12-18 阿里巴巴集团控股有限公司 Voiceprint information extraction method and device based on neural network
CN105869630B (en) * 2016-06-27 2019-08-02 上海交通大学 Speaker's voice spoofing attack detection method and system based on deep learning
CN106791024A (en) * 2016-11-30 2017-05-31 广东欧珀移动通信有限公司 Voice messaging player method, device and terminal
CN107610707B (en) * 2016-12-15 2018-08-31 平安科技(深圳)有限公司 A kind of method for recognizing sound-groove and device
US20190052471A1 (en) * 2017-08-10 2019-02-14 Microsoft Technology Licensing, Llc Personalized toxicity shield for multiuser virtual environments
US10574597B2 (en) * 2017-09-18 2020-02-25 Microsoft Technology Licensing, Llc Conversational log replay with voice and debugging information
CN107527617A (en) * 2017-09-30 2017-12-29 上海应用技术大学 Monitoring method, apparatus and system based on voice recognition
CN107919137A (en) * 2017-10-25 2018-04-17 平安普惠企业管理有限公司 The long-range measures and procedures for the examination and approval, device, equipment and readable storage medium storing program for executing
CN108053840A (en) 2017-12-29 2018-05-18 广州势必可赢网络科技有限公司 A kind of Emotion identification method and system based on PCA-BP
CN108269574B (en) * 2017-12-29 2021-05-25 安徽科大讯飞医疗信息技术有限公司 Method and device for processing voice signal to represent vocal cord state of user, storage medium and electronic equipment
GB2571548A (en) * 2018-03-01 2019-09-04 Sony Interactive Entertainment Inc User interaction monitoring
CN108419091A (en) 2018-03-02 2018-08-17 北京未来媒体科技股份有限公司 A kind of verifying video content method and device based on machine learning
CN108428447B (en) 2018-06-19 2021-02-02 科大讯飞股份有限公司 Voice intention recognition method and device
CN109065069B (en) * 2018-10-10 2020-09-04 广州市百果园信息技术有限公司 Audio detection method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN109065069B (en) 2020-09-04
US11948595B2 (en) 2024-04-02
CN109065069A (en) 2018-12-21
WO2020073743A1 (en) 2020-04-16
US20220005493A1 (en) 2022-01-06

Similar Documents

Publication Publication Date Title
EP3955158A4 (en) Object detection method and apparatus, electronic device, and storage medium
SG11202003818YA (en) Key point detection method and apparatus, electronic device, and storage medium
EP3819903A4 (en) Audio data processing method and apparatus, device and storage medium
SG11202010705QA (en) Short video synthesis method and apparatus, and device and storage medium
SG11202103561TA (en) Audio detection method and apparatus, and device and storage medium
EP3910551A4 (en) Face detection method, apparatus, device, and storage medium
SG11202007036XA (en) Method and apparatus for liveness detection, device, and storage medium
EP3605388A4 (en) Face detection method and apparatus, computer device, and storage medium
EP3605537A4 (en) Speech emotion detection method and apparatus, computer device, and storage medium
SG11202009794RA (en) Key point detection method and apparatus, electronic device and storage medium
SG11202013074SA (en) Method, apparatus and device for detecting lesion, and storage medium
EP3447769A4 (en) Speech detection method and apparatus, and storage medium
SG11202009691TA (en) Method and device for liveness detection, and storage medium
SG11202012861PA (en) Target detection method and apparatus, device, and storage medium
EP3846120A4 (en) Line segment detection method and apparatus, device, and computer-readable storage medium
SG11202106429VA (en) Botnet domain name family detecting method, apparatus, device, and storage medium
SG11202005080VA (en) Method, apparatus and system for liveness detection, electronic device, and storage medium
EP3627810A4 (en) Proximity detection method and apparatus, storage medium, and electronic device
SG11202106539QA (en) Audio signal transformation method, device, apparatus, and storage medium
SG11202004541WA (en) Chatbot configuration method and apparatus, computer device, and storage medium
SG11202108783WA (en) Pickup reminding method and apparatus, device, and storage medium
SG11202103654YA (en) Audio playing and collection method, apparatus, and device and readable storage medium
SG11202101118SA (en) Blockchain-based content processing method and apparatus, device, and storage medium
EP3661096A4 (en) Signal processing method and device, apparatus, and computer-readable storage medium
GB202100237D0 (en) Video publishing method and apparatus, device, and storage medium