SG11202103561TA - Audio detection method and apparatus, and device and storage medium - Google Patents
Audio detection method and apparatus, and device and storage mediumInfo
- Publication number
- SG11202103561TA SG11202103561TA SG11202103561TA SG11202103561TA SG11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA SG 11202103561T A SG11202103561T A SG 11202103561TA
- Authority
- SG
- Singapore
- Prior art keywords
- storage medium
- detection method
- audio detection
- audio
- medium
- Prior art date
Links
- 238000001514 detection method Methods 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/18—Artificial neural networks; Connectionist approaches
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811178750.2A CN109065069B (en) | 2018-10-10 | 2018-10-10 | Audio detection method, device, equipment and storage medium |
PCT/CN2019/102172 WO2020073743A1 (en) | 2018-10-10 | 2019-08-23 | Audio detection method and apparatus, and device and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202103561TA true SG11202103561TA (en) | 2021-05-28 |
Family
ID=64763727
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202103561TA SG11202103561TA (en) | 2018-10-10 | 2019-08-23 | Audio detection method and apparatus, and device and storage medium |
Country Status (4)
Country | Link |
---|---|
US (1) | US11948595B2 (en) |
CN (1) | CN109065069B (en) |
SG (1) | SG11202103561TA (en) |
WO (1) | WO2020073743A1 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109065069B (en) * | 2018-10-10 | 2020-09-04 | 广州市百果园信息技术有限公司 | Audio detection method, device, equipment and storage medium |
CN109949827A (en) * | 2019-03-15 | 2019-06-28 | 上海师范大学 | A kind of room acoustics Activity recognition method based on deep learning and intensified learning |
CN112182441A (en) * | 2019-07-02 | 2021-01-05 | 中国移动通信集团贵州有限公司 | Method and device for detecting violation data |
JP7290507B2 (en) * | 2019-08-06 | 2023-06-13 | 本田技研工業株式会社 | Information processing device, information processing method, recognition model and program |
CN111883139A (en) * | 2020-07-24 | 2020-11-03 | 北京字节跳动网络技术有限公司 | Method, apparatus, device and medium for screening target voices |
CN114125506B (en) * | 2020-08-28 | 2024-03-19 | 上海哔哩哔哩科技有限公司 | Voice auditing method and device |
CN113782036A (en) * | 2021-09-10 | 2021-12-10 | 北京声智科技有限公司 | Audio quality evaluation method and device, electronic equipment and storage medium |
US11948599B2 (en) * | 2022-01-06 | 2024-04-02 | Microsoft Technology Licensing, Llc | Audio event detection with window-based prediction |
Family Cites Families (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9020114B2 (en) * | 2002-04-29 | 2015-04-28 | Securus Technologies, Inc. | Systems and methods for detecting a call anomaly using biometric identification |
US8930261B2 (en) * | 2005-04-21 | 2015-01-06 | Verint Americas Inc. | Method and system for generating a fraud risk score using telephony channel based audio and non-audio data |
US20060248019A1 (en) * | 2005-04-21 | 2006-11-02 | Anthony Rajakumar | Method and system to detect fraud using voice data |
US9300790B2 (en) * | 2005-06-24 | 2016-03-29 | Securus Technologies, Inc. | Multi-party conversation analyzer and logger |
EP2122610B1 (en) * | 2007-01-31 | 2018-12-26 | Telecom Italia S.p.A. | Customizable method and system for emotional recognition |
CN101226743A (en) * | 2007-12-05 | 2008-07-23 | 浙江大学 | Method for recognizing speaker based on conversion of neutral and affection sound-groove model |
US8886663B2 (en) * | 2008-09-20 | 2014-11-11 | Securus Technologies, Inc. | Multi-party conversation analyzer and logger |
CN101770774B (en) * | 2009-12-31 | 2011-12-07 | 吉林大学 | Embedded-based open set speaker recognition method and system thereof |
CN201698746U (en) * | 2010-06-25 | 2011-01-05 | 北京安慧音通科技有限责任公司 | Portable multi-functional audio detector |
CN102572839B (en) * | 2010-12-14 | 2016-03-02 | 中国移动通信集团四川有限公司 | A kind of method and system controlling voice communication |
CN102436806A (en) * | 2011-09-29 | 2012-05-02 | 复旦大学 | Audio frequency copy detection method based on similarity |
CN102820033B (en) * | 2012-08-17 | 2013-12-04 | 南京大学 | Voiceprint identification method |
US20140123166A1 (en) * | 2012-10-26 | 2014-05-01 | Tektronix, Inc. | Loudness log for recovery of gated loudness measurements and associated analyzer |
CN103796183B (en) * | 2012-10-26 | 2017-08-04 | 中国移动通信集团上海有限公司 | A kind of refuse messages recognition methods and device |
CN104282303B (en) * | 2013-07-09 | 2019-03-29 | 威盛电子股份有限公司 | The method and its electronic device of speech recognition are carried out using Application on Voiceprint Recognition |
CN103731832A (en) * | 2013-12-26 | 2014-04-16 | 黄伟 | System and method for preventing phone and short message frauds |
CN105827787B (en) * | 2015-01-04 | 2019-12-17 | 中国移动通信集团公司 | number marking method and device |
US10142471B2 (en) * | 2015-03-02 | 2018-11-27 | Genesys Telecommunications Laboratories, Inc. | System and method for call progress detection |
CN104616666B (en) * | 2015-03-03 | 2018-05-25 | 广东小天才科技有限公司 | A kind of method and device for improving dialogue communication effectiveness based on speech analysis |
US10008209B1 (en) | 2015-09-25 | 2018-06-26 | Educational Testing Service | Computer-implemented systems and methods for speaker recognition using a neural network |
WO2017096473A1 (en) * | 2015-12-07 | 2017-06-15 | Syngrafii Inc. | Systems and methods for an advanced moderated online event |
CN107492382B (en) * | 2016-06-13 | 2020-12-18 | 阿里巴巴集团控股有限公司 | Voiceprint information extraction method and device based on neural network |
CN105869630B (en) * | 2016-06-27 | 2019-08-02 | 上海交通大学 | Speaker's voice spoofing attack detection method and system based on deep learning |
CN106791024A (en) * | 2016-11-30 | 2017-05-31 | 广东欧珀移动通信有限公司 | Voice messaging player method, device and terminal |
CN107610707B (en) * | 2016-12-15 | 2018-08-31 | 平安科技(深圳)有限公司 | A kind of method for recognizing sound-groove and device |
US20190052471A1 (en) * | 2017-08-10 | 2019-02-14 | Microsoft Technology Licensing, Llc | Personalized toxicity shield for multiuser virtual environments |
US10574597B2 (en) * | 2017-09-18 | 2020-02-25 | Microsoft Technology Licensing, Llc | Conversational log replay with voice and debugging information |
CN107527617A (en) * | 2017-09-30 | 2017-12-29 | 上海应用技术大学 | Monitoring method, apparatus and system based on voice recognition |
CN107919137A (en) * | 2017-10-25 | 2018-04-17 | 平安普惠企业管理有限公司 | The long-range measures and procedures for the examination and approval, device, equipment and readable storage medium storing program for executing |
CN108053840A (en) | 2017-12-29 | 2018-05-18 | 广州势必可赢网络科技有限公司 | A kind of Emotion identification method and system based on PCA-BP |
CN108269574B (en) * | 2017-12-29 | 2021-05-25 | 安徽科大讯飞医疗信息技术有限公司 | Method and device for processing voice signal to represent vocal cord state of user, storage medium and electronic equipment |
GB2571548A (en) * | 2018-03-01 | 2019-09-04 | Sony Interactive Entertainment Inc | User interaction monitoring |
CN108419091A (en) | 2018-03-02 | 2018-08-17 | 北京未来媒体科技股份有限公司 | A kind of verifying video content method and device based on machine learning |
CN108428447B (en) | 2018-06-19 | 2021-02-02 | 科大讯飞股份有限公司 | Voice intention recognition method and device |
CN109065069B (en) * | 2018-10-10 | 2020-09-04 | 广州市百果园信息技术有限公司 | Audio detection method, device, equipment and storage medium |
-
2018
- 2018-10-10 CN CN201811178750.2A patent/CN109065069B/en active Active
-
2019
- 2019-08-23 SG SG11202103561TA patent/SG11202103561TA/en unknown
- 2019-08-23 WO PCT/CN2019/102172 patent/WO2020073743A1/en active Application Filing
- 2019-08-23 US US17/282,732 patent/US11948595B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN109065069B (en) | 2020-09-04 |
US11948595B2 (en) | 2024-04-02 |
CN109065069A (en) | 2018-12-21 |
WO2020073743A1 (en) | 2020-04-16 |
US20220005493A1 (en) | 2022-01-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3955158A4 (en) | Object detection method and apparatus, electronic device, and storage medium | |
SG11202003818YA (en) | Key point detection method and apparatus, electronic device, and storage medium | |
EP3819903A4 (en) | Audio data processing method and apparatus, device and storage medium | |
SG11202010705QA (en) | Short video synthesis method and apparatus, and device and storage medium | |
SG11202103561TA (en) | Audio detection method and apparatus, and device and storage medium | |
EP3910551A4 (en) | Face detection method, apparatus, device, and storage medium | |
SG11202007036XA (en) | Method and apparatus for liveness detection, device, and storage medium | |
EP3605388A4 (en) | Face detection method and apparatus, computer device, and storage medium | |
EP3605537A4 (en) | Speech emotion detection method and apparatus, computer device, and storage medium | |
SG11202009794RA (en) | Key point detection method and apparatus, electronic device and storage medium | |
SG11202013074SA (en) | Method, apparatus and device for detecting lesion, and storage medium | |
EP3447769A4 (en) | Speech detection method and apparatus, and storage medium | |
SG11202009691TA (en) | Method and device for liveness detection, and storage medium | |
SG11202012861PA (en) | Target detection method and apparatus, device, and storage medium | |
EP3846120A4 (en) | Line segment detection method and apparatus, device, and computer-readable storage medium | |
SG11202106429VA (en) | Botnet domain name family detecting method, apparatus, device, and storage medium | |
SG11202005080VA (en) | Method, apparatus and system for liveness detection, electronic device, and storage medium | |
EP3627810A4 (en) | Proximity detection method and apparatus, storage medium, and electronic device | |
SG11202106539QA (en) | Audio signal transformation method, device, apparatus, and storage medium | |
SG11202004541WA (en) | Chatbot configuration method and apparatus, computer device, and storage medium | |
SG11202108783WA (en) | Pickup reminding method and apparatus, device, and storage medium | |
SG11202103654YA (en) | Audio playing and collection method, apparatus, and device and readable storage medium | |
SG11202101118SA (en) | Blockchain-based content processing method and apparatus, device, and storage medium | |
EP3661096A4 (en) | Signal processing method and device, apparatus, and computer-readable storage medium | |
GB202100237D0 (en) | Video publishing method and apparatus, device, and storage medium |