CN117409818A - 语音情绪识别方法及装置 - Google Patents
语音情绪识别方法及装置 Download PDFInfo
- Publication number
- CN117409818A CN117409818A CN202210806418.6A CN202210806418A CN117409818A CN 117409818 A CN117409818 A CN 117409818A CN 202210806418 A CN202210806418 A CN 202210806418A CN 117409818 A CN117409818 A CN 117409818A
- Authority
- CN
- China
- Prior art keywords
- audio frame
- feature
- emotion recognition
- historical
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000008909 emotion recognition Effects 0.000 title claims abstract description 123
- 238000000034 method Methods 0.000 title claims abstract description 59
- 239000013598 vector Substances 0.000 claims abstract description 122
- 230000004927 fusion Effects 0.000 claims abstract description 66
- 230000008451 emotion Effects 0.000 claims description 25
- 230000015654 memory Effects 0.000 claims description 21
- 238000000605 extraction Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 5
- 238000012549 training Methods 0.000 description 26
- 230000008569 process Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 238000001228 spectrum Methods 0.000 description 6
- 238000013528 artificial neural network Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 238000013527 convolutional neural network Methods 0.000 description 4
- 238000009432 framing Methods 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 238000013145 classification model Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000000306 recurrent effect Effects 0.000 description 3
- 210000001260 vocal cord Anatomy 0.000 description 3
- 238000005311 autocorrelation function Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000003062 neural network model Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000006403 short-term memory Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 206010028813 Nausea Diseases 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000008693 nausea Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/63—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/50—Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
- H04M3/527—Centralised call answering arrangements not requiring operator intervention
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- General Health & Medical Sciences (AREA)
- Hospice & Palliative Care (AREA)
- Psychiatry (AREA)
- Child & Adolescent Psychology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210806418.6A CN117409818A (zh) | 2022-07-08 | 2022-07-08 | 语音情绪识别方法及装置 |
PCT/CN2023/117475 WO2024008215A2 (fr) | 2022-07-08 | 2023-09-07 | Procédé et appareil de reconnaissance d'émotion vocale |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210806418.6A CN117409818A (zh) | 2022-07-08 | 2022-07-08 | 语音情绪识别方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117409818A true CN117409818A (zh) | 2024-01-16 |
Family
ID=89454303
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210806418.6A Pending CN117409818A (zh) | 2022-07-08 | 2022-07-08 | 语音情绪识别方法及装置 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN117409818A (fr) |
WO (1) | WO2024008215A2 (fr) |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108305642B (zh) * | 2017-06-30 | 2019-07-19 | 腾讯科技(深圳)有限公司 | 情感信息的确定方法和装置 |
US11205444B2 (en) * | 2019-08-16 | 2021-12-21 | Adobe Inc. | Utilizing bi-directional recurrent encoders with multi-hop attention for speech emotion recognition |
CN110570879A (zh) * | 2019-09-11 | 2019-12-13 | 深圳壹账通智能科技有限公司 | 基于情绪识别的智能会话方法、装置及计算机设备 |
CN111028827B (zh) * | 2019-12-10 | 2023-01-24 | 深圳追一科技有限公司 | 基于情绪识别的交互处理方法、装置、设备和存储介质 |
CN111524534B (zh) * | 2020-03-20 | 2021-04-09 | 北京捷通华声科技股份有限公司 | 一种语音分析方法、系统、设备及存储介质 |
CN113506586B (zh) * | 2021-06-18 | 2023-06-20 | 杭州摸象大数据科技有限公司 | 用户情绪识别的方法和系统 |
CN114022192A (zh) * | 2021-10-20 | 2022-02-08 | 百融云创科技股份有限公司 | 一种基于智能营销场景的数据建模方法及系统 |
CN114492579A (zh) * | 2021-12-25 | 2022-05-13 | 浙江大华技术股份有限公司 | 情绪识别方法、摄像装置、情绪识别装置及存储装置 |
CN114639150A (zh) * | 2022-03-16 | 2022-06-17 | 平安科技(深圳)有限公司 | 情绪识别方法、装置、计算机设备和存储介质 |
-
2022
- 2022-07-08 CN CN202210806418.6A patent/CN117409818A/zh active Pending
-
2023
- 2023-09-07 WO PCT/CN2023/117475 patent/WO2024008215A2/fr unknown
Also Published As
Publication number | Publication date |
---|---|
WO2024008215A2 (fr) | 2024-01-11 |
WO2024008215A3 (fr) | 2024-02-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021093449A1 (fr) | Procédé et appareil de détection de mot de réveil employant l'intelligence artificielle, dispositif, et support | |
CN111312245B (zh) | 一种语音应答方法、装置和存储介质 | |
CN108428447B (zh) | 一种语音意图识别方法及装置 | |
US20240021202A1 (en) | Method and apparatus for recognizing voice, electronic device and medium | |
CN113987179B (zh) | 基于知识增强和回溯损失的对话情绪识别网络模型、构建方法、电子设备及存储介质 | |
CN111276131A (zh) | 一种基于深度神经网络的多类声学特征整合方法和系统 | |
CN111832308B (zh) | 语音识别文本连贯性处理方法和装置 | |
CN108735201A (zh) | 连续语音识别方法、装置、设备和存储介质 | |
CN110619871B (zh) | 语音唤醒检测方法、装置、设备以及存储介质 | |
CN112259089B (zh) | 语音识别方法及装置 | |
CN111081230A (zh) | 语音识别方法和设备 | |
CN110930975B (zh) | 用于输出信息的方法和装置 | |
CN113314119A (zh) | 语音识别智能家居控制方法及装置 | |
Gupta et al. | Speech emotion recognition using SVM with thresholding fusion | |
CN115497465A (zh) | 语音交互方法、装置、电子设备和存储介质 | |
CN115687934A (zh) | 意图识别方法、装置、计算机设备及存储介质 | |
CN113360683B (zh) | 训练跨模态检索模型的方法以及跨模态检索方法和装置 | |
CN113468857B (zh) | 风格转换模型的训练方法、装置、电子设备以及存储介质 | |
CN114022192A (zh) | 一种基于智能营销场景的数据建模方法及系统 | |
CN111554270B (zh) | 训练样本筛选方法及电子设备 | |
CN113516964B (zh) | 语音合成方法及可读存储介质 | |
CN117409818A (zh) | 语音情绪识别方法及装置 | |
CN114373443A (zh) | 语音合成方法和装置、计算设备、存储介质及程序产品 | |
CN111414468A (zh) | 话术选择方法、装置和电子设备 | |
US12033618B1 (en) | Relevant context determination |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |