CN103531198B - 一种基于伪说话人聚类的语音情感特征规整化方法 - Google Patents
一种基于伪说话人聚类的语音情感特征规整化方法 Download PDFInfo
- Publication number
- CN103531198B CN103531198B CN201310534319.8A CN201310534319A CN103531198B CN 103531198 B CN103531198 B CN 103531198B CN 201310534319 A CN201310534319 A CN 201310534319A CN 103531198 B CN103531198 B CN 103531198B
- Authority
- CN
- China
- Prior art keywords
- speaker
- pseudo
- sample
- clustering
- feature
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000008451 emotion Effects 0.000 title claims abstract description 31
- 238000010606 normalization Methods 0.000 title claims abstract description 21
- 230000008909 emotion recognition Effects 0.000 claims abstract description 14
- 239000000463 material Substances 0.000 claims abstract description 9
- 239000000284 extract Substances 0.000 claims abstract description 4
- 230000002996 emotional effect Effects 0.000 claims description 10
- 230000035945 sensitivity Effects 0.000 claims description 8
- 238000000605 extraction Methods 0.000 claims description 5
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 238000006116 polymerization reaction Methods 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 238000000034 method Methods 0.000 abstract description 8
- 238000012360 testing method Methods 0.000 description 13
- 238000005516 engineering process Methods 0.000 description 12
- 238000002474 experimental method Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 6
- 238000012549 training Methods 0.000 description 5
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 229910052749 magnesium Inorganic materials 0.000 description 4
- 239000011777 magnesium Substances 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 3
- 230000003542 behavioural effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000002790 cross-validation Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 235000007926 Craterellus fallax Nutrition 0.000 description 1
- 240000007175 Datura inoxia Species 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 101150036841 minJ gene Proteins 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Landscapes
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310534319.8A CN103531198B (zh) | 2013-11-01 | 2013-11-01 | 一种基于伪说话人聚类的语音情感特征规整化方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310534319.8A CN103531198B (zh) | 2013-11-01 | 2013-11-01 | 一种基于伪说话人聚类的语音情感特征规整化方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103531198A CN103531198A (zh) | 2014-01-22 |
CN103531198B true CN103531198B (zh) | 2016-03-23 |
Family
ID=49933151
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310534319.8A Expired - Fee Related CN103531198B (zh) | 2013-11-01 | 2013-11-01 | 一种基于伪说话人聚类的语音情感特征规整化方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103531198B (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106782505A (zh) * | 2017-02-21 | 2017-05-31 | 南京工程学院 | 一种基于放电声音识别高压开关柜状态的方法 |
TWI654600B (zh) * | 2017-11-29 | 2019-03-21 | 隆宸星股份有限公司 | 語音情緒辨識系統與方法以及使用其之智慧型機器人 |
CN108122552B (zh) * | 2017-12-15 | 2021-10-15 | 上海智臻智能网络科技股份有限公司 | 语音情绪识别方法和装置 |
CN109961803A (zh) * | 2017-12-18 | 2019-07-02 | 上海智臻智能网络科技股份有限公司 | 语音情绪识别系统 |
CN109935241A (zh) * | 2017-12-18 | 2019-06-25 | 上海智臻智能网络科技股份有限公司 | 语音信息处理方法 |
CN109961776A (zh) * | 2017-12-18 | 2019-07-02 | 上海智臻智能网络科技股份有限公司 | 语音信息处理装置 |
CN109935240A (zh) * | 2017-12-18 | 2019-06-25 | 上海智臻智能网络科技股份有限公司 | 通过语音识别情绪的方法 |
CN110085220A (zh) * | 2018-01-26 | 2019-08-02 | 上海智臻智能网络科技股份有限公司 | 智能交互装置 |
CN108197115B (zh) * | 2018-01-26 | 2022-04-22 | 上海智臻智能网络科技股份有限公司 | 智能交互方法、装置、计算机设备和计算机可读存储介质 |
CN110085221A (zh) * | 2018-01-26 | 2019-08-02 | 上海智臻智能网络科技股份有限公司 | 语音情感交互方法、计算机设备和计算机可读存储介质 |
CN110085262A (zh) * | 2018-01-26 | 2019-08-02 | 上海智臻智能网络科技股份有限公司 | 语音情绪交互方法、计算机设备和计算机可读存储介质 |
CN108831450A (zh) * | 2018-03-30 | 2018-11-16 | 杭州鸟瞰智能科技股份有限公司 | 一种基于用户情绪识别的虚拟机器人人机交互方法 |
CN112204657B (zh) * | 2019-03-29 | 2023-12-22 | 微软技术许可有限责任公司 | 利用提前停止聚类的讲话者分离 |
CN113555038B (zh) * | 2021-07-05 | 2023-12-29 | 东南大学 | 基于无监督领域对抗学习的说话人无关语音情感识别方法及系统 |
CN117171693B (zh) * | 2023-10-30 | 2024-01-26 | 山东交通学院 | 一种木工打磨过程中的切割异常检测方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0146434A1 (en) * | 1983-11-08 | 1985-06-26 | TEXAS INSTRUMENTS FRANCE Société dite: | A speaker independent speech recognition process |
JP2003099084A (ja) * | 2001-07-13 | 2003-04-04 | Sony France Sa | 音声による感情合成方法及び装置 |
CN102663432A (zh) * | 2012-04-18 | 2012-09-12 | 电子科技大学 | 结合支持向量机二次识别的模糊核聚类语音情感识别方法 |
CN102779510A (zh) * | 2012-07-19 | 2012-11-14 | 东南大学 | 基于特征空间自适应投影的语音情感识别方法 |
-
2013
- 2013-11-01 CN CN201310534319.8A patent/CN103531198B/zh not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0146434A1 (en) * | 1983-11-08 | 1985-06-26 | TEXAS INSTRUMENTS FRANCE Société dite: | A speaker independent speech recognition process |
JP2003099084A (ja) * | 2001-07-13 | 2003-04-04 | Sony France Sa | 音声による感情合成方法及び装置 |
CN102663432A (zh) * | 2012-04-18 | 2012-09-12 | 电子科技大学 | 结合支持向量机二次识别的模糊核聚类语音情感识别方法 |
CN102779510A (zh) * | 2012-07-19 | 2012-11-14 | 东南大学 | 基于特征空间自适应投影的语音情感识别方法 |
Also Published As
Publication number | Publication date |
---|---|
CN103531198A (zh) | 2014-01-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103531198B (zh) | 一种基于伪说话人聚类的语音情感特征规整化方法 | |
CN110491416B (zh) | 一种基于lstm和sae的电话语音情感分析与识别方法 | |
Doddington et al. | The NIST speaker recognition evaluation–overview, methodology, systems, results, perspective | |
Heck et al. | Robustness to telephone handset distortion in speaker recognition by discriminative feature design | |
CN108231067A (zh) | 基于卷积神经网络与随机森林分类的声音场景识别方法 | |
CN110111797A (zh) | 基于高斯超矢量和深度神经网络的说话人识别方法 | |
Yücesoy et al. | A new approach with score-level fusion for the classification of a speaker age and gender | |
CN106531174A (zh) | 基于小波包分解和声谱图特征的动物声音识别方法 | |
Nassif et al. | Emotional speaker identification using a novel capsule nets model | |
Chenchah et al. | A bio-inspired emotion recognition system under real-life conditions | |
CN112562725A (zh) | 基于语谱图和胶囊网络的混合语音情感分类方法 | |
Revathi et al. | Text independent speaker recognition and speaker independent speech recognition using iterative clustering approach | |
Ganchev | Speaker recognition | |
Sinha et al. | Acoustic-phonetic feature based dialect identification in Hindi Speech | |
CN105845143A (zh) | 基于支持向量机的说话人确认方法及其系统 | |
Nawas et al. | Speaker recognition using random forest | |
Pao et al. | A study on the search of the most discriminative speech features in the speaker dependent speech emotion recognition | |
Jalil et al. | Speaker identification using convolutional neural network for clean and noisy speech samples | |
Hafen et al. | Speech information retrieval: a review | |
CN105976819A (zh) | 基于Rnorm得分归一化的说话人确认方法 | |
Koolagudi et al. | Speaker recognition in the case of emotional environment using transformation of speech features | |
CN116682463A (zh) | 一种多模态情感识别方法及系统 | |
Akinrinmade et al. | Creation of a Nigerian voice corpus for indigenous speaker recognition | |
Dwijayanti et al. | Speaker identification using a convolutional neural network | |
Karjigi et al. | Speech intelligibility assessment of dysarthria using Fisher vector encoding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20180608 Address after: 210037 Qixia district and Yanlu No. 408, Nanjing, Jiangsu Patentee after: Nanjing Boke Electronic Technology Co.,Ltd. Address before: 210096 No. four archway, 2, Jiangsu, Nanjing Patentee before: Southeast University |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20180709 Address after: 211103 No. 1009 Tianyuan East Road, Jiangning District, Nanjing, Jiangsu. Patentee after: LIXIN WIRELESS ELECTRONIC TECHNOLOGY Co.,Ltd. Address before: 210037 Qixia district and Yanlu No. 408, Nanjing, Jiangsu Patentee before: Nanjing Boke Electronic Technology Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20201123 Address after: 212, 2 / F, building 9, xingzhihui business garden, No.19 Xinghuo Road, Jiangbei new district, Nanjing, Jiangsu Province, 210046 Patentee after: Nanjing Lizhi psychological big data Industry Research Institute Co.,Ltd. Address before: 211103 No. 1009 Tianyuan East Road, Jiangning District, Nanjing, Jiangsu. Patentee before: LIXIN WIRELESS ELECTRONIC TECHNOLOGY Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220112 Address after: 211513 Room 204, No. 38, Donghong Road, Donggou central community, Longpao street, Liuhe District, Nanjing, Jiangsu Province Patentee after: Nanjing lingluniao Internet of things Technology Co.,Ltd. Address before: 210046 212, 2nd floor, building 9, xingzhihui business garden, 19 Xinghuo Road, Jiangbei new district, Nanjing City, Jiangsu Province Patentee before: Nanjing Lizhi psychological big data Industry Research Institute Co.,Ltd. |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160323 |