CN117409818A - 语音情绪识别方法及装置 - Google Patents

语音情绪识别方法及装置 Download PDF

Info

Publication number
CN117409818A
CN117409818A CN202210806418.6A CN202210806418A CN117409818A CN 117409818 A CN117409818 A CN 117409818A CN 202210806418 A CN202210806418 A CN 202210806418A CN 117409818 A CN117409818 A CN 117409818A
Authority
CN
China
Prior art keywords
audio frame
feature
emotion recognition
historical
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210806418.6A
Other languages
English (en)
Chinese (zh)
Inventor
刘汝洲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SF Technology Co Ltd
Original Assignee
SF Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SF Technology Co Ltd filed Critical SF Technology Co Ltd
Priority to CN202210806418.6A priority Critical patent/CN117409818A/zh
Priority to PCT/CN2023/117475 priority patent/WO2024008215A2/fr
Publication of CN117409818A publication Critical patent/CN117409818A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/63Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/527Centralised call answering arrangements not requiring operator intervention

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • Hospice & Palliative Care (AREA)
  • Psychiatry (AREA)
  • Child & Adolescent Psychology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN202210806418.6A 2022-07-08 2022-07-08 语音情绪识别方法及装置 Pending CN117409818A (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202210806418.6A CN117409818A (zh) 2022-07-08 2022-07-08 语音情绪识别方法及装置
PCT/CN2023/117475 WO2024008215A2 (fr) 2022-07-08 2023-09-07 Procédé et appareil de reconnaissance d'émotion vocale

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210806418.6A CN117409818A (zh) 2022-07-08 2022-07-08 语音情绪识别方法及装置

Publications (1)

Publication Number Publication Date
CN117409818A true CN117409818A (zh) 2024-01-16

Family

ID=89454303

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210806418.6A Pending CN117409818A (zh) 2022-07-08 2022-07-08 语音情绪识别方法及装置

Country Status (2)

Country Link
CN (1) CN117409818A (fr)
WO (1) WO2024008215A2 (fr)

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108305642B (zh) * 2017-06-30 2019-07-19 腾讯科技(深圳)有限公司 情感信息的确定方法和装置
US11205444B2 (en) * 2019-08-16 2021-12-21 Adobe Inc. Utilizing bi-directional recurrent encoders with multi-hop attention for speech emotion recognition
CN110570879A (zh) * 2019-09-11 2019-12-13 深圳壹账通智能科技有限公司 基于情绪识别的智能会话方法、装置及计算机设备
CN111028827B (zh) * 2019-12-10 2023-01-24 深圳追一科技有限公司 基于情绪识别的交互处理方法、装置、设备和存储介质
CN111524534B (zh) * 2020-03-20 2021-04-09 北京捷通华声科技股份有限公司 一种语音分析方法、系统、设备及存储介质
CN113506586B (zh) * 2021-06-18 2023-06-20 杭州摸象大数据科技有限公司 用户情绪识别的方法和系统
CN114022192A (zh) * 2021-10-20 2022-02-08 百融云创科技股份有限公司 一种基于智能营销场景的数据建模方法及系统
CN114492579A (zh) * 2021-12-25 2022-05-13 浙江大华技术股份有限公司 情绪识别方法、摄像装置、情绪识别装置及存储装置
CN114639150A (zh) * 2022-03-16 2022-06-17 平安科技(深圳)有限公司 情绪识别方法、装置、计算机设备和存储介质

Also Published As

Publication number Publication date
WO2024008215A2 (fr) 2024-01-11
WO2024008215A3 (fr) 2024-02-29

Similar Documents

Publication Publication Date Title
WO2021093449A1 (fr) Procédé et appareil de détection de mot de réveil employant l'intelligence artificielle, dispositif, et support
CN111312245B (zh) 一种语音应答方法、装置和存储介质
CN108428447B (zh) 一种语音意图识别方法及装置
US20240021202A1 (en) Method and apparatus for recognizing voice, electronic device and medium
CN113987179B (zh) 基于知识增强和回溯损失的对话情绪识别网络模型、构建方法、电子设备及存储介质
CN111276131A (zh) 一种基于深度神经网络的多类声学特征整合方法和系统
CN111832308B (zh) 语音识别文本连贯性处理方法和装置
CN108735201A (zh) 连续语音识别方法、装置、设备和存储介质
CN110619871B (zh) 语音唤醒检测方法、装置、设备以及存储介质
CN112259089B (zh) 语音识别方法及装置
CN111081230A (zh) 语音识别方法和设备
CN110930975B (zh) 用于输出信息的方法和装置
CN113314119A (zh) 语音识别智能家居控制方法及装置
Gupta et al. Speech emotion recognition using SVM with thresholding fusion
CN115497465A (zh) 语音交互方法、装置、电子设备和存储介质
CN115687934A (zh) 意图识别方法、装置、计算机设备及存储介质
CN113360683B (zh) 训练跨模态检索模型的方法以及跨模态检索方法和装置
CN113468857B (zh) 风格转换模型的训练方法、装置、电子设备以及存储介质
CN114022192A (zh) 一种基于智能营销场景的数据建模方法及系统
CN111554270B (zh) 训练样本筛选方法及电子设备
CN113516964B (zh) 语音合成方法及可读存储介质
CN117409818A (zh) 语音情绪识别方法及装置
CN114373443A (zh) 语音合成方法和装置、计算设备、存储介质及程序产品
CN111414468A (zh) 话术选择方法、装置和电子设备
US12033618B1 (en) Relevant context determination

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination