KR20170073113A - 음성의 톤, 템포 정보를 이용한 감정인식 방법 및 그 장치 - Google Patents
음성의 톤, 템포 정보를 이용한 감정인식 방법 및 그 장치 Download PDFInfo
- Publication number
- KR20170073113A KR20170073113A KR1020150181619A KR20150181619A KR20170073113A KR 20170073113 A KR20170073113 A KR 20170073113A KR 1020150181619 A KR1020150181619 A KR 1020150181619A KR 20150181619 A KR20150181619 A KR 20150181619A KR 20170073113 A KR20170073113 A KR 20170073113A
- Authority
- KR
- South Korea
- Prior art keywords
- value
- emotion
- voice
- interval
- information
- Prior art date
Links
- 230000008451 emotion Effects 0.000 title claims abstract description 67
- 238000000034 method Methods 0.000 title claims abstract description 37
- 238000013528 artificial neural network Methods 0.000 claims abstract description 35
- 230000008909 emotion recognition Effects 0.000 claims abstract description 31
- 239000000284 extract Substances 0.000 claims abstract description 5
- 230000002996 emotional effect Effects 0.000 claims description 4
- 238000005311 autocorrelation function Methods 0.000 claims description 3
- 241000282414 Homo sapiens Species 0.000 description 7
- 238000004891 communication Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 208000037656 Respiratory Sounds Diseases 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 241000282412 Homo Species 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000008921 facial expression Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 210000002243 primary neuron Anatomy 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 210000003900 secondary neuron Anatomy 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/60—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/63—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for estimating an emotional state
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Hospice & Palliative Care (AREA)
- Psychiatry (AREA)
- General Health & Medical Sciences (AREA)
- Child & Adolescent Psychology (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150181619A KR20170073113A (ko) | 2015-12-18 | 2015-12-18 | 음성의 톤, 템포 정보를 이용한 감정인식 방법 및 그 장치 |
PCT/KR2015/013968 WO2017104875A1 (fr) | 2015-12-18 | 2015-12-18 | Procédé de reconnaissance d'émotion utilisant des informations de ton et de rythme vocal, et appareil associé |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020150181619A KR20170073113A (ko) | 2015-12-18 | 2015-12-18 | 음성의 톤, 템포 정보를 이용한 감정인식 방법 및 그 장치 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20170073113A true KR20170073113A (ko) | 2017-06-28 |
Family
ID=59056830
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020150181619A KR20170073113A (ko) | 2015-12-18 | 2015-12-18 | 음성의 톤, 템포 정보를 이용한 감정인식 방법 및 그 장치 |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR20170073113A (fr) |
WO (1) | WO2017104875A1 (fr) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108806667B (zh) * | 2018-05-29 | 2020-04-17 | 重庆大学 | 基于神经网络的语音与情绪的同步识别方法 |
CN109147826B (zh) * | 2018-08-22 | 2022-12-27 | 平安科技(深圳)有限公司 | 音乐情感识别方法、装置、计算机设备及计算机存储介质 |
US10810382B2 (en) * | 2018-10-09 | 2020-10-20 | Disney Enterprises, Inc. | Automated conversion of vocabulary and narrative tone |
CN109243491B (zh) * | 2018-10-11 | 2023-06-02 | 平安科技(深圳)有限公司 | 在频谱上对语音进行情绪识别的方法、系统及存储介质 |
CN111627462B (zh) * | 2020-05-22 | 2023-12-19 | 上海师范大学 | 一种基于语义分析的情绪识别方法和设备 |
CN113327630B (zh) * | 2021-05-27 | 2023-05-09 | 平安科技(深圳)有限公司 | 语音情绪识别方法、装置、设备及存储介质 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI221574B (en) * | 2000-09-13 | 2004-10-01 | Agi Inc | Sentiment sensing method, perception generation method and device thereof and software |
US8788270B2 (en) * | 2009-06-16 | 2014-07-22 | University Of Florida Research Foundation, Inc. | Apparatus and method for determining an emotion state of a speaker |
US9020822B2 (en) * | 2012-10-19 | 2015-04-28 | Sony Computer Entertainment Inc. | Emotion recognition using auditory attention cues extracted from users voice |
-
2015
- 2015-12-18 WO PCT/KR2015/013968 patent/WO2017104875A1/fr active Application Filing
- 2015-12-18 KR KR1020150181619A patent/KR20170073113A/ko not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
WO2017104875A1 (fr) | 2017-06-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Boles et al. | Voice biometrics: Deep learning-based voiceprint authentication system | |
KR20170073113A (ko) | 음성의 톤, 템포 정보를 이용한 감정인식 방법 및 그 장치 | |
KR101988222B1 (ko) | 대어휘 연속 음성 인식 장치 및 방법 | |
US8145486B2 (en) | Indexing apparatus, indexing method, and computer program product | |
JPH0352640B2 (fr) | ||
WO2011046474A2 (fr) | Procédé d'identification d'un locuteur sur la base de phonogrammes de parole aléatoire, basé sur l'égalisation des formants | |
CN108899033B (zh) | 一种确定说话人特征的方法及装置 | |
KR101616112B1 (ko) | 음성 특징 벡터를 이용한 화자 분리 시스템 및 방법 | |
KR101893789B1 (ko) | 정규화를 이용한 음성 구간 판단 방법 및 이를 위한 음성 구간 판단 장치 | |
KR101943381B1 (ko) | 심층 신경망을 이용한 음성 끝점 검출 방법 및 이를 위한 끝점 검출 장치 | |
CN112102850A (zh) | 情绪识别的处理方法、装置、介质及电子设备 | |
JP2018180334A (ja) | 感情認識装置、方法およびプログラム | |
CN110827853A (zh) | 语音特征信息提取方法、终端及可读存储介质 | |
Pao et al. | Combining acoustic features for improved emotion recognition in mandarin speech | |
KR101992955B1 (ko) | 정규화를 이용한 음성 구간 판단 방법 및 이를 위한 음성 구간 판단 장치 | |
JP2015055653A (ja) | 音声認識装置及び方法、並びに、電子機器 | |
KR102098956B1 (ko) | 음성인식장치 및 음성인식방법 | |
Hasija et al. | Recognition of children Punjabi speech using tonal non-tonal classifier | |
CN114822502A (zh) | 一种报警方法、报警装置、计算机设备、以及存储介质 | |
KR100391123B1 (ko) | 피치 단위 데이터 분석을 이용한 음성인식 방법 및 시스템 | |
Mishra et al. | Speaker identification, differentiation and verification using deep learning for human machine interface | |
Jamil et al. | Influences of age in emotion recognition of spontaneous speech: A case of an under-resourced language | |
Lertwongkhanakool et al. | An automatic real-time synchronization of live speech with its transcription approach | |
Laleye et al. | Automatic boundary detection based on entropy measures for text-independent syllable segmentation | |
JP2019101285A (ja) | 音声処理装置、音声処理方法及びプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E90F | Notification of reason for final refusal | ||
E601 | Decision to refuse application |