TWI307493B - - Google Patents

Download PDF

Info

Publication number
TWI307493B
TWI307493B TW095120450A TW95120450A TWI307493B TW I307493 B TWI307493 B TW I307493B TW 095120450 A TW095120450 A TW 095120450A TW 95120450 A TW95120450 A TW 95120450A TW I307493 B TWI307493 B TW I307493B
Authority
TW
Taiwan
Prior art keywords
frequency
pitch
sound
autocorrelation waveform
peak
Prior art date
Application number
TW095120450A
Other languages
English (en)
Chinese (zh)
Other versions
TW200707409A (en
Inventor
Shunji Mitsuyoshi
Kaoru Ogata
Fumiaki Monma
Original Assignee
Agi Inc
Shunji Mitsuyoshi
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agi Inc, Shunji Mitsuyoshi filed Critical Agi Inc
Publication of TW200707409A publication Critical patent/TW200707409A/zh
Application granted granted Critical
Publication of TWI307493B publication Critical patent/TWI307493B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
TW095120450A 2005-06-09 2006-06-08 Speech analyzer detecting pitch frequency, speech analyzing method, and speech analyzing program TW200707409A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2005169414 2005-06-09
JP2005181581 2005-06-22

Publications (2)

Publication Number Publication Date
TW200707409A TW200707409A (en) 2007-02-16
TWI307493B true TWI307493B (https=) 2009-03-11

Family

ID=37498359

Family Applications (1)

Application Number Title Priority Date Filing Date
TW095120450A TW200707409A (en) 2005-06-09 2006-06-08 Speech analyzer detecting pitch frequency, speech analyzing method, and speech analyzing program

Country Status (9)

Country Link
US (1) US8738370B2 (https=)
EP (1) EP1901281B1 (https=)
JP (1) JP4851447B2 (https=)
KR (1) KR101248353B1 (https=)
CN (1) CN101199002B (https=)
CA (1) CA2611259C (https=)
RU (1) RU2403626C2 (https=)
TW (1) TW200707409A (https=)
WO (1) WO2006132159A1 (https=)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI401061B (zh) * 2009-12-16 2013-07-11 Ind Tech Res Inst 活動力監測方法與系統
TWI660160B (zh) * 2015-04-27 2019-05-21 Otohear Consultants Inc. 移動噪音源的檢測系統與方法
US10726863B2 (en) 2015-04-27 2020-07-28 Otocon Inc. System and method for locating mobile noise source

Families Citing this family (70)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006006366A1 (ja) * 2004-07-13 2006-01-19 Matsushita Electric Industrial Co., Ltd. ピッチ周波数推定装置およびピッチ周波数推定方法
US8204747B2 (en) * 2006-06-23 2012-06-19 Panasonic Corporation Emotion recognition apparatus
JP2009047831A (ja) * 2007-08-17 2009-03-05 Toshiba Corp 特徴量抽出装置、プログラムおよび特徴量抽出方法
KR100970446B1 (ko) 2007-11-21 2010-07-16 한국전자통신연구원 주파수 확장을 위한 가변 잡음레벨 결정 장치 및 그 방법
US8148621B2 (en) * 2009-02-05 2012-04-03 Brian Bright Scoring of free-form vocals for video game
JP5278952B2 (ja) * 2009-03-09 2013-09-04 国立大学法人福井大学 乳幼児の感情診断装置及び方法
US8666734B2 (en) * 2009-09-23 2014-03-04 University Of Maryland, College Park Systems and methods for multiple pitch tracking using a multidimensional function and strength values
JP5696828B2 (ja) * 2010-01-12 2015-04-08 ヤマハ株式会社 信号処理装置
JP5834449B2 (ja) * 2010-04-22 2015-12-24 富士通株式会社 発話状態検出装置、発話状態検出プログラムおよび発話状態検出方法
JP5494813B2 (ja) * 2010-09-29 2014-05-21 富士通株式会社 呼吸検出装置および呼吸検出方法
RU2454735C1 (ru) * 2010-12-09 2012-06-27 Учреждение Российской академии наук Институт проблем управления им. В.А. Трапезникова РАН Способ обработки речевого сигнала в частотной области
JP5803125B2 (ja) * 2011-02-10 2015-11-04 富士通株式会社 音声による抑圧状態検出装置およびプログラム
US8756061B2 (en) 2011-04-01 2014-06-17 Sony Computer Entertainment Inc. Speech syllable/vowel/phone boundary detection using auditory attention cues
JP5664480B2 (ja) * 2011-06-30 2015-02-04 富士通株式会社 異常状態検出装置、電話機、異常状態検出方法、及びプログラム
US20130166042A1 (en) * 2011-12-26 2013-06-27 Hewlett-Packard Development Company, L.P. Media content-based control of ambient environment
KR101471741B1 (ko) * 2012-01-27 2014-12-11 이승우 보컬프랙틱 시스템
RU2510955C2 (ru) * 2012-03-12 2014-04-10 Государственное казенное образовательное учреждение высшего профессионального образования Академия Федеральной службы охраны Российской Федерации (Академия ФСО России) Способ обнаружения эмоций по голосу
US20130297297A1 (en) * 2012-05-07 2013-11-07 Erhan Guven System and method for classification of emotion in human speech
CN103390409A (zh) * 2012-05-11 2013-11-13 鸿富锦精密工业(深圳)有限公司 电子装置及其侦测色情音频的方法
RU2553413C2 (ru) * 2012-08-29 2015-06-10 Федеральное государственное бюджетное образовательное учреждение высшего профессионального образования "Воронежский государственный университет" (ФГБУ ВПО "ВГУ") Способ выявления эмоционального состояния человека по голосу
RU2546311C2 (ru) * 2012-09-06 2015-04-10 Федеральное государственное бюджетное образовательное учреждение высшего профессионального образования "Воронежский государственный университет" (ФГБУ ВПО "ВГУ") Способ оценки частоты основного тона речевого сигнала
US9020822B2 (en) 2012-10-19 2015-04-28 Sony Computer Entertainment Inc. Emotion recognition using auditory attention cues extracted from users voice
US9031293B2 (en) 2012-10-19 2015-05-12 Sony Computer Entertainment Inc. Multi-modal sensor based emotion recognition and emotional interface
US9672811B2 (en) 2012-11-29 2017-06-06 Sony Interactive Entertainment Inc. Combining auditory attention cues with phoneme posterior scores for phone/vowel/syllable boundary detection
KR101499606B1 (ko) * 2013-05-10 2015-03-09 서강대학교산학협력단 음성신호의 특징정보를 이용한 흥미점수 산출 시스템 및 방법, 그를 기록한 기록매체
JP6085538B2 (ja) * 2013-09-02 2017-02-22 本田技研工業株式会社 音響認識装置、音響認識方法、及び音響認識プログラム
US10431209B2 (en) * 2016-12-30 2019-10-01 Google Llc Feedback controller for data transmissions
WO2015083357A1 (ja) * 2013-12-05 2015-06-11 Pst株式会社 推定装置、プログラム、推定方法および推定システム
US9363378B1 (en) 2014-03-19 2016-06-07 Noble Systems Corporation Processing stored voice messages to identify non-semantic message characteristics
JP6262613B2 (ja) * 2014-07-18 2018-01-17 ヤフー株式会社 提示装置、提示方法及び提示プログラム
JP6122816B2 (ja) 2014-08-07 2017-04-26 シャープ株式会社 音声出力装置、ネットワークシステム、音声出力方法、および音声出力プログラム
CN105590629B (zh) * 2014-11-18 2018-09-21 华为终端(东莞)有限公司 一种语音处理的方法及装置
US11120816B2 (en) 2015-02-01 2021-09-14 Board Of Regents, The University Of Texas System Natural ear
US9773426B2 (en) * 2015-02-01 2017-09-26 Board Of Regents, The University Of Texas System Apparatus and method to facilitate singing intended notes
US9830921B2 (en) * 2015-08-17 2017-11-28 Qualcomm Incorporated High-band target signal control
JP6531567B2 (ja) * 2015-08-28 2019-06-19 ブラザー工業株式会社 カラオケ装置及びカラオケ用プログラム
US9865281B2 (en) 2015-09-02 2018-01-09 International Business Machines Corporation Conversational analytics
WO2016046421A1 (en) * 2015-11-19 2016-03-31 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for voiced speech detection
JP6306071B2 (ja) 2016-02-09 2018-04-04 Pst株式会社 推定装置、推定プログラム、推定装置の作動方法および推定システム
KR101777302B1 (ko) * 2016-04-18 2017-09-12 충남대학교산학협력단 음성 주파수 분석 시스템 및 음성 주파수 분석 방법과 이를 이용한 음성 인식 시스템 및 음성 인식 방법
CN105852823A (zh) * 2016-04-20 2016-08-17 吕忠华 一种医学用智能化息怒提示设备
CN105725996A (zh) * 2016-04-20 2016-07-06 吕忠华 一种智能控制人体器官情绪变化医疗器械装置及方法
JP6345729B2 (ja) * 2016-04-22 2018-06-20 Cocoro Sb株式会社 応対データ収集システム、顧客応対システム及びプログラム
JP6219448B1 (ja) * 2016-05-16 2017-10-25 Cocoro Sb株式会社 顧客応対制御システム、顧客応対システム及びプログラム
CN106024015A (zh) * 2016-06-14 2016-10-12 上海航动科技有限公司 一种呼叫中心坐席人员监控方法及系统
CN106132040B (zh) * 2016-06-20 2019-03-19 科大讯飞股份有限公司 歌唱环境的灯光控制方法和装置
US11351680B1 (en) * 2017-03-01 2022-06-07 Knowledge Initiatives LLC Systems and methods for enhancing robot/human cooperation and shared responsibility
JP2018183474A (ja) * 2017-04-27 2018-11-22 ファミリーイナダ株式会社 マッサージ装置及びマッサージシステム
CN107368724A (zh) * 2017-06-14 2017-11-21 广东数相智能科技有限公司 基于声纹识别的防作弊网络调研方法、电子设备及存储介质
JP7103769B2 (ja) * 2017-09-05 2022-07-20 京セラ株式会社 電子機器、携帯端末、コミュニケーションシステム、見守り方法、およびプログラム
JP6904198B2 (ja) 2017-09-25 2021-07-14 富士通株式会社 音声処理プログラム、音声処理方法および音声処理装置
JP6907859B2 (ja) 2017-09-25 2021-07-21 富士通株式会社 音声処理プログラム、音声処理方法および音声処理装置
CN108447470A (zh) * 2017-12-28 2018-08-24 中南大学 一种基于声道和韵律特征的情感语音转换方法
US11538455B2 (en) 2018-02-16 2022-12-27 Dolby Laboratories Licensing Corporation Speech style transfer
JP6911208B2 (ja) * 2018-02-16 2021-07-28 ドルビー ラボラトリーズ ライセンシング コーポレイション 発話スタイル転移
US20190385711A1 (en) 2018-06-19 2019-12-19 Ellipsis Health, Inc. Systems and methods for mental health assessment
JP7608171B2 (ja) 2018-06-19 2025-01-06 エリプシス・ヘルス・インコーポレイテッド 精神的健康評価のためのシステム及び方法
EP3821815A4 (en) 2018-07-13 2021-12-29 Life Science Institute, Inc. Mental/nervous system disorder estimation system, estimation program, and estimation method
US12029579B2 (en) 2018-07-13 2024-07-09 Pst Inc. Apparatus for estimating mental/neurological disease
KR20200064539A (ko) 2018-11-29 2020-06-08 주식회사 위드마인드 음정과 음량 정보의 특징으로 분류된 감정 맵 기반의 감정 분석 방법
JP7402396B2 (ja) * 2020-01-07 2023-12-21 株式会社鉄人化計画 感情解析装置、感情解析方法、及び感情解析プログラム
JP7265293B2 (ja) 2020-01-09 2023-04-26 Pst株式会社 音声を用いて、精神・神経系疾患を推定する装置
TWI752551B (zh) * 2020-07-13 2022-01-11 國立屏東大學 迅吃偵測方法、迅吃偵測裝置與電腦程式產品
US20220189444A1 (en) * 2020-12-14 2022-06-16 Slate Digital France Note stabilization and transition boost in automatic pitch correction system
IT202100003821A1 (it) * 2021-02-19 2022-08-19 Univ Pisa Procedimento di interazione con oggetti
CN113707180A (zh) * 2021-08-10 2021-11-26 漳州立达信光电子科技有限公司 一种哭叫声音侦测方法和装置
US12527931B2 (en) 2021-11-01 2026-01-20 Unitedhealth Group Incorporated Machine learning techniques for optimized breathing therapy
TWI902280B (zh) * 2024-05-31 2025-10-21 瑞昱半導體股份有限公司 卡拉ok裝置及其歌聲評分系統
CN118588064B (zh) * 2024-07-31 2024-10-22 金纪科技有限公司 一种非接触式留置谈话虚假音频检测方法及系统
CN119296565B (zh) * 2024-12-10 2025-04-01 北京国旺盛源智能终端科技有限公司 一种具有音频数据采集功能的分体可拆卸作业装置

Family Cites Families (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4064363A (en) * 1974-07-25 1977-12-20 Northrop Corporation Vocoder systems providing wave form analysis and synthesis using fourier transform representative signals
RU2007763C1 (ru) * 1991-04-04 1994-02-15 Завод "Калугаприбор" Способ выделения основного тона из речевого сигнала
DE69233794D1 (de) * 1991-06-11 2010-09-23 Qualcomm Inc Vocoder mit veränderlicher Bitrate
JPH0519793A (ja) * 1991-07-11 1993-01-29 Hitachi Ltd ピツチ抽出方法
JP2812184B2 (ja) * 1994-02-23 1998-10-22 日本電気株式会社 音声の複素ケプストラム分析装置
KR0155798B1 (ko) * 1995-01-27 1998-12-15 김광호 음성신호 부호화 및 복호화 방법
JP3840684B2 (ja) * 1996-02-01 2006-11-01 ソニー株式会社 ピッチ抽出装置及びピッチ抽出方法
JPH10187178A (ja) 1996-10-28 1998-07-14 Omron Corp 歌唱の感情分析装置並びに採点装置
US5973252A (en) * 1997-10-27 1999-10-26 Auburn Audio Technologies, Inc. Pitch detection and intonation correction apparatus and method
KR100269216B1 (ko) * 1998-04-16 2000-10-16 윤종용 스펙트로-템포럴 자기상관을 사용한 피치결정시스템 및 방법
JP3251555B2 (ja) 1998-12-10 2002-01-28 科学技術振興事業団 信号分析装置
US6463415B2 (en) * 1999-08-31 2002-10-08 Accenture Llp 69voice authentication system and method for regulating border crossing
US6151571A (en) 1999-08-31 2000-11-21 Andersen Consulting System, method and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters
US7043430B1 (en) * 1999-11-23 2006-05-09 Infotalk Corporation Limitied System and method for speech recognition using tonal modeling
JP2001154681A (ja) * 1999-11-30 2001-06-08 Sony Corp 音声処理装置および音声処理方法、並びに記録媒体
US7139699B2 (en) * 2000-10-06 2006-11-21 Silverman Stephen E Method for analysis of vocal jitter for near-term suicidal risk assessment
EP1256937B1 (en) 2001-05-11 2006-11-02 Sony France S.A. Emotion recognition method and device
EP1262844A1 (en) * 2001-06-01 2002-12-04 Sony International (Europe) GmbH Method for controlling a man-machine-interface unit
DE60230856D1 (de) 2001-07-13 2009-03-05 Panasonic Corp Audiosignaldecodierungseinrichtung und audiosignalcodierungseinrichtung
JP2003108197A (ja) 2001-07-13 2003-04-11 Matsushita Electric Ind Co Ltd オーディオ信号復号化装置およびオーディオ信号符号化装置
KR100393899B1 (ko) * 2001-07-27 2003-08-09 어뮤즈텍(주) 2-단계 피치 판단 방법 및 장치
IL144818A (en) * 2001-08-09 2006-08-20 Voicesense Ltd Method and apparatus for speech analysis
JP3841705B2 (ja) 2001-09-28 2006-11-01 日本電信電話株式会社 占有度抽出装置および基本周波数抽出装置、それらの方法、それらのプログラム並びにそれらのプログラムを記録した記録媒体
US7124075B2 (en) * 2001-10-26 2006-10-17 Dmitry Edward Terez Methods and apparatus for pitch determination
JP3806030B2 (ja) * 2001-12-28 2006-08-09 キヤノン電子株式会社 情報処理装置及び方法
JP3960834B2 (ja) * 2002-03-19 2007-08-15 松下電器産業株式会社 音声強調装置及び音声強調方法
JP2004240214A (ja) * 2003-02-06 2004-08-26 Nippon Telegr & Teleph Corp <Ntt> 音響信号判別方法、音響信号判別装置、音響信号判別プログラム
SG120121A1 (en) * 2003-09-26 2006-03-28 St Microelectronics Asia Pitch detection of speech signals
US20050144002A1 (en) * 2003-12-09 2005-06-30 Hewlett-Packard Development Company, L.P. Text-to-speech conversion with associated mood tag
JP4965265B2 (ja) 2004-01-09 2012-07-04 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 分散型発電システム
JP4643640B2 (ja) 2005-04-13 2011-03-02 株式会社日立製作所 雰囲気制御装置

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI401061B (zh) * 2009-12-16 2013-07-11 Ind Tech Res Inst 活動力監測方法與系統
TWI660160B (zh) * 2015-04-27 2019-05-21 Otohear Consultants Inc. 移動噪音源的檢測系統與方法
US10311894B2 (en) 2015-04-27 2019-06-04 Otocon Inc. System and method for locating mobile noise source
US10726863B2 (en) 2015-04-27 2020-07-28 Otocon Inc. System and method for locating mobile noise source

Also Published As

Publication number Publication date
WO2006132159A1 (ja) 2006-12-14
US20090210220A1 (en) 2009-08-20
JP4851447B2 (ja) 2012-01-11
JPWO2006132159A1 (ja) 2009-01-08
US8738370B2 (en) 2014-05-27
EP1901281A4 (en) 2011-04-13
RU2007149237A (ru) 2009-07-20
EP1901281A1 (en) 2008-03-19
KR20080019278A (ko) 2008-03-03
CA2611259A1 (en) 2006-12-14
CA2611259C (en) 2016-03-22
TW200707409A (en) 2007-02-16
CN101199002B (zh) 2011-09-07
EP1901281B1 (en) 2013-03-20
CN101199002A (zh) 2008-06-11
RU2403626C2 (ru) 2010-11-10
KR101248353B1 (ko) 2013-04-02

Similar Documents

Publication Publication Date Title
TWI307493B (https=)
Dar et al. Speech databases, speech features, and classifiers in speech emotion recognition: A review
Pernet et al. The role of pitch and timbre in voice gender categorization
Wanderley et al. The musical significance of clarinetists' ancillary gestures: An exploration of the field
Carron et al. Speaking about sounds: a tool for communication on sound features
JP4495907B2 (ja) 音声の分析の方法及び装置
Johar Emotion, affect and personality in speech: The Bias of language and paralanguage
Polzehl Personality in speech
Gygi Factors in the identification of environmental sounds
Reed et al. Shifting ambiguity, collapsing indeterminacy: Designing with data as Baradian apparatus
Lech et al. Stress and emotion recognition using acoustic speech analysis
Chanda et al. A deep audiovisual approach for human confidence classification
Grill Perceptually informed organization of textural sounds
Ma Emotion-aware voice interfaces based on speech signal processing
Grigorev et al. An Electroglottographic Method for Assessing the Emotional State of the Speaker
CN117690456A (zh) 一种基于神经网络的小语种口语智能训练方法、系统及设备
Wen et al. What a deep song: The role of music features in perceived depth
Telembici et al. Emotion Recognition Audio Database for Service Robots
Talkar et al. Brief Report: Quantifying Speech Production Coordination from Non-and Minimally-Speaking Individuals
Qiu et al. Machine Learning in Human Emotion Detection from the Speech
Handa et al. An experimental and statistical analysis to assess impact of regional accent on distress non-linguistic scream of young women
Korade et al. Induced Mood Shift and Cognitive Adaptation During free Play in a Changing Tonic Context
Hosain Enhancing Speech Emotion Recognition through Bone-Conducted Speech: Development, Dataset Creation, and Cross-Cultural Analysis
WO2016039465A1 (ja) 音響解析装置
JP2016057570A (ja) 音響解析装置

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees