JP7407190B2 - 発話解析装置、発話解析方法及びプログラム - Google Patents

発話解析装置、発話解析方法及びプログラム Download PDF

Info

Publication number
JP7407190B2
JP7407190B2 JP2021529930A JP2021529930A JP7407190B2 JP 7407190 B2 JP7407190 B2 JP 7407190B2 JP 2021529930 A JP2021529930 A JP 2021529930A JP 2021529930 A JP2021529930 A JP 2021529930A JP 7407190 B2 JP7407190 B2 JP 7407190B2
Authority
JP
Japan
Prior art keywords
data
utterance
category
likelihood
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021529930A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2021002137A5 (https=
JPWO2021002137A1 (https=
Inventor
夏樹 佐伯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Intellectual Property Management Co Ltd
Original Assignee
Panasonic Intellectual Property Management Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Intellectual Property Management Co Ltd filed Critical Panasonic Intellectual Property Management Co Ltd
Publication of JPWO2021002137A1 publication Critical patent/JPWO2021002137A1/ja
Publication of JPWO2021002137A5 publication Critical patent/JPWO2021002137A5/ja
Application granted granted Critical
Publication of JP7407190B2 publication Critical patent/JP7407190B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • G06F40/35Discourse or dialogue representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/44Statistical methods, e.g. probability models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
JP2021529930A 2019-07-04 2020-06-02 発話解析装置、発話解析方法及びプログラム Active JP7407190B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
JP2019125454 2019-07-04
JP2019125454 2019-07-04
JP2019134559 2019-07-22
JP2019134559 2019-07-22
PCT/JP2020/021811 WO2021002137A1 (ja) 2019-07-04 2020-06-02 発話解析装置、発話解析方法及びプログラム

Publications (3)

Publication Number Publication Date
JPWO2021002137A1 JPWO2021002137A1 (https=) 2021-01-07
JPWO2021002137A5 JPWO2021002137A5 (https=) 2022-06-02
JP7407190B2 true JP7407190B2 (ja) 2023-12-28

Family

ID=74100168

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2021529930A Active JP7407190B2 (ja) 2019-07-04 2020-06-02 発話解析装置、発話解析方法及びプログラム
JP2021529929A Active JP7531164B2 (ja) 2019-07-04 2020-06-02 発話解析装置、発話解析方法及びプログラム

Family Applications After (1)

Application Number Title Priority Date Filing Date
JP2021529929A Active JP7531164B2 (ja) 2019-07-04 2020-06-02 発話解析装置、発話解析方法及びプログラム

Country Status (4)

Country Link
US (2) US12094464B2 (https=)
JP (2) JP7407190B2 (https=)
CN (2) CN114026557A (https=)
WO (2) WO2021002136A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4027247A4 (en) * 2019-09-02 2023-05-10 Imatrix Holdings Corp. TEXT ANALYSIS SYSTEM AND EVALUATION SYSTEM OF THE CHARACTERISTICS FOR MESSAGE EXCHANGE WITH THIS SYSTEM
JP7341111B2 (ja) * 2020-09-30 2023-09-08 本田技研工業株式会社 会話支援装置、会話支援システム、会話支援方法およびプログラム
JP7524784B2 (ja) * 2021-02-01 2024-07-30 オムロン株式会社 情報処理装置、制御システムおよびレポート出力方法
US11893990B2 (en) * 2021-09-27 2024-02-06 Sap Se Audio file annotation

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011123706A (ja) 2009-12-11 2011-06-23 Advanced Media Inc 文章分類装置および文章分類方法
JP2011221873A (ja) 2010-04-12 2011-11-04 Nippon Telegr & Teleph Corp <Ntt> データ分類方法及び装置及びプログラム
JP2013120547A (ja) 2011-12-08 2013-06-17 Nomura Research Institute Ltd 談話要約テンプレート作成システムおよび談話要約テンプレート作成プログラム
WO2016027364A1 (ja) 2014-08-22 2016-02-25 株式会社日立製作所 話題クラスタ選択装置、及び検索方法
WO2018110029A1 (ja) 2016-12-13 2018-06-21 株式会社東芝 情報処理装置、情報処理方法、および情報処理プログラム
JP2018194980A (ja) 2017-05-15 2018-12-06 富士通株式会社 判定プログラム、判定方法および判定装置

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5329610U (https=) 1976-08-18 1978-03-14
JPS5468474U (https=) 1977-10-24 1979-05-15
US20080300872A1 (en) * 2007-05-31 2008-12-04 Microsoft Corporation Scalable summaries of audio or visual content
WO2009084554A1 (ja) * 2007-12-27 2009-07-09 Nec Corporation テキスト分割装置とテキスト分割方法およびプログラム
JP5468474B2 (ja) 2010-06-21 2014-04-09 株式会社野村総合研究所 トークスクリプト利用状況算出システムおよびトークスクリプト利用状況算出プログラム
JP5329610B2 (ja) 2011-07-22 2013-10-30 みずほ情報総研株式会社 説明支援システム、説明支援方法及び説明支援プログラム
US8612211B1 (en) * 2012-09-10 2013-12-17 Google Inc. Speech recognition and summarization
US10057707B2 (en) * 2015-02-03 2018-08-21 Dolby Laboratories Licensing Corporation Optimized virtual scene layout for spatial meeting playback
JP2017016566A (ja) * 2015-07-06 2017-01-19 ソニー株式会社 情報処理装置、情報処理方法及びプログラム
JP6664072B2 (ja) * 2015-12-02 2020-03-13 パナソニックIpマネジメント株式会社 探索支援方法、探索支援装置、及び、プログラム
EP3809283A1 (en) * 2016-05-13 2021-04-21 Equals 3 LLC Searching structured and unstructured data sets
JP6718345B2 (ja) * 2016-09-21 2020-07-08 日本電信電話株式会社 テキスト分析方法、テキスト分析装置、及びプログラム
JP6614589B2 (ja) 2018-05-09 2019-12-04 株式会社野村総合研究所 コンプライアンスチェックシステムおよびコンプライアンスチェックプログラム

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011123706A (ja) 2009-12-11 2011-06-23 Advanced Media Inc 文章分類装置および文章分類方法
JP2011221873A (ja) 2010-04-12 2011-11-04 Nippon Telegr & Teleph Corp <Ntt> データ分類方法及び装置及びプログラム
JP2013120547A (ja) 2011-12-08 2013-06-17 Nomura Research Institute Ltd 談話要約テンプレート作成システムおよび談話要約テンプレート作成プログラム
WO2016027364A1 (ja) 2014-08-22 2016-02-25 株式会社日立製作所 話題クラスタ選択装置、及び検索方法
WO2018110029A1 (ja) 2016-12-13 2018-06-21 株式会社東芝 情報処理装置、情報処理方法、および情報処理プログラム
JP2018194980A (ja) 2017-05-15 2018-12-06 富士通株式会社 判定プログラム、判定方法および判定装置

Also Published As

Publication number Publication date
US20220108697A1 (en) 2022-04-07
JPWO2021002137A1 (https=) 2021-01-07
JP7531164B2 (ja) 2024-08-09
WO2021002137A1 (ja) 2021-01-07
CN114072786A (zh) 2022-02-18
CN114026557A (zh) 2022-02-08
US12300226B2 (en) 2025-05-13
US20220114348A1 (en) 2022-04-14
US12094464B2 (en) 2024-09-17
WO2021002136A1 (ja) 2021-01-07
JPWO2021002136A1 (https=) 2021-01-07

Similar Documents

Publication Publication Date Title
JP7407190B2 (ja) 発話解析装置、発話解析方法及びプログラム
JP6755304B2 (ja) 情報処理装置
US11450311B2 (en) System and methods for accent and dialect modification
CN108630193B (zh) 语音识别方法及装置
US10839788B2 (en) Systems and methods for selecting accent and dialect based on context
JP6440967B2 (ja) 文末記号推定装置、この方法及びプログラム
CN107818798A (zh) 客服服务质量评价方法、装置、设备及存储介质
US10592997B2 (en) Decision making support device and decision making support method
US20220392485A1 (en) System and Method For Identifying Sentiment (Emotions) In A Speech Audio Input
CN109313892A (zh) 稳健的语言识别方法和系统
US11270691B2 (en) Voice interaction system, its processing method, and program therefor
JP2021009535A (ja) 営業トークナビゲーションシステム、営業トークナビゲーション方法および営業トークナビゲーション用プログラム
EP4024395B1 (en) Speech analyser and related method
JP2020034683A (ja) 音声認識装置、音声認識プログラムおよび音声認識方法
WO2020196743A1 (ja) 評価システム及び評価方法
JP2021124530A (ja) 情報処理装置、情報処理方法及びプログラム
Williamson et al. Estimating nonnegative matrix model activations with deep neural networks to increase perceptual speech quality
CN113593523A (zh) 基于人工智能的语音检测方法、装置及电子设备
CN116741143B (zh) 基于数字分身的个性化ai名片的交互方法及相关组件
CN115083412B (zh) 语音交互方法及相关装置、电子设备、存储介质
CN117219118A (zh) 音频质检的方法及系统
Patel et al. Google duplex-a big leap in the evolution of artificial intelligence
KR20230156482A (ko) 음성으로부터 감정 상태를 추론하는 신경망 기반의 감정 상태 추론 장치 및 방법
JP2022082049A (ja) 発話評価方法および発話評価装置
JP2025138379A (ja) 音声分析システム及び音声分析方法

Legal Events

Date Code Title Description
A529 Written submission of copy of amendment under article 34 pct

Free format text: JAPANESE INTERMEDIATE CODE: A5211

Effective date: 20211227

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20230509

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230926

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20231120

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20231205

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20231218

R151 Written notification of patent or utility model registration

Ref document number: 7407190

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R151