JP7407190B2 - 発話解析装置、発話解析方法及びプログラム - Google Patents
発話解析装置、発話解析方法及びプログラム Download PDFInfo
- Publication number
- JP7407190B2 JP7407190B2 JP2021529930A JP2021529930A JP7407190B2 JP 7407190 B2 JP7407190 B2 JP 7407190B2 JP 2021529930 A JP2021529930 A JP 2021529930A JP 2021529930 A JP2021529930 A JP 2021529930A JP 7407190 B2 JP7407190 B2 JP 7407190B2
- Authority
- JP
- Japan
- Prior art keywords
- data
- utterance
- category
- likelihood
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3343—Query execution using phonetics
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/44—Statistical methods, e.g. probability models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2019125454 | 2019-07-04 | ||
| JP2019125454 | 2019-07-04 | ||
| JP2019134559 | 2019-07-22 | ||
| JP2019134559 | 2019-07-22 | ||
| PCT/JP2020/021811 WO2021002137A1 (ja) | 2019-07-04 | 2020-06-02 | 発話解析装置、発話解析方法及びプログラム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JPWO2021002137A1 JPWO2021002137A1 (https=) | 2021-01-07 |
| JPWO2021002137A5 JPWO2021002137A5 (https=) | 2022-06-02 |
| JP7407190B2 true JP7407190B2 (ja) | 2023-12-28 |
Family
ID=74100168
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021529930A Active JP7407190B2 (ja) | 2019-07-04 | 2020-06-02 | 発話解析装置、発話解析方法及びプログラム |
| JP2021529929A Active JP7531164B2 (ja) | 2019-07-04 | 2020-06-02 | 発話解析装置、発話解析方法及びプログラム |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021529929A Active JP7531164B2 (ja) | 2019-07-04 | 2020-06-02 | 発話解析装置、発話解析方法及びプログラム |
Country Status (4)
| Country | Link |
|---|---|
| US (2) | US12094464B2 (https=) |
| JP (2) | JP7407190B2 (https=) |
| CN (2) | CN114026557A (https=) |
| WO (2) | WO2021002136A1 (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4027247A4 (en) * | 2019-09-02 | 2023-05-10 | Imatrix Holdings Corp. | TEXT ANALYSIS SYSTEM AND EVALUATION SYSTEM OF THE CHARACTERISTICS FOR MESSAGE EXCHANGE WITH THIS SYSTEM |
| JP7341111B2 (ja) * | 2020-09-30 | 2023-09-08 | 本田技研工業株式会社 | 会話支援装置、会話支援システム、会話支援方法およびプログラム |
| JP7524784B2 (ja) * | 2021-02-01 | 2024-07-30 | オムロン株式会社 | 情報処理装置、制御システムおよびレポート出力方法 |
| US11893990B2 (en) * | 2021-09-27 | 2024-02-06 | Sap Se | Audio file annotation |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2011123706A (ja) | 2009-12-11 | 2011-06-23 | Advanced Media Inc | 文章分類装置および文章分類方法 |
| JP2011221873A (ja) | 2010-04-12 | 2011-11-04 | Nippon Telegr & Teleph Corp <Ntt> | データ分類方法及び装置及びプログラム |
| JP2013120547A (ja) | 2011-12-08 | 2013-06-17 | Nomura Research Institute Ltd | 談話要約テンプレート作成システムおよび談話要約テンプレート作成プログラム |
| WO2016027364A1 (ja) | 2014-08-22 | 2016-02-25 | 株式会社日立製作所 | 話題クラスタ選択装置、及び検索方法 |
| WO2018110029A1 (ja) | 2016-12-13 | 2018-06-21 | 株式会社東芝 | 情報処理装置、情報処理方法、および情報処理プログラム |
| JP2018194980A (ja) | 2017-05-15 | 2018-12-06 | 富士通株式会社 | 判定プログラム、判定方法および判定装置 |
Family Cites Families (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS5329610U (https=) | 1976-08-18 | 1978-03-14 | ||
| JPS5468474U (https=) | 1977-10-24 | 1979-05-15 | ||
| US20080300872A1 (en) * | 2007-05-31 | 2008-12-04 | Microsoft Corporation | Scalable summaries of audio or visual content |
| WO2009084554A1 (ja) * | 2007-12-27 | 2009-07-09 | Nec Corporation | テキスト分割装置とテキスト分割方法およびプログラム |
| JP5468474B2 (ja) | 2010-06-21 | 2014-04-09 | 株式会社野村総合研究所 | トークスクリプト利用状況算出システムおよびトークスクリプト利用状況算出プログラム |
| JP5329610B2 (ja) | 2011-07-22 | 2013-10-30 | みずほ情報総研株式会社 | 説明支援システム、説明支援方法及び説明支援プログラム |
| US8612211B1 (en) * | 2012-09-10 | 2013-12-17 | Google Inc. | Speech recognition and summarization |
| US10057707B2 (en) * | 2015-02-03 | 2018-08-21 | Dolby Laboratories Licensing Corporation | Optimized virtual scene layout for spatial meeting playback |
| JP2017016566A (ja) * | 2015-07-06 | 2017-01-19 | ソニー株式会社 | 情報処理装置、情報処理方法及びプログラム |
| JP6664072B2 (ja) * | 2015-12-02 | 2020-03-13 | パナソニックIpマネジメント株式会社 | 探索支援方法、探索支援装置、及び、プログラム |
| EP3809283A1 (en) * | 2016-05-13 | 2021-04-21 | Equals 3 LLC | Searching structured and unstructured data sets |
| JP6718345B2 (ja) * | 2016-09-21 | 2020-07-08 | 日本電信電話株式会社 | テキスト分析方法、テキスト分析装置、及びプログラム |
| JP6614589B2 (ja) | 2018-05-09 | 2019-12-04 | 株式会社野村総合研究所 | コンプライアンスチェックシステムおよびコンプライアンスチェックプログラム |
-
2020
- 2020-06-02 CN CN202080046853.2A patent/CN114026557A/zh active Pending
- 2020-06-02 CN CN202080048836.2A patent/CN114072786A/zh active Pending
- 2020-06-02 WO PCT/JP2020/021809 patent/WO2021002136A1/ja not_active Ceased
- 2020-06-02 WO PCT/JP2020/021811 patent/WO2021002137A1/ja not_active Ceased
- 2020-06-02 JP JP2021529930A patent/JP7407190B2/ja active Active
- 2020-06-02 JP JP2021529929A patent/JP7531164B2/ja active Active
-
2021
- 2021-12-17 US US17/554,248 patent/US12094464B2/en active Active
- 2021-12-22 US US17/559,033 patent/US12300226B2/en active Active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2011123706A (ja) | 2009-12-11 | 2011-06-23 | Advanced Media Inc | 文章分類装置および文章分類方法 |
| JP2011221873A (ja) | 2010-04-12 | 2011-11-04 | Nippon Telegr & Teleph Corp <Ntt> | データ分類方法及び装置及びプログラム |
| JP2013120547A (ja) | 2011-12-08 | 2013-06-17 | Nomura Research Institute Ltd | 談話要約テンプレート作成システムおよび談話要約テンプレート作成プログラム |
| WO2016027364A1 (ja) | 2014-08-22 | 2016-02-25 | 株式会社日立製作所 | 話題クラスタ選択装置、及び検索方法 |
| WO2018110029A1 (ja) | 2016-12-13 | 2018-06-21 | 株式会社東芝 | 情報処理装置、情報処理方法、および情報処理プログラム |
| JP2018194980A (ja) | 2017-05-15 | 2018-12-06 | 富士通株式会社 | 判定プログラム、判定方法および判定装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220108697A1 (en) | 2022-04-07 |
| JPWO2021002137A1 (https=) | 2021-01-07 |
| JP7531164B2 (ja) | 2024-08-09 |
| WO2021002137A1 (ja) | 2021-01-07 |
| CN114072786A (zh) | 2022-02-18 |
| CN114026557A (zh) | 2022-02-08 |
| US12300226B2 (en) | 2025-05-13 |
| US20220114348A1 (en) | 2022-04-14 |
| US12094464B2 (en) | 2024-09-17 |
| WO2021002136A1 (ja) | 2021-01-07 |
| JPWO2021002136A1 (https=) | 2021-01-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7407190B2 (ja) | 発話解析装置、発話解析方法及びプログラム | |
| JP6755304B2 (ja) | 情報処理装置 | |
| US11450311B2 (en) | System and methods for accent and dialect modification | |
| CN108630193B (zh) | 语音识别方法及装置 | |
| US10839788B2 (en) | Systems and methods for selecting accent and dialect based on context | |
| JP6440967B2 (ja) | 文末記号推定装置、この方法及びプログラム | |
| CN107818798A (zh) | 客服服务质量评价方法、装置、设备及存储介质 | |
| US10592997B2 (en) | Decision making support device and decision making support method | |
| US20220392485A1 (en) | System and Method For Identifying Sentiment (Emotions) In A Speech Audio Input | |
| CN109313892A (zh) | 稳健的语言识别方法和系统 | |
| US11270691B2 (en) | Voice interaction system, its processing method, and program therefor | |
| JP2021009535A (ja) | 営業トークナビゲーションシステム、営業トークナビゲーション方法および営業トークナビゲーション用プログラム | |
| EP4024395B1 (en) | Speech analyser and related method | |
| JP2020034683A (ja) | 音声認識装置、音声認識プログラムおよび音声認識方法 | |
| WO2020196743A1 (ja) | 評価システム及び評価方法 | |
| JP2021124530A (ja) | 情報処理装置、情報処理方法及びプログラム | |
| Williamson et al. | Estimating nonnegative matrix model activations with deep neural networks to increase perceptual speech quality | |
| CN113593523A (zh) | 基于人工智能的语音检测方法、装置及电子设备 | |
| CN116741143B (zh) | 基于数字分身的个性化ai名片的交互方法及相关组件 | |
| CN115083412B (zh) | 语音交互方法及相关装置、电子设备、存储介质 | |
| CN117219118A (zh) | 音频质检的方法及系统 | |
| Patel et al. | Google duplex-a big leap in the evolution of artificial intelligence | |
| KR20230156482A (ko) | 음성으로부터 감정 상태를 추론하는 신경망 기반의 감정 상태 추론 장치 및 방법 | |
| JP2022082049A (ja) | 発話評価方法および発話評価装置 | |
| JP2025138379A (ja) | 音声分析システム及び音声分析方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A529 | Written submission of copy of amendment under article 34 pct |
Free format text: JAPANESE INTERMEDIATE CODE: A5211 Effective date: 20211227 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20230509 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20230926 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20231120 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20231205 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20231218 |
|
| R151 | Written notification of patent or utility model registration |
Ref document number: 7407190 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R151 |