CN114026557A - 说话解析装置、说话解析方法以及程序 - Google Patents
说话解析装置、说话解析方法以及程序 Download PDFInfo
- Publication number
- CN114026557A CN114026557A CN202080046853.2A CN202080046853A CN114026557A CN 114026557 A CN114026557 A CN 114026557A CN 202080046853 A CN202080046853 A CN 202080046853A CN 114026557 A CN114026557 A CN 114026557A
- Authority
- CN
- China
- Prior art keywords
- data
- speech
- likelihood
- category
- unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3343—Query execution using phonetics
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/44—Statistical methods, e.g. probability models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2019-125454 | 2019-07-04 | ||
| JP2019125454 | 2019-07-04 | ||
| JP2019-134559 | 2019-07-22 | ||
| JP2019134559 | 2019-07-22 | ||
| PCT/JP2020/021811 WO2021002137A1 (ja) | 2019-07-04 | 2020-06-02 | 発話解析装置、発話解析方法及びプログラム |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN114026557A true CN114026557A (zh) | 2022-02-08 |
Family
ID=74100168
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202080046853.2A Pending CN114026557A (zh) | 2019-07-04 | 2020-06-02 | 说话解析装置、说话解析方法以及程序 |
| CN202080048836.2A Pending CN114072786A (zh) | 2019-07-04 | 2020-06-02 | 说话解析装置、说话解析方法以及程序 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202080048836.2A Pending CN114072786A (zh) | 2019-07-04 | 2020-06-02 | 说话解析装置、说话解析方法以及程序 |
Country Status (4)
| Country | Link |
|---|---|
| US (2) | US12094464B2 (https=) |
| JP (2) | JP7407190B2 (https=) |
| CN (2) | CN114026557A (https=) |
| WO (2) | WO2021002136A1 (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4027247A4 (en) * | 2019-09-02 | 2023-05-10 | Imatrix Holdings Corp. | TEXT ANALYSIS SYSTEM AND EVALUATION SYSTEM OF THE CHARACTERISTICS FOR MESSAGE EXCHANGE WITH THIS SYSTEM |
| JP7341111B2 (ja) * | 2020-09-30 | 2023-09-08 | 本田技研工業株式会社 | 会話支援装置、会話支援システム、会話支援方法およびプログラム |
| JP7524784B2 (ja) * | 2021-02-01 | 2024-07-30 | オムロン株式会社 | 情報処理装置、制御システムおよびレポート出力方法 |
| US11893990B2 (en) * | 2021-09-27 | 2024-02-06 | Sap Se | Audio file annotation |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS5329610U (https=) | 1976-08-18 | 1978-03-14 | ||
| JPS5468474U (https=) | 1977-10-24 | 1979-05-15 | ||
| US20080300872A1 (en) * | 2007-05-31 | 2008-12-04 | Microsoft Corporation | Scalable summaries of audio or visual content |
| WO2009084554A1 (ja) * | 2007-12-27 | 2009-07-09 | Nec Corporation | テキスト分割装置とテキスト分割方法およびプログラム |
| JP5427581B2 (ja) * | 2009-12-11 | 2014-02-26 | 株式会社アドバンスト・メディア | 文章分類装置および文章分類方法 |
| JP2011221873A (ja) * | 2010-04-12 | 2011-11-04 | Nippon Telegr & Teleph Corp <Ntt> | データ分類方法及び装置及びプログラム |
| JP5468474B2 (ja) | 2010-06-21 | 2014-04-09 | 株式会社野村総合研究所 | トークスクリプト利用状況算出システムおよびトークスクリプト利用状況算出プログラム |
| JP5329610B2 (ja) | 2011-07-22 | 2013-10-30 | みずほ情報総研株式会社 | 説明支援システム、説明支援方法及び説明支援プログラム |
| JP5774459B2 (ja) * | 2011-12-08 | 2015-09-09 | 株式会社野村総合研究所 | 談話要約テンプレート作成システムおよび談話要約テンプレート作成プログラム |
| US8612211B1 (en) * | 2012-09-10 | 2013-12-17 | Google Inc. | Speech recognition and summarization |
| WO2016027364A1 (ja) * | 2014-08-22 | 2016-02-25 | 株式会社日立製作所 | 話題クラスタ選択装置、及び検索方法 |
| US10057707B2 (en) * | 2015-02-03 | 2018-08-21 | Dolby Laboratories Licensing Corporation | Optimized virtual scene layout for spatial meeting playback |
| JP2017016566A (ja) * | 2015-07-06 | 2017-01-19 | ソニー株式会社 | 情報処理装置、情報処理方法及びプログラム |
| JP6664072B2 (ja) * | 2015-12-02 | 2020-03-13 | パナソニックIpマネジメント株式会社 | 探索支援方法、探索支援装置、及び、プログラム |
| EP3809283A1 (en) * | 2016-05-13 | 2021-04-21 | Equals 3 LLC | Searching structured and unstructured data sets |
| JP6718345B2 (ja) * | 2016-09-21 | 2020-07-08 | 日本電信電話株式会社 | テキスト分析方法、テキスト分析装置、及びプログラム |
| JP6815184B2 (ja) * | 2016-12-13 | 2021-01-20 | 株式会社東芝 | 情報処理装置、情報処理方法、および情報処理プログラム |
| JP2018194980A (ja) * | 2017-05-15 | 2018-12-06 | 富士通株式会社 | 判定プログラム、判定方法および判定装置 |
| JP6614589B2 (ja) | 2018-05-09 | 2019-12-04 | 株式会社野村総合研究所 | コンプライアンスチェックシステムおよびコンプライアンスチェックプログラム |
-
2020
- 2020-06-02 CN CN202080046853.2A patent/CN114026557A/zh active Pending
- 2020-06-02 CN CN202080048836.2A patent/CN114072786A/zh active Pending
- 2020-06-02 WO PCT/JP2020/021809 patent/WO2021002136A1/ja not_active Ceased
- 2020-06-02 WO PCT/JP2020/021811 patent/WO2021002137A1/ja not_active Ceased
- 2020-06-02 JP JP2021529930A patent/JP7407190B2/ja active Active
- 2020-06-02 JP JP2021529929A patent/JP7531164B2/ja active Active
-
2021
- 2021-12-17 US US17/554,248 patent/US12094464B2/en active Active
- 2021-12-22 US US17/559,033 patent/US12300226B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| JP7407190B2 (ja) | 2023-12-28 |
| US20220108697A1 (en) | 2022-04-07 |
| JPWO2021002137A1 (https=) | 2021-01-07 |
| JP7531164B2 (ja) | 2024-08-09 |
| WO2021002137A1 (ja) | 2021-01-07 |
| CN114072786A (zh) | 2022-02-18 |
| US12300226B2 (en) | 2025-05-13 |
| US20220114348A1 (en) | 2022-04-14 |
| US12094464B2 (en) | 2024-09-17 |
| WO2021002136A1 (ja) | 2021-01-07 |
| JPWO2021002136A1 (https=) | 2021-01-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11455985B2 (en) | Information processing apparatus | |
| CN114026557A (zh) | 说话解析装置、说话解析方法以及程序 | |
| US9293133B2 (en) | Improving voice communication over a network | |
| JP7255032B2 (ja) | 音声認識 | |
| US11183180B2 (en) | Speech recognition apparatus, speech recognition method, and a recording medium performing a suppression process for categories of noise | |
| JP7287006B2 (ja) | 話者決定装置、話者決定方法、および話者決定装置の制御プログラム | |
| JP2017161731A (ja) | 会話解析装置、会話解析方法およびプログラム | |
| JP5196199B2 (ja) | キーワード表示システム、キーワード表示方法及びプログラム | |
| WO2020196743A1 (ja) | 評価システム及び評価方法 | |
| JP7394192B2 (ja) | 音声処理装置、音声処理方法、及び、プログラム | |
| JP7040593B2 (ja) | 接客支援装置、接客支援方法、及び、接客支援プログラム | |
| JP5803125B2 (ja) | 音声による抑圧状態検出装置およびプログラム | |
| JP2020160425A (ja) | 評価システム、評価方法、及びコンピュータプログラム。 | |
| JP4587854B2 (ja) | 感情解析装置、感情解析プログラム、プログラム格納媒体 | |
| JP6736225B2 (ja) | 対話装置、対話装置の制御方法およびプログラム | |
| CN112101046B (zh) | 一种基于通话行为的会话分析方法、装置和系统 | |
| Lykartsis et al. | Prediction of dialogue success with spectral and rhythm acoustic features using dnns and svms | |
| US10505879B2 (en) | Communication support device, communication support method, and computer program product | |
| JP2004021028A (ja) | 音声対話装置及び音声対話プログラム | |
| JP7681214B1 (ja) | 情報処理方法、プログラム及び情報処理システム | |
| CN119132319B (zh) | 克隆音生成方法、克隆音应用方法及装置 | |
| KR102960377B1 (ko) | 호출 속성에 기초한 다수의 자동 어시스턴트 사이에서의 선택 | |
| JP7110057B2 (ja) | 音声認識システム | |
| JP2002268683A (ja) | 情報処理方法及び装置 | |
| JP2025138379A (ja) | 音声分析システム及び音声分析方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| WD01 | Invention patent application deemed withdrawn after publication | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20220208 |