JP7416078B2 - 音声認識装置、音声認識方法、およびプログラム - Google Patents
音声認識装置、音声認識方法、およびプログラム Download PDFInfo
- Publication number
- JP7416078B2 JP7416078B2 JP2021548767A JP2021548767A JP7416078B2 JP 7416078 B2 JP7416078 B2 JP 7416078B2 JP 2021548767 A JP2021548767 A JP 2021548767A JP 2021548767 A JP2021548767 A JP 2021548767A JP 7416078 B2 JP7416078 B2 JP 7416078B2
- Authority
- JP
- Japan
- Prior art keywords
- voice
- user
- text information
- recognition
- target
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/242—Dictionaries
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/221—Announcement of recognition results
Landscapes
- Engineering & Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Machine Translation (AREA)
- Electrically Operated Instructional Devices (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2019176484 | 2019-09-27 | ||
| JP2019176484 | 2019-09-27 | ||
| PCT/JP2020/033974 WO2021059968A1 (ja) | 2019-09-27 | 2020-09-08 | 音声認識装置、音声認識方法、およびプログラム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JPWO2021059968A1 JPWO2021059968A1 (https=) | 2021-04-01 |
| JPWO2021059968A5 JPWO2021059968A5 (https=) | 2022-06-01 |
| JP7416078B2 true JP7416078B2 (ja) | 2024-01-17 |
Family
ID=75166092
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021548767A Active JP7416078B2 (ja) | 2019-09-27 | 2020-09-08 | 音声認識装置、音声認識方法、およびプログラム |
Country Status (3)
| Country | Link |
|---|---|
| US (2) | US20220335951A1 (https=) |
| JP (1) | JP7416078B2 (https=) |
| WO (1) | WO2021059968A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP7288530B1 (ja) | 2022-03-09 | 2023-06-07 | 陸 荒川 | システムおよびプログラム |
| WO2025191650A1 (ja) * | 2024-03-11 | 2025-09-18 | ファナック株式会社 | 音声コマンド作成装置、及びコンピュータが読み取り可能な記憶媒体 |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003345379A (ja) | 2002-03-20 | 2003-12-03 | Japan Science & Technology Corp | 音声映像変換装置及び方法、音声映像変換プログラム |
| JP2004170765A (ja) | 2002-11-21 | 2004-06-17 | Sony Corp | 音声処理装置および方法、記録媒体並びにプログラム |
| JP2010197669A (ja) | 2009-02-25 | 2010-09-09 | Kyocera Corp | 携帯端末、編集誘導プログラムおよび編集装置 |
| JP2013182261A (ja) | 2012-03-05 | 2013-09-12 | Nippon Hoso Kyokai <Nhk> | 適応化装置、音声認識装置、およびそのプログラム |
| JP2014240940A (ja) | 2013-06-12 | 2014-12-25 | 株式会社東芝 | 書き起こし支援装置、方法、及びプログラム |
| JP2015184564A (ja) | 2014-03-25 | 2015-10-22 | 株式会社アドバンスト・メディア | 音声書起支援システム、サーバ、装置、方法及びプログラム |
| WO2017068826A1 (ja) | 2015-10-23 | 2017-04-27 | ソニー株式会社 | 情報処理装置、情報処理方法、およびプログラム |
| JP2017161726A (ja) | 2016-03-09 | 2017-09-14 | 株式会社アドバンスト・メディア | 情報処理装置、情報処理システム、サーバ、端末装置、情報処理方法及びプログラム |
-
2020
- 2020-09-08 WO PCT/JP2020/033974 patent/WO2021059968A1/ja not_active Ceased
- 2020-09-08 US US17/760,847 patent/US20220335951A1/en not_active Abandoned
- 2020-09-08 JP JP2021548767A patent/JP7416078B2/ja active Active
-
2025
- 2025-09-10 US US19/324,688 patent/US20260011333A1/en active Pending
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003345379A (ja) | 2002-03-20 | 2003-12-03 | Japan Science & Technology Corp | 音声映像変換装置及び方法、音声映像変換プログラム |
| JP2004170765A (ja) | 2002-11-21 | 2004-06-17 | Sony Corp | 音声処理装置および方法、記録媒体並びにプログラム |
| JP2010197669A (ja) | 2009-02-25 | 2010-09-09 | Kyocera Corp | 携帯端末、編集誘導プログラムおよび編集装置 |
| JP2013182261A (ja) | 2012-03-05 | 2013-09-12 | Nippon Hoso Kyokai <Nhk> | 適応化装置、音声認識装置、およびそのプログラム |
| JP2014240940A (ja) | 2013-06-12 | 2014-12-25 | 株式会社東芝 | 書き起こし支援装置、方法、及びプログラム |
| JP2015184564A (ja) | 2014-03-25 | 2015-10-22 | 株式会社アドバンスト・メディア | 音声書起支援システム、サーバ、装置、方法及びプログラム |
| WO2017068826A1 (ja) | 2015-10-23 | 2017-04-27 | ソニー株式会社 | 情報処理装置、情報処理方法、およびプログラム |
| JP2017161726A (ja) | 2016-03-09 | 2017-09-14 | 株式会社アドバンスト・メディア | 情報処理装置、情報処理システム、サーバ、端末装置、情報処理方法及びプログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| JPWO2021059968A1 (https=) | 2021-04-01 |
| US20260011333A1 (en) | 2026-01-08 |
| WO2021059968A1 (ja) | 2021-04-01 |
| US20220335951A1 (en) | 2022-10-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12198675B2 (en) | Electronic apparatus and method for controlling thereof | |
| US8738375B2 (en) | System and method for optimizing speech recognition and natural language parameters with user feedback | |
| US9984679B2 (en) | System and method for optimizing speech recognition and natural language parameters with user feedback | |
| KR102725749B1 (ko) | 자동 음성 인식을 위한 컨텍스트 비정규화 | |
| JP5787780B2 (ja) | 書き起こし支援システムおよび書き起こし支援方法 | |
| US20260011333A1 (en) | Speech recognition device, speech recognition method, and program | |
| JP2016062357A (ja) | 音声翻訳装置、方法およびプログラム | |
| CN101253549A (zh) | 将声音和人工转录文本进行同步的系统和方法 | |
| CN115668358A (zh) | 用于文本到语音合成的用户接口适应的方法和系统 | |
| WO2014136534A1 (ja) | 理解支援システム、理解支援サーバ、理解支援方法、及びコンピュータ読み取り可能な記録媒体 | |
| JP2014240940A (ja) | 書き起こし支援装置、方法、及びプログラム | |
| KR20210043341A (ko) | 인공지능 대화 서비스 생성 방법 및 장치 | |
| JPWO2018043138A1 (ja) | 情報処理装置および情報処理方法、並びにプログラム | |
| JP2013025299A (ja) | 書き起こし支援システムおよび書き起こし支援方法 | |
| WO2024178262A1 (en) | Personalized aphasia communication assistant system | |
| JP2013025763A (ja) | 書き起こし支援システムおよび書き起こし支援方法 | |
| KR20250051049A (ko) | 상호작용형 음성 응답 시스템 내에서 사용자 상호작용 세션을 최적화하는 시스템 및 방법 | |
| JP4354299B2 (ja) | 事例検索プログラム、事例検索方法及び事例検索装置 | |
| JP2014134640A (ja) | 文字起こし装置およびプログラム | |
| WO2026041000A1 (zh) | 外语教学视频生成方法、生成装置 | |
| JP2021009253A (ja) | プログラム、情報処理装置、及び情報処理方法 | |
| JP2023007014A (ja) | 応答システム、応答方法、および応答プログラム | |
| KR101501705B1 (ko) | 음성 데이터를 이용한 문서 생성 장치, 방법 및 컴퓨터 판독 가능 기록 매체 | |
| KR101830210B1 (ko) | 적어도 하나의 의미론적 유닛의 집합을 개선하기 위한 방법, 장치 및 컴퓨터 판독 가능한 기록 매체 | |
| US20260120684A1 (en) | Personalized aphasia communication assistant system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20220324 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20220324 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20230214 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20230412 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20230725 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20230825 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20231205 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20231218 |
|
| R151 | Written notification of patent or utility model registration |
Ref document number: 7416078 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R151 |