JP7416078B2 - 音声認識装置、音声認識方法、およびプログラム - Google Patents

音声認識装置、音声認識方法、およびプログラム Download PDF

Info

Publication number
JP7416078B2
JP7416078B2 JP2021548767A JP2021548767A JP7416078B2 JP 7416078 B2 JP7416078 B2 JP 7416078B2 JP 2021548767 A JP2021548767 A JP 2021548767A JP 2021548767 A JP2021548767 A JP 2021548767A JP 7416078 B2 JP7416078 B2 JP 7416078B2
Authority
JP
Japan
Prior art keywords
voice
user
text information
recognition
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021548767A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2021059968A1 (https=
JPWO2021059968A5 (https=
Inventor
秀治 古明地
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of JPWO2021059968A1 publication Critical patent/JPWO2021059968A1/ja
Publication of JPWO2021059968A5 publication Critical patent/JPWO2021059968A5/ja
Application granted granted Critical
Publication of JP7416078B2 publication Critical patent/JP7416078B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
JP2021548767A 2019-09-27 2020-09-08 音声認識装置、音声認識方法、およびプログラム Active JP7416078B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2019176484 2019-09-27
JP2019176484 2019-09-27
PCT/JP2020/033974 WO2021059968A1 (ja) 2019-09-27 2020-09-08 音声認識装置、音声認識方法、およびプログラム

Publications (3)

Publication Number Publication Date
JPWO2021059968A1 JPWO2021059968A1 (https=) 2021-04-01
JPWO2021059968A5 JPWO2021059968A5 (https=) 2022-06-01
JP7416078B2 true JP7416078B2 (ja) 2024-01-17

Family

ID=75166092

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021548767A Active JP7416078B2 (ja) 2019-09-27 2020-09-08 音声認識装置、音声認識方法、およびプログラム

Country Status (3)

Country Link
US (2) US20220335951A1 (https=)
JP (1) JP7416078B2 (https=)
WO (1) WO2021059968A1 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7288530B1 (ja) 2022-03-09 2023-06-07 陸 荒川 システムおよびプログラム
WO2025191650A1 (ja) * 2024-03-11 2025-09-18 ファナック株式会社 音声コマンド作成装置、及びコンピュータが読み取り可能な記憶媒体

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003345379A (ja) 2002-03-20 2003-12-03 Japan Science & Technology Corp 音声映像変換装置及び方法、音声映像変換プログラム
JP2004170765A (ja) 2002-11-21 2004-06-17 Sony Corp 音声処理装置および方法、記録媒体並びにプログラム
JP2010197669A (ja) 2009-02-25 2010-09-09 Kyocera Corp 携帯端末、編集誘導プログラムおよび編集装置
JP2013182261A (ja) 2012-03-05 2013-09-12 Nippon Hoso Kyokai <Nhk> 適応化装置、音声認識装置、およびそのプログラム
JP2014240940A (ja) 2013-06-12 2014-12-25 株式会社東芝 書き起こし支援装置、方法、及びプログラム
JP2015184564A (ja) 2014-03-25 2015-10-22 株式会社アドバンスト・メディア 音声書起支援システム、サーバ、装置、方法及びプログラム
WO2017068826A1 (ja) 2015-10-23 2017-04-27 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム
JP2017161726A (ja) 2016-03-09 2017-09-14 株式会社アドバンスト・メディア 情報処理装置、情報処理システム、サーバ、端末装置、情報処理方法及びプログラム

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003345379A (ja) 2002-03-20 2003-12-03 Japan Science & Technology Corp 音声映像変換装置及び方法、音声映像変換プログラム
JP2004170765A (ja) 2002-11-21 2004-06-17 Sony Corp 音声処理装置および方法、記録媒体並びにプログラム
JP2010197669A (ja) 2009-02-25 2010-09-09 Kyocera Corp 携帯端末、編集誘導プログラムおよび編集装置
JP2013182261A (ja) 2012-03-05 2013-09-12 Nippon Hoso Kyokai <Nhk> 適応化装置、音声認識装置、およびそのプログラム
JP2014240940A (ja) 2013-06-12 2014-12-25 株式会社東芝 書き起こし支援装置、方法、及びプログラム
JP2015184564A (ja) 2014-03-25 2015-10-22 株式会社アドバンスト・メディア 音声書起支援システム、サーバ、装置、方法及びプログラム
WO2017068826A1 (ja) 2015-10-23 2017-04-27 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム
JP2017161726A (ja) 2016-03-09 2017-09-14 株式会社アドバンスト・メディア 情報処理装置、情報処理システム、サーバ、端末装置、情報処理方法及びプログラム

Also Published As

Publication number Publication date
JPWO2021059968A1 (https=) 2021-04-01
US20260011333A1 (en) 2026-01-08
WO2021059968A1 (ja) 2021-04-01
US20220335951A1 (en) 2022-10-20

Similar Documents

Publication Publication Date Title
US12198675B2 (en) Electronic apparatus and method for controlling thereof
US8738375B2 (en) System and method for optimizing speech recognition and natural language parameters with user feedback
US9984679B2 (en) System and method for optimizing speech recognition and natural language parameters with user feedback
KR102725749B1 (ko) 자동 음성 인식을 위한 컨텍스트 비정규화
JP5787780B2 (ja) 書き起こし支援システムおよび書き起こし支援方法
US20260011333A1 (en) Speech recognition device, speech recognition method, and program
JP2016062357A (ja) 音声翻訳装置、方法およびプログラム
CN101253549A (zh) 将声音和人工转录文本进行同步的系统和方法
CN115668358A (zh) 用于文本到语音合成的用户接口适应的方法和系统
WO2014136534A1 (ja) 理解支援システム、理解支援サーバ、理解支援方法、及びコンピュータ読み取り可能な記録媒体
JP2014240940A (ja) 書き起こし支援装置、方法、及びプログラム
KR20210043341A (ko) 인공지능 대화 서비스 생성 방법 및 장치
JPWO2018043138A1 (ja) 情報処理装置および情報処理方法、並びにプログラム
JP2013025299A (ja) 書き起こし支援システムおよび書き起こし支援方法
WO2024178262A1 (en) Personalized aphasia communication assistant system
JP2013025763A (ja) 書き起こし支援システムおよび書き起こし支援方法
KR20250051049A (ko) 상호작용형 음성 응답 시스템 내에서 사용자 상호작용 세션을 최적화하는 시스템 및 방법
JP4354299B2 (ja) 事例検索プログラム、事例検索方法及び事例検索装置
JP2014134640A (ja) 文字起こし装置およびプログラム
WO2026041000A1 (zh) 外语教学视频生成方法、生成装置
JP2021009253A (ja) プログラム、情報処理装置、及び情報処理方法
JP2023007014A (ja) 応答システム、応答方法、および応答プログラム
KR101501705B1 (ko) 음성 데이터를 이용한 문서 생성 장치, 방법 및 컴퓨터 판독 가능 기록 매체
KR101830210B1 (ko) 적어도 하나의 의미론적 유닛의 집합을 개선하기 위한 방법, 장치 및 컴퓨터 판독 가능한 기록 매체
US20260120684A1 (en) Personalized aphasia communication assistant system

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220324

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20220324

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230214

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230412

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230725

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230825

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20231205

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20231218

R151 Written notification of patent or utility model registration

Ref document number: 7416078

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R151