JP7567940B2 - 学習方法、学習システム及び学習プログラム - Google Patents

学習方法、学習システム及び学習プログラム Download PDF

Info

Publication number
JP7567940B2
JP7567940B2 JP2022575008A JP2022575008A JP7567940B2 JP 7567940 B2 JP7567940 B2 JP 7567940B2 JP 2022575008 A JP2022575008 A JP 2022575008A JP 2022575008 A JP2022575008 A JP 2022575008A JP 7567940 B2 JP7567940 B2 JP 7567940B2
Authority
JP
Japan
Prior art keywords
learning
noise
data
unit
acoustic model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2022575008A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2022153504A1 (https=
Inventor
歩相名 神山
義和 山口
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NTT Inc
NTT Inc USA
Original Assignee
Nippon Telegraph and Telephone Corp
NTT Inc USA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp, NTT Inc USA filed Critical Nippon Telegraph and Telephone Corp
Publication of JPWO2022153504A1 publication Critical patent/JPWO2022153504A1/ja
Application granted granted Critical
Publication of JP7567940B2 publication Critical patent/JP7567940B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrically Operated Instructional Devices (AREA)
JP2022575008A 2021-01-15 2021-01-15 学習方法、学習システム及び学習プログラム Active JP7567940B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2021/001354 WO2022153504A1 (ja) 2021-01-15 2021-01-15 学習方法、学習システム及び学習プログラム

Publications (2)

Publication Number Publication Date
JPWO2022153504A1 JPWO2022153504A1 (https=) 2022-07-21
JP7567940B2 true JP7567940B2 (ja) 2024-10-16

Family

ID=82448146

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022575008A Active JP7567940B2 (ja) 2021-01-15 2021-01-15 学習方法、学習システム及び学習プログラム

Country Status (3)

Country Link
US (1) US20240078999A1 (https=)
JP (1) JP7567940B2 (https=)
WO (1) WO2022153504A1 (https=)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004252167A (ja) 2003-02-20 2004-09-09 Nippon Telegr & Teleph Corp <Ntt> 音素モデル学習用文リスト生成方法、生成装置、および生成プログラム
WO2011052412A1 (ja) 2009-10-28 2011-05-05 日本電気株式会社 音声認識システム、音声認識要求装置、音声認識方法、音声認識用プログラムおよび記録媒体
JP2011248001A (ja) 2010-05-25 2011-12-08 Nippon Telegr & Teleph Corp <Ntt> 音響モデル学習用ラベル作成装置、その方法及びプログラム
JP2015225296A (ja) 2014-05-29 2015-12-14 富士通株式会社 音響モデル調整装置及びプログラム
JP2016212273A (ja) 2015-05-11 2016-12-15 国立研究開発法人情報通信研究機構 リカレント型ニューラルネットワークの学習方法及びそのためのコンピュータプログラム、並びに音声認識装置

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5461696A (en) * 1992-10-28 1995-10-24 Motorola, Inc. Decision directed adaptive neural network
US10403269B2 (en) * 2015-03-27 2019-09-03 Google Llc Processing audio waveforms
US10339958B2 (en) * 2015-09-09 2019-07-02 Arris Enterprises Llc In-home legacy device onboarding and privacy enhanced monitoring
JP2019078857A (ja) * 2017-10-24 2019-05-23 国立研究開発法人情報通信研究機構 音響モデルの学習方法及びコンピュータプログラム
JP7010905B2 (ja) * 2019-09-05 2022-01-26 Nttテクノクロス株式会社 情報処理装置、情報処理方法及びプログラム
US10621378B1 (en) * 2019-10-24 2020-04-14 Deeping Source Inc. Method for learning and testing user learning network to be used for recognizing obfuscated data created by concealing original data to protect personal information and learning device and testing device using the same

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004252167A (ja) 2003-02-20 2004-09-09 Nippon Telegr & Teleph Corp <Ntt> 音素モデル学習用文リスト生成方法、生成装置、および生成プログラム
WO2011052412A1 (ja) 2009-10-28 2011-05-05 日本電気株式会社 音声認識システム、音声認識要求装置、音声認識方法、音声認識用プログラムおよび記録媒体
JP2011248001A (ja) 2010-05-25 2011-12-08 Nippon Telegr & Teleph Corp <Ntt> 音響モデル学習用ラベル作成装置、その方法及びプログラム
JP2015225296A (ja) 2014-05-29 2015-12-14 富士通株式会社 音響モデル調整装置及びプログラム
JP2016212273A (ja) 2015-05-11 2016-12-15 国立研究開発法人情報通信研究機構 リカレント型ニューラルネットワークの学習方法及びそのためのコンピュータプログラム、並びに音声認識装置

Also Published As

Publication number Publication date
JPWO2022153504A1 (https=) 2022-07-21
US20240078999A1 (en) 2024-03-07
WO2022153504A1 (ja) 2022-07-21

Similar Documents

Publication Publication Date Title
EP3095113B1 (en) Digital personal assistant interaction with impersonations and rich multimedia in responses
JP6207733B2 (ja) 人工知的エージェントまたはシステムを作成および実装するためのシステムおよび方法
JP2020034998A (ja) 拡張装置、拡張方法及び拡張プログラム
CN114743539A (zh) 语音合成方法、装置、设备及存储介质
CN110929114A (zh) 利用动态记忆网络来跟踪数字对话状态并生成响应
US20210217418A1 (en) Methods and systems for facilitating accomplishing tasks based on a natural language conversation
CN111797220A (zh) 对话生成方法、装置、计算机设备和存储介质
CN105845130A (zh) 用于语音识别的声学模型训练方法及装置
EP4220475A1 (en) Dialog flow inference based on weighted finite state automata
CN116959408A (zh) 基于扩散过程的文本转语音模型的构建方法及应用
JP6325762B1 (ja) 情報処理装置、情報処理方法、および情報処理プログラム
JP7567940B2 (ja) 学習方法、学習システム及び学習プログラム
CN110582763A (zh) 用于利用个人的属性信息的集合的计算机系统、服务器装置及程序
WO2023084761A1 (ja) 情報処理装置、情報処理方法及び情報処理プログラム
CN118981537A (zh) 文本处理方法及装置
CN113850071B (zh) 一种文本规整方法、装置、设备及存储介质
WO2024023946A1 (ja) 音声処理装置、音声処理方法及び音声処理プログラム
CN115795028A (zh) 一种公文智能生成方法及系统
CN114969287A (zh) 文档搜索方法、装置、设备及计算机可读存储介质
JP7671003B2 (ja) 学習装置、学習方法およびプログラム
WO2023195105A1 (ja) 付与装置、付与方法および付与プログラム
JP2022113066A (ja) 支援装置、支援方法及び支援プログラム
WO2024241548A1 (ja) 情報処理装置、情報処理方法、および、情報処理プログラム
WO2023152914A1 (ja) 埋め込み装置、埋め込み方法、および、埋め込みプログラム
CN118917436A (zh) 基于pre-agi的多轮对话交互引擎训练方法及系统

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20230517

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20240604

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240801

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20240903

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20240916

R150 Certificate of patent or registration of utility model

Ref document number: 7567940

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

S533 Written request for registration of change of name

Free format text: JAPANESE INTERMEDIATE CODE: R313533

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350