JP2006113570A5 - - Google Patents

Download PDF

Info

Publication number
JP2006113570A5
JP2006113570A5 JP2005268550A JP2005268550A JP2006113570A5 JP 2006113570 A5 JP2006113570 A5 JP 2006113570A5 JP 2005268550 A JP2005268550 A JP 2005268550A JP 2005268550 A JP2005268550 A JP 2005268550A JP 2006113570 A5 JP2006113570 A5 JP 2006113570A5
Authority
JP
Japan
Prior art keywords
value
hidden
feature
hidden state
determining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2005268550A
Other languages
English (en)
Japanese (ja)
Other versions
JP2006113570A (ja
JP5072206B2 (ja
Filing date
Publication date
Priority claimed from US10/966,047 external-priority patent/US7627473B2/en
Application filed filed Critical
Publication of JP2006113570A publication Critical patent/JP2006113570A/ja
Publication of JP2006113570A5 publication Critical patent/JP2006113570A5/ja
Application granted granted Critical
Publication of JP5072206B2 publication Critical patent/JP5072206B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

JP2005268550A 2004-10-15 2005-09-15 音声分類および音声認識のための隠れ条件付確率場モデル Expired - Fee Related JP5072206B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/966,047 2004-10-15
US10/966,047 US7627473B2 (en) 2004-10-15 2004-10-15 Hidden conditional random field models for phonetic classification and speech recognition

Publications (3)

Publication Number Publication Date
JP2006113570A JP2006113570A (ja) 2006-04-27
JP2006113570A5 true JP2006113570A5 (enExample) 2008-10-30
JP5072206B2 JP5072206B2 (ja) 2012-11-14

Family

ID=35520793

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005268550A Expired - Fee Related JP5072206B2 (ja) 2004-10-15 2005-09-15 音声分類および音声認識のための隠れ条件付確率場モデル

Country Status (7)

Country Link
US (1) US7627473B2 (enExample)
EP (1) EP1647970B1 (enExample)
JP (1) JP5072206B2 (enExample)
KR (1) KR101153078B1 (enExample)
CN (1) CN1760974B (enExample)
AT (1) ATE487212T1 (enExample)
DE (1) DE602005024497D1 (enExample)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8751226B2 (en) 2006-06-29 2014-06-10 Nec Corporation Learning a verification model for speech recognition based on extracted recognition and language feature information
KR100774800B1 (ko) * 2006-09-06 2007-11-07 한국정보통신대학교 산학협력단 포아송 폴링 기법을 이용한 세그먼트 단위의 음성/비음성분류 방법 및 장치
WO2008105263A1 (ja) * 2007-02-28 2008-09-04 Nec Corporation 重み係数学習システム及び音声認識システム
US7509163B1 (en) * 2007-09-28 2009-03-24 International Business Machines Corporation Method and system for subject-adaptive real-time sleep stage classification
KR101230183B1 (ko) * 2008-07-14 2013-02-15 광운대학교 산학협력단 오디오 신호의 상태결정 장치
US20100076978A1 (en) * 2008-09-09 2010-03-25 Microsoft Corporation Summarizing online forums into question-context-answer triples
US8140328B2 (en) * 2008-12-01 2012-03-20 At&T Intellectual Property I, L.P. User intention based on N-best list of recognition hypotheses for utterances in a dialog
US8306806B2 (en) * 2008-12-02 2012-11-06 Microsoft Corporation Adaptive web mining of bilingual lexicon
US8473430B2 (en) * 2010-01-29 2013-06-25 Microsoft Corporation Deep-structured conditional random fields for sequential labeling and classification
US9355683B2 (en) 2010-07-30 2016-05-31 Samsung Electronics Co., Ltd. Audio playing method and apparatus
US9031844B2 (en) 2010-09-21 2015-05-12 Microsoft Technology Licensing, Llc Full-sequence training of deep structures for speech recognition
US9164983B2 (en) 2011-05-27 2015-10-20 Robert Bosch Gmbh Broad-coverage normalization system for social media language
CN104933048B (zh) * 2014-03-17 2018-08-31 联想(北京)有限公司 一种语音信息处理方法、装置和电子设备
US9785891B2 (en) * 2014-12-09 2017-10-10 Conduent Business Services, Llc Multi-task conditional random field models for sequence labeling
CN104700833A (zh) * 2014-12-29 2015-06-10 芜湖乐锐思信息咨询有限公司 一种大数据语音分类方法
US9875736B2 (en) 2015-02-19 2018-01-23 Microsoft Technology Licensing, Llc Pre-training and/or transfer learning for sequence taggers
US11030407B2 (en) * 2016-01-28 2021-06-08 Rakuten, Inc. Computer system, method and program for performing multilingual named entity recognition model transfer
US10311863B2 (en) * 2016-09-02 2019-06-04 Disney Enterprises, Inc. Classifying segments of speech based on acoustic features and context
CN109829164B (zh) * 2019-02-01 2020-05-22 北京字节跳动网络技术有限公司 用于生成文本的方法和装置
CN110826320B (zh) * 2019-11-28 2023-10-13 上海观安信息技术股份有限公司 一种基于文本识别的敏感数据发现方法及系统

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3285047B2 (ja) * 1992-09-04 2002-05-27 日本電信電話株式会社 不特定話者用音声認識装置
JPH06266389A (ja) * 1993-03-10 1994-09-22 N T T Data Tsushin Kk 音素ラベリング装置
JPH0990975A (ja) * 1995-09-22 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> パターン認識のためのモデル学習方法
US6505152B1 (en) * 1999-09-03 2003-01-07 Microsoft Corporation Method and apparatus for using formant models in speech systems

Similar Documents

Publication Publication Date Title
JP2006113570A5 (enExample)
Peng et al. A Study on Fine-Tuning wav2vec2. 0 Model for the Task of Mispronunciation Detection and Diagnosis.
CN109697974B (zh) 使用卷积序列学习的神经文本转语音的系统和方法
CN110556100B (zh) 端到端语音识别模型的训练方法及系统
CN110085261A (zh) 一种发音纠正方法、装置、设备以及计算机可读存储介质
CN108305643B (zh) 情感信息的确定方法和装置
US9818409B2 (en) Context-dependent modeling of phonemes
CN107871496B (zh) 语音识别方法和装置
US9600764B1 (en) Markov-based sequence tagging using neural networks
JP7055630B2 (ja) 音声認識のための学習方法、学習装置、コンピュータプログラム及び記憶媒体
CN107316638A (zh) 一种诗词背诵评测方法及系统、一种终端及存储介质
WO2020010338A1 (en) Hybrid audio synthesis using neural networks
US20170011736A1 (en) Method and device for recognizing voice
CN112397056B (zh) 语音评测方法及计算机存储介质
CN110223678A (zh) 语音识别方法及系统
CN108538285A (zh) 一种基于多任务神经网络的多样例关键词检测方法
Zaiem et al. Pretext tasks selection for multitask self-supervised audio representation learning
Munkhdalai et al. Nam+: Towards scalable end-to-end contextual biasing for adaptive asr
CN113053414A (zh) 一种发音评测方法及装置
CN108074562A (zh) 语音识别装置、语音识别方法以及存储介质
CN111613219A (zh) 语音数据识别方法、设备及介质
CN110708619B (zh) 一种智能设备的词向量训练方法及装置
US12233338B1 (en) Robust speech audio generation for video games
CN114566147B (zh) 语音评测方法、计算机设备、存储介质和计算机程序产品
CN109635302A (zh) 一种训练文本摘要生成模型的方法和装置