CN1760974B - 用于标识至少一个语音单元的方法 - Google Patents

用于标识至少一个语音单元的方法 Download PDF

Info

Publication number
CN1760974B
CN1760974B CN2005101099246A CN200510109924A CN1760974B CN 1760974 B CN1760974 B CN 1760974B CN 2005101099246 A CN2005101099246 A CN 2005101099246A CN 200510109924 A CN200510109924 A CN 200510109924A CN 1760974 B CN1760974 B CN 1760974B
Authority
CN
China
Prior art keywords
eigenwert
hidden state
voice
grating texture
confirm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2005101099246A
Other languages
English (en)
Chinese (zh)
Other versions
CN1760974A (zh
Inventor
A·阿赛罗
A·J·古纳瓦德纳
M·V·马哈詹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of CN1760974A publication Critical patent/CN1760974A/zh
Application granted granted Critical
Publication of CN1760974B publication Critical patent/CN1760974B/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Document Processing Apparatus (AREA)
CN2005101099246A 2004-10-15 2005-09-15 用于标识至少一个语音单元的方法 Expired - Fee Related CN1760974B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/966,047 2004-10-15
US10/966,047 US7627473B2 (en) 2004-10-15 2004-10-15 Hidden conditional random field models for phonetic classification and speech recognition

Publications (2)

Publication Number Publication Date
CN1760974A CN1760974A (zh) 2006-04-19
CN1760974B true CN1760974B (zh) 2012-04-18

Family

ID=35520793

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2005101099246A Expired - Fee Related CN1760974B (zh) 2004-10-15 2005-09-15 用于标识至少一个语音单元的方法

Country Status (7)

Country Link
US (1) US7627473B2 (enExample)
EP (1) EP1647970B1 (enExample)
JP (1) JP5072206B2 (enExample)
KR (1) KR101153078B1 (enExample)
CN (1) CN1760974B (enExample)
AT (1) ATE487212T1 (enExample)
DE (1) DE602005024497D1 (enExample)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8751226B2 (en) 2006-06-29 2014-06-10 Nec Corporation Learning a verification model for speech recognition based on extracted recognition and language feature information
KR100774800B1 (ko) * 2006-09-06 2007-11-07 한국정보통신대학교 산학협력단 포아송 폴링 기법을 이용한 세그먼트 단위의 음성/비음성분류 방법 및 장치
WO2008105263A1 (ja) * 2007-02-28 2008-09-04 Nec Corporation 重み係数学習システム及び音声認識システム
US7509163B1 (en) * 2007-09-28 2009-03-24 International Business Machines Corporation Method and system for subject-adaptive real-time sleep stage classification
KR101230183B1 (ko) * 2008-07-14 2013-02-15 광운대학교 산학협력단 오디오 신호의 상태결정 장치
US20100076978A1 (en) * 2008-09-09 2010-03-25 Microsoft Corporation Summarizing online forums into question-context-answer triples
US8140328B2 (en) * 2008-12-01 2012-03-20 At&T Intellectual Property I, L.P. User intention based on N-best list of recognition hypotheses for utterances in a dialog
US8306806B2 (en) * 2008-12-02 2012-11-06 Microsoft Corporation Adaptive web mining of bilingual lexicon
US8473430B2 (en) * 2010-01-29 2013-06-25 Microsoft Corporation Deep-structured conditional random fields for sequential labeling and classification
US9355683B2 (en) 2010-07-30 2016-05-31 Samsung Electronics Co., Ltd. Audio playing method and apparatus
US9031844B2 (en) 2010-09-21 2015-05-12 Microsoft Technology Licensing, Llc Full-sequence training of deep structures for speech recognition
US9164983B2 (en) 2011-05-27 2015-10-20 Robert Bosch Gmbh Broad-coverage normalization system for social media language
CN104933048B (zh) * 2014-03-17 2018-08-31 联想(北京)有限公司 一种语音信息处理方法、装置和电子设备
US9785891B2 (en) * 2014-12-09 2017-10-10 Conduent Business Services, Llc Multi-task conditional random field models for sequence labeling
CN104700833A (zh) * 2014-12-29 2015-06-10 芜湖乐锐思信息咨询有限公司 一种大数据语音分类方法
US9875736B2 (en) 2015-02-19 2018-01-23 Microsoft Technology Licensing, Llc Pre-training and/or transfer learning for sequence taggers
US11030407B2 (en) * 2016-01-28 2021-06-08 Rakuten, Inc. Computer system, method and program for performing multilingual named entity recognition model transfer
US10311863B2 (en) * 2016-09-02 2019-06-04 Disney Enterprises, Inc. Classifying segments of speech based on acoustic features and context
CN109829164B (zh) * 2019-02-01 2020-05-22 北京字节跳动网络技术有限公司 用于生成文本的方法和装置
CN110826320B (zh) * 2019-11-28 2023-10-13 上海观安信息技术股份有限公司 一种基于文本识别的敏感数据发现方法及系统

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030097266A1 (en) * 1999-09-03 2003-05-22 Alejandro Acero Method and apparatus for using formant models in speech systems

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3285047B2 (ja) * 1992-09-04 2002-05-27 日本電信電話株式会社 不特定話者用音声認識装置
JPH06266389A (ja) * 1993-03-10 1994-09-22 N T T Data Tsushin Kk 音素ラベリング装置
JPH0990975A (ja) * 1995-09-22 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> パターン認識のためのモデル学習方法

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030097266A1 (en) * 1999-09-03 2003-05-22 Alejandro Acero Method and apparatus for using formant models in speech systems

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Hanna M. Wallach.Conditional Random Fields: An Introduction.《Technical Reports (CIS) of University of Pennylvania》.2004,1-9. *
Steve Young.Acoustic Modelling for Large Vocabulary Continuous Speech Recognition.《Proc. NATO Advance Study Institute》.1999,1-23. *

Also Published As

Publication number Publication date
ATE487212T1 (de) 2010-11-15
EP1647970B1 (en) 2010-11-03
KR101153078B1 (ko) 2012-06-04
EP1647970A1 (en) 2006-04-19
KR20060050361A (ko) 2006-05-19
DE602005024497D1 (de) 2010-12-16
US20060085190A1 (en) 2006-04-20
JP2006113570A (ja) 2006-04-27
CN1760974A (zh) 2006-04-19
JP5072206B2 (ja) 2012-11-14
US7627473B2 (en) 2009-12-01

Similar Documents

Publication Publication Date Title
CN1760974B (zh) 用于标识至少一个语音单元的方法
CN112735373B (zh) 语音合成方法、装置、设备及存储介质
CN1667699B (zh) 为字母-声音转换生成有互信息标准的大文法音素单元
US10762305B2 (en) Method for generating chatting data based on artificial intelligence, computer device and computer-readable storage medium
US7603276B2 (en) Standard-model generation for speech recognition using a reference model
CN1667700B (zh) 把字的语音或声学描述、发音添加到语音识别词典的方法
EP3180785B1 (en) Systems and methods for speech transcription
CN113035231B (zh) 关键词检测方法及装置
CN100589179C (zh) 从文本中预测误词率的方法和设备
CN111081230B (zh) 语音识别方法和设备
CN112530408A (zh) 用于识别语音的方法、装置、电子设备和介质
US20080201147A1 (en) Distributed speech recognition system and method and terminal and server for distributed speech recognition
KR20170022445A (ko) 통합 모델 기반의 음성 인식 장치 및 방법
US20230178069A1 (en) Methods and systems for synthesising speech from text
US10553206B2 (en) Voice keyword detection apparatus and voice keyword detection method
CN114187921B (zh) 语音质量评价方法和装置
CN112164407B (zh) 音色转换方法及装置
CN116092485A (zh) 语音识别模型的训练方法及装置、语音识别方法及装置
CN115132196A (zh) 语音指令识别的方法、装置、电子设备及存储介质
CN114333772B (zh) 语音识别方法、装置、设备、可读存储介质及产品
CN115132195A (zh) 语音唤醒方法、装置、设备、存储介质及程序产品
CN100589180C (zh) 使用切换状态空间模型的多模变分推导的语音识别方法
CN112185340B (zh) 语音合成方法、语音合成装置、存储介质与电子设备
CN119234269A (zh) 检测语言模型融合asr系统中的无意记忆
US20070129946A1 (en) High quality speech reconstruction for a dialog method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: MICROSOFT TECHNOLOGY LICENSING LLC

Free format text: FORMER OWNER: MICROSOFT CORP.

Effective date: 20150428

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20150428

Address after: Washington State

Patentee after: Micro soft technique license Co., Ltd

Address before: Washington State

Patentee before: Microsoft Corp.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120418

Termination date: 20190915