DE60200519D1 - Verfahren und Vorrichtung zur verteilten Spracherkennung - Google Patents

Verfahren und Vorrichtung zur verteilten Spracherkennung

Info

Publication number
DE60200519D1
DE60200519D1 DE60200519T DE60200519T DE60200519D1 DE 60200519 D1 DE60200519 D1 DE 60200519D1 DE 60200519 T DE60200519 T DE 60200519T DE 60200519 T DE60200519 T DE 60200519T DE 60200519 D1 DE60200519 D1 DE 60200519D1
Authority
DE
Germany
Prior art keywords
acoustic
terminal
server
encoding
processing condition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60200519T
Other languages
English (en)
Other versions
DE60200519T2 (de
Inventor
Tetsuo Kosaka
Hiroki Yamamoto
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Publication of DE60200519D1 publication Critical patent/DE60200519D1/de
Application granted granted Critical
Publication of DE60200519T2 publication Critical patent/DE60200519T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Exchange Systems With Centralized Control (AREA)
DE60200519T 2001-03-08 2002-03-06 Verfahren und Vorrichtung zur verteilten Spracherkennung Expired - Lifetime DE60200519T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2001065383 2001-03-08
JP2001065383A JP2002268681A (ja) 2001-03-08 2001-03-08 音声認識システム及び方法及び該システムに用いる情報処理装置とその方法

Publications (2)

Publication Number Publication Date
DE60200519D1 true DE60200519D1 (de) 2004-07-01
DE60200519T2 DE60200519T2 (de) 2005-06-02

Family

ID=18924045

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60200519T Expired - Lifetime DE60200519T2 (de) 2001-03-08 2002-03-06 Verfahren und Vorrichtung zur verteilten Spracherkennung

Country Status (5)

Country Link
US (1) US20020128826A1 (de)
EP (1) EP1239462B1 (de)
JP (1) JP2002268681A (de)
AT (1) ATE268044T1 (de)
DE (1) DE60200519T2 (de)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3542578B2 (ja) * 2001-11-22 2004-07-14 キヤノン株式会社 音声認識装置及びその方法、プログラム
JP4217495B2 (ja) 2003-01-29 2009-02-04 キヤノン株式会社 音声認識辞書作成方法、音声認識辞書作成装置及びプログラム、記録媒体
KR100672355B1 (ko) 2004-07-16 2007-01-24 엘지전자 주식회사 음성 코딩/디코딩 방법 및 그를 위한 장치
JP4603429B2 (ja) * 2005-06-17 2010-12-22 日本電信電話株式会社 クライアント・サーバ音声認識方法、サーバ計算機での音声認識方法、音声特徴量抽出・送信方法、これらの方法を用いたシステム、装置、プログラムおよび記録媒体
JP4769121B2 (ja) * 2006-05-15 2011-09-07 日本電信電話株式会社 サーバ・クライアント型音声認識方法、装置およびサーバ・クライアント型音声認識プログラム、記録媒体
KR100861653B1 (ko) * 2007-05-25 2008-10-02 주식회사 케이티 음성 특징을 이용한 네트워크 기반 분산형 음성 인식단말기, 서버, 및 그 시스템 및 그 방법
EP2721607A1 (de) * 2011-06-15 2014-04-23 Bone Tone Communications (Israel) Ltd. Spracherkennungssystem, -vorrichtung und -verfahren
US10032036B2 (en) * 2011-09-14 2018-07-24 Shahab Khan Systems and methods of multidimensional encrypted data transfer
US9251723B2 (en) * 2011-09-14 2016-02-02 Jonas Moses Systems and methods of multidimensional encrypted data transfer
US9460729B2 (en) 2012-09-21 2016-10-04 Dolby Laboratories Licensing Corporation Layered approach to spatial audio coding
EP3975000A1 (de) 2015-06-01 2022-03-30 Sinclair Broadcast Group, Inc. Unterbrechungsstatusdetektion in inhaltsverwaltungssystemen
US10068568B2 (en) 2015-06-01 2018-09-04 Sinclair Broadcast Group, Inc. Content segmentation and time reconciliation
CA2988105C (en) * 2015-06-01 2024-06-18 Benjamin Aaron Miller Content segmentation and time reconciliation
US10855765B2 (en) 2016-05-20 2020-12-01 Sinclair Broadcast Group, Inc. Content atomization

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69028072T2 (de) * 1989-11-06 1997-01-09 Canon Kk Verfahren und Einrichtung zur Sprachsynthese
JPH03150599A (ja) * 1989-11-07 1991-06-26 Canon Inc 日本語音節の符号化方式
US6236964B1 (en) * 1990-02-01 2001-05-22 Canon Kabushiki Kaisha Speech recognition apparatus and method for matching inputted speech and a word generated from stored referenced phoneme data
JPH04362698A (ja) * 1991-06-11 1992-12-15 Canon Inc 音声認識方法及び装置
JP3066920B2 (ja) * 1991-06-11 2000-07-17 キヤノン株式会社 音声認識方法及び装置
US5627939A (en) * 1993-09-03 1997-05-06 Microsoft Corporation Speech recognition system and method employing data compression
US5680506A (en) * 1994-12-29 1997-10-21 Lucent Technologies Inc. Apparatus and method for speech signal analysis
JPH09258771A (ja) * 1996-03-25 1997-10-03 Canon Inc 音声処理方法及び装置
JP3397568B2 (ja) * 1996-03-25 2003-04-14 キヤノン株式会社 音声認識方法及び装置
JPH1097276A (ja) * 1996-09-20 1998-04-14 Canon Inc 音声認識方法及び装置並びに記憶媒体
JPH10161692A (ja) * 1996-12-03 1998-06-19 Canon Inc 音声認識装置及び音声認識方法
JP3962445B2 (ja) * 1997-03-13 2007-08-22 キヤノン株式会社 音声処理方法及び装置
JPH10254486A (ja) * 1997-03-13 1998-09-25 Canon Inc 音声認識装置および方法
US6009387A (en) * 1997-03-20 1999-12-28 International Business Machines Corporation System and method of compression/decompressing a speech signal by using split vector quantization and scalar quantization
US6223157B1 (en) * 1998-05-07 2001-04-24 Dsc Telecom, L.P. Method for direct recognition of encoded speech data
JP2000047696A (ja) * 1998-07-29 2000-02-18 Canon Inc 情報処理方法及び装置、その記憶媒体
US20020116180A1 (en) * 2001-02-20 2002-08-22 Grinblat Zinovy D. Method for transmission and storage of speech

Also Published As

Publication number Publication date
US20020128826A1 (en) 2002-09-12
DE60200519T2 (de) 2005-06-02
EP1239462A1 (de) 2002-09-11
JP2002268681A (ja) 2002-09-20
ATE268044T1 (de) 2004-06-15
EP1239462B1 (de) 2004-05-26

Similar Documents

Publication Publication Date Title
DE60200519D1 (de) Verfahren und Vorrichtung zur verteilten Spracherkennung
EP1701340A3 (de) Kodiervorrichtung und Dekodiervorrichtung
DE60220959D1 (de) Verfahren und Vorrichtung zur Bereitstellung einer Liste von öffentlichen Schlüsseln in einem Public-Key-System
DE60007620D1 (de) Spracherkennungsverfahren
CN101903947A (zh) 使用接收器进行上下文抑制的系统、方法及设备
ATE305655T1 (de) Vorrichtung und verfahren zum codieren eines zeitdiskreten audiosignals und vorrichtung und verfahren zum decodieren von codierten audiodaten
ATE292524T1 (de) Vorrichtung und verfahren zur telefonie-basierten spracherkennung für das bereitstellen von informationen zum sortieren von poststücken und paketen.
ATE372572T1 (de) Vorrichtung und verfahren zur konfiguration von sprachlesern unter verwendung semantischer analyse
DE60103424D1 (de) Verbessern der leistung von kodierungssystemen, die hochfrequenz-rekonstruktionsverfahren verwenden
ATE354156T1 (de) Verfahren zum training oder zur adaption eines spracherkenners
WO2003005340A3 (en) Method and apparatus for improving voice recognition performance in a voice application distribution system
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
EP1447792A3 (de) Verfahren und Vorrichtung zur Modellierung eines Spracherkennungssystems und zur Schätzung einer Wort-Fehlerrate basierend auf einem Text
CN1650349A (zh) 用于抗噪声语音识别的在线参数直方图正态化
ATE319160T1 (de) Verfahren zur rauschrobusten klassifikation in der sprachkodierung
ATE218260T1 (de) Vorrichtung und verfahren zur videosignalkodierung
DE60322433D1 (de) Vorrichtung und verfahren zur mehrfachbeschreibungscodierung
ATE340399T1 (de) Für eine benutzergruppe spezifisches musterverarbeitungssystem
WO2002103675A8 (en) Client-server based distributed speech recognition system architecture
CN110211610A (zh) 评估音频信号损失的方法、装置及存储介质
ATE352078T1 (de) Verfahren, system und vorrichtung zur authentifierung von durch einen benutzer übertragener und/oder empfangener daten
ATE316283T1 (de) Vorrichtung zur verbesserung der spracherkennung
ATE527596T1 (de) Erhalten von konfigurationsdaten für ein datenverarbeitungsgerät
DE60206619D1 (de) Verfahren und vorrichtung zur erzeugung und verteilung von interaktiven echtzeit-medieninhalten uber drahtlose kommunikationsnetze und das internet
SE0201366L (sv) Framställning av en frekvenskod ur en transformerad bild som representerar ett fingeravtryck att användas vid kontroll av en persons identitet

Legal Events

Date Code Title Description
8364 No opposition during term of opposition