WO2005077098A8 - Saisie manuscrite et vocale a correction automatique - Google Patents

Saisie manuscrite et vocale a correction automatique

Info

Publication number
WO2005077098A8
WO2005077098A8 PCT/US2005/004359 US2005004359W WO2005077098A8 WO 2005077098 A8 WO2005077098 A8 WO 2005077098A8 US 2005004359 W US2005004359 W US 2005004359W WO 2005077098 A8 WO2005077098 A8 WO 2005077098A8
Authority
WO
WIPO (PCT)
Prior art keywords
words
entered
language
handwriting
voice input
Prior art date
Application number
PCT/US2005/004359
Other languages
English (en)
Other versions
WO2005077098A3 (fr
WO2005077098B1 (fr
WO2005077098A2 (fr
Inventor
Alex Robinson
Ethan R Bradford
David Kay
Meurs Pim Van
James Stephanick
Original Assignee
America Online Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/043,506 external-priority patent/US7319957B2/en
Priority claimed from US11/043,525 external-priority patent/US20050192802A1/en
Priority to BRPI0507577-7A priority Critical patent/BRPI0507577A/pt
Priority to AU2005211782A priority patent/AU2005211782B2/en
Priority to JP2006553258A priority patent/JP2007524949A/ja
Priority to EP05722955A priority patent/EP1714234A4/fr
Application filed by America Online Inc filed Critical America Online Inc
Priority to CN2005800046235A priority patent/CN1918578B/zh
Priority to CA2556065A priority patent/CA2556065C/fr
Publication of WO2005077098A2 publication Critical patent/WO2005077098A2/fr
Publication of WO2005077098A3 publication Critical patent/WO2005077098A3/fr
Publication of WO2005077098B1 publication Critical patent/WO2005077098B1/fr
Publication of WO2005077098A8 publication Critical patent/WO2005077098A8/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/224Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Character Discrimination (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

La présente invention concerne une approche hybride destinée à améliorer la reconnaissance de l'écriture manuscrite et la reconnaissance vocale dans les systèmes informatiques. Dans un mode de réalisation, on utilise un module de premier plan pour reconnaître les traits, les caractères et/ou les phonèmes. Le module de premier plan renvoie des candidats affectés de probabilités relatives ou absolues de correspondre à l'entrée. Partant de caractéristiques linguistiques de la langue, par exemple langue alphabétique ou à idéogrammes pour les mots en cours de saisie, par exemple de la fréquence des mots et locutions en cours d'utilisation, de parties vraisemblables d'élocution du mot saisi, de la morphologie de la langue, ou du contexte dans lequel le mot est saisi, un module de second plan combine les candidats déterminé par le module de premier plan des entrées pour que les mots correspondent à des mots connus et aux probabilités d'utilisation de tels mots dans le contexte en cours.
PCT/US2005/004359 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique WO2005077098A2 (fr)

Priority Applications (6)

Application Number Priority Date Filing Date Title
CA2556065A CA2556065C (fr) 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique
CN2005800046235A CN1918578B (zh) 2004-02-11 2005-02-08 具有自动校正的手写及语音输入
AU2005211782A AU2005211782B2 (en) 2004-02-11 2005-02-08 Handwriting and voice input with automatic correction
JP2006553258A JP2007524949A (ja) 2004-02-11 2005-02-08 自動訂正機能を備えた手書き文字入力およびボイス入力
EP05722955A EP1714234A4 (fr) 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique
BRPI0507577-7A BRPI0507577A (pt) 2004-02-11 2005-02-08 entrada de caligrafia e voz com correção automática

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US54417004P 2004-02-11 2004-02-11
US60/544,170 2004-02-11
US11/043,506 US7319957B2 (en) 2004-02-11 2005-01-25 Handwriting and voice input with automatic correction
US11/043,506 2005-01-25
US11/043,525 2005-01-25
US11/043,525 US20050192802A1 (en) 2004-02-11 2005-01-25 Handwriting and voice input with automatic correction

Publications (4)

Publication Number Publication Date
WO2005077098A2 WO2005077098A2 (fr) 2005-08-25
WO2005077098A3 WO2005077098A3 (fr) 2005-11-03
WO2005077098B1 WO2005077098B1 (fr) 2005-12-08
WO2005077098A8 true WO2005077098A8 (fr) 2007-05-10

Family

ID=34865026

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/004359 WO2005077098A2 (fr) 2004-02-11 2005-02-08 Saisie manuscrite et vocale a correction automatique

Country Status (9)

Country Link
EP (1) EP1714234A4 (fr)
JP (1) JP2007524949A (fr)
KR (1) KR100912753B1 (fr)
CN (1) CN1918578B (fr)
AU (1) AU2005211782B2 (fr)
BR (1) BRPI0507577A (fr)
CA (1) CA2556065C (fr)
TW (1) TW200538969A (fr)
WO (1) WO2005077098A2 (fr)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008076812A (ja) * 2006-09-22 2008-04-03 Honda Motor Co Ltd 音声認識装置、音声認識方法、及び音声認識プログラム
KR100908444B1 (ko) * 2006-12-05 2009-07-21 한국전자통신연구원 음소 인식 기반의 탐색공간 제한을 이용한 연속음성인식장치 및 방법
US8032374B2 (en) 2006-12-05 2011-10-04 Electronics And Telecommunications Research Institute Method and apparatus for recognizing continuous speech using search space restriction based on phoneme recognition
US8237665B2 (en) * 2008-03-11 2012-08-07 Microsoft Corporation Interpreting ambiguous inputs on a touch-screen
JP5541166B2 (ja) 2009-01-20 2014-07-09 日本電気株式会社 入力装置、情報処理装置、入力方法およびプログラム
JP2011065322A (ja) * 2009-09-16 2011-03-31 Konica Minolta Holdings Inc 文字認識システム及び文字認識プログラム、並びに音声認識システム及び音声認識プログラム
US8543382B2 (en) * 2010-10-27 2013-09-24 King Abdulaziz City for Science and Technology (KACST) Method and system for diacritizing arabic language text
CN103631802B (zh) * 2012-08-24 2015-05-20 腾讯科技(深圳)有限公司 歌曲信息检索方法、装置及相应的服务器
DE102013009375A1 (de) * 2012-12-28 2014-07-03 Volkswagen Aktiengesellschaft Verfahren zum Eingeben und Erkennen einer Zeichenkette
GB201321927D0 (en) * 2013-12-11 2014-01-22 Touchtype Ltd System and method for inputting text into electronic devices
TWI587281B (zh) * 2014-11-07 2017-06-11 Papago Inc Voice control system and its method
TWI616868B (zh) * 2014-12-30 2018-03-01 鴻海精密工業股份有限公司 會議記錄裝置及其自動生成會議記錄的方法
TWI619115B (zh) * 2014-12-30 2018-03-21 鴻海精密工業股份有限公司 會議記錄裝置及其自動生成會議記錄的方法
CN105810197B (zh) * 2014-12-30 2019-07-26 联想(北京)有限公司 语音处理方法、语音处理装置和电子设备
JP6310155B2 (ja) * 2015-07-17 2018-04-11 楽天株式会社 文字認識装置、文字認識方法及び文字認識プログラム
KR101636823B1 (ko) * 2015-11-27 2016-07-07 (주)인키움 자기소개서 자동 제공 서버 및 제공 방법
CN106406807A (zh) * 2016-09-19 2017-02-15 北京云知声信息技术有限公司 一种语音修改文字的方法及装置
JP7143665B2 (ja) 2018-07-27 2022-09-29 富士通株式会社 音声認識装置、音声認識プログラムおよび音声認識方法
DE102018213602B3 (de) * 2018-08-13 2019-10-31 Audi Ag Verfahren zum Erzeugen einer Sprachansage als Rückmeldung zu einer handschriftlichen Nutzereingabe sowie entsprechende Bedienvorrichtung und Kraftfahrzeug
CN109584882B (zh) * 2018-11-30 2022-12-27 南京天溯自动化控制系统有限公司 一种针对特定场景的语音转文字的优化方法及系统
KR102577589B1 (ko) * 2019-10-22 2023-09-12 삼성전자주식회사 음성 인식 방법 및 음성 인식 장치
TWI771720B (zh) 2020-07-24 2022-07-21 華碩電腦股份有限公司 具有多型態輸入之辨識方法及使用其之電子裝置
CN116097347A (zh) * 2022-09-16 2023-05-09 英华达(上海)科技有限公司 语音实时翻译方法、系统、设备以及存储介质
US11726657B1 (en) 2023-03-01 2023-08-15 Daniel Pohoryles Keyboard input method, system, and techniques

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4003025A (en) * 1975-12-24 1977-01-11 International Business Machines Corporation Alphabetic character word upper/lower case print convention apparatus and method
US5244802A (en) * 1987-11-18 1993-09-14 Phytogen Regeneration of cotton
US5828991A (en) * 1995-06-30 1998-10-27 The Research Foundation Of The State University Of New York Sentence reconstruction using word ambiguity resolution
US5917941A (en) * 1995-08-08 1999-06-29 Apple Computer, Inc. Character segmentation technique with integrated word search for handwriting recognition
US5950160A (en) * 1996-10-31 1999-09-07 Microsoft Corporation Method and system for displaying a variable number of alternative words during speech recognition
US5926566A (en) * 1996-11-15 1999-07-20 Synaptics, Inc. Incremental ideographic character input method
US5896321A (en) * 1997-11-14 1999-04-20 Microsoft Corporation Text completion system for a miniature computer
US6393395B1 (en) * 1999-01-07 2002-05-21 Microsoft Corporation Handwriting and speech recognizer using neural network with separate start and continuation output scores
US20020152075A1 (en) * 2001-04-16 2002-10-17 Shao-Tsu Kung Composite input method
US7444286B2 (en) * 2001-09-05 2008-10-28 Roth Daniel L Speech recognition using re-utterance recognition
US7225130B2 (en) * 2001-09-05 2007-05-29 Voice Signal Technologies, Inc. Methods, systems, and programming for performing speech recognition

Also Published As

Publication number Publication date
JP2007524949A (ja) 2007-08-30
CA2556065C (fr) 2012-07-03
CN1918578A (zh) 2007-02-21
WO2005077098A3 (fr) 2005-11-03
BRPI0507577A (pt) 2007-07-03
WO2005077098B1 (fr) 2005-12-08
KR100912753B1 (ko) 2009-08-18
EP1714234A4 (fr) 2012-03-21
TW200538969A (en) 2005-12-01
CN1918578B (zh) 2012-05-02
CA2556065A1 (fr) 2005-08-25
AU2005211782A1 (en) 2005-08-25
KR20070090075A (ko) 2007-09-05
AU2005211782B2 (en) 2009-01-22
WO2005077098A2 (fr) 2005-08-25
EP1714234A2 (fr) 2006-10-25

Similar Documents

Publication Publication Date Title
WO2005077098A8 (fr) Saisie manuscrite et vocale a correction automatique
US9786273B2 (en) Multimodal disambiguation of speech recognition
EP2466450B1 (fr) Procédé et appareil de correction d'erreurs de reconnaissance de la parole
WO2006086511A8 (fr) Procede et appareil utilisant la saisie vocale pour resoudre une saisie de texte manuelle ambigue
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
WO2003058603A3 (fr) Systeme et procede relatifs a une reconnaissance vocale par reconnaissance multipassage utilisant des grammaires specifiques de contexte
CN111415656B (zh) 语音语义识别方法、装置及车辆
KR100825690B1 (ko) 음성 인식 시스템에서의 인식 오류 수정 방법
EP1205908A3 (fr) Prononciation de nouveaux mots pour le traitement de la parole
US7676364B2 (en) System and method for speech-to-text conversion using constrained dictation in a speak-and-spell mode
US20150081272A1 (en) Simultaneous speech processing apparatus and method
CN110942767B (zh) 一种asr语言模型识别标注与优化方法及其装置
CA2596126A1 (fr) Reconnaissance de la parole par langage statistique faisant appel a une actualisation quadratique
Yeung et al. Improving automatic forced alignment for dysarthric speech transcription.
Hatmi et al. Incorporating named entity recognition into the speech transcription process
JP6001944B2 (ja) 音声コマンド制御装置、音声コマンド制御方法及び音声コマンド制御プログラム
WO2004008433A3 (fr) Systeme et procede de reconnaissance vocale pour le mandarin utilisant un appareil telephonique optimise
Kirchhoff et al. Cross-dialectal acoustic data sharing for Arabic speech recognition
JP2009031328A (ja) 音声認識装置
KR20050101695A (ko) 인식 결과를 이용한 통계적인 음성 인식 시스템 및 그 방법
Geutner et al. Selection criteria for hypothesis driven lexical adaptation
Zhou et al. A two-level schema for detecting recognition errors.
KR20110017600A (ko) 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법
Arısoy et al. Discriminative n-gram language modeling for Turkish
Arısoy et al. Feature combination approaches for discriminative language models

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

B Later publication of amended claims

Effective date: 20051005

WWE Wipo information: entry into national phase

Ref document number: 2556065

Country of ref document: CA

Ref document number: 2200/KOLNP/2006

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2005211782

Country of ref document: AU

Ref document number: 2006553258

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2005722955

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 200580004623.5

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2005211782

Country of ref document: AU

Date of ref document: 20050208

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 2005211782

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 1020067018544

Country of ref document: KR

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 2005722955

Country of ref document: EP

ENP Entry into the national phase

Ref document number: PI0507577

Country of ref document: BR