DE69829389D1 - Textnormalisieren durch eine kontextfreie grammatik - Google Patents

Textnormalisieren durch eine kontextfreie grammatik

Info

Publication number
DE69829389D1
DE69829389D1 DE69829389T DE69829389T DE69829389D1 DE 69829389 D1 DE69829389 D1 DE 69829389D1 DE 69829389 T DE69829389 T DE 69829389T DE 69829389 T DE69829389 T DE 69829389T DE 69829389 D1 DE69829389 D1 DE 69829389D1
Authority
DE
Germany
Prior art keywords
free grammar
context free
normalizing
text
text normalizing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69829389T
Other languages
English (en)
Other versions
DE69829389T2 (de
Inventor
A Alleva
J Rozak
J Israel
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of DE69829389D1 publication Critical patent/DE69829389D1/de
Publication of DE69829389T2 publication Critical patent/DE69829389T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
DE69829389T 1997-04-03 1998-04-03 Textnormalisierung unter verwendung einer kontextfreien grammatik Expired - Lifetime DE69829389T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US840117 1997-04-03
US08/840,117 US5970449A (en) 1997-04-03 1997-04-03 Text normalization using a context-free grammar
PCT/US1998/006852 WO1998044484A1 (en) 1997-04-03 1998-04-03 Text normalization using a context-free grammar

Publications (2)

Publication Number Publication Date
DE69829389D1 true DE69829389D1 (de) 2005-04-21
DE69829389T2 DE69829389T2 (de) 2006-02-09

Family

ID=25281495

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69829389T Expired - Lifetime DE69829389T2 (de) 1997-04-03 1998-04-03 Textnormalisierung unter verwendung einer kontextfreien grammatik

Country Status (6)

Country Link
US (1) US5970449A (de)
EP (1) EP1016074B1 (de)
JP (1) JP2001519043A (de)
CN (1) CN1285068C (de)
DE (1) DE69829389T2 (de)
WO (1) WO1998044484A1 (de)

Families Citing this family (66)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2945887B2 (ja) * 1997-10-09 1999-09-06 オリンパス光学工業株式会社 コードイメージ記録装置
US6523031B1 (en) * 1997-11-21 2003-02-18 International Business Machines Corporation Method for obtaining structured information exists in special data format from a natural language text by aggregation
JP2000163418A (ja) * 1997-12-26 2000-06-16 Canon Inc 自然言語処理装置及びその方法、及びそのプログラムを格納した記憶媒体
US6513002B1 (en) * 1998-02-11 2003-01-28 International Business Machines Corporation Rule-based number formatter
US6493662B1 (en) * 1998-02-11 2002-12-10 International Business Machines Corporation Rule-based number parser
US7181399B1 (en) * 1999-05-19 2007-02-20 At&T Corp. Recognizing the numeric language in natural spoken dialogue
JP3709305B2 (ja) * 1999-07-01 2005-10-26 日立オムロンターミナルソリューションズ株式会社 地名文字列照合方法、地名文字列照合装置、地名文字列認識装置及び郵便物区分システム
US6762699B1 (en) 1999-12-17 2004-07-13 The Directv Group, Inc. Method for lossless data compression using greedy sequential grammar transform and sequential encoding
US6640098B1 (en) * 2000-02-14 2003-10-28 Action Engine Corporation System for obtaining service-related information for local interactive wireless devices
US6704728B1 (en) 2000-05-02 2004-03-09 Iphase.Com, Inc. Accessing information from a collection of data
US8478732B1 (en) * 2000-05-02 2013-07-02 International Business Machines Corporation Database aliasing in information access system
US8290768B1 (en) 2000-06-21 2012-10-16 International Business Machines Corporation System and method for determining a set of attributes based on content of communications
US6408277B1 (en) 2000-06-21 2002-06-18 Banter Limited System and method for automatic task prioritization
US9699129B1 (en) 2000-06-21 2017-07-04 International Business Machines Corporation System and method for increasing email productivity
US20020099734A1 (en) * 2000-11-29 2002-07-25 Philips Electronics North America Corp. Scalable parser for extensible mark-up language
US7644057B2 (en) 2001-01-03 2010-01-05 International Business Machines Corporation System and method for electronic communication management
US7136846B2 (en) * 2001-04-06 2006-11-14 2005 Keel Company, Inc. Wireless information retrieval
US7152029B2 (en) 2001-07-18 2006-12-19 At&T Corp. Spoken language understanding that incorporates prior knowledge into boosting
US20030115066A1 (en) * 2001-12-17 2003-06-19 Seeley Albert R. Method of using automated speech recognition (ASR) for web-based voice applications
US7343372B2 (en) 2002-02-22 2008-03-11 International Business Machines Corporation Direct navigation for information retrieval
US7257531B2 (en) * 2002-04-19 2007-08-14 Medcom Information Systems, Inc. Speech to text system using controlled vocabulary indices
US7146320B2 (en) * 2002-05-29 2006-12-05 Microsoft Corporation Electronic mail replies with speech recognition
US7328146B1 (en) 2002-05-31 2008-02-05 At&T Corp. Spoken language understanding that incorporates prior knowledge into boosting
US20050187913A1 (en) 2003-05-06 2005-08-25 Yoram Nelken Web-based customer service interface
US8495002B2 (en) 2003-05-06 2013-07-23 International Business Machines Corporation Software tool for training and testing a knowledge base
WO2004109658A1 (ja) * 2003-06-02 2004-12-16 International Business Machines Corporation 音声応答システム、音声応答方法、音声サーバ、音声ファイル処理方法、プログラム及び記録媒体
US7343604B2 (en) 2003-07-25 2008-03-11 International Business Machines Corporation Methods and apparatus for creation of parsing rules
US7672436B1 (en) 2004-01-23 2010-03-02 Sprint Spectrum L.P. Voice rendering of E-mail with tags for improved user experience
US20050216256A1 (en) * 2004-03-29 2005-09-29 Mitra Imaging Inc. Configurable formatting system and method
US20050240408A1 (en) * 2004-04-22 2005-10-27 Redin Jaime H Method and apparatus for entering verbal numerals in electronic devices
DE102004028724A1 (de) * 2004-06-14 2005-12-29 T-Mobile Deutschland Gmbh Verfahren zur natürlichsprachlichen Erkennung von Nummern
US8335688B2 (en) * 2004-08-20 2012-12-18 Multimodal Technologies, Llc Document transcription system training
US8412521B2 (en) * 2004-08-20 2013-04-02 Multimodal Technologies, Llc Discriminative training of document transcription system
US7584103B2 (en) * 2004-08-20 2009-09-01 Multimodal Technologies, Inc. Automated extraction of semantic content and generation of a structured document from speech
US7630892B2 (en) * 2004-09-10 2009-12-08 Microsoft Corporation Method and apparatus for transducer-based text normalization and inverse text normalization
CN100462966C (zh) * 2004-09-14 2009-02-18 株式会社Ipb 将文件配置成时间序列的文件相关图的制成装置
US8977953B1 (en) * 2006-01-27 2015-03-10 Linguastat, Inc. Customizing information by combining pair of annotations from at least two different documents
EP2030197A4 (de) * 2006-06-22 2012-04-04 Multimodal Technologies Llc Automatische entscheidungsunterstützung
WO2008066981A2 (en) * 2006-08-21 2008-06-05 Western Slope Utilities, Inc. Systems and methods for pipeline rehabilitation installation
US8671341B1 (en) 2007-01-05 2014-03-11 Linguastat, Inc. Systems and methods for identifying claims associated with electronic text
US7813929B2 (en) * 2007-03-30 2010-10-12 Nuance Communications, Inc. Automatic editing using probabilistic word substitution models
US20080312928A1 (en) * 2007-06-12 2008-12-18 Robert Patrick Goebel Natural language speech recognition calculator
US20090157385A1 (en) * 2007-12-14 2009-06-18 Nokia Corporation Inverse Text Normalization
JP2009244639A (ja) * 2008-03-31 2009-10-22 Sanyo Electric Co Ltd 発話装置、発話制御プログラムおよび発話制御方法
US9460708B2 (en) * 2008-09-19 2016-10-04 Microsoft Technology Licensing, Llc Automated data cleanup by substitution of words of the same pronunciation and different spelling in speech recognition
US8364487B2 (en) * 2008-10-21 2013-01-29 Microsoft Corporation Speech recognition system with display information
US8990088B2 (en) * 2009-01-28 2015-03-24 Microsoft Corporation Tool and framework for creating consistent normalization maps and grammars
US8370155B2 (en) * 2009-04-23 2013-02-05 International Business Machines Corporation System and method for real time support for agents in contact center environments
CN102339228B (zh) * 2010-07-22 2017-05-10 上海果壳电子有限公司 上下文无关文法的解析方法
US8959102B2 (en) 2010-10-08 2015-02-17 Mmodal Ip Llc Structured searching of dynamic structured document corpuses
US9110852B1 (en) * 2012-07-20 2015-08-18 Google Inc. Methods and systems for extracting information from text
US9146919B2 (en) * 2013-01-16 2015-09-29 Google Inc. Bootstrapping named entity canonicalizers from English using alignment models
US9471561B2 (en) * 2013-12-26 2016-10-18 International Business Machines Corporation Adaptive parser-centric text normalization
US9535904B2 (en) * 2014-03-26 2017-01-03 Microsoft Technology Licensing, Llc Temporal translation grammar for language translation
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
CN104360897B (zh) * 2014-10-29 2017-09-22 百度在线网络技术(北京)有限公司 对话处理方法和对话管理系统
EP3369002A4 (de) * 2015-10-26 2019-06-12 24/7 Customer, Inc. Verfahren und vorrichtung zur erleichterung der vorhersage von kundenabsichten
US20170154029A1 (en) * 2015-11-30 2017-06-01 Robert Martin Kane System, method, and apparatus to normalize grammar of textual data
US11316865B2 (en) 2017-08-10 2022-04-26 Nuance Communications, Inc. Ambient cooperative intelligence system and method
US11482308B2 (en) 2017-08-10 2022-10-25 Nuance Communications, Inc. Automated clinical documentation system and method
US10496382B2 (en) * 2018-02-22 2019-12-03 Midea Group Co., Ltd. Machine generation of context-free grammar for intent deduction
US11250382B2 (en) 2018-03-05 2022-02-15 Nuance Communications, Inc. Automated clinical documentation system and method
US10789955B2 (en) * 2018-11-16 2020-09-29 Google Llc Contextual denormalization for automatic speech recognition
CN111370083B (zh) * 2018-12-26 2023-04-25 阿里巴巴集团控股有限公司 一种文本结构化方法及装置
US11182504B2 (en) * 2019-04-29 2021-11-23 Microsoft Technology Licensing, Llc System and method for speaker role determination and scrubbing identifying information
US11482214B1 (en) * 2019-12-12 2022-10-25 Amazon Technologies, Inc. Hypothesis generation and selection for inverse text normalization for search

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914704A (en) * 1984-10-30 1990-04-03 International Business Machines Corporation Text editor for speech input
US4829576A (en) * 1986-10-21 1989-05-09 Dragon Systems, Inc. Voice recognition system
US5231670A (en) * 1987-06-01 1993-07-27 Kurzweil Applied Intelligence, Inc. Voice controlled system and method for generating text from a voice controlled input
US5349526A (en) * 1991-08-07 1994-09-20 Occam Research Corporation System and method for converting sentence elements unrecognizable by a computer system into base language elements recognizable by the computer system
DE69232407T2 (de) * 1991-11-18 2002-09-12 Toshiba Kawasaki Kk Sprach-Dialog-System zur Erleichterung von Rechner-Mensch-Wechselwirkung
US5371807A (en) * 1992-03-20 1994-12-06 Digital Equipment Corporation Method and apparatus for text classification
DE69327446T2 (de) * 1992-11-18 2000-05-11 Canon Information Syst Inc Verfahren und Gerät zur Gewinnung von Text aus einer strukturierten Datei und zu dessen Umsetzung in Sprache
DE69326431T2 (de) * 1992-12-28 2000-02-03 Toshiba Kawasaki Kk Spracherkennungs-Schnittstellensystem, das als Fenstersystem und Sprach-Postsystem verwendbar ist
JPH0736882A (ja) * 1993-07-19 1995-02-07 Fujitsu Ltd 辞書検索装置
US5651096A (en) * 1995-03-14 1997-07-22 Apple Computer, Inc. Merging of language models from two or more application programs for a speech recognition system

Also Published As

Publication number Publication date
CN1285068C (zh) 2006-11-15
DE69829389T2 (de) 2006-02-09
JP2001519043A (ja) 2001-10-16
US5970449A (en) 1999-10-19
CN1255224A (zh) 2000-05-31
EP1016074B1 (de) 2005-03-16
EP1016074A1 (de) 2000-07-05
WO1998044484A1 (en) 1998-10-08

Similar Documents

Publication Publication Date Title
DE69829389D1 (de) Textnormalisieren durch eine kontextfreie grammatik
DE69922104D1 (de) Spracherkenner mit durch buchstabierte Worteingabe adaptierbarem Wortschatz
DE69421077D1 (de) Wortkettenerkennung
DE69919842D1 (de) Sprachmodell basierend auf der spracherkennungshistorie
FI971822A (fi) Puheentunnistus
DE69712927D1 (de) CELP-Codec
DE69629667D1 (de) Sprachsegmentierung
DE69423588D1 (de) Spracherkennungsgerät
DE69933623D1 (de) Spracherkennung
DE69718234T2 (de) Sprachkodierer
DE69524321D1 (de) Spracherkenner
DE69524890T2 (de) Parametrische Sprachkodierung
HK1040312A1 (zh) 聲音識別裝置
DE69920714D1 (de) Spracherkennung
DE69942288D1 (de) Sprachdekodierung
DE60000244D1 (de) Erkennungsgerät
NL1000245C1 (nl) Paal.
DE29718665U1 (de) Verbindungsvorrichtung für eine Stabwerk-Konstruktion
NL194481B (nl) Spraaksynthese-inrichting.
FI3346U1 (fi) Hiilihapotin
DE69811323D1 (de) Bilderkennung durch lokalisierte Interpretation
FI102344B1 (fi) Integroitu puheenkoodaus
DE9404569U1 (de) Als Dachtritt o.dgl. ausgebildete Plattform
DE59813611D1 (de) Schlitten für eine Führung
DE29710253U1 (de) Leitpfosten

Legal Events

Date Code Title Description
8364 No opposition during term of opposition