ATE496342T1 - DARSTELLUNG EINES ßDELETED INTERPOLATIONß N-GRAM SPRACHMODELLS IN ARPA STANDARDFORMAT - Google Patents

DARSTELLUNG EINES ßDELETED INTERPOLATIONß N-GRAM SPRACHMODELLS IN ARPA STANDARDFORMAT

Info

Publication number
ATE496342T1
ATE496342T1 AT05102283T AT05102283T ATE496342T1 AT E496342 T1 ATE496342 T1 AT E496342T1 AT 05102283 T AT05102283 T AT 05102283T AT 05102283 T AT05102283 T AT 05102283T AT E496342 T1 ATE496342 T1 AT E496342T1
Authority
AT
Austria
Prior art keywords
language model
ßdeleted
interpolationß
representation
standard format
Prior art date
Application number
AT05102283T
Other languages
English (en)
Inventor
Alejandro Acero
Ciprian Chelba
Milind Mahajan
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of ATE496342T1 publication Critical patent/ATE496342T1/de

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Algebra (AREA)
  • Pure & Applied Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Machine Translation (AREA)
  • Devices For Executing Special Programs (AREA)
AT05102283T 2004-03-26 2005-03-22 DARSTELLUNG EINES ßDELETED INTERPOLATIONß N-GRAM SPRACHMODELLS IN ARPA STANDARDFORMAT ATE496342T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/810,254 US7406416B2 (en) 2004-03-26 2004-03-26 Representation of a deleted interpolation N-gram language model in ARPA standard format

Publications (1)

Publication Number Publication Date
ATE496342T1 true ATE496342T1 (de) 2011-02-15

Family

ID=34862105

Family Applications (1)

Application Number Title Priority Date Filing Date
AT05102283T ATE496342T1 (de) 2004-03-26 2005-03-22 DARSTELLUNG EINES ßDELETED INTERPOLATIONß N-GRAM SPRACHMODELLS IN ARPA STANDARDFORMAT

Country Status (7)

Country Link
US (1) US7406416B2 (de)
EP (1) EP1580667B1 (de)
JP (1) JP4974470B2 (de)
KR (1) KR101120773B1 (de)
CN (1) CN100535890C (de)
AT (1) ATE496342T1 (de)
DE (1) DE602005025955D1 (de)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8700404B1 (en) 2005-08-27 2014-04-15 At&T Intellectual Property Ii, L.P. System and method for using semantic and syntactic graphs for utterance classification
US20070078653A1 (en) * 2005-10-03 2007-04-05 Nokia Corporation Language model compression
US20080282154A1 (en) * 2006-09-11 2008-11-13 Nurmi Mikko A Method and apparatus for improved text input
US7774197B1 (en) 2006-09-27 2010-08-10 Raytheon Bbn Technologies Corp. Modular approach to building large language models
US8332207B2 (en) * 2007-03-26 2012-12-11 Google Inc. Large language models in machine translation
WO2010051654A1 (en) * 2008-11-05 2010-05-14 Google Inc. Custom language models
US8798983B2 (en) * 2009-03-30 2014-08-05 Microsoft Corporation Adaptation for statistical language model
US8655647B2 (en) * 2010-03-11 2014-02-18 Microsoft Corporation N-gram selection for practical-sized language models
US9367526B1 (en) * 2011-07-26 2016-06-14 Nuance Communications, Inc. Word classing for language modeling
CN102982024B (zh) * 2011-09-02 2016-03-23 北京百度网讯科技有限公司 一种搜索需求识别方法及装置
CN102509549B (zh) * 2011-09-28 2013-08-14 盛乐信息技术(上海)有限公司 语言模型训练方法及系统
US9224386B1 (en) 2012-06-22 2015-12-29 Amazon Technologies, Inc. Discriminative language model training using a confusion matrix
US9292487B1 (en) * 2012-08-16 2016-03-22 Amazon Technologies, Inc. Discriminative language model pruning
US20150088511A1 (en) * 2013-09-24 2015-03-26 Verizon Patent And Licensing Inc. Named-entity based speech recognition
KR101509727B1 (ko) 2013-10-02 2015-04-07 주식회사 시스트란인터내셔널 자율학습 정렬 기반의 정렬 코퍼스 생성 장치 및 그 방법과, 정렬 코퍼스를 사용한 파괴 표현 형태소 분석 장치 및 그 형태소 분석 방법
US9400783B2 (en) * 2013-11-26 2016-07-26 Xerox Corporation Procedure for building a max-ARPA table in order to compute optimistic back-offs in a language model
US10311046B2 (en) * 2016-09-12 2019-06-04 Conduent Business Services, Llc System and method for pruning a set of symbol-based sequences by relaxing an independence assumption of the sequences

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US1940720A (en) * 1931-03-16 1933-12-26 Madsen Jens A Windfeld Water softener
US4096017A (en) * 1977-02-18 1978-06-20 H. C. Price Co. Method and article for forming field joints on pipe coated with thermoplastic material
US4111017A (en) * 1977-06-21 1978-09-05 The United States Of America As Represented By The United States Department Of Energy Manually operated coded switch
US5258909A (en) * 1989-08-31 1993-11-02 International Business Machines Corporation Method and apparatus for "wrong word" spelling error detection and correction
US5199464A (en) * 1989-12-28 1993-04-06 Interprovincial Pipe Line, Inc. Pipeline repair sleeve assembly having heat sink groove
US5267345A (en) * 1992-02-10 1993-11-30 International Business Machines Corporation Speech recognition apparatus which predicts word classes from context and words from word classes
IT1254723B (it) * 1992-03-18 1995-10-09 Snam Spa Procedimento perfezionato per gli interventi di riparazione di danni localizzati alle condotte mediante applicazione di corazze con una guaina protettiva interposta
EP0602296A1 (de) * 1992-12-17 1994-06-22 International Business Machines Corporation Adaptives Verfahren zur Erzeugung gebietsabhängiger Modelle für intelligente Systeme
US5467425A (en) * 1993-02-26 1995-11-14 International Business Machines Corporation Building scalable N-gram language models using maximum likelihood maximum entropy N-gram models
JP2886121B2 (ja) * 1995-11-10 1999-04-26 株式会社エイ・ティ・アール音声翻訳通信研究所 統計的言語モデル生成装置及び音声認識装置
US5937384A (en) 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US5722463A (en) * 1996-11-25 1998-03-03 Petro-Line Upgrading Services Ltd. External pipe reinforcing sleeve
CA2192620C (en) * 1996-12-11 2000-08-29 Gerald Henderson Pipe repair assembly
US6188976B1 (en) 1998-10-23 2001-02-13 International Business Machines Corporation Apparatus and method for building domain-specific language models
JP2000250583A (ja) * 1999-03-02 2000-09-14 Atr Interpreting Telecommunications Res Lab 統計的言語モデル生成装置及び音声認識装置
JP2000356997A (ja) 1999-06-15 2000-12-26 Atr Interpreting Telecommunications Res Lab 統計的言語モデル生成装置及び音声認識装置
JP2001142881A (ja) 1999-11-16 2001-05-25 Nippon Telegr & Teleph Corp <Ntt> 統計的言語モデル及びそれを用いた確率計算法

Also Published As

Publication number Publication date
KR20060044753A (ko) 2006-05-16
CN100535890C (zh) 2009-09-02
EP1580667A3 (de) 2007-10-10
US7406416B2 (en) 2008-07-29
EP1580667B1 (de) 2011-01-19
DE602005025955D1 (de) 2011-03-03
KR101120773B1 (ko) 2012-03-23
JP4974470B2 (ja) 2012-07-11
US20050216265A1 (en) 2005-09-29
CN1673997A (zh) 2005-09-28
EP1580667A2 (de) 2005-09-28
JP2005293580A (ja) 2005-10-20

Similar Documents

Publication Publication Date Title
ATE496342T1 (de) DARSTELLUNG EINES ßDELETED INTERPOLATIONß N-GRAM SPRACHMODELLS IN ARPA STANDARDFORMAT
AT8676U9 (de) Kopierfräseinrichtung zur herstellung von insbesondere zahntechnischen werkstücken
WO2006017493A3 (en) Approach for creating a tag or attribute in a markup language document
DE602005018542D1 (de) Verfahren zur herstellung einer flaschenartigen dose
WO2009076383A3 (en) Automatically generating formulas based on parameters of a model
ATE439665T1 (de) Verfahren zur personalisierung eines dienstes
WO2006092800A3 (en) System and method for scanning an intraoral cavity
WO2007062658A3 (en) Impression scanning for manufacturing of dental restorations
DE60325188D1 (de) Verfahren zur herstellung von sterilen, stabilisierten nanodispersionen
DE602005019848D1 (de) Verfahren zur herstellung von tad- getrocknetem ti
TW200701017A (en) Method and apparatus for efficient indexed storage for unstructured content
EP1758290A4 (de) Speichermedienkonvertierungsverfahren, programm und vorrichtung
DE102004010312B8 (de) Verfahren zum Einmessen eines Arbeitspunktes
ATE449096T1 (de) Verfahren zur herstellung von 3(r)-(2-hydroxy-2,2-dithien-2-ylacetoxy)-1-(3-p enoxypropyl)-1- azoniabicycloä2.2.2üoctanbromid
WO2006130862A3 (en) Generating a volume of interest using a dose isocontour
DE602005016091D1 (de) Verfahren zur herstellung eines holzformteils
GB2426612C (en) Method and apparatus for generating configuration.
ZA200803046B (en) Enantioselective epoxide hydlrolase and method for preparing an enantpure epoxide using the same
GB0704478D0 (en) Method for estimating absorption parameter Q(T)
GB0606235D0 (en) Apparatus and method for model adaptation for spoken language understanding
WO2006039498A3 (en) Method and apparatus for a handle
WO2007034425A3 (en) A method of and a system for adapting a geometric model using multiple partial transformations
ATE311949T1 (de) Verfahren zur herstellung eines hochfesten behälters, insbesondere aerosolbehälters
DE602004003383D1 (de) Verfahren zur Herstellung eines Kraftwerkzeugs
WO2007016391A3 (en) An automated method and tool for documenting a transformer design

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties