CA2473172A1 - Information retrieval and speech recognition based on language models - Google Patents

Information retrieval and speech recognition based on language models

Info

Publication number
CA2473172A1
CA2473172A1 CA002473172A CA2473172A CA2473172A1 CA 2473172 A1 CA2473172 A1 CA 2473172A1 CA 002473172 A CA002473172 A CA 002473172A CA 2473172 A CA2473172 A CA 2473172A CA 2473172 A1 CA2473172 A1 CA 2473172A1
Authority
CA
Canada
Prior art keywords
data store
information
language model
speech recognition
language models
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002473172A
Other languages
French (fr)
Other versions
CA2473172C (en
Inventor
Milind V. Mahajan
Xuedong D. Huang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/050,286 external-priority patent/US6418431B1/en
Application filed by Individual filed Critical Individual
Publication of CA2473172A1 publication Critical patent/CA2473172A1/en
Application granted granted Critical
Publication of CA2473172C publication Critical patent/CA2473172C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A language model is used in a speech recognition system which has access to a first, smaller data store and a second, larger data store. The language model is adapted by formulating an information retrieval query based on information contained in the first data store and querying the second data store. Information retrieved from the second data store is used in adapting the language model. Also, language models are used in retrieving information from the second data store. Language models are built based on information in the first data store, and based on information in the second data store. The perplexity of a document in the second data store is determined, given the first language model, and given the second language model. Relevancy of the document is determined based upon the first and second perplexities. Documents are retrieved which have a relevancy measure that exceeds a threshold level.
CA002473172A 1998-03-30 1999-02-09 Information retrieval and speech recognition based on language models Expired - Fee Related CA2473172C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/050,286 US6418431B1 (en) 1998-03-30 1998-03-30 Information retrieval and speech recognition based on language models
US09/050,286 1998-03-30
CA002321112A CA2321112C (en) 1998-03-30 1999-02-09 Information retrieval and speech recognition based on language models

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CA002321112A Division CA2321112C (en) 1998-03-30 1999-02-09 Information retrieval and speech recognition based on language models

Publications (2)

Publication Number Publication Date
CA2473172A1 true CA2473172A1 (en) 1999-10-07
CA2473172C CA2473172C (en) 2005-10-18

Family

ID=32909193

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002473172A Expired - Fee Related CA2473172C (en) 1998-03-30 1999-02-09 Information retrieval and speech recognition based on language models

Country Status (1)

Country Link
CA (1) CA2473172C (en)

Also Published As

Publication number Publication date
CA2473172C (en) 2005-10-18

Similar Documents

Publication Publication Date Title
CA2321112A1 (en) Information retrieval and speech recognition based on language models
Schmitz Inducing ontology from flickr tags
US8849787B2 (en) Two stage search
US5937422A (en) Automatically generating a topic description for text and searching and sorting text by topic using the same
US6345275B2 (en) Apparatus and method for retrieving image information in computer
EP1209582A3 (en) Document retrieval method and system and computer readable storage medium
WO2001084373A3 (en) Information retrieval
WO2001084374A3 (en) Information access method
EP1624386A3 (en) Searching for data objects
WO2000003315A3 (en) A search system and method for retrieval of data, and the use thereof in a search engine
WO1995012173A3 (en) Database search summary with user determined characteristics
EP1696437A3 (en) Storage medium storing search information and reproducing apparatus and method
EP1560130A3 (en) Image retrieval system and image retrieval method
US20090157656A1 (en) Automatic, computer-based similarity calculation system for quantifying the similarity of text expressions
WO2002027524A3 (en) A method and system for describing and identifying concepts in natural language text for information retrieval and processing
CA2152971A1 (en) Collapsible Keyboard Structure for a Notebook Computer
WO2003100662A3 (en) Associative database searching using fpga devices
WO2000054168A3 (en) Database annotation and retrieval
EP1128282A3 (en) Method and system for search and retrieval of similar patterns
WO1996021901A3 (en) User interface for full-text document retrieval
EP1211616A3 (en) Data storage and retrieval system
EP1024440A3 (en) Method for data retrieval
WO2002021339A3 (en) Method and apparatus for xml data storage, query rewrites, visualization, mapping and references
EP0822503A1 (en) Document retrieval system
Besançon et al. Textual similarities based on a distributional approach

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed

Effective date: 20190211