DE69602444D1 - System und verfahren zum einschränken des suchumfangs in einem lexikon - Google Patents

System und verfahren zum einschränken des suchumfangs in einem lexikon

Info

Publication number
DE69602444D1
DE69602444D1 DE69602444T DE69602444T DE69602444D1 DE 69602444 D1 DE69602444 D1 DE 69602444D1 DE 69602444 T DE69602444 T DE 69602444T DE 69602444 T DE69602444 T DE 69602444T DE 69602444 D1 DE69602444 D1 DE 69602444D1
Authority
DE
Germany
Prior art keywords
lexicon
unverified
string
partitioning
entries
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69602444T
Other languages
English (en)
Other versions
DE69602444T2 (de
Inventor
Liang Li
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
United Parcel Service of America Inc
United Parcel Service Inc
Original Assignee
United Parcel Service of America Inc
United Parcel Service Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by United Parcel Service of America Inc, United Parcel Service Inc filed Critical United Parcel Service of America Inc
Publication of DE69602444D1 publication Critical patent/DE69602444D1/de
Application granted granted Critical
Publication of DE69602444T2 publication Critical patent/DE69602444T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Machine Translation (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
DE69602444T 1995-06-07 1996-06-05 System und verfahren zum einschränken des suchumfangs in einem lexikon Expired - Lifetime DE69602444T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/477,481 US5774588A (en) 1995-06-07 1995-06-07 Method and system for comparing strings with entries of a lexicon
PCT/US1996/009333 WO1996041280A1 (en) 1995-06-07 1996-06-05 System and method for reducing the search scope in a lexicon

Publications (2)

Publication Number Publication Date
DE69602444D1 true DE69602444D1 (de) 1999-06-17
DE69602444T2 DE69602444T2 (de) 2000-01-05

Family

ID=23896088

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69602444T Expired - Lifetime DE69602444T2 (de) 1995-06-07 1996-06-05 System und verfahren zum einschränken des suchumfangs in einem lexikon

Country Status (7)

Country Link
US (1) US5774588A (de)
EP (1) EP0834138B1 (de)
JP (1) JP3077765B2 (de)
AT (1) ATE180072T1 (de)
CA (1) CA2222590C (de)
DE (1) DE69602444T2 (de)
WO (1) WO1996041280A1 (de)

Families Citing this family (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5963666A (en) * 1995-08-18 1999-10-05 International Business Machines Corporation Confusion matrix mediated word prediction
US6295524B1 (en) * 1996-10-29 2001-09-25 Nec Research Institute, Inc. Learning edit distance costs
US6157905A (en) * 1997-12-11 2000-12-05 Microsoft Corporation Identifying language and character set of data representing text
US7043426B2 (en) * 1998-04-01 2006-05-09 Cyberpulse, L.L.C. Structured speech recognition
JP3541930B2 (ja) * 1998-08-13 2004-07-14 富士通株式会社 符号化装置及び復号化装置
US6249605B1 (en) * 1998-09-14 2001-06-19 International Business Machines Corporation Key character extraction and lexicon reduction for cursive text recognition
US7110998B1 (en) * 1998-10-13 2006-09-19 Virtual Gold, Inc. Method and apparatus for finding hidden patterns in the context of querying applications
US7031985B1 (en) * 1999-03-08 2006-04-18 Oracle International Corporation Lexical cache
WO2000057350A1 (en) * 1999-03-19 2000-09-28 Raf Technology, Inc. Rollup functions for efficient storage, presentation, and analysis of data
AU755267B2 (en) 1999-05-12 2002-12-05 Siemens Aktiengesellschaft Address reading method
DE19933984C2 (de) * 1999-07-20 2001-05-31 Siemens Ag Verfahren zur Bildung und/oder Aktualisierung von Wörterbüchern zum automatischen Adreßlesen
US6671407B1 (en) * 1999-10-19 2003-12-30 Microsoft Corporation System and method for hashing digital images
US6647395B1 (en) * 1999-11-01 2003-11-11 Kurzweil Cyberart Technologies, Inc. Poet personalities
US6848080B1 (en) 1999-11-05 2005-01-25 Microsoft Corporation Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors
US7403888B1 (en) * 1999-11-05 2008-07-22 Microsoft Corporation Language input user interface
US6778683B1 (en) * 1999-12-08 2004-08-17 Federal Express Corporation Method and apparatus for reading and decoding information
US7047493B1 (en) * 2000-03-31 2006-05-16 Brill Eric D Spell checker with arbitrary length string-to-string transformations to improve noisy channel spelling correction
US6668085B1 (en) * 2000-08-01 2003-12-23 Xerox Corporation Character matching process for text converted from images
DE10054124A1 (de) * 2000-10-31 2002-05-08 Peter Linssen Verfahren zur Ermittlung von Ähnlichkeiten zwischen Ereignisfolgen
US6735560B1 (en) * 2001-01-31 2004-05-11 International Business Machines Corporation Method of identifying members of classes in a natural language understanding system
US7020775B2 (en) 2001-04-24 2006-03-28 Microsoft Corporation Derivation and quantization of robust non-local characteristics for blind watermarking
US6975743B2 (en) * 2001-04-24 2005-12-13 Microsoft Corporation Robust and stealthy video watermarking into regions of successive frames
US7356188B2 (en) * 2001-04-24 2008-04-08 Microsoft Corporation Recognizer of text-based work
US6973574B2 (en) * 2001-04-24 2005-12-06 Microsoft Corp. Recognizer of audio-content in digital signals
US6996273B2 (en) 2001-04-24 2006-02-07 Microsoft Corporation Robust recognizer of perceptually similar content
US7095873B2 (en) * 2002-06-28 2006-08-22 Microsoft Corporation Watermarking via quantization of statistics of overlapping regions
US7006703B2 (en) 2002-06-28 2006-02-28 Microsoft Corporation Content recognizer via probabilistic mirror distribution
DE10245834A1 (de) * 2002-10-01 2004-04-15 Siemens Ag Verfahren zum Erzeugen von Lern- und/oder Teststichproben
US20040139072A1 (en) * 2003-01-13 2004-07-15 Broder Andrei Z. System and method for locating similar records in a database
JP4486324B2 (ja) * 2003-06-19 2010-06-23 ヤフー株式会社 類似単語検索装置、この方法、このプログラム、および情報検索システム
US7451168B1 (en) 2003-06-30 2008-11-11 Data Domain, Inc. Incremental garbage collection of data in a secondary storage
US7424498B1 (en) 2003-06-30 2008-09-09 Data Domain, Inc. Probabilistic summary data structure based encoding for garbage collection
US20050086234A1 (en) * 2003-10-15 2005-04-21 Sierra Wireless, Inc., A Canadian Corporation Incremental search of keyword strings
US7831832B2 (en) * 2004-01-06 2010-11-09 Microsoft Corporation Digital goods representation based upon matrix invariances
US20050165690A1 (en) * 2004-01-23 2005-07-28 Microsoft Corporation Watermarking via quantization of rational statistics of regions
US7770014B2 (en) * 2004-04-30 2010-08-03 Microsoft Corporation Randomized signal transforms and their applications
US7529668B2 (en) * 2004-08-03 2009-05-05 Sony Corporation System and method for implementing a refined dictionary for speech recognition
US7895218B2 (en) * 2004-11-09 2011-02-22 Veveo, Inc. Method and system for performing searches for television content using reduced text input
US7634810B2 (en) * 2004-12-02 2009-12-15 Microsoft Corporation Phishing detection, prevention, and notification
US8291065B2 (en) * 2004-12-02 2012-10-16 Microsoft Corporation Phishing detection, prevention, and notification
US7779011B2 (en) 2005-08-26 2010-08-17 Veveo, Inc. Method and system for dynamically processing ambiguous, reduced text search queries and highlighting results thereof
US7788266B2 (en) 2005-08-26 2010-08-31 Veveo, Inc. Method and system for processing ambiguous, multi-term search queries
US7644054B2 (en) 2005-11-23 2010-01-05 Veveo, Inc. System and method for finding desired results by incremental search using an ambiguous keypad with the input containing orthographic and typographic errors
US7792815B2 (en) 2006-03-06 2010-09-07 Veveo, Inc. Methods and systems for selecting and presenting content based on context sensitive user preferences
US8073860B2 (en) 2006-03-30 2011-12-06 Veveo, Inc. Method and system for incrementally selecting and providing relevant search engines in response to a user query
US7461061B2 (en) 2006-04-20 2008-12-02 Veveo, Inc. User interface methods and systems for selecting and presenting content based on user navigation and selection actions associated with the content
US7558725B2 (en) * 2006-05-23 2009-07-07 Lexisnexis, A Division Of Reed Elsevier Inc. Method and apparatus for multilingual spelling corrections
CA2989780C (en) 2006-09-14 2022-08-09 Veveo, Inc. Methods and systems for dynamically rearranging search results into hierarchically organized concept clusters
WO2008045690A2 (en) 2006-10-06 2008-04-17 Veveo, Inc. Linear character selection display interface for ambiguous text input
US8078884B2 (en) 2006-11-13 2011-12-13 Veveo, Inc. Method of and system for selecting and presenting content based on user identification
WO2008148012A1 (en) 2007-05-25 2008-12-04 Veveo, Inc. System and method for text disambiguation and context designation in incremental search
US8077983B2 (en) 2007-10-04 2011-12-13 Zi Corporation Of Canada, Inc. Systems and methods for character correction in communication devices
US7962507B2 (en) * 2007-11-19 2011-06-14 Microsoft Corporation Web content mining of pair-based data
KR100946145B1 (ko) * 2008-05-13 2010-03-08 성균관대학교산학협력단 서열 유사도 측정 장치 및 그 제어방법
US9124431B2 (en) * 2009-05-14 2015-09-01 Microsoft Technology Licensing, Llc Evidence-based dynamic scoring to limit guesses in knowledge-based authentication
US8856879B2 (en) 2009-05-14 2014-10-07 Microsoft Corporation Social authentication for account recovery
US8644622B2 (en) * 2009-07-30 2014-02-04 Xerox Corporation Compact signature for unordered vector sets with application to image retrieval
US9166714B2 (en) 2009-09-11 2015-10-20 Veveo, Inc. Method of and system for presenting enriched video viewing analytics
EP2341467B1 (de) * 2009-09-24 2019-12-18 Nec Corporation Worterkennungsvorrichtung, programm zur übergangslosen speicherung auf einem computerlesbaren medium und vorrichtung zur klassifikation gelieferter artikel
US9703779B2 (en) 2010-02-04 2017-07-11 Veveo, Inc. Method of and system for enhanced local-device content discovery
GB201010545D0 (en) * 2010-06-23 2010-08-11 Rolls Royce Plc Entity recognition
US9384423B2 (en) * 2013-05-28 2016-07-05 Xerox Corporation System and method for OCR output verification
US10043009B2 (en) * 2014-09-24 2018-08-07 Intel Corporation Technologies for software basic block similarity analysis
JP2016099662A (ja) * 2014-11-18 2016-05-30 富士通株式会社 符号化プログラム、符号化装置、符号化方法および検索プログラム
US9928436B2 (en) 2015-07-08 2018-03-27 Conduent Business Services, Llc Lexicon-free, matching-based word-image recognition
WO2017009958A1 (ja) 2015-07-14 2017-01-19 富士通株式会社 圧縮プログラム、圧縮方法および圧縮装置
CN107102998A (zh) * 2016-02-22 2017-08-29 阿里巴巴集团控股有限公司 一种字符串距离计算方法和装置
US10635693B2 (en) * 2016-11-11 2020-04-28 International Business Machines Corporation Efficiently finding potential duplicate values in data
US10127219B2 (en) * 2016-12-09 2018-11-13 Hong Kong Applied Science and Technoloy Research Institute Company Limited System and method for organizing and processing feature based data structures

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4058795A (en) * 1972-10-03 1977-11-15 International Business Machines Corporation Method and apparatus for context-aided recognition
JPS5729745B2 (de) * 1974-09-25 1982-06-24
US3969698A (en) * 1974-10-08 1976-07-13 International Business Machines Corporation Cluster storage apparatus for post processing error correction of a character recognition machine
US3995254A (en) * 1975-07-16 1976-11-30 International Business Machines Corporation Digital reference matrix for word verification
US4771385A (en) * 1984-11-21 1988-09-13 Nec Corporation Word recognition processing time reduction system using word length and hash technique involving head letters
US5261009A (en) * 1985-10-15 1993-11-09 Palantir Corporation Means for resolving ambiguities in text passed upon character context
US5133023A (en) * 1985-10-15 1992-07-21 The Palantir Corporation Means for resolving ambiguities in text based upon character context
US4754489A (en) * 1985-10-15 1988-06-28 The Palantir Corporation Means for resolving ambiguities in text based upon character context
JPH0682403B2 (ja) * 1986-03-24 1994-10-19 沖電気工業株式会社 光学式文字読取装置
US5050218A (en) * 1986-08-26 1991-09-17 Nec Corporation Apparatus for recognizing address appearing on mail article
JPS63198154A (ja) * 1987-02-05 1988-08-16 インタ−ナショナル・ビジネス・マシ−ンズ・コ−ポレ−ション つづり誤り訂正装置
EP0312905B1 (de) * 1987-10-16 1992-04-29 Computer Gesellschaft Konstanz Mbh Verfahren zur automatischen Zeichenerkennung
US5062143A (en) * 1990-02-23 1991-10-29 Harris Corporation Trigram-based method of language identification
US5329609A (en) * 1990-07-31 1994-07-12 Fujitsu Limited Recognition apparatus with function of displaying plural recognition candidates
EP0470798B1 (de) * 1990-08-06 1997-10-29 Fujitsu Limited Wörterbuch-Suchsystem
US5276741A (en) * 1991-05-16 1994-01-04 Trw Financial Systems & Services, Inc. Fuzzy string matcher
CA2077604C (en) * 1991-11-19 1999-07-06 Todd A. Cass Method and apparatus for determining the frequency of words in a document without document image decoding

Also Published As

Publication number Publication date
CA2222590C (en) 2000-11-07
EP0834138B1 (de) 1999-05-12
JP3077765B2 (ja) 2000-08-14
EP0834138A1 (de) 1998-04-08
JPH11505052A (ja) 1999-05-11
DE69602444T2 (de) 2000-01-05
CA2222590A1 (en) 1996-12-19
US5774588A (en) 1998-06-30
WO1996041280A1 (en) 1996-12-19
ATE180072T1 (de) 1999-05-15

Similar Documents

Publication Publication Date Title
DE69602444T2 (de) System und verfahren zum einschränken des suchumfangs in einem lexikon
CN105869634A (zh) 一种基于领域的带反馈语音识别后文本纠错方法及系统
ATE426206T1 (de) Systeme und verfahren zum suchen durch verwendung von in einem anderen zeichensatz und/oder in einer anderen sprache als die zielseiten geschriebenen anfragen
CA2198306A1 (en) Method and apparatus for an improved language recognition system
CN103390004B (zh) 一种语义冗余的确定方法和装置、对应的搜索方法和装置
ATE398811T1 (de) Verfahren und system zum semantischen segmentieren von szenen einer videosequenz
DE10196212T1 (de) Verfahren und Vorrichtung zum Identifizieren von Verwandten suchen in einem Datenbanksuchsystem
WO2002077873A3 (en) System, method and apparatus for conducting a phrase search
EA199800623A1 (ru) Способ и система оценивания набора данных (варианты)
WO2005017682A3 (en) Product placement engine and method
CN102915299A (zh) 一种分词方法及装置
CN102027534B (zh) 语言模型得分前瞻值赋值方法及设备
Katsurada et al. Evaluation of fast spoken term detection using a suffix array
Müller et al. Improved modeling of out-of-vocabulary words using morphological classes
JPH06274546A (ja) 情報量一致度計算方式
JPH03116376A (ja) キーワード・マッチング装置
KR100385863B1 (ko) 상호정보를 이용한 한국어-영어 질의어 변환방법 및 장치
JP2003256448A5 (de)
KR101301534B1 (ko) 이형태 자동 구축 방법 및 장치
JPH03125264A (ja) キーワード抽出装置
JP2000348059A5 (ja) 文書検索装置
Tian et al. Fast QBE: Towards Real-Time Spoken Term Detection with Separable Model
JPS61122781A (ja) 音声ワ−ドプロセツサ
TW200519638A (en) Method for feature extraction and data decoding and method and system for searching piratic articles
KR20010097365A (ko) 영한기계번역 시스템 및 방법

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8328 Change in the person/name/address of the agent

Representative=s name: STIPPL PATENTANWAELTE, 90482 NUERNBERG