JP2001291060A - 単語列照合装置および単語列照合方法 - Google Patents

単語列照合装置および単語列照合方法

Info

Publication number
JP2001291060A
JP2001291060A JP2000102370A JP2000102370A JP2001291060A JP 2001291060 A JP2001291060 A JP 2001291060A JP 2000102370 A JP2000102370 A JP 2000102370A JP 2000102370 A JP2000102370 A JP 2000102370A JP 2001291060 A JP2001291060 A JP 2001291060A
Authority
JP
Japan
Prior art keywords
word
word string
words
string
distance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2000102370A
Other languages
English (en)
Japanese (ja)
Inventor
Naoki Natori
直毅 名取
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to JP2000102370A priority Critical patent/JP2001291060A/ja
Priority to KR10-2001-0017871A priority patent/KR100417306B1/ko
Priority to US09/824,876 priority patent/US6643647B2/en
Publication of JP2001291060A publication Critical patent/JP2001291060A/ja
Priority to US10/653,924 priority patent/US7124130B2/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99936Pattern matching access
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99937Sorting
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99951File or database maintenance
    • Y10S707/99952Coherency, e.g. same view to multiple users
    • Y10S707/99953Recoverability
JP2000102370A 2000-04-04 2000-04-04 単語列照合装置および単語列照合方法 Pending JP2001291060A (ja)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2000102370A JP2001291060A (ja) 2000-04-04 2000-04-04 単語列照合装置および単語列照合方法
KR10-2001-0017871A KR100417306B1 (ko) 2000-04-04 2001-04-04 단어열 대조장치, 단어열 대조방법 및 주소 인식장치
US09/824,876 US6643647B2 (en) 2000-04-04 2001-04-04 Word string collating apparatus, word string collating method and address recognition apparatus
US10/653,924 US7124130B2 (en) 2000-04-04 2003-09-04 Word string collating apparatus, word string collating method and address recognition apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2000102370A JP2001291060A (ja) 2000-04-04 2000-04-04 単語列照合装置および単語列照合方法

Publications (1)

Publication Number Publication Date
JP2001291060A true JP2001291060A (ja) 2001-10-19

Family

ID=18616268

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2000102370A Pending JP2001291060A (ja) 2000-04-04 2000-04-04 単語列照合装置および単語列照合方法

Country Status (3)

Country Link
US (2) US6643647B2 (US06643647-20031104-M00002.png)
JP (1) JP2001291060A (US06643647-20031104-M00002.png)
KR (1) KR100417306B1 (US06643647-20031104-M00002.png)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011036830A1 (ja) * 2009-09-24 2011-03-31 日本電気株式会社 単語認識装置、方法及びプログラムが格納された非一時的なコンピュータ可読媒体並びに発送物区分装置
JP4809477B2 (ja) * 2006-06-09 2011-11-09 ソニー エリクソン モバイル コミュニケーションズ, エービー 電子メールアドレスの検査

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7031002B1 (en) 1998-12-31 2006-04-18 International Business Machines Corporation System and method for using character set matching to enhance print quality
US7039637B2 (en) * 1998-12-31 2006-05-02 International Business Machines Corporation System and method for evaluating characters in an inputted search string against a character table bank comprising a predetermined number of columns that correspond to a plurality of pre-determined candidate character sets in order to provide enhanced full text search
US7286115B2 (en) 2000-05-26 2007-10-23 Tegic Communications, Inc. Directional input system with automatic correction
US7030863B2 (en) 2000-05-26 2006-04-18 America Online, Incorporated Virtual keyboard system with automatic correction
US7750891B2 (en) * 2003-04-09 2010-07-06 Tegic Communications, Inc. Selective input system based on tracking of motion parameters of an input device
US7821503B2 (en) 2003-04-09 2010-10-26 Tegic Communications, Inc. Touch screen and graphical user interface
US7191114B1 (en) 1999-08-27 2007-03-13 International Business Machines Corporation System and method for evaluating character sets to determine a best match encoding a message
US7899665B2 (en) * 2004-08-20 2011-03-01 International Business Machines Corporation Methods and systems for detecting the alphabetic order used by different languages
JP4855698B2 (ja) * 2005-03-22 2012-01-18 株式会社東芝 宛先認識装置
JP4740060B2 (ja) * 2006-07-31 2011-08-03 富士通株式会社 重複データ検出プログラム、重複データ検出方法および重複データ検出装置
US8255216B2 (en) 2006-10-30 2012-08-28 Nuance Communications, Inc. Speech recognition of character sequences
KR100835289B1 (ko) * 2006-11-20 2008-06-05 엔에이치엔(주) 키 배열 정보를 이용한 단어 추천 방법 및 그 시스템
US8201087B2 (en) * 2007-02-01 2012-06-12 Tegic Communications, Inc. Spell-check for a keyboard system with automatic correction
US8225203B2 (en) 2007-02-01 2012-07-17 Nuance Communications, Inc. Spell-check for a keyboard system with automatic correction
DE102007010259A1 (de) 2007-03-02 2008-09-04 Volkswagen Ag Sensor-Auswertevorrichtung und Verfahren zum Auswerten von Sensorsignalen
US8775931B2 (en) * 2007-03-30 2014-07-08 Blackberry Limited Spell check function that applies a preference to a spell check algorithm based upon extensive user selection of spell check results generated by the algorithm, and associated handheld electronic device
US8023719B2 (en) * 2007-08-15 2011-09-20 International Business Machines Corporation MICR reader using phase angle extracted from frequency domain analysis
KR101126406B1 (ko) * 2008-11-27 2012-04-20 엔에이치엔(주) 유사어 결정 방법 및 시스템
US20110106836A1 (en) * 2009-10-30 2011-05-05 International Business Machines Corporation Semantic Link Discovery
US20130007004A1 (en) * 2011-06-30 2013-01-03 Landon Ip, Inc. Method and apparatus for creating a search index for a composite document and searching same
US10146979B2 (en) * 2015-06-03 2018-12-04 Lenovo Enterprise Solutions (Singapore) Pte. Ltd. Processing visual cues to improve device understanding of user input
US9858385B2 (en) * 2015-07-23 2018-01-02 International Business Machines Corporation Identifying errors in medical data
CN105446957B (zh) * 2015-12-03 2018-07-20 小米科技有限责任公司 相似性确定方法、装置及终端
JP6690484B2 (ja) * 2016-09-15 2020-04-28 富士通株式会社 音声認識用コンピュータプログラム、音声認識装置及び音声認識方法
KR102132745B1 (ko) * 2016-10-21 2020-07-10 두나무 주식회사 Sms 메시지를 이용한 주식 매매일지 작성 장치
KR102322703B1 (ko) * 2016-10-21 2021-11-08 두나무 주식회사 주식 매매일지 자동 작성 방법
CN107133215A (zh) * 2017-05-20 2017-09-05 复旦大学 一种脱机手写中文规范地址识别方法

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2737173B2 (ja) * 1988-10-25 1998-04-08 日本電気株式会社 記号列照合装置とその制御方法
US5020112A (en) * 1989-10-31 1991-05-28 At&T Bell Laboratories Image recognition method using two-dimensional stochastic grammars
US5497488A (en) * 1990-06-12 1996-03-05 Hitachi, Ltd. System for parallel string search with a function-directed parallel collation of a first partition of each string followed by matching of second partitions
US5526444A (en) * 1991-12-10 1996-06-11 Xerox Corporation Document image decoding using modified branch-and-bound methods
US5321773A (en) * 1991-12-10 1994-06-14 Xerox Corporation Image recognition method using finite state networks
US5535119A (en) * 1992-06-11 1996-07-09 Hitachi, Ltd. Character inputting method allowing input of a plurality of different types of character species, and information processing equipment adopting the same
US5699456A (en) * 1994-01-21 1997-12-16 Lucent Technologies Inc. Large vocabulary connected speech recognition system and method of language representation using evolutional grammar to represent context free grammars
US5594809A (en) * 1995-04-28 1997-01-14 Xerox Corporation Automatic training of character templates using a text line image, a text line transcription and a line image source model
JP3040945B2 (ja) * 1995-11-29 2000-05-15 松下電器産業株式会社 文書検索装置
US5933525A (en) * 1996-04-10 1999-08-03 Bbn Corporation Language-independent and segmentation-free optical character recognition system and method
US5873111A (en) * 1996-05-10 1999-02-16 Apple Computer, Inc. Method and system for collation in a processing system of a variety of distinct sets of information
US5995963A (en) * 1996-06-27 1999-11-30 Fujitsu Limited Apparatus and method of multi-string matching based on sparse state transition list
JP3143079B2 (ja) * 1997-05-30 2001-03-07 松下電器産業株式会社 辞書索引作成装置と文書検索装置
JPH1153384A (ja) * 1997-08-05 1999-02-26 Mitsubishi Electric Corp キーワード抽出装置及びキーワード抽出方法並びにキーワード抽出プログラムを格納したコンピュータ読み取り可能な記録媒体
JP3275816B2 (ja) * 1998-01-14 2002-04-22 日本電気株式会社 記号列検索方法及び記号列検索装置並びに記号列検索プログラムを記録した記録媒体
US6507678B2 (en) * 1998-06-19 2003-01-14 Fujitsu Limited Apparatus and method for retrieving character string based on classification of character

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4809477B2 (ja) * 2006-06-09 2011-11-09 ソニー エリクソン モバイル コミュニケーションズ, エービー 電子メールアドレスの検査
WO2011036830A1 (ja) * 2009-09-24 2011-03-31 日本電気株式会社 単語認識装置、方法及びプログラムが格納された非一時的なコンピュータ可読媒体並びに発送物区分装置
JP5621777B2 (ja) * 2009-09-24 2014-11-12 日本電気株式会社 単語認識装置、方法及びプログラムが格納された非一時的なコンピュータ可読媒体並びに発送物区分装置
US9101961B2 (en) 2009-09-24 2015-08-11 Nec Corporation Word recognition apparatus, word recognition method, non-transitory computer readable medium storing word recognition program, and delivery item sorting apparatus

Also Published As

Publication number Publication date
US7124130B2 (en) 2006-10-17
US20010031088A1 (en) 2001-10-18
US20040044676A1 (en) 2004-03-04
US6643647B2 (en) 2003-11-04
KR100417306B1 (ko) 2004-02-05
KR20010095304A (ko) 2001-11-03

Similar Documents

Publication Publication Date Title
JP2001291060A (ja) 単語列照合装置および単語列照合方法
US8745077B2 (en) Searching and matching of data
JP3689455B2 (ja) 情報処理方法及び装置
US7092567B2 (en) Post-processing system and method for correcting machine recognized text
EP0844583B1 (en) Method and apparatus for character recognition
US8069033B2 (en) Document based character ambiguity resolution
US20090006394A1 (en) Systems and methods for validating an address
US20060004744A1 (en) Method and system for approximate string matching
CN111209447A (zh) 一种基于音形码的中文字符串相似度计算方法及装置
US20140229484A1 (en) Extraction method, computer product, extracting apparatus, and extracting system
JPH087033A (ja) 情報処理方法及び装置
JP4066507B2 (ja) 日本語文字認識誤り訂正方法及び装置、並びに、誤り訂正プログラムを記録した記録媒体
CN111782892B (zh) 基于前缀树的相似字符识别方法、设备、装置和存储介质
WO2016181470A1 (ja) 認識装置、認識方法およびプログラム
CN111814781A (zh) 用于对图像块识别结果进行校正的方法、设备和存储介质
KR20110044253A (ko) 근사조합장치, 근사조합방법, 프로그램 및 기록매체
JPH05257982A (ja) 文字列認識方法
JP3080066B2 (ja) 文字認識装置、方法及び記憶媒体
JP2003331214A (ja) 文字認識誤り訂正方法、装置及びプログラム
JP2015170129A (ja) 認識装置、認識方法およびプログラム
JP3548372B2 (ja) 文字認識装置
JP2894305B2 (ja) 認識装置の候補修正方式
US8019158B2 (en) Method and computer program product for recognition error correction data
KR100258923B1 (ko) 한글 및 영문 성명인식 및 오인식 교정방법
JP2986255B2 (ja) 文字認識装置