JPH0454261B2 - - Google Patents
Info
- Publication number
- JPH0454261B2 JPH0454261B2 JP62259880A JP25988087A JPH0454261B2 JP H0454261 B2 JPH0454261 B2 JP H0454261B2 JP 62259880 A JP62259880 A JP 62259880A JP 25988087 A JP25988087 A JP 25988087A JP H0454261 B2 JPH0454261 B2 JP H0454261B2
- Authority
- JP
- Japan
- Prior art keywords
- dictionary
- input
- keyword
- storage area
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 claims description 29
- 230000000877 morphologic effect Effects 0.000 claims description 16
- 238000012545 processing Methods 0.000 claims description 13
- 238000013507 mapping Methods 0.000 description 31
- 230000017105 transposition Effects 0.000 description 9
- 230000008859 change Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 3
- 238000011524 similarity measure Methods 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 238000000844 transformation Methods 0.000 description 2
- PCTMTFRHKVHKIS-BMFZQQSSSA-N (1s,3r,4e,6e,8e,10e,12e,14e,16e,18s,19r,20r,21s,25r,27r,30r,31r,33s,35r,37s,38r)-3-[(2r,3s,4s,5s,6r)-4-amino-3,5-dihydroxy-6-methyloxan-2-yl]oxy-19,25,27,30,31,33,35,37-octahydroxy-18,20,21-trimethyl-23-oxo-22,39-dioxabicyclo[33.3.1]nonatriaconta-4,6,8,10 Chemical compound C1C=C2C[C@@H](OS(O)(=O)=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2.O[C@H]1[C@@H](N)[C@H](O)[C@@H](C)O[C@H]1O[C@H]1/C=C/C=C/C=C/C=C/C=C/C=C/C=C/[C@H](C)[C@@H](O)[C@@H](C)[C@H](C)OC(=O)C[C@H](O)C[C@H](O)CC[C@@H](O)[C@H](O)C[C@H](O)C[C@](O)(C[C@H](O)[C@H]2C(O)=O)O[C@H]2C1 PCTMTFRHKVHKIS-BMFZQQSSSA-N 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000000205 computational method Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3343—Query execution using phonetics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
- G06F16/3335—Syntactic pre-processing, e.g. stopword elimination, stemming
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/247—Thesauruses; Synonyms
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US94212386A | 1986-12-16 | 1986-12-16 | |
US942123 | 1986-12-16 |
Publications (2)
Publication Number | Publication Date |
---|---|
JPS63157262A JPS63157262A (ja) | 1988-06-30 |
JPH0454261B2 true JPH0454261B2 (fr) | 1992-08-28 |
Family
ID=25477608
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP62259880A Granted JPS63157262A (ja) | 1986-12-16 | 1987-10-16 | ワードの類似性をランク付けする方法 |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP0271664B1 (fr) |
JP (1) | JPS63157262A (fr) |
DE (1) | DE3751359D1 (fr) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0494573A1 (fr) * | 1991-01-08 | 1992-07-15 | International Business Machines Corporation | Procédé pour supprimer automatiquement l'ambiguité des liaisons entre synonymes dans un dictionnaire pour système de traitement de langage naturel |
DE4213533C2 (de) * | 1992-04-22 | 1996-01-25 | Ibm | Verfahren und Computersystem zum Zerlegen von zusammengesetzten Wörtern |
EP0639814B1 (fr) * | 1993-08-20 | 2000-06-14 | Canon Kabushiki Kaisha | Appareil et méthode de recherche textuelle adaptive, non-littérale |
US5606690A (en) * | 1993-08-20 | 1997-02-25 | Canon Inc. | Non-literal textual search using fuzzy finite non-deterministic automata |
JP3113814B2 (ja) * | 1996-04-17 | 2000-12-04 | インターナショナル・ビジネス・マシーンズ・コーポレ−ション | 情報検索方法及び情報検索装置 |
KR100421530B1 (ko) * | 2001-03-06 | 2004-03-09 | 김시환 | 정보 검색 방법 |
GB2391647A (en) * | 2002-08-07 | 2004-02-11 | Sharp Kk | Generating a List of Terms and a Thesaurus from Input Terms |
JP4333516B2 (ja) | 2004-08-05 | 2009-09-16 | ソニー株式会社 | 記録制御装置および方法、並びにプログラム |
WO2007144199A1 (fr) | 2006-06-16 | 2007-12-21 | Omikron Data Quality Gmbh | Procédé pour l'évaluation automatique de la similitude de deux chaînes de caractères qui sont stockées dans un ordinateur |
US8244521B2 (en) | 2007-01-11 | 2012-08-14 | Microsoft Corporation | Paraphrasing the web by search-based data collection |
US9300322B2 (en) | 2014-06-20 | 2016-03-29 | Oracle International Corporation | Encoding of plain ASCII data streams |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4580241A (en) * | 1983-02-18 | 1986-04-01 | Houghton Mifflin Company | Graphic word spelling correction using automated dictionary comparisons with phonetic skeletons |
-
1987
- 1987-10-16 JP JP62259880A patent/JPS63157262A/ja active Granted
- 1987-10-16 EP EP87115183A patent/EP0271664B1/fr not_active Expired - Lifetime
- 1987-10-16 DE DE3751359T patent/DE3751359D1/de not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
EP0271664A3 (fr) | 1991-11-27 |
DE3751359D1 (de) | 1995-07-27 |
EP0271664A2 (fr) | 1988-06-22 |
EP0271664B1 (fr) | 1995-06-21 |
JPS63157262A (ja) | 1988-06-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4833610A (en) | Morphological/phonetic method for ranking word similarities | |
Karimi et al. | Machine transliteration survey | |
KR100318762B1 (ko) | 외래어 음차표기의 음성적 거리 계산방법 | |
US9110980B2 (en) | Searching and matching of data | |
JPS6211932A (ja) | 情報検索方法 | |
AU2007268059A1 (en) | Method and apparatus for multilingual spelling corrections | |
Jiampojamarn et al. | Transliteration generation and mining with limited training resources | |
EP2162838B1 (fr) | Recherche phonétique utilisant une chaîne normalisée | |
JP2010519655A (ja) | 名前照合システムの名前インデックス付け | |
Naseem et al. | A novel approach for ranking spelling error corrections for Urdu | |
JPH0454261B2 (fr) | ||
Freihat et al. | Towards an optimal solution to lemmatization in Arabic | |
Medhat et al. | A hybrid cross-language name matching technique using novel modified Levenshtein Distance | |
Chaudhuri | Reversed word dictionary and phonetically similar word grouping based spell-checker to Bangla text | |
Robertson et al. | Searching for historical word-forms in a database of 17th-century English text using spelling-correction methods | |
JP4486324B2 (ja) | 類似単語検索装置、この方法、このプログラム、および情報検索システム | |
Vīksna et al. | Multilingual slavic named entity recognition | |
Yousef | Cross language duplicate record detection in big data | |
Ren et al. | A hybrid approach to automatic Chinese text checking and error correction | |
JP2002132789A (ja) | 文書検索方法 | |
Sulaiman et al. | The effectiveness of a Jawi stemmer for retrieving relevant Malay documents in Jawi characters | |
Rani et al. | Post-processing methodology for word level Telugu character recognition systems using Unicode Approximation Models | |
CN1323004A (zh) | 汉语盲文到汉字的自动转换方法 | |
Kiawkaew et al. | A Practical Technique for Thai-English Word Mapping Using Phonetic Rules: Person Name Matching Case Study | |
Graliński et al. | Mining historical texts for diachronic spelling variants |