CN1302030A - 词义消歧的机器翻译方法和系统 - Google Patents
词义消歧的机器翻译方法和系统 Download PDFInfo
- Publication number
- CN1302030A CN1302030A CN99126567A CN99126567A CN1302030A CN 1302030 A CN1302030 A CN 1302030A CN 99126567 A CN99126567 A CN 99126567A CN 99126567 A CN99126567 A CN 99126567A CN 1302030 A CN1302030 A CN 1302030A
- Authority
- CN
- China
- Prior art keywords
- word
- language
- speech
- meaning
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/49—Data-driven translation using very large corpora, e.g. the web
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/55—Rule-based translation
Abstract
Description
NATO | order | air strike |
subject | verb | object |
he | receive | his | doctoral | degree |
subject | verb | pron--> | adj--> | object |
词义 | 加权 |
罢工 | 40 |
袭击 | 33 |
打 | 21 |
好球 | 2 |
词义 | 加权值 |
罢工 | 40 |
袭击 | 33+10 |
打 | 21 |
好球 | 2 |
薄层 | 薄膜 | 胶片 | 影片 | 影片业 | |
成长 | 0 | 0 | 0 | 0 | 0 |
冲洗 | 1 | 2 | 7 | 0 | 0 |
发展 | 0 | 0 | 0 | 0 | 2 |
开发 | 0 | 1 | 2 | 0 | 0 |
使…成长 | 0 | 0 | 0 | 0 | 1 |
使…形成 | 0 | 0 | 0 | 0 | 0 |
Claims (12)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN99126567XA CN1302030B (zh) | 1999-12-24 | 1999-12-24 | 词义消歧的机器翻译方法和系统 |
US09/740,780 US7149681B2 (en) | 1999-12-24 | 2000-12-19 | Method, system and program product for resolving word ambiguity in text language translation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN99126567XA CN1302030B (zh) | 1999-12-24 | 1999-12-24 | 词义消歧的机器翻译方法和系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1302030A true CN1302030A (zh) | 2001-07-04 |
CN1302030B CN1302030B (zh) | 2010-04-21 |
Family
ID=5284460
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN99126567XA Expired - Lifetime CN1302030B (zh) | 1999-12-24 | 1999-12-24 | 词义消歧的机器翻译方法和系统 |
Country Status (2)
Country | Link |
---|---|
US (1) | US7149681B2 (zh) |
CN (1) | CN1302030B (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7688460B2 (en) | 2001-07-05 | 2010-03-30 | Canon Kabushiki Kaisha | Communication terminal for accessing and printing page data from links |
CN102650987A (zh) * | 2011-02-25 | 2012-08-29 | 北京百度网讯科技有限公司 | 一种基于源语言复述资源的机器翻译方法及装置 |
CN102650988A (zh) * | 2011-02-25 | 2012-08-29 | 北京百度网讯科技有限公司 | 一种基于目标语言复述资源的机器翻译方法及装置 |
CN101661462B (zh) * | 2009-07-17 | 2012-12-12 | 北京邮电大学 | 四层结构的中文文本正则化体系及实现 |
CN105159893A (zh) * | 2015-08-31 | 2015-12-16 | 小米科技有限责任公司 | 字符串保存方法及装置 |
Families Citing this family (44)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8806326B1 (en) * | 2000-09-25 | 2014-08-12 | Nokia Inc. | User preference based content linking |
US7321439B2 (en) * | 2001-07-05 | 2008-01-22 | Canon Kabushiki Kaisha | Information processing apparatus and method for processing externally transmitted data, and information processing program |
US20030074188A1 (en) * | 2001-10-12 | 2003-04-17 | Tohgo Murata | Method and apparatus for language instruction |
JP3906356B2 (ja) * | 2001-12-27 | 2007-04-18 | 独立行政法人情報通信研究機構 | 構文解析方法及び装置 |
US20040193557A1 (en) * | 2003-03-25 | 2004-09-30 | Olsen Jesse Dale | Systems and methods for reducing ambiguity of communications |
US8135575B1 (en) | 2003-08-21 | 2012-03-13 | Google Inc. | Cross-lingual indexing and information retrieval |
US20080306727A1 (en) * | 2005-03-07 | 2008-12-11 | Linguatec Sprachtechnologien Gmbh | Hybrid Machine Translation System |
JP4058057B2 (ja) * | 2005-04-26 | 2008-03-05 | 株式会社東芝 | 日中機械翻訳装置、日中機械翻訳方法および日中機械翻訳プログラム |
US9984071B2 (en) | 2006-10-10 | 2018-05-29 | Abbyy Production Llc | Language ambiguity detection of text |
US8145473B2 (en) | 2006-10-10 | 2012-03-27 | Abbyy Software Ltd. | Deep model statistics method for machine translation |
US9633005B2 (en) | 2006-10-10 | 2017-04-25 | Abbyy Infopoisk Llc | Exhaustive automatic processing of textual information |
US8214199B2 (en) * | 2006-10-10 | 2012-07-03 | Abbyy Software, Ltd. | Systems for translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US8195447B2 (en) | 2006-10-10 | 2012-06-05 | Abbyy Software Ltd. | Translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US8548795B2 (en) * | 2006-10-10 | 2013-10-01 | Abbyy Software Ltd. | Method for translating documents from one language into another using a database of translations, a terminology dictionary, a translation dictionary, and a machine translation system |
US9047275B2 (en) | 2006-10-10 | 2015-06-02 | Abbyy Infopoisk Llc | Methods and systems for alignment of parallel text corpora |
US20080086298A1 (en) * | 2006-10-10 | 2008-04-10 | Anisimovich Konstantin | Method and system for translating sentences between langauges |
US9645993B2 (en) | 2006-10-10 | 2017-05-09 | Abbyy Infopoisk Llc | Method and system for semantic searching |
US9235573B2 (en) | 2006-10-10 | 2016-01-12 | Abbyy Infopoisk Llc | Universal difference measure |
US20080091411A1 (en) * | 2006-10-12 | 2008-04-17 | Frank John Williams | Method for identifying a meaning of a word capable of identifying several meanings |
US8539349B1 (en) * | 2006-10-31 | 2013-09-17 | Hewlett-Packard Development Company, L.P. | Methods and systems for splitting a chinese character sequence into word segments |
US8244521B2 (en) * | 2007-01-11 | 2012-08-14 | Microsoft Corporation | Paraphrasing the web by search-based data collection |
US8959011B2 (en) | 2007-03-22 | 2015-02-17 | Abbyy Infopoisk Llc | Indicating and correcting errors in machine translation systems |
US20080288239A1 (en) * | 2007-05-15 | 2008-11-20 | Microsoft Corporation | Localization and internationalization of document resources |
US8812296B2 (en) | 2007-06-27 | 2014-08-19 | Abbyy Infopoisk Llc | Method and system for natural language dictionary generation |
US7890493B2 (en) * | 2007-07-20 | 2011-02-15 | Google Inc. | Translating a search query into multiple languages |
US8738353B2 (en) * | 2007-09-05 | 2014-05-27 | Modibo Soumare | Relational database method and systems for alphabet based language representation |
US20090083026A1 (en) * | 2007-09-24 | 2009-03-26 | Microsoft Corporation | Summarizing document with marked points |
US8521516B2 (en) * | 2008-03-26 | 2013-08-27 | Google Inc. | Linguistic key normalization |
US9262409B2 (en) * | 2008-08-06 | 2016-02-16 | Abbyy Infopoisk Llc | Translation of a selected text fragment of a screen |
US9323854B2 (en) * | 2008-12-19 | 2016-04-26 | Intel Corporation | Method, apparatus and system for location assisted translation |
US9588963B2 (en) * | 2009-03-18 | 2017-03-07 | Iqintell, Inc. | System and method of grouping and extracting information from data corpora |
US9063923B2 (en) * | 2009-03-18 | 2015-06-23 | Iqintell, Inc. | Method for identifying the integrity of information |
US9063931B2 (en) * | 2011-02-16 | 2015-06-23 | Ming-Yuan Wu | Multiple language translation system |
US8971630B2 (en) | 2012-04-27 | 2015-03-03 | Abbyy Development Llc | Fast CJK character recognition |
US8989485B2 (en) | 2012-04-27 | 2015-03-24 | Abbyy Development Llc | Detecting a junction in a text line of CJK characters |
US20140188456A1 (en) * | 2012-12-27 | 2014-07-03 | Abbyy Development Llc | Dictionary Markup System and Method |
RU2592395C2 (ru) | 2013-12-19 | 2016-07-20 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Разрешение семантической неоднозначности при помощи статистического анализа |
RU2586577C2 (ru) | 2014-01-15 | 2016-06-10 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Фильтрация дуг в синтаксическом графе |
RU2596600C2 (ru) | 2014-09-02 | 2016-09-10 | Общество с ограниченной ответственностью "Аби Девелопмент" | Способы и системы обработки изображений математических выражений |
US9626358B2 (en) | 2014-11-26 | 2017-04-18 | Abbyy Infopoisk Llc | Creating ontologies by analyzing natural language texts |
US10586168B2 (en) * | 2015-10-08 | 2020-03-10 | Facebook, Inc. | Deep translations |
US10191899B2 (en) | 2016-06-06 | 2019-01-29 | Comigo Ltd. | System and method for understanding text using a translation of the text |
US11487938B2 (en) * | 2020-01-15 | 2022-11-01 | International Business Machines Corporation | Methods and systems for improving language processing for ambiguous instances |
US20230351172A1 (en) * | 2022-04-29 | 2023-11-02 | Intuit Inc. | Supervised machine learning method for matching unsupervised data |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5293584A (en) * | 1992-05-21 | 1994-03-08 | International Business Machines Corporation | Speech recognition system for natural language translation |
US5983169A (en) * | 1995-11-13 | 1999-11-09 | Japan Science And Technology Corporation | Method for automated translation of conjunctive phrases in natural languages |
US6470306B1 (en) * | 1996-04-23 | 2002-10-22 | Logovista Corporation | Automated translation of annotated text based on the determination of locations for inserting annotation tokens and linked ending, end-of-sentence or language tokens |
US6516321B1 (en) * | 1996-07-30 | 2003-02-04 | Carlos De La Huerga | Method for database address specification |
US6282511B1 (en) * | 1996-12-04 | 2001-08-28 | At&T | Voiced interface with hyperlinked information |
US6295542B1 (en) * | 1998-10-02 | 2001-09-25 | National Power Plc | Method and apparatus for cross-referencing text |
US6269189B1 (en) * | 1998-12-29 | 2001-07-31 | Xerox Corporation | Finding selected character strings in text and providing information relating to the selected character strings |
US6393389B1 (en) * | 1999-09-23 | 2002-05-21 | Xerox Corporation | Using ranked translation choices to obtain sequences indicating meaning of multi-token expressions |
-
1999
- 1999-12-24 CN CN99126567XA patent/CN1302030B/zh not_active Expired - Lifetime
-
2000
- 2000-12-19 US US09/740,780 patent/US7149681B2/en active Active
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7688460B2 (en) | 2001-07-05 | 2010-03-30 | Canon Kabushiki Kaisha | Communication terminal for accessing and printing page data from links |
CN101661462B (zh) * | 2009-07-17 | 2012-12-12 | 北京邮电大学 | 四层结构的中文文本正则化体系及实现 |
CN102650987A (zh) * | 2011-02-25 | 2012-08-29 | 北京百度网讯科技有限公司 | 一种基于源语言复述资源的机器翻译方法及装置 |
CN102650988A (zh) * | 2011-02-25 | 2012-08-29 | 北京百度网讯科技有限公司 | 一种基于目标语言复述资源的机器翻译方法及装置 |
CN105159893A (zh) * | 2015-08-31 | 2015-12-16 | 小米科技有限责任公司 | 字符串保存方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
US7149681B2 (en) | 2006-12-12 |
CN1302030B (zh) | 2010-04-21 |
US20010014902A1 (en) | 2001-08-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1302030B (zh) | 词义消歧的机器翻译方法和系统 | |
CN1135485C (zh) | 利用计算机系统的日文文本字的识别 | |
Xu et al. | TREC 2003 QA at BBN: Answering Definitional Questions. | |
US8285541B2 (en) | System and method for handling multiple languages in text | |
Kok et al. | Hitting the right paraphrases in good time | |
Mohammadi et al. | Building bilingual parallel corpora based on wikipedia | |
US8249855B2 (en) | Identifying parallel bilingual data over a network | |
Chen et al. | A fully unsupervised word sense disambiguation method using dependency knowledge | |
CN1102271C (zh) | 具有习惯用语处理功能的电子词典 | |
US20070011132A1 (en) | Named entity translation | |
Chen et al. | Description of the NTU System used for MET-2 | |
Gupta et al. | Punjabi language stemmer for nouns and proper names | |
US8670974B2 (en) | Acquisition of out-of-vocabulary translations by dynamically learning extraction rules | |
Jauhiainen et al. | Building web corpora for minority languages | |
CN1114165C (zh) | 中文文本中的字词分割方法 | |
Shekhar et al. | Linguistic structural framework for encoding transliteration variants for word origin detection using bilingual lexicon | |
Boulaknadel et al. | Multi-word term indexing for Arabic document retrieval | |
Bajpai et al. | Cross language information retrieval: In indian language perspective | |
Ahmed et al. | A web statistics based conflation approach to improve Arabic text retrieval | |
Awadallah et al. | Web-based multiple choice question answering for English and Arabic questions | |
Milić-Frayling | Text processing and information retrieval | |
JP5633844B2 (ja) | 言い換え関係集合取得装置、言い換え関係集合取得方法、及びプログラム | |
Debbarma et al. | Morphological Analyzer for Kokborok | |
Pimpalshende et al. | Pre-processing phase of Hindi language text summarization System | |
Malumba et al. | AfriWeb: a web search engine for a marginalized language |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
ASS | Succession or assignment of patent right |
Owner name: NEW ANST COMMUNICATION CO.,LTD. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINE CORP. Effective date: 20090911 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20090911 Address after: Massachusetts, USA Applicant after: Nuance Communications Inc Address before: American New York Applicant before: International Business Machines Corp. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CX01 | Expiry of patent term | ||
CX01 | Expiry of patent term |
Granted publication date: 20100421 |