WO2001029697A1 - A method and system for reducing lexical ambiguity - Google Patents
A method and system for reducing lexical ambiguity Download PDFInfo
- Publication number
- WO2001029697A1 WO2001029697A1 PCT/US2000/041256 US0041256W WO0129697A1 WO 2001029697 A1 WO2001029697 A1 WO 2001029697A1 US 0041256 W US0041256 W US 0041256W WO 0129697 A1 WO0129697 A1 WO 0129697A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- lexical
- tokens
- cost
- paths
- graph
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/44—Statistical methods, e.g. probability models
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU24669/01A AU2466901A (en) | 1999-10-18 | 2000-10-17 | A method and system for reducing lexical ambiguity |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/420,517 US6721697B1 (en) | 1999-10-18 | 1999-10-18 | Method and system for reducing lexical ambiguity |
US09/420,517 | 1999-10-18 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2001029697A1 true WO2001029697A1 (en) | 2001-04-26 |
WO2001029697A9 WO2001029697A9 (en) | 2002-08-08 |
Family
ID=23666795
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2000/041256 WO2001029697A1 (en) | 1999-10-18 | 2000-10-17 | A method and system for reducing lexical ambiguity |
Country Status (3)
Country | Link |
---|---|
US (2) | US6721697B1 (US20040167771A1-20040826-M00002.png) |
AU (1) | AU2466901A (US20040167771A1-20040826-M00002.png) |
WO (1) | WO2001029697A1 (US20040167771A1-20040826-M00002.png) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004042641A2 (en) * | 2002-11-04 | 2004-05-21 | Matsushita Electric Industrial Co., Ltd. | Post-processing system and method for correcting machine recognized text |
EP1474757A1 (en) * | 2002-02-12 | 2004-11-10 | Sunflare Co. Ltd. | SYSTEM AND METHOD FOR ACCURATE GRAMMAR ANALYSIS USING A LEARNERS' MODEL AND PART-OF-SPEECH TAGGED (POST) PARSER |
EP1483686A1 (en) * | 2002-02-12 | 2004-12-08 | Sunflare Co. Ltd. | System and method for accurate grammar analysis using a part-of-speech tagged (post) parser and learners model |
WO2007082948A1 (fr) * | 2006-01-20 | 2007-07-26 | Thales | Procede et dispositif pour extraire des informations et les transformer en donnees qualitatives d'un document textuel |
CN104239294A (zh) * | 2014-09-10 | 2014-12-24 | 华建宇通科技(北京)有限责任公司 | 藏汉翻译系统的多策略藏语长句切分方法 |
Families Citing this family (97)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7379862B1 (en) * | 1999-11-19 | 2008-05-27 | Microsoft Corporation | Method and apparatus for analyzing and debugging natural language parses |
US7146349B2 (en) * | 2000-11-06 | 2006-12-05 | International Business Machines Corporation | Network for describing multimedia information |
US20020091509A1 (en) * | 2001-01-02 | 2002-07-11 | Yacov Zoarez | Method and system for translating text |
US6859771B2 (en) * | 2001-04-23 | 2005-02-22 | Microsoft Corporation | System and method for identifying base noun phrases |
JP4947861B2 (ja) * | 2001-09-25 | 2012-06-06 | キヤノン株式会社 | 自然言語処理装置およびその制御方法ならびにプログラム |
US6963832B2 (en) * | 2001-10-09 | 2005-11-08 | Hewlett-Packard Development Company, L.P. | Meaning token dictionary for automatic speech recognition |
US7080352B2 (en) * | 2002-01-30 | 2006-07-18 | Dloo, Incorporated | Method and system for creating programs using code having coupled syntactic and semantic relationships |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7493253B1 (en) | 2002-07-12 | 2009-02-17 | Language And Computing, Inc. | Conceptual world representation natural language understanding system and method |
US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
US20040193557A1 (en) * | 2003-03-25 | 2004-09-30 | Olsen Jesse Dale | Systems and methods for reducing ambiguity of communications |
JP3768205B2 (ja) * | 2003-05-30 | 2006-04-19 | 沖電気工業株式会社 | 形態素解析装置、形態素解析方法及び形態素解析プログラム |
US7328156B2 (en) * | 2003-07-17 | 2008-02-05 | International Business Machines Corporation | Computational linguistic statements for providing an autonomic computing environment |
US7430504B2 (en) * | 2004-03-02 | 2008-09-30 | Microsoft Corporation | Method and system for ranking words and concepts in a text using graph-based ranking |
US7908143B2 (en) * | 2004-04-28 | 2011-03-15 | International Business Machines Corporation | Dialog call-flow optimization |
KR100669241B1 (ko) * | 2004-12-15 | 2007-01-15 | 한국전자통신연구원 | 화행 정보를 이용한 대화체 음성합성 시스템 및 방법 |
FR2885712B1 (fr) * | 2005-05-12 | 2007-07-13 | Kabire Fidaali | Dispositif et procede d'analyse semantique de documents par constitution d'arbres n-aire et semantique |
US7689557B2 (en) * | 2005-06-07 | 2010-03-30 | Madan Pandit | System and method of textual information analytics |
US7991607B2 (en) * | 2005-06-27 | 2011-08-02 | Microsoft Corporation | Translation and capture architecture for output of conversational utterances |
US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7620549B2 (en) | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
WO2007027989A2 (en) | 2005-08-31 | 2007-03-08 | Voicebox Technologies, Inc. | Dynamic speech sharpening |
WO2007097208A1 (ja) * | 2006-02-27 | 2007-08-30 | Nec Corporation | 言語処理装置、言語処理方法および言語処理用プログラム |
US9047275B2 (en) | 2006-10-10 | 2015-06-02 | Abbyy Infopoisk Llc | Methods and systems for alignment of parallel text corpora |
US9984071B2 (en) | 2006-10-10 | 2018-05-29 | Abbyy Production Llc | Language ambiguity detection of text |
US9235573B2 (en) | 2006-10-10 | 2016-01-12 | Abbyy Infopoisk Llc | Universal difference measure |
US9633005B2 (en) | 2006-10-10 | 2017-04-25 | Abbyy Infopoisk Llc | Exhaustive automatic processing of textual information |
US8548795B2 (en) * | 2006-10-10 | 2013-10-01 | Abbyy Software Ltd. | Method for translating documents from one language into another using a database of translations, a terminology dictionary, a translation dictionary, and a machine translation system |
US9645993B2 (en) | 2006-10-10 | 2017-05-09 | Abbyy Infopoisk Llc | Method and system for semantic searching |
US8145473B2 (en) | 2006-10-10 | 2012-03-27 | Abbyy Software Ltd. | Deep model statistics method for machine translation |
US8195447B2 (en) | 2006-10-10 | 2012-06-05 | Abbyy Software Ltd. | Translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US8214199B2 (en) * | 2006-10-10 | 2012-07-03 | Abbyy Software, Ltd. | Systems for translating sentences between languages using language-independent semantic structures and ratings of syntactic constructions |
US8073681B2 (en) * | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
US11222185B2 (en) | 2006-10-26 | 2022-01-11 | Meta Platforms, Inc. | Lexicon development via shared translation database |
US9128926B2 (en) | 2006-10-26 | 2015-09-08 | Facebook, Inc. | Simultaneous translation of open domain lectures and speeches |
US8972268B2 (en) | 2008-04-15 | 2015-03-03 | Facebook, Inc. | Enhanced speech-to-speech translation system and methods for adding a new word |
US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
US8959011B2 (en) | 2007-03-22 | 2015-02-17 | Abbyy Infopoisk Llc | Indicating and correcting errors in machine translation systems |
US9779079B2 (en) * | 2007-06-01 | 2017-10-03 | Xerox Corporation | Authoring system |
US8812296B2 (en) | 2007-06-27 | 2014-08-19 | Abbyy Infopoisk Llc | Method and system for natural language dictionary generation |
US8738353B2 (en) * | 2007-09-05 | 2014-05-27 | Modibo Soumare | Relational database method and systems for alphabet based language representation |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
JP5112116B2 (ja) * | 2008-03-07 | 2013-01-09 | 株式会社東芝 | 機械翻訳する装置、方法およびプログラム |
US9305548B2 (en) | 2008-05-27 | 2016-04-05 | Voicebox Technologies Corporation | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8738360B2 (en) | 2008-06-06 | 2014-05-27 | Apple Inc. | Data detection of a character sequence having multiple possible data types |
US9262409B2 (en) | 2008-08-06 | 2016-02-16 | Abbyy Infopoisk Llc | Translation of a selected text fragment of a screen |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US8321848B2 (en) * | 2009-04-16 | 2012-11-27 | The Mathworks, Inc. | Method and system for syntax error repair in programming languages |
US9547642B2 (en) * | 2009-06-17 | 2017-01-17 | Empire Technology Development Llc | Voice to text to voice processing |
US9201965B1 (en) | 2009-09-30 | 2015-12-01 | Cisco Technology, Inc. | System and method for providing speech recognition using personal vocabulary in a network environment |
US9171541B2 (en) | 2009-11-10 | 2015-10-27 | Voicebox Technologies Corporation | System and method for hybrid processing in a natural language voice services environment |
US9502025B2 (en) | 2009-11-10 | 2016-11-22 | Voicebox Technologies Corporation | System and method for providing a natural language content dedication service |
US8935274B1 (en) * | 2010-05-12 | 2015-01-13 | Cisco Technology, Inc | System and method for deriving user expertise based on data propagating in a network environment |
US8977538B2 (en) * | 2010-09-13 | 2015-03-10 | Richard Salisbury | Constructing and analyzing a word graph |
US9465795B2 (en) | 2010-12-17 | 2016-10-11 | Cisco Technology, Inc. | System and method for providing feeds based on activity in a network environment |
US8909624B2 (en) | 2011-05-31 | 2014-12-09 | Cisco Technology, Inc. | System and method for evaluating results of a search query in a network environment |
US9721003B2 (en) * | 2011-06-20 | 2017-08-01 | Nokia Technologies Oy | Method and apparatus for providing contextual based searches |
JP5799733B2 (ja) * | 2011-10-12 | 2015-10-28 | 富士通株式会社 | 認識装置、認識プログラムおよび認識方法 |
US8909516B2 (en) * | 2011-10-27 | 2014-12-09 | Microsoft Corporation | Functionality for normalizing linguistic items |
US20150161109A1 (en) * | 2012-01-13 | 2015-06-11 | Google Inc. | Reordering words for machine translation |
CN103365834B (zh) * | 2012-03-29 | 2017-08-18 | 富泰华工业(深圳)有限公司 | 语言歧义消除系统及方法 |
US8971630B2 (en) | 2012-04-27 | 2015-03-03 | Abbyy Development Llc | Fast CJK character recognition |
US8989485B2 (en) | 2012-04-27 | 2015-03-24 | Abbyy Development Llc | Detecting a junction in a text line of CJK characters |
US9436681B1 (en) * | 2013-07-16 | 2016-09-06 | Amazon Technologies, Inc. | Natural language translation techniques |
RU2592395C2 (ru) | 2013-12-19 | 2016-07-20 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Разрешение семантической неоднозначности при помощи статистического анализа |
RU2586577C2 (ru) * | 2014-01-15 | 2016-06-10 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Фильтрация дуг в синтаксическом графе |
RU2596600C2 (ru) | 2014-09-02 | 2016-09-10 | Общество с ограниченной ответственностью "Аби Девелопмент" | Способы и системы обработки изображений математических выражений |
WO2016044321A1 (en) | 2014-09-16 | 2016-03-24 | Min Tang | Integration of domain information into state transitions of a finite state transducer for natural language processing |
CN107003996A (zh) | 2014-09-16 | 2017-08-01 | 声钰科技 | 语音商务 |
US10810357B1 (en) * | 2014-10-15 | 2020-10-20 | Slickjump, Inc. | System and method for selection of meaningful page elements with imprecise coordinate selection for relevant information identification and browsing |
CN107003999B (zh) | 2014-10-15 | 2020-08-21 | 声钰科技 | 对用户的在先自然语言输入的后续响应的系统和方法 |
US10614799B2 (en) | 2014-11-26 | 2020-04-07 | Voicebox Technologies Corporation | System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance |
US9626358B2 (en) | 2014-11-26 | 2017-04-18 | Abbyy Infopoisk Llc | Creating ontologies by analyzing natural language texts |
US10431214B2 (en) | 2014-11-26 | 2019-10-01 | Voicebox Technologies Corporation | System and method of determining a domain and/or an action related to a natural language input |
US10586168B2 (en) | 2015-10-08 | 2020-03-10 | Facebook, Inc. | Deep translations |
US9990361B2 (en) * | 2015-10-08 | 2018-06-05 | Facebook, Inc. | Language independent representations |
WO2017176496A1 (en) * | 2016-04-08 | 2017-10-12 | Pearson Education, Inc. | System and method for automatic content aggregation generation |
US10642848B2 (en) | 2016-04-08 | 2020-05-05 | Pearson Education, Inc. | Personalized automatic content aggregation generation |
US10789316B2 (en) | 2016-04-08 | 2020-09-29 | Pearson Education, Inc. | Personalized automatic content aggregation generation |
US10325215B2 (en) | 2016-04-08 | 2019-06-18 | Pearson Education, Inc. | System and method for automatic content aggregation generation |
US20170330080A1 (en) * | 2016-05-13 | 2017-11-16 | Cognitive Scale, Inc. | Universal Cognitive Graph Architecture |
US10331788B2 (en) | 2016-06-22 | 2019-06-25 | International Business Machines Corporation | Latent ambiguity handling in natural language processing |
WO2018023106A1 (en) | 2016-07-29 | 2018-02-01 | Erik SWART | System and method of disambiguating natural language processing requests |
JP2019537103A (ja) * | 2016-09-28 | 2019-12-19 | シストラン インターナショナル カンパニー.,リミテッド.Systran International Co.,Ltd. | 文字を翻訳する方法及びその装置 |
US10169324B2 (en) * | 2016-12-08 | 2019-01-01 | Entit Software Llc | Universal lexical analyzers |
WO2018131048A1 (en) * | 2017-01-11 | 2018-07-19 | Satyanarayana Krishnamurthy | System and method for natural language generation |
CN107301170B (zh) * | 2017-06-19 | 2020-12-22 | 北京百度网讯科技有限公司 | 基于人工智能的切分语句的方法和装置 |
US20190034555A1 (en) * | 2017-07-31 | 2019-01-31 | Splunk Inc. | Translating a natural language request to a domain specific language request based on multiple interpretation algorithms |
US10901811B2 (en) | 2017-07-31 | 2021-01-26 | Splunk Inc. | Creating alerts associated with a data storage system based on natural language requests |
US11494395B2 (en) | 2017-07-31 | 2022-11-08 | Splunk Inc. | Creating dashboards for viewing data in a data storage system based on natural language requests |
US10956670B2 (en) | 2018-03-03 | 2021-03-23 | Samurai Labs Sp. Z O.O. | System and method for detecting undesirable and potentially harmful online behavior |
CN109325227A (zh) * | 2018-09-14 | 2019-02-12 | 北京字节跳动网络技术有限公司 | 用于生成修正语句的方法和装置 |
TWI665567B (zh) * | 2018-09-26 | 2019-07-11 | 華碩電腦股份有限公司 | 語意處理方法、電子裝置以及非暫態電腦可讀取記錄媒體 |
US11594213B2 (en) * | 2020-03-03 | 2023-02-28 | Rovi Guides, Inc. | Systems and methods for interpreting natural language search queries |
US11240266B1 (en) * | 2021-07-16 | 2022-02-01 | Social Safeguard, Inc. | System, device and method for detecting social engineering attacks in digital communications |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4974191A (en) * | 1987-07-31 | 1990-11-27 | Syntellect Software Inc. | Adaptive natural language computer interface system |
US5418717A (en) * | 1990-08-27 | 1995-05-23 | Su; Keh-Yih | Multiple score language processing system |
US5677835A (en) * | 1992-09-04 | 1997-10-14 | Caterpillar Inc. | Integrated authoring and translation system |
US5799268A (en) * | 1994-09-28 | 1998-08-25 | Apple Computer, Inc. | Method for extracting knowledge from online documentation and creating a glossary, index, help database or the like |
US5806021A (en) * | 1995-10-30 | 1998-09-08 | International Business Machines Corporation | Automatic segmentation of continuous text using statistical approaches |
US5873056A (en) * | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
Family Cites Families (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4905138A (en) * | 1985-10-17 | 1990-02-27 | Westinghouse Electric Corp. | Meta-interpreter |
US5083268A (en) * | 1986-10-15 | 1992-01-21 | Texas Instruments Incorporated | System and method for parsing natural language by unifying lexical features of words |
EP0361570B1 (en) * | 1988-09-15 | 1997-08-06 | Océ-Nederland B.V. | A system for grammatically processing a sentence composed in natural language |
US5111398A (en) * | 1988-11-21 | 1992-05-05 | Xerox Corporation | Processing natural language text using autonomous punctuational structure |
US5033087A (en) | 1989-03-14 | 1991-07-16 | International Business Machines Corp. | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system |
JPH02308370A (ja) | 1989-05-24 | 1990-12-21 | Toshiba Corp | 機械翻訳システム |
US5095432A (en) * | 1989-07-10 | 1992-03-10 | Harris Corporation | Data processing system implemented process and compiling technique for performing context-free parsing algorithm based on register vector grammar |
US5497319A (en) | 1990-12-31 | 1996-03-05 | Trans-Link International Corp. | Machine translation and telecommunications system |
US5477451A (en) | 1991-07-25 | 1995-12-19 | International Business Machines Corp. | Method and system for natural language translation |
US5268840A (en) * | 1992-04-30 | 1993-12-07 | Industrial Technology Research Institute | Method and system for morphologizing text |
US5528491A (en) * | 1992-08-31 | 1996-06-18 | Language Engineering Corporation | Apparatus and method for automated natural language translation |
JPH06195373A (ja) | 1992-12-24 | 1994-07-15 | Sharp Corp | 機械翻訳装置 |
ES2101613B1 (es) | 1993-02-02 | 1998-03-01 | Uribe Echebarria Diaz De Mendi | Metodo de traduccion automatica interlingual asistida por ordenador. |
US5510981A (en) | 1993-10-28 | 1996-04-23 | International Business Machines Corporation | Language translation apparatus and method using context-based translation models |
US5659765A (en) | 1994-03-15 | 1997-08-19 | Toppan Printing Co., Ltd. | Machine translation system |
US5642519A (en) * | 1994-04-29 | 1997-06-24 | Sun Microsystems, Inc. | Speech interpreter with a unified grammer compiler |
US5752052A (en) | 1994-06-24 | 1998-05-12 | Microsoft Corporation | Method and system for bootstrapping statistical processing into a rule-based natural language parser |
US5644775A (en) | 1994-08-11 | 1997-07-01 | International Business Machines Corporation | Method and system for facilitating language translation using string-formatting libraries |
US5644755A (en) * | 1995-02-24 | 1997-07-01 | Compaq Computer Corporation | Processor with virtual system mode |
EP0834139A4 (en) * | 1995-06-07 | 1998-08-05 | Int Language Engineering Corp | COMPUTER-ASSISTED TRANSLATION TOOLS |
US5680511A (en) * | 1995-06-07 | 1997-10-21 | Dragon Systems, Inc. | Systems and methods for word recognition |
JPH09128396A (ja) | 1995-11-06 | 1997-05-16 | Hitachi Ltd | 対訳辞書作成方法 |
US5983169A (en) | 1995-11-13 | 1999-11-09 | Japan Science And Technology Corporation | Method for automated translation of conjunctive phrases in natural languages |
US6161083A (en) | 1996-05-02 | 2000-12-12 | Sony Corporation | Example-based translation method and system which calculates word similarity degrees, a priori probability, and transformation probability to determine the best example for translation |
US5966686A (en) * | 1996-06-28 | 1999-10-12 | Microsoft Corporation | Method and system for computing semantic logical forms from syntax trees |
DE19636739C1 (de) | 1996-09-10 | 1997-07-03 | Siemens Ag | Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem |
US5884247A (en) | 1996-10-31 | 1999-03-16 | Dialect Corporation | Method and apparatus for automated language translation |
US6230153B1 (en) | 1998-06-18 | 2001-05-08 | International Business Machines Corporation | Association rule ranker for web site emulation |
US6285978B1 (en) * | 1998-09-24 | 2001-09-04 | International Business Machines Corporation | System and method for estimating accuracy of an automatic natural language translation |
US6173441B1 (en) * | 1998-10-16 | 2001-01-09 | Peter A. Klein | Method and system for compiling source code containing natural language instructions |
US6243669B1 (en) | 1999-01-29 | 2001-06-05 | Sony Corporation | Method and apparatus for providing syntactic analysis and data structure for translation knowledge in example-based language translation |
US6301554B1 (en) | 1999-09-23 | 2001-10-09 | Wordstream, Inc. | Language translation using a constrained grammar in the form of structured sentences formed according to pre-defined grammar templates |
US6330530B1 (en) | 1999-10-18 | 2001-12-11 | Sony Corporation | Method and system for transforming a source language linguistic structure into a target language linguistic structure based on example linguistic feature structures |
-
1999
- 1999-10-18 US US09/420,517 patent/US6721697B1/en not_active Expired - Fee Related
-
2000
- 2000-10-17 AU AU24669/01A patent/AU2466901A/en not_active Abandoned
- 2000-10-17 WO PCT/US2000/041256 patent/WO2001029697A1/en active Application Filing
-
2004
- 2004-02-17 US US10/781,223 patent/US20040167771A1/en not_active Abandoned
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4974191A (en) * | 1987-07-31 | 1990-11-27 | Syntellect Software Inc. | Adaptive natural language computer interface system |
US5418717A (en) * | 1990-08-27 | 1995-05-23 | Su; Keh-Yih | Multiple score language processing system |
US5677835A (en) * | 1992-09-04 | 1997-10-14 | Caterpillar Inc. | Integrated authoring and translation system |
US5873056A (en) * | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
US5799268A (en) * | 1994-09-28 | 1998-08-25 | Apple Computer, Inc. | Method for extracting knowledge from online documentation and creating a glossary, index, help database or the like |
US5806021A (en) * | 1995-10-30 | 1998-09-08 | International Business Machines Corporation | Automatic segmentation of continuous text using statistical approaches |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1474757A1 (en) * | 2002-02-12 | 2004-11-10 | Sunflare Co. Ltd. | SYSTEM AND METHOD FOR ACCURATE GRAMMAR ANALYSIS USING A LEARNERS' MODEL AND PART-OF-SPEECH TAGGED (POST) PARSER |
EP1483686A1 (en) * | 2002-02-12 | 2004-12-08 | Sunflare Co. Ltd. | System and method for accurate grammar analysis using a part-of-speech tagged (post) parser and learners model |
EP1474757A4 (en) * | 2002-02-12 | 2008-01-02 | Sunflare Co Ltd | SYSTEM AND METHOD FOR PRECISE GRAMMATICAL ANALYSIS USING A SYNTAXIC ANALYZER OF THE PART OF THE SPEECH MARK (POST) |
EP1483686A4 (en) * | 2002-02-12 | 2008-01-23 | Sunflare Co Ltd | SYSTEM AND METHOD FOR ACCURATE GRAMMATICAL ANALYSIS USING A PART-OF-SPEECH TAGGED (POST) ANALYZER AND A LEARNING MODEL |
WO2004042641A2 (en) * | 2002-11-04 | 2004-05-21 | Matsushita Electric Industrial Co., Ltd. | Post-processing system and method for correcting machine recognized text |
WO2004042641A3 (en) * | 2002-11-04 | 2004-08-05 | Matsushita Electric Ind Co Ltd | Post-processing system and method for correcting machine recognized text |
US7092567B2 (en) | 2002-11-04 | 2006-08-15 | Matsushita Electric Industrial Co., Ltd. | Post-processing system and method for correcting machine recognized text |
WO2007082948A1 (fr) * | 2006-01-20 | 2007-07-26 | Thales | Procede et dispositif pour extraire des informations et les transformer en donnees qualitatives d'un document textuel |
FR2896603A1 (fr) * | 2006-01-20 | 2007-07-27 | Thales Sa | Procede et dispositif pour extraire des informations et les transformer en donnees qualitatives d'un document textuel |
CN104239294A (zh) * | 2014-09-10 | 2014-12-24 | 华建宇通科技(北京)有限责任公司 | 藏汉翻译系统的多策略藏语长句切分方法 |
Also Published As
Publication number | Publication date |
---|---|
AU2466901A (en) | 2001-04-30 |
US6721697B1 (en) | 2004-04-13 |
WO2001029697A9 (en) | 2002-08-08 |
US20040167771A1 (en) | 2004-08-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6721697B1 (en) | Method and system for reducing lexical ambiguity | |
US6330530B1 (en) | Method and system for transforming a source language linguistic structure into a target language linguistic structure based on example linguistic feature structures | |
US6928448B1 (en) | System and method to match linguistic structures using thesaurus information | |
US6778949B2 (en) | Method and system to analyze, transfer and generate language expressions using compiled instructions to manipulate linguistic structures | |
US6760695B1 (en) | Automated natural language processing | |
US8374871B2 (en) | Methods for creating a phrase thesaurus | |
US6223150B1 (en) | Method and apparatus for parsing in a spoken language translation system | |
US6529865B1 (en) | System and method to compile instructions to manipulate linguistic structures into separate functions | |
US6243669B1 (en) | Method and apparatus for providing syntactic analysis and data structure for translation knowledge in example-based language translation | |
Dien et al. | Vietnamese Word Segmentation. | |
US11386269B2 (en) | Fault-tolerant information extraction | |
US20010029443A1 (en) | Machine translation system, machine translation method, and storage medium storing program for executing machine translation method | |
Kübler et al. | Part of speech tagging for Arabic | |
KR20040101678A (ko) | 복합 형태소 분석 장치 및 방법 | |
Marcińczuk et al. | Statistical proper name recognition in Polish economic texts | |
Vasiu et al. | Enhancing tokenization by embedding romanian language specific morphology | |
Đorđević et al. | Different approaches in serbian language parsing using context-free grammars | |
Okhovvat et al. | An Accurate Persian Part-of-Speech Tagger. | |
Loftsson | Tagging and parsing Icelandic text | |
Samir et al. | Training and evaluation of TreeTagger on Amazigh corpus | |
JP3698454B2 (ja) | 並列句解析装置および学習データ自動作成装置 | |
Loglo | A Lexical Dependency Probability Model for Mongolian Based on Integration of Morphological and Syntactic Features | |
EP1429257B1 (en) | Method and apparatus for recognizing multiword expressions | |
Gerbremedhin | Design and Development of Part of Speech Tagger for Ge’ ez Language Using Hybrid Approach | |
CHICHE et al. | A Hidden Markov Model-based Part of Speech Tagger for Shekki’noono |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
AK | Designated states |
Kind code of ref document: C2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: C2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
COP | Corrected version of pamphlet |
Free format text: PAGES 1/10-10/10, DRAWINGS, REPLACED BY NEW PAGES 1/11-11/11; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: JP |