JP2009500754A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2009500754A5 JP2009500754A5 JP2008520339A JP2008520339A JP2009500754A5 JP 2009500754 A5 JP2009500754 A5 JP 2009500754A5 JP 2008520339 A JP2008520339 A JP 2008520339A JP 2008520339 A JP2008520339 A JP 2008520339A JP 2009500754 A5 JP2009500754 A5 JP 2009500754A5
- Authority
- JP
- Japan
- Prior art keywords
- sentence
- word
- query
- text
- queries
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 claims 16
- 238000012937 correction Methods 0.000 claims 2
- 238000006467 substitution reaction Methods 0.000 claims 1
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US11/177,136 US7574348B2 (en) | 2005-07-08 | 2005-07-08 | Processing collocation mistakes in documents |
| US11/177,136 | 2005-07-08 | ||
| PCT/US2006/026012 WO2007008492A2 (en) | 2005-07-08 | 2006-06-30 | Processing collocation mistakes in documents |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2009500754A JP2009500754A (ja) | 2009-01-08 |
| JP2009500754A5 true JP2009500754A5 (enExample) | 2009-08-06 |
| JP5362353B2 JP5362353B2 (ja) | 2013-12-11 |
Family
ID=37619276
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2008520339A Expired - Fee Related JP5362353B2 (ja) | 2005-07-08 | 2006-06-30 | 文書中のコロケーション誤りを処理すること |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US7574348B2 (enExample) |
| EP (1) | EP1899835B1 (enExample) |
| JP (1) | JP5362353B2 (enExample) |
| KR (1) | KR20080023341A (enExample) |
| CN (1) | CN101218573A (enExample) |
| AU (1) | AU2006269494A1 (enExample) |
| CA (1) | CA2614416C (enExample) |
| MX (1) | MX2008000176A (enExample) |
| NO (1) | NO20080112L (enExample) |
| WO (1) | WO2007008492A2 (enExample) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR100837750B1 (ko) * | 2006-08-25 | 2008-06-13 | 엔에이치엔(주) | 성조를 이용하여 중국어를 검색하는 방법 및 상기 방법을수행하는 시스템 |
| US7774193B2 (en) * | 2006-12-05 | 2010-08-10 | Microsoft Corporation | Proofing of word collocation errors based on a comparison with collocations in a corpus |
| US20110055209A1 (en) * | 2007-02-23 | 2011-03-03 | Anthony Novac | System and method for delivering content and advertisments |
| KR100978581B1 (ko) * | 2008-05-08 | 2010-08-27 | 엔에이치엔(주) | 웹 페이지 열람 중에 편리하게 사전 서비스를 제공하기위한 방법 및 시스템 |
| US8473278B2 (en) * | 2008-07-24 | 2013-06-25 | Educational Testing Service | Systems and methods for identifying collocation errors in text |
| US20100082324A1 (en) * | 2008-09-30 | 2010-04-01 | Microsoft Corporation | Replacing terms in machine translation |
| US8484014B2 (en) * | 2008-11-03 | 2013-07-09 | Microsoft Corporation | Retrieval using a generalized sentence collocation |
| TWI403911B (zh) * | 2008-11-28 | 2013-08-01 | Inst Information Industry | 中文辭典建置裝置和方法,以及儲存媒體 |
| US8250072B2 (en) * | 2009-03-06 | 2012-08-21 | Dmitri Asonov | Detecting real word typos |
| CN101930594B (zh) * | 2010-04-14 | 2012-05-23 | 山东山大鸥玛软件有限公司 | 一种扫描文档图像的快速纠偏方法 |
| US20110271232A1 (en) | 2010-04-30 | 2011-11-03 | Orbis Technologies, Inc. | Systems and methods for semantic search, content correlation and visualization |
| US10496714B2 (en) * | 2010-08-06 | 2019-12-03 | Google Llc | State-dependent query response |
| US9262397B2 (en) | 2010-10-08 | 2016-02-16 | Microsoft Technology Licensing, Llc | General purpose correction of grammatical and word usage errors |
| US8855997B2 (en) | 2011-07-28 | 2014-10-07 | Microsoft Corporation | Linguistic error detection |
| US9015080B2 (en) | 2012-03-16 | 2015-04-21 | Orbis Technologies, Inc. | Systems and methods for semantic inference and reasoning |
| US8484017B1 (en) | 2012-09-10 | 2013-07-09 | Google Inc. | Identifying media content |
| US20140074466A1 (en) | 2012-09-10 | 2014-03-13 | Google Inc. | Answering questions using environmental context |
| US9189531B2 (en) | 2012-11-30 | 2015-11-17 | Orbis Technologies, Inc. | Ontology harmonization and mediation systems and methods |
| CN103365838B (zh) * | 2013-07-24 | 2016-04-20 | 桂林电子科技大学 | 基于多元特征的英语作文语法错误自动纠正方法 |
| US9298695B2 (en) * | 2013-09-05 | 2016-03-29 | At&T Intellectual Property I, Lp | Method and apparatus for managing auto-correction in messaging |
| CN103678714B (zh) * | 2013-12-31 | 2017-05-10 | 北京百度网讯科技有限公司 | 实体知识库的构建方法和装置 |
| US20160087929A1 (en) * | 2014-09-24 | 2016-03-24 | Zoho Corporation Private Limited | Methods and apparatus for document creation via email |
| US10691709B2 (en) | 2015-10-28 | 2020-06-23 | Open Text Sa Ulc | System and method for subset searching and associated search operators |
| US10747815B2 (en) | 2017-05-11 | 2020-08-18 | Open Text Sa Ulc | System and method for searching chains of regions and associated search operators |
| US10241716B2 (en) | 2017-06-30 | 2019-03-26 | Microsoft Technology Licensing, Llc | Global occupancy aggregator for global garbage collection scheduling |
| WO2019006550A1 (en) | 2017-07-06 | 2019-01-10 | Open Text Sa Ulc | SYSTEM AND METHOD FOR VALUE-BASED REGION SEARCH AND RELATED SEARCH OPERATORS |
| US10824686B2 (en) | 2018-03-05 | 2020-11-03 | Open Text Sa Ulc | System and method for searching based on text blocks and associated search operators |
| US11551006B2 (en) * | 2019-09-09 | 2023-01-10 | International Business Machines Corporation | Removal of personality signatures |
| US20250086392A1 (en) * | 2023-09-08 | 2025-03-13 | Sap Se | Computer-implemented contract risk assessment platform leveraging transformers |
Family Cites Families (34)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH083815B2 (ja) * | 1985-10-25 | 1996-01-17 | 株式会社日立製作所 | 自然言語の共起関係辞書保守方法 |
| GB8625468D0 (en) | 1986-10-24 | 1987-04-15 | Smiths Industries Plc | Speech recognition apparatus |
| US4868750A (en) * | 1987-10-07 | 1989-09-19 | Houghton Mifflin Company | Collocational grammar system |
| US5251129A (en) * | 1990-08-21 | 1993-10-05 | General Electric Company | Method for automated morphological analysis of word structure |
| US5541836A (en) * | 1991-12-30 | 1996-07-30 | At&T Corp. | Word disambiguation apparatus and methods |
| US5383120A (en) * | 1992-03-02 | 1995-01-17 | General Electric Company | Method for tagging collocations in text |
| US5617488A (en) * | 1995-02-01 | 1997-04-01 | The Research Foundation Of State University Of New York | Relaxation word recognizer |
| US5887120A (en) * | 1995-05-31 | 1999-03-23 | Oracle Corporation | Method and apparatus for determining theme for discourse |
| US5680511A (en) * | 1995-06-07 | 1997-10-21 | Dragon Systems, Inc. | Systems and methods for word recognition |
| US5721938A (en) * | 1995-06-07 | 1998-02-24 | Stuckey; Barbara K. | Method and device for parsing and analyzing natural language sentences and text |
| US5907839A (en) * | 1996-07-03 | 1999-05-25 | Yeda Reseach And Development, Co., Ltd. | Algorithm for context sensitive spelling correction |
| US6173298B1 (en) * | 1996-09-17 | 2001-01-09 | Asap, Ltd. | Method and apparatus for implementing a dynamic collocation dictionary |
| CN1193779A (zh) * | 1997-03-13 | 1998-09-23 | 国际商业机器公司 | 中文语句分词方法及其在中文查错系统中的应用 |
| GB2329047A (en) * | 1997-09-05 | 1999-03-10 | Sharp Kk | A method of identifying collocates |
| KR980004126A (ko) * | 1997-12-16 | 1998-03-30 | 양승택 | 다국어 웹 문서 검색을 위한 질의어 변환 장치 및 방법 |
| GB2334115A (en) * | 1998-01-30 | 1999-08-11 | Sharp Kk | Processing text eg for approximate translation |
| US6216123B1 (en) * | 1998-06-24 | 2001-04-10 | Novell, Inc. | Method and system for rapid retrieval in a full text indexing system |
| GB9821787D0 (en) * | 1998-10-06 | 1998-12-02 | Data Limited | Apparatus for classifying or processing data |
| JP2001101186A (ja) * | 1999-09-30 | 2001-04-13 | Oki Electric Ind Co Ltd | 機械翻訳装置 |
| GB0006721D0 (en) * | 2000-03-20 | 2000-05-10 | Mitchell Thomas A | Assessment methods and systems |
| US7860706B2 (en) * | 2001-03-16 | 2010-12-28 | Eli Abir | Knowledge system method and appparatus |
| US20020152219A1 (en) * | 2001-04-16 | 2002-10-17 | Singh Monmohan L. | Data interexchange protocol |
| US7269546B2 (en) * | 2001-05-09 | 2007-09-11 | International Business Machines Corporation | System and method of finding documents related to other documents and of finding related words in response to a query to refine a search |
| US7003444B2 (en) * | 2001-07-12 | 2006-02-21 | Microsoft Corporation | Method and apparatus for improved grammar checking using a stochastic parser |
| US7246060B2 (en) * | 2001-11-06 | 2007-07-17 | Microsoft Corporation | Natural input recognition system and method using a contextual mapping engine and adaptive user bias |
| US20030154071A1 (en) * | 2002-02-11 | 2003-08-14 | Shreve Gregory M. | Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents |
| KR100530154B1 (ko) * | 2002-06-07 | 2005-11-21 | 인터내셔널 비지네스 머신즈 코포레이션 | 변환방식 기계번역시스템에서 사용되는 변환사전을생성하는 방법 및 장치 |
| US7031911B2 (en) * | 2002-06-28 | 2006-04-18 | Microsoft Corporation | System and method for automatic detection of collocation mistakes in documents |
| US7171351B2 (en) * | 2002-09-19 | 2007-01-30 | Microsoft Corporation | Method and system for retrieving hint sentences using expanded queries |
| US7249012B2 (en) * | 2002-11-20 | 2007-07-24 | Microsoft Corporation | Statistical method and apparatus for learning translation relationships among phrases |
| US7689412B2 (en) * | 2003-12-05 | 2010-03-30 | Microsoft Corporation | Synonymous collocation extraction using translation information |
| US7707039B2 (en) * | 2004-02-15 | 2010-04-27 | Exbiblio B.V. | Automatic modification of web pages |
| US20060282255A1 (en) * | 2005-06-14 | 2006-12-14 | Microsoft Corporation | Collocation translation from monolingual and available bilingual corpora |
| US20070016397A1 (en) * | 2005-07-18 | 2007-01-18 | Microsoft Corporation | Collocation translation using monolingual corpora |
-
2005
- 2005-07-08 US US11/177,136 patent/US7574348B2/en not_active Expired - Fee Related
-
2006
- 2006-06-30 KR KR1020087000528A patent/KR20080023341A/ko not_active Abandoned
- 2006-06-30 CA CA2614416A patent/CA2614416C/en not_active Expired - Fee Related
- 2006-06-30 JP JP2008520339A patent/JP5362353B2/ja not_active Expired - Fee Related
- 2006-06-30 EP EP06774479.7A patent/EP1899835B1/en not_active Not-in-force
- 2006-06-30 MX MX2008000176A patent/MX2008000176A/es not_active Application Discontinuation
- 2006-06-30 WO PCT/US2006/026012 patent/WO2007008492A2/en not_active Ceased
- 2006-06-30 AU AU2006269494A patent/AU2006269494A1/en not_active Abandoned
- 2006-06-30 CN CNA2006800248782A patent/CN101218573A/zh active Pending
-
2008
- 2008-01-08 NO NO20080112A patent/NO20080112L/no unknown
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2009500754A5 (enExample) | ||
| US8473278B2 (en) | Systems and methods for identifying collocation errors in text | |
| US8560297B2 (en) | Locating parallel word sequences in electronic documents | |
| US12141211B2 (en) | System, method, and computer program product for tokenizing document citations | |
| Bjarnadóttir | The database of modern Icelandic inflection (Beygingarlýsing íslensks nútímamáls) | |
| KR101500617B1 (ko) | 한국어 어휘 의미망을 이용한 문맥 철자오류 교정 장치 및 방법 | |
| CN103324609A (zh) | 文本校对装置和文本校对方法 | |
| Beck et al. | Representation problems in linguistic annotations: Ambiguity, variation, uncertainty, error and bias | |
| US8583415B2 (en) | Phonetic search using normalized string | |
| Näther | An in-depth comparison of 14 spelling correction tools on a common benchmark | |
| US20250363302A1 (en) | Mapping entities in unstructured text documents via entity correction and entity resolution | |
| Abdelmageed et al. | Results of semtab 2022 | |
| CN101369285B (zh) | 一种中文搜索引擎中查询词的拼写校正方法 | |
| Tang et al. | Overview of the NTCIR-9 Crosslink Task: Cross-lingual Link Discovery. | |
| Bhatti et al. | Phonetic-based sindhi spellchecker system using a hybrid model | |
| US11288451B2 (en) | Machine based expansion of contractions in text in digital media | |
| JP5285491B2 (ja) | 情報検索システム、方法及びプログラム、索引作成システム、方法及びプログラム、 | |
| Daðason | Post-correction of Icelandic OCR text | |
| Melero et al. | Holaaa!! writin like u talk is kewl but kinda hard 4 NLP | |
| Yousef et al. | Intra-language text alignment using ialigner | |
| US12399874B1 (en) | De-confliction system and method of querying a database including confusable characters | |
| JP6677158B2 (ja) | 文書データ処理装置、文書データ処理方法、及び文書データ処理プログラム | |
| Dastgheib et al. | Design and implementation of Persian spelling detection and correction system based on Semantic | |
| Pretkalniņa et al. | Making historical Latvian texts more intelligible to contemporary readers | |
| Verulkar et al. | Transliterated search of Hindi lyrics |