AU2000276398A1 - A method and apparatus for determining text passage similarity - Google Patents
A method and apparatus for determining text passage similarityInfo
- Publication number
- AU2000276398A1 AU2000276398A1 AU2000276398A AU7639800A AU2000276398A1 AU 2000276398 A1 AU2000276398 A1 AU 2000276398A1 AU 2000276398 A AU2000276398 A AU 2000276398A AU 7639800 A AU7639800 A AU 7639800A AU 2000276398 A1 AU2000276398 A1 AU 2000276398A1
- Authority
- AU
- Australia
- Prior art keywords
- text passage
- determining text
- passage similarity
- similarity
- determining
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2000/000300 WO2002029618A1 (en) | 2000-09-30 | 2000-09-30 | A method and apparatus for determining text passage similarity |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2000276398A1 true AU2000276398A1 (en) | 2002-04-15 |
Family
ID=4574713
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2000276398A Abandoned AU2000276398A1 (en) | 2000-09-30 | 2000-09-30 | A method and apparatus for determining text passage similarity |
Country Status (3)
Country | Link |
---|---|
US (3) | US7778817B1 (en) |
AU (1) | AU2000276398A1 (en) |
WO (1) | WO2002029618A1 (en) |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002029618A1 (en) | 2000-09-30 | 2002-04-11 | Intel Corporation (A Corporation Of Delaware) | A method and apparatus for determining text passage similarity |
EP2071452A1 (en) * | 2007-12-07 | 2009-06-17 | Alcatel Lucent | Device and method for automatically building applications from specifications and from off-the-shelf components selected by semantic analysis |
US9092517B2 (en) * | 2008-09-23 | 2015-07-28 | Microsoft Technology Licensing, Llc | Generating synonyms based on query log data |
US9600566B2 (en) | 2010-05-14 | 2017-03-21 | Microsoft Technology Licensing, Llc | Identifying entity synonyms |
US8195458B2 (en) * | 2010-08-17 | 2012-06-05 | Xerox Corporation | Open class noun classification |
US8745019B2 (en) | 2012-03-05 | 2014-06-03 | Microsoft Corporation | Robust discovery of entity synonyms using query logs |
US10032131B2 (en) | 2012-06-20 | 2018-07-24 | Microsoft Technology Licensing, Llc | Data services for enterprises leveraging search system data assets |
US9594831B2 (en) | 2012-06-22 | 2017-03-14 | Microsoft Technology Licensing, Llc | Targeted disambiguation of named entities |
US9229924B2 (en) | 2012-08-24 | 2016-01-05 | Microsoft Technology Licensing, Llc | Word detection and domain dictionary recommendation |
US9754215B2 (en) | 2012-12-17 | 2017-09-05 | Sinoeast Concept Limited | Question classification and feature mapping in a deep question answering system |
US10706092B1 (en) | 2013-07-28 | 2020-07-07 | William S. Morriss | Error and manipulation resistant search technology |
US9514098B1 (en) * | 2013-12-09 | 2016-12-06 | Google Inc. | Iteratively learning coreference embeddings of noun phrases using feature representations that include distributed word representations of the noun phrases |
US9697099B2 (en) | 2014-06-04 | 2017-07-04 | International Business Machines Corporation | Real-time or frequent ingestion by running pipeline in order of effectiveness |
US9542496B2 (en) | 2014-06-04 | 2017-01-10 | International Business Machines Corporation | Effective ingesting data used for answering questions in a question and answer (QA) system |
US9619513B2 (en) | 2014-07-29 | 2017-04-11 | International Business Machines Corporation | Changed answer notification in a question and answer system |
JP6414967B2 (en) * | 2014-11-25 | 2018-10-31 | 日本放送協会 | Document processing apparatus and program |
US9424321B1 (en) * | 2015-04-27 | 2016-08-23 | Altep, Inc. | Conceptual document analysis and characterization |
US10169326B2 (en) | 2015-05-22 | 2019-01-01 | International Business Machines Corporation | Cognitive reminder notification mechanisms for answers to questions |
US9912736B2 (en) | 2015-05-22 | 2018-03-06 | International Business Machines Corporation | Cognitive reminder notification based on personal user profile and activity information |
US10152534B2 (en) | 2015-07-02 | 2018-12-11 | International Business Machines Corporation | Monitoring a corpus for changes to previously provided answers to questions |
US10769185B2 (en) | 2015-10-16 | 2020-09-08 | International Business Machines Corporation | Answer change notifications based on changes to user profile information |
CN109033212B (en) * | 2018-07-01 | 2021-09-07 | 上海新诤信知识产权服务股份有限公司 | Text classification method based on similarity matching |
US10831989B2 (en) | 2018-12-04 | 2020-11-10 | International Business Machines Corporation | Distributing updated communications to viewers of prior versions of the communications |
CN109658938B (en) * | 2018-12-07 | 2020-03-17 | 百度在线网络技术(北京)有限公司 | Method, device and equipment for matching voice and text and computer readable medium |
CN110674635B (en) * | 2019-09-27 | 2023-04-25 | 北京妙笔智能科技有限公司 | Method and device for dividing text paragraphs |
CA3096928A1 (en) * | 2019-10-25 | 2021-02-09 | Element Ai Inc. | Method and system for extracting information from a document |
US11392774B2 (en) | 2020-02-10 | 2022-07-19 | International Business Machines Corporation | Extracting relevant sentences from text corpus |
CN112380830B (en) * | 2020-06-18 | 2024-05-17 | 达观数据有限公司 | Matching method, system and computer readable storage medium for related sentences in different documents |
EP4202714A4 (en) * | 2020-09-27 | 2024-05-22 | Siemens Aktiengesellschaft | Text similarity determination method and apparatus and industrial diagnosis method and system |
CN115828931B (en) * | 2023-02-09 | 2023-05-02 | 中南大学 | Chinese and English semantic similarity calculation method for paragraph level text |
CN116188125B (en) * | 2023-03-10 | 2024-05-31 | 深圳市伙伴行网络科技有限公司 | Business invitation management method and device for office building, electronic equipment and storage medium |
CN116578673B (en) * | 2023-07-03 | 2024-02-09 | 北京凌霄文苑教育科技有限公司 | Text feature retrieval method based on linguistic logics in digital economy field |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2943447B2 (en) * | 1991-01-30 | 1999-08-30 | 三菱電機株式会社 | Text information extraction device, text similarity matching device, text search system, text information extraction method, text similarity matching method, and question analysis device |
ATE260486T1 (en) * | 1992-07-31 | 2004-03-15 | Ibm | FINDING CHARACTERS IN A DATABASE OF CHARACTERS |
EP0590173A1 (en) * | 1992-09-28 | 1994-04-06 | International Business Machines Corporation | Computer system for speech recognition |
JP3724847B2 (en) * | 1995-06-05 | 2005-12-07 | 株式会社日立製作所 | Structured document difference extraction method and apparatus |
KR19990087167A (en) * | 1996-12-24 | 1999-12-15 | 롤페스 요하네스 게라투스 알베르투스 | Methods of training a speech recognition system and devices implementing the method, in particular portable telephone devices |
WO2002029618A1 (en) | 2000-09-30 | 2002-04-11 | Intel Corporation (A Corporation Of Delaware) | A method and apparatus for determining text passage similarity |
US7295965B2 (en) * | 2001-06-29 | 2007-11-13 | Honeywell International Inc. | Method and apparatus for determining a measure of similarity between natural language sentences |
-
2000
- 2000-09-30 WO PCT/CN2000/000300 patent/WO2002029618A1/en active Application Filing
- 2000-09-30 US US10/130,858 patent/US7778817B1/en active Active
- 2000-09-30 AU AU2000276398A patent/AU2000276398A1/en not_active Abandoned
-
2010
- 2010-08-09 US US12/853,034 patent/US8117025B2/en not_active Expired - Fee Related
-
2012
- 2012-01-23 US US13/356,000 patent/US8650025B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
US20100332219A1 (en) | 2010-12-30 |
US20120123768A1 (en) | 2012-05-17 |
US8650025B2 (en) | 2014-02-11 |
US7778817B1 (en) | 2010-08-17 |
WO2002029618A1 (en) | 2002-04-11 |
US8117025B2 (en) | 2012-02-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2000276398A1 (en) | A method and apparatus for determining text passage similarity | |
AU2001238486A1 (en) | Method and apparatus for conducting or facilitating a promotion | |
AU2001255806A1 (en) | A method and device for forming a semantic description | |
AU2002232552A1 (en) | Method and apparatus for alphanumeric recognition | |
AU2001275422A1 (en) | Method and system for text analysis | |
GB0026353D0 (en) | Apparatus and a method for facilitating searching | |
AU2001224723A1 (en) | Method and apparatus for automatically selecting a rule | |
AU2002246524A1 (en) | Method and apparatus for providing location information | |
AU2001264698A1 (en) | Method and apparatus for providing customized information | |
AU2001284328A1 (en) | Information processing apparatus and method | |
AU2001243298A1 (en) | Apparatus and method for determining position | |
AU2002213933A1 (en) | Document search and analysing method and apparatus | |
AU2002236614A1 (en) | Method and apparatus for analyzing affect and emotion in text | |
AU2001261446A1 (en) | Method and apparatus for a portable information agent | |
AU2001291909A1 (en) | A method and a system for recognizing a melody | |
AU2001264771A1 (en) | Well reference apparatus and method | |
GB0127027D0 (en) | Method and apparatus for inputting information | |
AU2001261661A1 (en) | Method and apparatus for pressure sensing | |
AU7458301A (en) | Information processing device and processing method | |
AU2001278004A1 (en) | Internet information retrieval method and apparatus | |
AU2001266948A1 (en) | Apparatus and method for providing sequence database comparison | |
AU2001277270A1 (en) | Method and apparatus for a morphology-preserving smoothing | |
AU2000263270A1 (en) | Apparatus and a method for supplying information | |
AU2001256982A1 (en) | Apparatus and method for a vertically integrated construction business | |
AU2002214476A1 (en) | Method and device for speech analysis |