WO2001001289A8 - Semantic processor and method with knowledge analysis of and extraction from natural language documents - Google Patents

Semantic processor and method with knowledge analysis of and extraction from natural language documents

Info

Publication number
WO2001001289A8
WO2001001289A8 PCT/US2000/017444 US0017444W WO0101289A8 WO 2001001289 A8 WO2001001289 A8 WO 2001001289A8 US 0017444 W US0017444 W US 0017444W WO 0101289 A8 WO0101289 A8 WO 0101289A8
Authority
WO
WIPO (PCT)
Prior art keywords
sao
storing
natural language
extractions
association
Prior art date
Application number
PCT/US2000/017444
Other languages
French (fr)
Other versions
WO2001001289A1 (en
Inventor
Valery Tsourikov
Leonid Batchilo
Igor Sovpel
Original Assignee
Inv Machine Corp Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inv Machine Corp Inc filed Critical Inv Machine Corp Inc
Priority to AU56370/00A priority Critical patent/AU5637000A/en
Priority to EP00941702A priority patent/EP1208457A1/en
Publication of WO2001001289A1 publication Critical patent/WO2001001289A1/en
Publication of WO2001001289A8 publication Critical patent/WO2001001289A8/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method of semantically processing natural language representations in a general purpose computer including retrieving from remote and local databases (12, 14) and storing representations of the texts of a plurality of natural language documents, formatting said representations and storing the formatted representation (18), identifying and extracting from the formatted representation subject-action-object (SAO) extractions and storing the SAO extractions (20), processing the SAO extractions into normalized SAO structures and storing the SAO structures (22), designating the AO portions as substantially the names of Folders of at least some of the SAO structures, and storing in association with each Folder name the identity of one or more subject portions (S1, S2, ...Sn) that are associated with the respective AO portion of stored SAO structures. The method further includes storing in association with each respective (S1, S2, ... Sn) the full sentence in which the respective SAO appears and highlighting each S-A-O portion that appears in each said full sentence. The list of subjects (S1, S2 ...Sn) stored in association with a respective AO portion is displayed in response to the user selecting the displayed AO portion or Folder name. If desired, the retrieved and processed documents can relate to a user-entered criterion.
PCT/US2000/017444 1999-06-30 2000-06-23 Semantic processor and method with knowledge analysis of and extraction from natural language documents WO2001001289A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU56370/00A AU5637000A (en) 1999-06-30 2000-06-23 Semantic processor and method with knowledge analysis of and extraction from natural language documents
EP00941702A EP1208457A1 (en) 1999-06-30 2000-06-23 Semantic processor and method with knowledge analysis of and extraction from natural language documents

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US34554799A 1999-06-30 1999-06-30
US09/345,547 1999-06-30

Publications (2)

Publication Number Publication Date
WO2001001289A1 WO2001001289A1 (en) 2001-01-04
WO2001001289A8 true WO2001001289A8 (en) 2001-06-21

Family

ID=23355462

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/017444 WO2001001289A1 (en) 1999-06-30 2000-06-23 Semantic processor and method with knowledge analysis of and extraction from natural language documents

Country Status (3)

Country Link
EP (1) EP1208457A1 (en)
AU (1) AU5637000A (en)
WO (1) WO2001001289A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8874427B2 (en) 2004-03-05 2014-10-28 Sdl Enterprise Technologies, Inc. In-context exact (ICE) matching
US8935150B2 (en) 2009-03-02 2015-01-13 Sdl Plc Dynamic generation of auto-suggest dictionary for natural language translation
US9400786B2 (en) 2006-09-21 2016-07-26 Sdl Plc Computer-implemented method, computer software and apparatus for use in a translation system
US9600472B2 (en) 1999-09-17 2017-03-21 Sdl Inc. E-services translation utilizing machine translation and translation memory

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1351156A1 (en) * 2002-03-14 2003-10-08 Universita' Degli Studi di Firenze System and method for automatically performing functional analyses of technical texts
US20040001099A1 (en) * 2002-06-27 2004-01-01 Microsoft Corporation Method and system for associating actions with semantic labels in electronic documents
GB2417103A (en) * 2004-08-11 2006-02-15 Sdl Plc Natural language translation system
US9128929B2 (en) 2011-01-14 2015-09-08 Sdl Language Technologies Systems and methods for automatically estimating a translation time including preparation time in addition to the translation itself
ITTO20120303A1 (en) * 2012-04-05 2012-07-05 Wolf S R L Dr METHOD AND SYSTEM FOR CARRYING OUT ANALYSIS AND AUTOMATIC COMPARISON OF PATENTS AND TECHNICAL DESCRIPTIONS.
US9965461B2 (en) 2013-03-01 2018-05-08 The Software Shop, Inc. Systems and methods for improving the efficiency of syntactic and semantic analysis in automated processes for natural language understanding using argument ordering
US10318636B2 (en) * 2016-10-30 2019-06-11 Wipro Limited Method and system for determining action items using neural networks from knowledge base for execution of operations
EP3619700A4 (en) * 2017-05-05 2020-10-14 Midmore, Roger Interactive story system using four-valued logic
US10635863B2 (en) 2017-10-30 2020-04-28 Sdl Inc. Fragment recall and adaptive automated translation
US10817676B2 (en) 2017-12-27 2020-10-27 Sdl Inc. Intelligent routing services and systems
US11256867B2 (en) 2018-10-09 2022-02-22 Sdl Inc. Systems and methods of machine learning for digital assets and message creation
CN109918640B (en) * 2018-12-22 2023-05-02 浙江工商大学 Chinese text proofreading method based on knowledge graph

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4829423A (en) * 1983-01-28 1989-05-09 Texas Instruments Incorporated Menu-based natural language understanding system
US5696916A (en) * 1985-03-27 1997-12-09 Hitachi, Ltd. Information storage and retrieval system and display method therefor
US4864502A (en) * 1987-10-07 1989-09-05 Houghton Mifflin Company Sentence analyzer
US5559940A (en) * 1990-12-14 1996-09-24 Hutson; William H. Method and system for real-time information analysis of textual material
US5369575A (en) * 1992-05-15 1994-11-29 International Business Machines Corporation Constrained natural language interface for a computer system
US5331556A (en) * 1993-06-28 1994-07-19 General Electric Company Method for natural language data processing using morphological and part-of-speech information
US5873056A (en) * 1993-10-12 1999-02-16 The Syracuse University Natural language processing system for semantic vector representation which accounts for lexical ambiguity
US5692176A (en) * 1993-11-22 1997-11-25 Reed Elsevier Inc. Associative text search and retrieval system
US5799268A (en) * 1994-09-28 1998-08-25 Apple Computer, Inc. Method for extracting knowledge from online documentation and creating a glossary, index, help database or the like
US5873076A (en) * 1995-09-15 1999-02-16 Infonautics Corporation Architecture for processing search queries, retrieving documents identified thereby, and method for using same

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9600472B2 (en) 1999-09-17 2017-03-21 Sdl Inc. E-services translation utilizing machine translation and translation memory
US8874427B2 (en) 2004-03-05 2014-10-28 Sdl Enterprise Technologies, Inc. In-context exact (ICE) matching
US9342506B2 (en) 2004-03-05 2016-05-17 Sdl Inc. In-context exact (ICE) matching
US9400786B2 (en) 2006-09-21 2016-07-26 Sdl Plc Computer-implemented method, computer software and apparatus for use in a translation system
US8935150B2 (en) 2009-03-02 2015-01-13 Sdl Plc Dynamic generation of auto-suggest dictionary for natural language translation
US9262403B2 (en) 2009-03-02 2016-02-16 Sdl Plc Dynamic generation of auto-suggest dictionary for natural language translation

Also Published As

Publication number Publication date
EP1208457A1 (en) 2002-05-29
WO2001001289A1 (en) 2001-01-04
AU5637000A (en) 2001-01-31

Similar Documents

Publication Publication Date Title
WO2001001289A8 (en) Semantic processor and method with knowledge analysis of and extraction from natural language documents
Tangherlini et al. Trawling in the Sea of the Great Unread: Sub-corpus topic modeling and Humanities research
JP3254642B2 (en) How to display the index
US6128635A (en) Document display system and electronic dictionary
EP1124189A1 (en) Document sorting method, document sorter, and recorded medium on which document sorting program is recorded
US20070005649A1 (en) Contextual title extraction
JPH07160727A (en) Electronic manual display method
DE112005001314T5 (en) Portable electronic device with text disambiguation
DE112005002060T5 (en) Portable electronic device with text disambiguation
Sinclair Prospects for automatic lexicography
Blake Traditional African values and the right to communicate
CN101082910B (en) Sentence display control device and method
Minugh You people use such weird, expressions: The Frequency of Idioms in Newspaper CDs as Corpora
JP3682535B2 (en) Document difference detection apparatus and program
JPH06348756A (en) Index preparing device and index utilizing device
Rokaya et al. Building a multi-lingual field association terms dictionary
JPH08314974A (en) Automatic key work extracting device and document retrieving device
Maniez The use of electronic corpora and lexical frequency data in solving translation problems
JPH022458A (en) Similar document retrieving device
De Vorsey et al. The development of a local thesaurus to improve access to the anthropological collections of the American Museum of Natural History
JP2005141630A (en) Translation support dictionary apparatus
Kibbee 16th-century bilingual dictionaries (French-English): Organization and access, then and now
JPH0944504A (en) Translating method and machine translating device
Boachie-Danquah Human resource capacity building in Ghana's local government
Myers Les Murray, The Peasant Mandarin: Prose Pieces

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: C1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

CFP Corrected version of a pamphlet front page

Free format text: REVISED ABSTRACT RECEIVED BY THE INTERNATIONAL BUREAU AFTER COMPLETION OF THE TECHNICAL PREPARATIONS FOR INTERNATIONAL PUBLICATION

WWE Wipo information: entry into national phase

Ref document number: 2000941702

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 2000941702

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2000941702

Country of ref document: EP

NENP Non-entry into the national phase in:

Ref country code: JP