WO2011051970A3 - Method and system for obtaining semantically valid chunks for natural language applications - Google Patents

Method and system for obtaining semantically valid chunks for natural language applications Download PDF

Info

Publication number
WO2011051970A3
WO2011051970A3 PCT/IN2010/000693 IN2010000693W WO2011051970A3 WO 2011051970 A3 WO2011051970 A3 WO 2011051970A3 IN 2010000693 W IN2010000693 W IN 2010000693W WO 2011051970 A3 WO2011051970 A3 WO 2011051970A3
Authority
WO
WIPO (PCT)
Prior art keywords
predicates
natural language
objects
query
semantically valid
Prior art date
Application number
PCT/IN2010/000693
Other languages
French (fr)
Other versions
WO2011051970A2 (en
Inventor
Shailly Goyal
Shefali Bhat
Shailja Gulati
Chandrasekhar Anantaram
Original Assignee
Tata Consultancy Services Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tata Consultancy Services Ltd. filed Critical Tata Consultancy Services Ltd.
Publication of WO2011051970A2 publication Critical patent/WO2011051970A2/en
Publication of WO2011051970A3 publication Critical patent/WO2011051970A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

A method and system for obtaining semantically valid chunks for natural language applications are disclosed in the present invention. The method includes the following steps: identifying predicates, objects and comparison operators in a natural language query; binding the identified predicates and objects using the identified comparison operator; replacing all occurrences of string comparator operators in said natural language query with corresponding mathematical operators; binding predicates and objects of same data type; checking compatibility of bound predicates and objects using domain ontology; binding string objects to their compatible predicates using domain ontology; forming constraint predicate sets from the remaining predicates of the query in order to find semantically valid chunk sets from said natural language query; syntactically parsing natural language query for absolving ambiguities; determining the depth between any two predicates using domain ontology, thereby providing a syntactically and semantically valid chunk set adapted to be used as a query.
PCT/IN2010/000693 2009-10-28 2010-10-27 Method and system for obtaining semantically valid chunks for natural language applications WO2011051970A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN2501MU2009 2009-10-28
IN2501/MUM/2009 2009-10-28

Publications (2)

Publication Number Publication Date
WO2011051970A2 WO2011051970A2 (en) 2011-05-05
WO2011051970A3 true WO2011051970A3 (en) 2011-07-07

Family

ID=43922729

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2010/000693 WO2011051970A2 (en) 2009-10-28 2010-10-27 Method and system for obtaining semantically valid chunks for natural language applications

Country Status (1)

Country Link
WO (1) WO2011051970A2 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9372924B2 (en) 2012-06-12 2016-06-21 International Business Machines Corporation Ontology driven dictionary generation and ambiguity resolution for natural language processing
US10303763B2 (en) 2017-01-06 2019-05-28 International Business Machines Corporation Process for identifying completion of domain adaptation dictionary activities
KR102209786B1 (en) * 2018-06-29 2021-01-29 김태정 Method and apparatus for constructing chunk based on natural language processing
CN112749548A (en) * 2020-11-02 2021-05-04 万齐智 Rule-based Chinese structured financial event default completion extraction method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998025217A1 (en) * 1996-12-04 1998-06-11 Quarterdeck Corporation Method and apparatus for natural language querying and semantic searching of an information database
CN1255213A (en) * 1997-03-04 2000-05-31 石仓博 Language analysis system and method
US6947923B2 (en) * 2000-12-08 2005-09-20 Electronics And Telecommunications Research Institute Information generation and retrieval method based on standardized format of sentence structure and semantic structure and system using the same
US20090070311A1 (en) * 2007-09-07 2009-03-12 At&T Corp. System and method using a discriminative learning approach for question answering
CN101398835A (en) * 2007-09-30 2009-04-01 日电(中国)有限公司 Service selecting system and method, and service enquiring system and method based on natural language

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998025217A1 (en) * 1996-12-04 1998-06-11 Quarterdeck Corporation Method and apparatus for natural language querying and semantic searching of an information database
CN1255213A (en) * 1997-03-04 2000-05-31 石仓博 Language analysis system and method
US6947923B2 (en) * 2000-12-08 2005-09-20 Electronics And Telecommunications Research Institute Information generation and retrieval method based on standardized format of sentence structure and semantic structure and system using the same
US20090070311A1 (en) * 2007-09-07 2009-03-12 At&T Corp. System and method using a discriminative learning approach for question answering
CN101398835A (en) * 2007-09-30 2009-04-01 日电(中国)有限公司 Service selecting system and method, and service enquiring system and method based on natural language

Also Published As

Publication number Publication date
WO2011051970A2 (en) 2011-05-05

Similar Documents

Publication Publication Date Title
WO2008042563A3 (en) Apparatus and method for searching reports
WO2007126996A3 (en) System and methods for enhanced metadata entry
WO2014015168A3 (en) Just-in-time distributed video cache
IN2013DE00589A (en)
WO2007106858A3 (en) System, method, and computer program product for data mining and automatically generating hypotheses from data repositories
WO2008070362A3 (en) System and method for converting a natural language query into a logical query
WO2012070840A3 (en) Apparatus and method for consensus search
WO2013006422A3 (en) Systems and methods for creating an annotation from a document
WO2012173886A3 (en) Method for parsing, searching and formatting of text input for visual mapping of knowledge information
WO2008088721A3 (en) Querying data and an associated ontology in a database management system
WO2007115078A3 (en) System and method for generating homogeneous metadata from pre-existing metadata
WO2011077300A3 (en) Processing of geological data
WO2007098320A3 (en) Apparatus and method for federated querying of unstructured data
WO2011156731A3 (en) Query pipeline
WO2011063036A3 (en) Systems and methods for accessing web pages using natural language
WO2013181588A3 (en) Defining and mapping application interface semantics
GB201209093D0 (en) Method of searching for document data files based on keywords,and computer system and computer program thereof
WO2013025624A3 (en) Searching encrypted electronic books
WO2012126015A3 (en) Xbrl database mapping system and method
BR112013016993A2 (en) Method and apparatus for updating a database on a receiving device
EP2506151A4 (en) Semantic syntax tree kernel-based processing system and method for automatically extracting semantic correlations between scientific and technological core entities
EP2688000A4 (en) Data deduplication method and device
WO2010062737A3 (en) Retrieval using a generalized sentence collocation
MX2012011904A (en) System and method for subject identification from free format data sources.
GB2482089A (en) System and method for storage and retrieval of electronic documents

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10826238

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10826238

Country of ref document: EP

Kind code of ref document: A2