US20170068725A1 - Analogy Finder - Google Patents

Analogy Finder Download PDF

Info

Publication number
US20170068725A1
US20170068725A1 US15/355,418 US201615355418A US2017068725A1 US 20170068725 A1 US20170068725 A1 US 20170068725A1 US 201615355418 A US201615355418 A US 201615355418A US 2017068725 A1 US2017068725 A1 US 2017068725A1
Authority
US
United States
Prior art keywords
query
predicate
data
subject
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/355,418
Inventor
Anthony McCaffrey
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Massachusetts UMass
Original Assignee
University of Massachusetts UMass
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Massachusetts UMass filed Critical University of Massachusetts UMass
Priority to US15/355,418 priority Critical patent/US20170068725A1/en
Assigned to UNIVERSITY OF MASSACHUSETTS reassignment UNIVERSITY OF MASSACHUSETTS ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MCCAFFREY, ANTHONY
Assigned to INNOVATION ACCELERATOR, INC. reassignment INNOVATION ACCELERATOR, INC. LICENSE (SEE DOCUMENT FOR DETAILS). Assignors: UNIVERSITY OF MASSACHUSETTS
Publication of US20170068725A1 publication Critical patent/US20170068725A1/en
Priority to US15/717,521 priority patent/US20180025074A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06F17/30654
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/338Presentation of query results
    • G06F17/2775
    • G06F17/2795
    • G06F17/30696
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking

Definitions

  • Design fixation is the tendency to fixate on the features of known solutions when trying to create novel solutions (Jansson & Smith, 1991). For example, a subject who is shown an existing chair and then asked to design an improved chair is likely to fixate on features of the existing chair when attempting to design an improved chair. Such fixation can lead the subject to overlook features that would be useful to include in an improved chair, but which are lacking in the existing chair.
  • a user provides a query that includes at least two of a subject, a predicate, and an object.
  • a computer system identifies synonyms of one or more of the subject, predicate, and object, and forms new queries from the identified synonyms.
  • the system searches a dataset using the new queries, and possibly also using the user-provided query, to produce search results.
  • the system may process the search results, such as by filtering and/or sorting them.
  • the system provides output representing the search results to the user.
  • the user may use the search result output to identify answers that are analogous to answers to the query originally provided by the user.
  • one embodiment of the present invention is directed to a method performed by at least one computer processor.
  • the method includes: (A) obtaining predicate data representing a predicate; (B) obtaining object data representing an object of the predicate; (C) identifying a synonym of the predicate; (D) identifying a synonym of the object; (E) forming a query from the synonym of the predicate and the synonym of the object; and (F) searching a dataset using the query to identify a subset of the dataset.
  • FIG. 1 is a dataflow diagram of a system for assisting in overcoming design fixation by searching a dataset using queries that are analogous to a query specified by a user according to one embodiment of the present invention
  • FIG. 2 is a flowchart of a method performed by the system of FIG. 1 according to one embodiment of the present invention.
  • Embodiments of the present invention may be used to alleviate design fixation and to promote analogy transfer in a variety of ways.
  • embodiments of the present invention may be used to obtain a query taking the form of “subject predicate object” or “predicate object” (where the subject is a first noun phrase, the predicate is a verb phrase, and the object is a second noun phrase); obtain one or more synonyms (e.g., hypernyms and/or hyponyms) of one or more of the subject, predicate, and object; form modified queries using combinations of the subject, predicate, object and/or their synonyms; search a dataset using the modified queries; and provide the results of the search to the user.
  • synonyms e.g., hypernyms and/or hyponyms
  • Such techniques may be used to search for and obtain search results representing analogs to the search results that would have been found using the original query provided by the user.
  • Embodiments of the present invention may obtain a synonym of “reduce,” such as “minimize,” and form a first revised query from this synonym and the original object, e.g., “minimize vibrations.”
  • embodiments of the present invention may obtain a synonym of “vibrations,” such as “oscillations,” and form a second revised query from this synonym and the original predicate, e.g., “reduce oscillations.”
  • embodiments of the present invention may form a third revised query from the synonym of the predicate and the synonym of the object, e.g.
  • Embodiments of the present invention may then search a dataset using some or all of the first, second, and third revised queries, and possibly also using the original query submitted by the user. Embodiments of the present invention may then provide the results of some or all of such searches to the user. Similar techniques may be applied to queries of the form “subject predicate object,” such as “fluid reduces vibrations.”
  • any of the techniques disclosed herein which obtain synonyms of a particular term, phrase, or query as a whole, may be used to obtain antonyms of the particular term, phrase, or query as a whole.
  • the techniques disclosed herein may be used to obtain an antonym of the predicate (e.g., “reduce”) and an antonym of an object (e.g., “vibrations”) in a query to produce a first revised query from the two synonyms, e.g., “increase stability.”
  • the techniques disclosed herein may be used to obtain an antonym of the subject (e.g., “fluid”) and an antonym of a predicate (e.g., “reduces”) in a query to produce a second revised query from the two antonyms, e.g., “solids increase.”
  • the techniques disclosed herein may be used to obtain an antonym of the subject (e.g., “fluid”) and an antonym of an object (e
  • Search results produced using the techniques summarized above are likely to include data records (such as web pages, patents, or other documents) that contain information related not only to the result described explicitly by the user-submitted query (i.e., reducing vibrations), but also data records that contain information related to results that are analogous to the result described by the user-submitted query.
  • the techniques summarized above therefore, may be used to assist the user in identifying analogous solutions to the problem described by the user-submitted query.
  • FIG. 1 a dataflow diagram is shown of a system 100 for finding data that is analogous to the answer to a query provided by a user according to one embodiment of the present invention.
  • FIG. 2 a flowchart is shown of a method 200 that is performed by the system 100 of FIG. 1 according to one embodiment of the present invention.
  • a user 102 provides a query 104 to the system 100 ( FIG. 2 , operation 202 ).
  • the query 104 is referred to herein as the “original query” to distinguish it from other queries referred to herein.
  • the original query 104 may take a variety of forms.
  • the query 104 is shown as containing subject data 106 a representing a subject, predicate data 106 b representing a predicate, and object data 106 c representing an object.
  • the subject data 106 a may be the text “fluid”
  • the predicate data may be the text “reduces”
  • the object data 106 c may be the text “vibrations.”
  • the subject represented by the subject data 106 a is a subject of the predicate represented by the predicate data 106 b
  • the object represented by the object data 106 c is an object of the predicate represented by the predicate data 106 b.
  • the subject data 106 a may, for example, be or represent a first noun phrase containing one or more words (also signified herein as “nounPhrase1”).
  • the predicate data 106 b may, for example, be or represent a verb phrase containing one or more words (also signified herein as “verbPhrase”).
  • the term “predicate” as used herein is synonymous with “verb phrase” and is not limited to any particular type of verb phrase.
  • the object data 106 c may, for example, be or represent a second noun phrase containing one or more words (also signified herein as “nounPhrase2”).
  • Any of these three phrases may be expressed in any natural language, artificial language, or any combination thereof. Any of these three phrases may be singular or plural (e.g., “fluid” or “fluids”), be conjugated in any manner (e.g., “reduce” or “reduces”), and be expressed in any tense (e.g., “reduce,” “reduced,” or “reducing”).
  • the system 100 may perform any of a variety of pre-processing on any of these three phrases, such as any kind of stemming, to produce modified versions of one or more of the three phrases, which may then be used in addition to or instead of the original versions of the three phrases in any of the processing disclosed herein.
  • the query 104 need not include all three of the subject data 106 a , predicate data 106 b , and object data 106 c .
  • the query 104 may include the subject data 106 a and predicate data 106 b , but not the object data 106 c (e.g., “fluid reduces”).
  • the query 104 may include the predicate data 106 b and the object data 106 c , but not the subject data 106 a (e.g., “reduce vibrations”).
  • the query 104 may include any two of the subject data 106 a , the predicate data 106 b , and the object data 106 c .
  • Each of the subject data 106 a , the predicate data 106 b , and the object data 106 c may represent a corresponding part of a sentence. More specifically, the subject data 106 a may represent a subject of a sentence, the predicate data 106 b may represent a predicate of the sentence, and the object data 106 c may represent an object of the sentence.
  • the query 104 may include both: (1) first data (e.g., a first one of the subject data 106 a , the predicate data 106 b , and the object data 106 c ) representing a first part of a sentence, and (2) second data (e.g., a second one of the subject data 106 a , the predicate data 106 b , and the object data 106 c ) representing a second part of the sentence.
  • the query 104 may further include third data (e.g., a third one of the subject data 106 a , the predicate data 106 b , and the object data 106 c ) representing a third part of the sentence.
  • Techniques disclosed specifically herein in connection with the subject data 106 a (and the subject that it represents), the predicate data 106 b (and the predicate that it represents), and the object data 106 c (and the object that it represents) may be understood more generally to apply to any other data representing a part of a sentence.
  • techniques disclosed specifically herein in connection with the subject data 106 a (and the subject that it represents) may be understood more generally to apply to the predicate data 106 b (and the predicate that it represents) and to the object data 106 c (and the object that it represents).
  • the query 104 may include data in addition to that shown in FIG. 1 .
  • the query 104 may include additional terms, such as any one or more of the following: prepositions, adjectives, adverbs, additional noun phrases, and additional verb phrases.
  • additional terms may be used as further constraints to narrow the search performed by the search engine 116 .
  • the query “fluid reduces vibrations under low temperature” includes the additional phrase “under low temperature,” which consists of a preposition (“under”) and a noun phrase (“low temperature”).
  • the noun phrase consists of an adjective (“low”) and a noun (“temperature”).
  • This additional phrase may be used as an additional constraint to narrow the query “fluid reduces vibrations.”
  • the additional constraints need not be literal, but may be enforced after semantic manipulation. For example, if the phrase “under low temperature” is input as an additional term in the query 104 , the synonym identification module 108 may create one or more synonyms of this phrase, such as “beneath freezing,” “zero degree Celsius or below,” and “less than 32 degrees Fahrenheit.” The additional constraints may then be enforced using the original additional phrase (“under low temperature”) and/or any of the automatically-generated synonyms.
  • the user 102 may provide the query 104 in any of a variety of ways.
  • the user 102 may type, speak, select, or otherwise input text representing any one or more of the subject data 106 a , predicate data 106 b , and object data 106 c .
  • the user 102 may type, speak, or otherwise input a single contiguous text string containing some or all of the subject data 106 a , predicate data 106 b , and object data 106 c in sequence, possibly separated by spaces, tabs, line breaks, or other delimiters.
  • the system 100 may, in response, take any of a variety of actions, such as removing the verb phrase from the query 104 , or prompting the user 102 to provide an alternate verb phrase, receiving an alternate verb phrase from the user, and replacing the original verb phrase with the alternate verb phrase provided by the user 102 .
  • the system 100 may assist the user 102 in providing the alternate verb phrase in any of a variety of ways.
  • the system 100 may automatically identify one or more synonyms (e.g., hypernyms and/or hyponyms) of the original verb phrase and display those synonyms to the user 102 .
  • the system 100 may display the synonyms in any way, such as in a list, or in the form of a hierarchy in which the parent/child relationships among hypernyms and hyponyms is displayed graphically.
  • the user 102 may select one or more of the displayed synonyms, in response to which the system 100 may replace the original verb phrase in the query with the synonym(s) selected by the user 102 .
  • the system 100 may enable the user 102 to select such synonyms in any of a variety of ways, such as by clicking on them or otherwise selecting them with a pointing device (e.g., mouse, touchpad, or touchscreen), or by typing their initial letters.
  • a pointing device e.g., mouse, touchpad, or touchscreen
  • the system 100 may apply the same or similar techniques to assist the user 102 in removing and/or replacing noun phrases in the original version of the query 104 .
  • the system 100 includes a synonym identification module 108 , which receives the original query 104 as input and identifies one or more synonyms for each of one or more of the subject data 106 a , the predicate data 106 b , and the object data 106 c ( FIG. 2 , operation 204 ). More specifically, the synonym identification module 108 may identify, and produce as output, subject synonym data 110 a representing one or more synonyms of the subject represented by the subject data 106 a ( FIG. 2 , operation 204 a ). Similarly, the synonym identification module 108 may identify, and produce as output, predicate synonym data 110 b representing one or more synonyms of the predicate represented by the predicate data 106 b ( FIG. 2 , operation 204 b ). Similarly, the synonym identification module 108 may identify, and produce as output, object synonym data 110 c representing one or more synonyms of the object represented by the object data 106 c ( FIG. 2 , operation 204 c ).
  • the original query 104 need not include all three of the subject data 106 a , the predicate data 106 b , and the object data 106 c .
  • the synonym identification module 108 need not identify synonyms of any data not contained within the original query 104 . More specifically, the synonym identification module 108 need not identify synonyms of the subject data 106 a if the original query 104 does not include the subject data 106 a ; the synonym identification module 108 need not identify synonyms of the predicate data 106 b if the original query 104 does not include the predicate data 106 b ; and the synonym identification module 108 need not identify synonyms of the object data 106 c if the original query 104 does not include the object data 106 c.
  • the synonym identification module 108 need not identify synonyms of all data contained within the original query 104 . For example, if the original query contains only the subject data 106 a and the predicate data 106 b , then the synonym identification module 108 may, but need not, identify synonyms for both the subject data 106 a and the predicate data 106 b . Instead, the synonym identification module 108 may identify synonyms for only the subject data 106 a and not the predicate data 106 b , or for only the predicate data 106 b and not the subject data 106 a .
  • the synonym identification module 108 may, but need not, identify synonyms for both the predicate data 106 b and the object data 106 c . Instead, the synonym identification module 108 may identify synonyms for only the predicate data 106 b and not the object data 106 c , or for only the object data 106 c and not the predicate data 106 b .
  • the synonym identification module 108 may, but need not, identify synonyms for all three of the subject data 106 a , the predicate data 106 b , and the object data 106 c . Instead, the synonym identification module 108 may identify synonyms for only one but not the other two of the subject data 106 a , the predicate data 106 b , and the object data 106 c ; or for only two but not three of the subject data 106 a , predicate data 106 b , and object data 106 c.
  • the synonym identification module 108 may identify any number of synonyms for each of the subject data 106 a , predicate data 106 b , and object data 106 c .
  • the synonym identification module 108 may, for example, identify a different number of synonyms for the subject data 106 a than for the predicate data 106 b , and/or a different number of synonyms for the predicate data 106 b than the object data 106 c.
  • the synonym identification module 108 may initially identify an initial set of synonyms and then enable the user 102 to select a subset of the initial set of synonyms for inclusion in the final set of synonyms 110 .
  • the synonym identification module 108 may initially identify all available synonyms of a particular term in the query 104 (e.g., the subject 106 a ) and enable the user 102 to select a subset of those synonyms for inclusion in the final set of synonyms 110 .
  • the synonym identification module 108 may, for example, display or otherwise provide output representing the initial set of synonyms to the user 102 in any way, such as in a list, or in the form of a hierarchy in which the parent/child relationships among hypernyms and hyponyms is displayed graphically.
  • the user 102 may then provide input selecting one or more of the displayed synonyms, in response to which the system 100 may include the selected synonyms, but not the un-selected synonyms, in the final set of synonyms 110 .
  • the system 100 may initially select all displayed synonyms by default.
  • the user 102 may then provide input de-selecting one or more of the displayed synonyms, in response to which the system 100 may include the still-selected synonyms, but not the synonyms de-selected by the user 102 , in the final set of synonyms 110 .
  • the user 102 may also provide input to the system 100 to specify one or more synonyms to add to the synonyms 110 , such as one or more subject synonyms to add to the subject synonym data 110 a , one or more predicate synonyms 110 b to add to the predicate synonym data 110 b , one or more object synonyms to add to the object synonym data 110 c , or any combination thereof.
  • the user 102 may supplement the synonyms automatically generated by the synonym identification module 108 .
  • the synonym identification module 108 may store a record of any user-specified synonyms, so that the synonym identification module 108 may subsequently automatically identify such synonyms in the future.
  • the system 100 may also include a query formulation module 112 , which may form and produce as output one or more queries 114 based on some or all of the synonyms 110 , possibly in combination with some or all of the original query 104 ( FIG. 2 , operation 206 ).
  • the query formulation module 112 may produce any number of queries 114 . As one extreme example, the queries 114 may consist of a single query.
  • the user 102 may also provide input to the system 100 to specify one or more queries to add to the queries 114 .
  • the user 102 may provide such input in any of the ways described above in connection with the original query 104 .
  • the user 102 may supplement the queries 114 automatically generated by the query formulation module 112 .
  • the query formulation module 112 may store a record of any user-specified queries, so that the query formulation module 112 may subsequently automatically generate such queries based on the same original query 104 in the future.
  • Any particular query in the queries 114 may be of any of the following forms: “subject predicate,” “predicate object,” “subject predicate object,” “subject object,” “subject ⁇ verbPhrase>,” “ ⁇ nounPhrase> predicate,” “ ⁇ verbPhrase> object,” or “predicate ⁇ nounPhrase>.”
  • the queries 114 may include any number of queries (including zero) of each of these forms, so long as the queries 114 include at least one query.
  • the queries 114 may, therefore, include one or more queries of one form and one or more queries of another form.
  • the query formulation module 112 may form each query in the queries 114 by, for example: (1) selecting a form for the query (e.g., “subject predicate,” “predicate object,” “subject predicate object,” “subject object,” “subject ⁇ verbPhrase>,” “ ⁇ nounPhrase> predicate,” “ ⁇ verbPhrase> object,” or “predicate ⁇ nounPhrase>”); (2) selecting, for each term of the query, a term of the corresponding type from the original query 104 or the synonyms 110 ; and (3) forming the query from the terms selected in (2). Note that in certain embodiments of the present invention, the query formulation module 112 only selects terms from the synonyms 110 in (2). Note further that in certain embodiments of the present invention, the query formulation module 112 selects terms in (2) such that at least one term in the resulting query is selected from the synonyms 110 , so that the resulting query is not identical to the original query 104 .
  • a form for the query e.g.,
  • the queries 114 include at least one query that is not identical to the original query 104 , and which therefore includes at least one of the subject synonyms 110 a , predicate synonyms 110 b , or object synonyms 110 c .
  • the queries 114 may include a query that is identical to the original query 104 , so long as the queries 114 include at least one other query that is not identical to the original query 104 .
  • the query formulation module 112 selects the form “predicate object” for a query to generate.
  • the query formulation module 112 may then select a predicate for use in the query to generate by selecting either the original predicate 106 b or a predicate from the set of predicate synonyms 110 b .
  • the query formulation module 112 selects the predicate “minimize” from the predicate synonyms 110 b .
  • the query formulation module 112 may then select an object for use in the query to generate by selecting either the original object 106 c or an object from the set of object synonyms 110 c .
  • the query formulation module 112 selects the object “oscillations” from the object synonyms 110 c.
  • the query formulation module 112 may then form the resulting modified query by combining the selected terms (e.g., subject and predicate; predicate and object; subject, predicate, and object; subject and object; subject and verb phrase; noun phrase and predicate; verb phrase and object; or predicate and noun phrase) together in any of a variety of ways.
  • the query formulation module 112 may form the resulting modified query by forming a conjunction (i.e., logical AND) of all of the selected terms, by forming a disjunction (i.e., logical OR) of all of the selected terms, or by joining the selected terms using any Boolean operator(s) in any combination (e.g., AND, OR, XOR, NOT).
  • the query formulation module 112 may form the resulting modified query by concatenating the selected terms into a single text string, possibly separated by spaces or other delimiters, in the sequence “subject predicate,” “predicate object,” “subject predicate object,” “subject object,” “subject ⁇ verbPhrase>,” “ ⁇ nounPhrase> predicate,” “ ⁇ verbPhrase> object,” or “predicate ⁇ nounPhrase>.”
  • the query formulation module 112 may form any number of queries using any combination of the techniques described above to generate the set of queries 114 , which may include any number of queries that differ from the original query 104 , and which may also include the original query 104 .
  • the system 100 also includes a search engine 116 , which performs searches on a dataset 118 using the queries 114 to produce search results 120 ( FIG. 2 , operation 208 ).
  • the search engine 116 may be any kind of search engine, such as any commercially available search engine.
  • the dataset 118 may be any kind of dataset, such as a database of documents (e.g., patents) or a collection of Web pages on the Internet.
  • the search engine 116 may, for example, perform one search for each query in the queries 114 to produce one set of search results corresponding to each such query, and then combine the resulting sets of search results to produce the search results 120 .
  • the search engine 116 may use any technique(s) to search the dataset 118 using the queries 114 .
  • the search engine 116 may use one or more of the queries 114 as natural language queries to search the dataset 118 .
  • the search engine 116 may use one or more of the queries 114 as structured queries (e.g., in cases in which the queries 114 contain Boolean operators, such as AND, OR, and NOT) to search the dataset 118 .
  • the search engine 116 may treat a record in the dataset 118 as a record that matches a particular one of the queries 114 even if the record does not contain a verbatim copy of the query.
  • the search engine 116 may consider the query “reduce vibrations” to be matched by phrases such as “reduce high-frequency vibrations” and “reduce some of the vibrations.”
  • the extent to which a dataset record may differ from a query yet still be treated by the search engine 116 as matching the query e.g., the number of words that may occur between subject and predicate, predicate and object, or subject and object
  • the search engine 116 may use any search techniques to perform the functions disclosed herein.
  • the search engine 116 may search the entire dataset 118 , this is not a requirement of the present invention. Alternatively, for example, the search engine 116 may search only a subset of the dataset 118 , such as a subset of the records in the dataset 118 , or a portion of each of some or all of the records in the dataset 118 . For example, if the dataset 118 is a database of patents, then the search engine 116 may search only the claims section of each of the patents in the dataset 118 . The user 102 may, for example, specify, in the original query 104 , which portion of the dataset 118 is to be searched by the search engine 116 . Such subset-specifying information may be carried over to one or more of the queries 114 , which may therefore specify which portion of the dataset 118 is to be searched by the search engine 116 .
  • the search results 120 may, for example, be or be derived from a subset of the dataset 118 .
  • the search results 120 may be a subset of the dataset 118 that has properties that satisfy one or more of the queries 114 .
  • the dataset 118 may include a plurality of data records, such as a plurality of documents. If one of the queries 114 matches one of the plurality of documents, then the search engine 116 may copy the matching document into the search results 120 , include a reference to the matching document into the search results 120 , or include a summary of or other information derived from the matching document in the search results 120 .
  • the system 100 may also include a search result processor 122 , which may process the search results 120 in any of a variety of ways to produce processed search results 124 ( FIG. 2 , operation 210 ).
  • the search result processor 122 may remove (filter) certain data records from the search results 120 to produce the processed search results 124 .
  • This filtering may, for example, be performed based on input received from the user 102 , such as one or more of the following: prepositions, adjectives, adverbs, additional noun phrases (i.e., in addition to the subject and object in a query), and additional verb phrases (i.e., in addition to the predicate in a query).
  • the user 102 may input the additional phrase, “under low temperature.”
  • the prepositional phrase, “under low temperature” consists of a preposition (“under”) and a noun phrase (“low temperature”).
  • the noun phrase consists of an adjective (“low”) and a noun (“temperature”).
  • the search result processor 122 may sort the search results 120 to produce the processed search results 124 .
  • the search result processor may re-order the search results so that data records which resulted from performing the original query 104 on the dataset 118 appearing lower in the processed search results 124 than data records which resulted from performing queries other than the original query 104 on the dataset 118 , so that data records representing information analogous to the problem described by the original query 104 are emphasized to the user 102 .
  • the system 100 may enable the user 102 to manually change the order of the processed search results 124 and otherwise modify the processed search results 124 .
  • the system 100 may enable the user 102 to move individual search results up/down in the processed search results 124 , mark individual search results as relevant/irrelevant, mark individual search results as favorites, or otherwise tag individual search results with any kind of data (such as indicates of categories or colors).
  • the system 100 may provide output representing the processed search results 124 (or, if the system 100 does not include the search result processor 122 , output representing the original search results 120 ) to the user 102 ( FIG. 5 , operation 212 ).
  • the system 100 may provide the output representing the processed search results 124 to the user 102 in any of a variety of ways, such as by displaying output representing the processed search results 124 on a display screen, e.g., in the form of textual summaries of the processed search results 124 or in the form of a graph illustrating features of the processed search results 124 .
  • the user 102 may use the system 100 to perform additional searches on the processed search results 124 instead of the dataset 118 .
  • the user 102 may use the system 100 in the manner described above to perform an initial set of searches on the dataset 118 to produce the processed search results 124 , and then use the system 100 in the manner described above to perform one or more additional sets of searches on the processed search results 124 , thereby refining the initial set of processed search results 124 .
  • Such additional searches may be performed using queries that differ from the initial set of queries in any of a variety of ways.
  • the user 102 may perform any one or more of the following to produce a new query: (1) arbitrarily add terms to the original query 104 , such as conditions that restrict terms in the original query (e.g., by adding the condition “>1800 Hz” to the term “vibrations” to restrict the search to vibrations greater than 1800 Hz); (2) add/replace terms in the original query 104 with synonyms (e.g., hypernyms and/or hyponyms) of such terms; and (3) combine the query 104 with one or more other queries.
  • the system 100 may then use the new query to produce a modified set of queries using the techniques disclosed above and then to perform searches on the processed search results 124 using the modified set of queries and thereby to produce yet further processed search results.
  • the system 100 may assist the user 102 in producing such new queries in any of a variety of ways. For example, after performing the initial set of searches using the original query 104 , the system 100 may display the original query 104 to the user 102 and enable the user 102 to browse synonyms (e.g., hypernyms and/or hypernyms) of each term in the query 104 , such as by displaying such synonyms in a hierarchy in the manner described above. The user 102 may then select any such synonym and thereby generate a new query which is identical to the new query 104 except that the synonym selected by the user replaces the original term.
  • synonyms e.g., hypernyms and/or hypernyms
  • the system 100 may store a record of all queries used by the user 102 and enable the user 102 to view all such queries in the record.
  • the user 102 may select any query from the record and instruct the system to perform a search using the selected query, either on the same dataset 118 , an updated version of the dataset 118 , or a different dataset.
  • the user 102 may also select any query from the record, modify the selected query using any of the techniques disclosed herein, and then instruct the system to perform a search using the selected query, either on the same dataset 118 , an updated version of the dataset 118 , or a different dataset.
  • FIGS. 1 and 2 envisage the use of a single original query 104 , this is merely an example and does not constitute a limitation of the present invention.
  • the user 102 may provide a plurality of original queries (such as “reduce vibrations” and “maintain strength”). The techniques disclosed herein may then be applied to each such original query to provide a plurality of queries for each of the original queries. For example, assume that the user 102 provides a first original query and a second original query, which causes the system 100 to produce a first set of final queries and a second set of final queries, respectively.
  • the system 100 may then create a derived query by selecting one of the queries in the first set of final queries and one of the queries in the second set of final queries, and combine the selected queries using a logical operator (e.g., AND, OR, NOT, or XOR).
  • the system 100 may then search the dataset 118 using the derived query to produce search results 120 .
  • the system 100 may perform any number of such searches using any combinations of queries from the first and second sets of queries.
  • Embodiments of the present invention have a variety of advantages. For example, embodiments of the present invention may be used to find information that is analogous to solutions to the problem represented by the user's original query 104 .
  • Embodiments of the present invention may perform searches using queries that are synonyms of “reduce vibrations” to find data that may be used to answer the question, “What physical objects have the effect of reducing vibrations?”
  • Embodiments of the present invention may perform searches using queries that are synonyms of “fluid reduces” to find data that may be used to answer the question, “What does fluid reduce?” If the query is generalized even further to take the form “fluid ⁇ verbPhrase>,” where ⁇ verbPhrase> is a token that is used as a wildcard that is matched by any verb phrase in the dataset 118 (not merely verb phrases that are synonyms of “reduces”), then embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What are the effects of a fluid?” Similarly, if the query is generalized to take the form “ ⁇ nounPhrase> reduces,” where ⁇ nounPhrase> is a token that is used as a wildcard that is matched by any noun phrase in the dataset (not merely noun phrases that are synonyms of “flui
  • Embodiments of the present invention may perform searches using queries that include synonyms of “fluid” and “vibrations” to find data that may be used to answer the question, “What is the functional relationship between fluid and vibrations?” Such searches may, for example, be used to discover the verb phrase (e.g., “reduces”) that links “fluid” to “vibrations.”
  • ⁇ verbPhrase> object such as “ ⁇ verbPhrase> vibrations,” where ⁇ verbPhrase> is a token that is used as a wildcard that is matched by any verb phrase in the dataset.
  • Embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What things effect a change on vibrations?”
  • ⁇ nounPhrase> is a token that is used as a wildcard that is matched by any noun phrase in the dataset.
  • Embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What things have the effect of reducing another thing?”
  • Embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What things semantically synonymous to ‘fluid’ and ‘vibrations’ have a relationship that is semantically synonymous to ‘reduces’?”
  • Any of the functions disclosed herein may be implemented using means for performing those functions. Such means include, but are not limited to, any of the components disclosed herein, such as the computer-related components described below.
  • the techniques described above may be implemented, for example, in hardware, one or more computer programs tangibly stored on one or more computer-readable media, firmware, or any combination thereof.
  • the techniques described above may be implemented in one or more computer programs executing on (or executable by) a programmable computer including any combination of any number of the following: a processor, a storage medium readable and/or writable by the processor (including, for example, volatile and non-volatile memory and/or storage elements), an input device, and an output device.
  • Program code may be applied to input entered using the input device to perform the functions described and to generate output using the output device.
  • Each computer program within the scope of the claims below may be implemented in any programming language, such as assembly language, machine language, a high-level procedural programming language, or an object-oriented programming language.
  • the programming language may, for example, be a compiled or interpreted programming language.
  • Each such computer program may be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a computer processor.
  • Method steps of the invention may be performed by one or more computer processors executing a program tangibly embodied on a computer-readable medium to perform functions of the invention by operating on input and generating output.
  • Suitable processors include, by way of example, both general and special purpose microprocessors.
  • the processor receives (reads) instructions and data from a memory (such as a read-only memory and/or a random access memory) and writes (stores) instructions and data to the memory.
  • Storage devices suitable for tangibly embodying computer program instructions and data include, for example, all forms of non-volatile memory, such as semiconductor memory devices, including EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROMs. Any of the foregoing may be supplemented by, or incorporated in, specially-designed ASICs (application-specific integrated circuits) or FPGAs (Field-Programmable Gate Arrays).
  • a computer can generally also receive (read) programs and data from, and write (store) programs and data to, a non-transitory computer-readable storage medium such as an internal disk (not shown) or a removable disk.
  • Any data disclosed herein may be implemented, for example, in one or more data structures tangibly stored on a non-transitory computer-readable medium. Embodiments of the invention may store such data in such data structure(s) and read such data from such data structure(s).

Abstract

A user provides a query that includes at least two of a subject, a predicate, and an object. A computer system identifies synonyms of one or more of the subject, predicate, and object, and forms new queries from the identified synonyms. The system searches a dataset using the new queries, and possibly also using the user-provided query, to produce search results. The system may process the search results, such as by filtering and/or sorting them. The system provides output representing the search results to the user. The user may use the search result output to identify answers that are analogous to answers to the query originally provided by the user.

Description

    STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT
  • This invention was made with government support under Grant Nos. IIP1261052 and IIP1127609 from the National Science Foundation. The government has certain rights in the invention.
  • BACKGROUND
  • “Design fixation” is the tendency to fixate on the features of known solutions when trying to create novel solutions (Jansson & Smith, 1991). For example, a subject who is shown an existing chair and then asked to design an improved chair is likely to fixate on features of the existing chair when attempting to design an improved chair. Such fixation can lead the subject to overlook features that would be useful to include in an improved chair, but which are lacking in the existing chair.
  • Furthermore, it is well known that people struggle with recognizing analogous solutions that are provided to them before working on a particular problem. In an experiment, a story can be read before a problem is provided. The story contains an analogous solution. Without explicit hints, very few subjects (e.g., only about 10%) recognize the analogy and transfer the solution to the problem on which they are assigned to work (Gick and Holyoak, 1980, 1983). As these experiments demonstrate, it is difficult for people to transfer analogies from one situation to another.
  • SUMMARY
  • A user provides a query that includes at least two of a subject, a predicate, and an object. A computer system identifies synonyms of one or more of the subject, predicate, and object, and forms new queries from the identified synonyms. The system searches a dataset using the new queries, and possibly also using the user-provided query, to produce search results. The system may process the search results, such as by filtering and/or sorting them. The system provides output representing the search results to the user. The user may use the search result output to identify answers that are analogous to answers to the query originally provided by the user.
  • For example, one embodiment of the present invention is directed to a method performed by at least one computer processor. The method includes: (A) obtaining predicate data representing a predicate; (B) obtaining object data representing an object of the predicate; (C) identifying a synonym of the predicate; (D) identifying a synonym of the object; (E) forming a query from the synonym of the predicate and the synonym of the object; and (F) searching a dataset using the query to identify a subset of the dataset.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a dataflow diagram of a system for assisting in overcoming design fixation by searching a dataset using queries that are analogous to a query specified by a user according to one embodiment of the present invention; and
  • FIG. 2 is a flowchart of a method performed by the system of FIG. 1 according to one embodiment of the present invention.
  • DETAILED DESCRIPTION
  • Embodiments of the present invention may be used to alleviate design fixation and to promote analogy transfer in a variety of ways. For example, embodiments of the present invention may be used to obtain a query taking the form of “subject predicate object” or “predicate object” (where the subject is a first noun phrase, the predicate is a verb phrase, and the object is a second noun phrase); obtain one or more synonyms (e.g., hypernyms and/or hyponyms) of one or more of the subject, predicate, and object; form modified queries using combinations of the subject, predicate, object and/or their synonyms; search a dataset using the modified queries; and provide the results of the search to the user. Such techniques may be used to search for and obtain search results representing analogs to the search results that would have been found using the original query provided by the user.
  • Consider an example in which the user enters a query describing a desired result to be achieved, such as “reduce vibrations,” in which “reduce” is the predicate and in which “vibrations” is the object. Embodiments of the present invention may obtain a synonym of “reduce,” such as “minimize,” and form a first revised query from this synonym and the original object, e.g., “minimize vibrations.” As another example, embodiments of the present invention may obtain a synonym of “vibrations,” such as “oscillations,” and form a second revised query from this synonym and the original predicate, e.g., “reduce oscillations.” As yet another example, embodiments of the present invention may form a third revised query from the synonym of the predicate and the synonym of the object, e.g. “minimize oscillations.” Embodiments of the present invention may then search a dataset using some or all of the first, second, and third revised queries, and possibly also using the original query submitted by the user. Embodiments of the present invention may then provide the results of some or all of such searches to the user. Similar techniques may be applied to queries of the form “subject predicate object,” such as “fluid reduces vibrations.”
  • Any of the techniques disclosed herein which obtain synonyms of a particular term, phrase, or query as a whole, may be used to obtain antonyms of the particular term, phrase, or query as a whole. For example, the techniques disclosed herein may be used to obtain an antonym of the predicate (e.g., “reduce”) and an antonym of an object (e.g., “vibrations”) in a query to produce a first revised query from the two synonyms, e.g., “increase stability.” As another example, the techniques disclosed herein may be used to obtain an antonym of the subject (e.g., “fluid”) and an antonym of a predicate (e.g., “reduces”) in a query to produce a second revised query from the two antonyms, e.g., “solids increase.” As another example, the techniques disclosed herein may be used to obtain an antonym of the subject (e.g., “fluid”) and an antonym of an object (e.g., “vibrations”) in a query to produce a third revised query from the two antonyms, e.g., “solid stability.” As described in more detail below, any computer-generated antonyms of a particular term (e.g., subject, predicate, or object), phrase, or query may be modified and/or supplemented by the input of a human user.
  • Search results produced using the techniques summarized above are likely to include data records (such as web pages, patents, or other documents) that contain information related not only to the result described explicitly by the user-submitted query (i.e., reducing vibrations), but also data records that contain information related to results that are analogous to the result described by the user-submitted query. The techniques summarized above, therefore, may be used to assist the user in identifying analogous solutions to the problem described by the user-submitted query.
  • Having described in general certain features of embodiments of the present invention, particular embodiments of the present invention will now be described in more detail. Referring to FIG. 1, a dataflow diagram is shown of a system 100 for finding data that is analogous to the answer to a query provided by a user according to one embodiment of the present invention. Referring to FIG. 2, a flowchart is shown of a method 200 that is performed by the system 100 of FIG. 1 according to one embodiment of the present invention.
  • A user 102 provides a query 104 to the system 100 (FIG. 2, operation 202). The query 104 is referred to herein as the “original query” to distinguish it from other queries referred to herein.
  • The original query 104 may take a variety of forms. For example, in FIG. 1 the query 104 is shown as containing subject data 106 a representing a subject, predicate data 106 b representing a predicate, and object data 106 c representing an object. For example, the subject data 106 a may be the text “fluid,” the predicate data may be the text “reduces,” and the object data 106 c may be the text “vibrations.” In general, the subject represented by the subject data 106 a is a subject of the predicate represented by the predicate data 106 b, and the object represented by the object data 106 c is an object of the predicate represented by the predicate data 106 b.
  • The subject data 106 a may, for example, be or represent a first noun phrase containing one or more words (also signified herein as “nounPhrase1”). The predicate data 106 b may, for example, be or represent a verb phrase containing one or more words (also signified herein as “verbPhrase”). The term “predicate” as used herein is synonymous with “verb phrase” and is not limited to any particular type of verb phrase. The object data 106 c may, for example, be or represent a second noun phrase containing one or more words (also signified herein as “nounPhrase2”).
  • Any of these three phrases may be expressed in any natural language, artificial language, or any combination thereof. Any of these three phrases may be singular or plural (e.g., “fluid” or “fluids”), be conjugated in any manner (e.g., “reduce” or “reduces”), and be expressed in any tense (e.g., “reduce,” “reduced,” or “reducing”). The system 100 may perform any of a variety of pre-processing on any of these three phrases, such as any kind of stemming, to produce modified versions of one or more of the three phrases, which may then be used in addition to or instead of the original versions of the three phrases in any of the processing disclosed herein.
  • The query 104 need not include all three of the subject data 106 a, predicate data 106 b, and object data 106 c. For example, the query 104 may include the subject data 106 a and predicate data 106 b, but not the object data 106 c (e.g., “fluid reduces”). As another example, the query 104 may include the predicate data 106 b and the object data 106 c, but not the subject data 106 a (e.g., “reduce vibrations”).
  • More generally, the query 104 may include any two of the subject data 106 a, the predicate data 106 b, and the object data 106 c. Each of the subject data 106 a, the predicate data 106 b, and the object data 106 c may represent a corresponding part of a sentence. More specifically, the subject data 106 a may represent a subject of a sentence, the predicate data 106 b may represent a predicate of the sentence, and the object data 106 c may represent an object of the sentence. Therefore, in general, the query 104 may include both: (1) first data (e.g., a first one of the subject data 106 a, the predicate data 106 b, and the object data 106 c) representing a first part of a sentence, and (2) second data (e.g., a second one of the subject data 106 a, the predicate data 106 b, and the object data 106 c) representing a second part of the sentence. The query 104 may further include third data (e.g., a third one of the subject data 106 a, the predicate data 106 b, and the object data 106 c) representing a third part of the sentence.
  • Techniques disclosed specifically herein in connection with the subject data 106 a (and the subject that it represents), the predicate data 106 b (and the predicate that it represents), and the object data 106 c (and the object that it represents) may be understood more generally to apply to any other data representing a part of a sentence. For example, techniques disclosed specifically herein in connection with the subject data 106 a (and the subject that it represents) may be understood more generally to apply to the predicate data 106 b (and the predicate that it represents) and to the object data 106 c (and the object that it represents).
  • The query 104 may include data in addition to that shown in FIG. 1. For example, the query 104 may include additional terms, such as any one or more of the following: prepositions, adjectives, adverbs, additional noun phrases, and additional verb phrases. Such additional terms may be used as further constraints to narrow the search performed by the search engine 116. For example, the query “fluid reduces vibrations under low temperature” includes the additional phrase “under low temperature,” which consists of a preposition (“under”) and a noun phrase (“low temperature”). In turn, the noun phrase consists of an adjective (“low”) and a noun (“temperature”). This additional phrase may be used as an additional constraint to narrow the query “fluid reduces vibrations.” The additional constraints need not be literal, but may be enforced after semantic manipulation. For example, if the phrase “under low temperature” is input as an additional term in the query 104, the synonym identification module 108 may create one or more synonyms of this phrase, such as “beneath freezing,” “zero degree Celsius or below,” and “less than 32 degrees Fahrenheit.” The additional constraints may then be enforced using the original additional phrase (“under low temperature”) and/or any of the automatically-generated synonyms.
  • The user 102 may provide the query 104 in any of a variety of ways. For example, the user 102 may type, speak, select, or otherwise input text representing any one or more of the subject data 106 a, predicate data 106 b, and object data 106 c. For example, the user 102 may type, speak, or otherwise input a single contiguous text string containing some or all of the subject data 106 a, predicate data 106 b, and object data 106 c in sequence, possibly separated by spaces, tabs, line breaks, or other delimiters. For example, the user 102 may type, speak, or otherwise input a single contiguous text string containing both the subject data 106 a and the predicate data 106 b in sequence (e.g., “fluid reduces”), a single contiguous text string containing both the predicate data 106 b and the object data 106 c in sequence (e.g., “reduces vibrations”), or a single contiguous text string containing the subject data 106 a, the predicate data 106 b, and the object data 106 c in sequence (e.g., “fluid reduces vibrations”).
  • The system 100 may assist the user 102 in providing the query 104 in any of a variety of ways. For example, the user 102 may provide an initial version of the query 104 as input to the system 100. In response, the system 100 may identify each verb phrase in the initial version of the query 104, or use conventional text analytic techniques to identify the primary verb in the initial version of the query 104. The system 100 may then provide output prompting the user 102 to indicate whether each identified verb phrase characterizes the essential process the user 102 is attempting to define. In response, the user 102 may provide input either confirming or disconfirming that a particular verb phrase characterizes the essential process the user 102 is attempting to define. If the input received from the user 102 confirms that a particular verb phrase characterizes the essential process the user 102 is attempting to define, then the system 100 may, in response, retain the verb phrase in the query 104.
  • If the input received from the user 102 does not confirm that a verb phrase characterizes the essential process the user 102 is attempting to define, then the system 100 may, in response, take any of a variety of actions, such as removing the verb phrase from the query 104, or prompting the user 102 to provide an alternate verb phrase, receiving an alternate verb phrase from the user, and replacing the original verb phrase with the alternate verb phrase provided by the user 102. In the latter case, the system 100 may assist the user 102 in providing the alternate verb phrase in any of a variety of ways. For example, the system 100 may automatically identify one or more synonyms (e.g., hypernyms and/or hyponyms) of the original verb phrase and display those synonyms to the user 102. The system 100 may display the synonyms in any way, such as in a list, or in the form of a hierarchy in which the parent/child relationships among hypernyms and hyponyms is displayed graphically.
  • Regardless of how the synonyms are selected and displayed to the user 102, the user 102 may select one or more of the displayed synonyms, in response to which the system 100 may replace the original verb phrase in the query with the synonym(s) selected by the user 102. The system 100 may enable the user 102 to select such synonyms in any of a variety of ways, such as by clicking on them or otherwise selecting them with a pointing device (e.g., mouse, touchpad, or touchscreen), or by typing their initial letters.
  • The system 100 may apply the same or similar techniques to assist the user 102 in removing and/or replacing noun phrases in the original version of the query 104.
  • The system 100 includes a synonym identification module 108, which receives the original query 104 as input and identifies one or more synonyms for each of one or more of the subject data 106 a, the predicate data 106 b, and the object data 106 c (FIG. 2, operation 204). More specifically, the synonym identification module 108 may identify, and produce as output, subject synonym data 110 a representing one or more synonyms of the subject represented by the subject data 106 a (FIG. 2, operation 204 a). Similarly, the synonym identification module 108 may identify, and produce as output, predicate synonym data 110 b representing one or more synonyms of the predicate represented by the predicate data 106 b (FIG. 2, operation 204 b). Similarly, the synonym identification module 108 may identify, and produce as output, object synonym data 110 c representing one or more synonyms of the object represented by the object data 106 c (FIG. 2, operation 204 c).
  • As mentioned above, the original query 104 need not include all three of the subject data 106 a, the predicate data 106 b, and the object data 106 c. The synonym identification module 108 need not identify synonyms of any data not contained within the original query 104. More specifically, the synonym identification module 108 need not identify synonyms of the subject data 106 a if the original query 104 does not include the subject data 106 a; the synonym identification module 108 need not identify synonyms of the predicate data 106 b if the original query 104 does not include the predicate data 106 b; and the synonym identification module 108 need not identify synonyms of the object data 106 c if the original query 104 does not include the object data 106 c.
  • Furthermore, the synonym identification module 108 need not identify synonyms of all data contained within the original query 104. For example, if the original query contains only the subject data 106 a and the predicate data 106 b, then the synonym identification module 108 may, but need not, identify synonyms for both the subject data 106 a and the predicate data 106 b. Instead, the synonym identification module 108 may identify synonyms for only the subject data 106 a and not the predicate data 106 b, or for only the predicate data 106 b and not the subject data 106 a. Similarly, if the original query contains only the predicate data 106 b and the object data 106 c, then the synonym identification module 108 may, but need not, identify synonyms for both the predicate data 106 b and the object data 106 c. Instead, the synonym identification module 108 may identify synonyms for only the predicate data 106 b and not the object data 106 c, or for only the object data 106 c and not the predicate data 106 b. Finally, if the original query contains the subject data 106 a, the predicate data 106 b, and the object data 106 c, then the synonym identification module 108 may, but need not, identify synonyms for all three of the subject data 106 a, the predicate data 106 b, and the object data 106 c. Instead, the synonym identification module 108 may identify synonyms for only one but not the other two of the subject data 106 a, the predicate data 106 b, and the object data 106 c; or for only two but not three of the subject data 106 a, predicate data 106 b, and object data 106 c.
  • The synonym identification module 108 may identify any type(s) of synonyms for the subject data 106 a, predicate data 106 b, and object data 106 c in any manner. For example, the synonym identification module 108 may identify the synonyms 110 using a publicly-available database, such as WordNet from Princeton University or Thesaurus.com. The synonyms 110 identified by the synonym identification module 108 may include, for example, hyponyms and/or hypernyms of the subject, predicate, and object.
  • The synonym identification module 108 may identify any number of synonyms for each of the subject data 106 a, predicate data 106 b, and object data 106 c. The synonym identification module 108 may, for example, identify a different number of synonyms for the subject data 106 a than for the predicate data 106 b, and/or a different number of synonyms for the predicate data 106 b than the object data 106 c.
  • The synonym identification module 108 may initially identify an initial set of synonyms and then enable the user 102 to select a subset of the initial set of synonyms for inclusion in the final set of synonyms 110. For example, the synonym identification module 108 may initially identify all available synonyms of a particular term in the query 104 (e.g., the subject 106 a) and enable the user 102 to select a subset of those synonyms for inclusion in the final set of synonyms 110. The synonym identification module 108 may, for example, display or otherwise provide output representing the initial set of synonyms to the user 102 in any way, such as in a list, or in the form of a hierarchy in which the parent/child relationships among hypernyms and hyponyms is displayed graphically. The user 102 may then provide input selecting one or more of the displayed synonyms, in response to which the system 100 may include the selected synonyms, but not the un-selected synonyms, in the final set of synonyms 110. Conversely, for example, the system 100 may initially select all displayed synonyms by default. The user 102 may then provide input de-selecting one or more of the displayed synonyms, in response to which the system 100 may include the still-selected synonyms, but not the synonyms de-selected by the user 102, in the final set of synonyms 110.
  • The user 102 may also provide input to the system 100 to specify one or more synonyms to add to the synonyms 110, such as one or more subject synonyms to add to the subject synonym data 110 a, one or more predicate synonyms 110 b to add to the predicate synonym data 110 b, one or more object synonyms to add to the object synonym data 110 c, or any combination thereof. In this way, the user 102 may supplement the synonyms automatically generated by the synonym identification module 108. The synonym identification module 108 may store a record of any user-specified synonyms, so that the synonym identification module 108 may subsequently automatically identify such synonyms in the future.
  • The system 100 may also include a query formulation module 112, which may form and produce as output one or more queries 114 based on some or all of the synonyms 110, possibly in combination with some or all of the original query 104 (FIG. 2, operation 206). The query formulation module 112 may produce any number of queries 114. As one extreme example, the queries 114 may consist of a single query. At the other extreme, the queries 114 may include all possible queries of the form “subject predicate,” “predicate object,” “subject predicate object,” “subject object,” “subject <verbPhrase>,” “<nounPhrase> predicate,” “<verbPhrase> object,” or “predicate <nounPhrase>,” where “subject,” “predicate,” and “object” are drawn from the subject synonym data 110 a (and possibly from the original subject data 106 a), the predicate synonym data 110 b (and possibly from the original predicate data 106 b), and the object synonym data 110 c (and possibly from the original object data 106 c), respectively. In these examples, <verbPhrase> and <nounPhrase> are variable names which represent any verb phrase or noun phrase, respectively. Therefore, for example, when the search engine 116 searches the dataset 118 for the phrase, “reduce <verbPhrase>,” the search engine 116 may consider such a phrase to match text in the dataset 118 containing the word “reduce” followed by any verb phrase.
  • The query formulation module 112 may initially identify an initial set of queries and then enable the user 102 to select a subset of the initial set of queries for inclusion in the final set of queries 114. The query formulation module 112 may, for example, display or otherwise provide output representing the initial set of queries to the user 102 in any way, such as in a list, or in the form of a hierarchy in which the parent/child relationships among the hypernyms and hyponyms in the initial set of queries is displayed graphically. The user 102 may then provide input selecting one or more of the displayed queries, in response to which the system 100 may include the selected queries, but not the un-selected queries, in the final set of queries 114. Conversely, for example, the system 100 may initially select all displayed queries by default. The user 102 may then provide input de-selecting one or more of the displayed queries, in response to which the system 100 may include the still-selected queries, but not the queries de-selected by the user 102, in the final set of queries 114.
  • The user 102 may also provide input to the system 100 to specify one or more queries to add to the queries 114. The user 102 may provide such input in any of the ways described above in connection with the original query 104. In this way, the user 102 may supplement the queries 114 automatically generated by the query formulation module 112. The query formulation module 112 may store a record of any user-specified queries, so that the query formulation module 112 may subsequently automatically generate such queries based on the same original query 104 in the future.
  • Any particular query in the queries 114 may be of any of the following forms: “subject predicate,” “predicate object,” “subject predicate object,” “subject object,” “subject <verbPhrase>,” “<nounPhrase> predicate,” “<verbPhrase> object,” or “predicate <nounPhrase>.” The queries 114 may include any number of queries (including zero) of each of these forms, so long as the queries 114 include at least one query. The queries 114 may, therefore, include one or more queries of one form and one or more queries of another form.
  • The query formulation module 112 may form each query in the queries 114 by, for example: (1) selecting a form for the query (e.g., “subject predicate,” “predicate object,” “subject predicate object,” “subject object,” “subject <verbPhrase>,” “<nounPhrase> predicate,” “<verbPhrase> object,” or “predicate <nounPhrase>”); (2) selecting, for each term of the query, a term of the corresponding type from the original query 104 or the synonyms 110; and (3) forming the query from the terms selected in (2). Note that in certain embodiments of the present invention, the query formulation module 112 only selects terms from the synonyms 110 in (2). Note further that in certain embodiments of the present invention, the query formulation module 112 selects terms in (2) such that at least one term in the resulting query is selected from the synonyms 110, so that the resulting query is not identical to the original query 104.
  • The only requirement is that the queries 114 include at least one query that is not identical to the original query 104, and which therefore includes at least one of the subject synonyms 110 a, predicate synonyms 110 b, or object synonyms 110 c. The queries 114 may include a query that is identical to the original query 104, so long as the queries 114 include at least one other query that is not identical to the original query 104.
  • For example, assume that the original query 104 contains the predicate “reduce” and the object “vibrations,” and that the query formulation module 112 selects the form “predicate object” for a query to generate. The query formulation module 112 may then select a predicate for use in the query to generate by selecting either the original predicate 106 b or a predicate from the set of predicate synonyms 110 b. Assume for purpose of example that the query formulation module 112 selects the predicate “minimize” from the predicate synonyms 110 b. The query formulation module 112 may then select an object for use in the query to generate by selecting either the original object 106 c or an object from the set of object synonyms 110 c. Assume for purpose of example that the query formulation module 112 selects the object “oscillations” from the object synonyms 110 c.
  • The query formulation module 112 may then form the resulting modified query by combining the selected terms (e.g., subject and predicate; predicate and object; subject, predicate, and object; subject and object; subject and verb phrase; noun phrase and predicate; verb phrase and object; or predicate and noun phrase) together in any of a variety of ways. For example, the query formulation module 112 may form the resulting modified query by forming a conjunction (i.e., logical AND) of all of the selected terms, by forming a disjunction (i.e., logical OR) of all of the selected terms, or by joining the selected terms using any Boolean operator(s) in any combination (e.g., AND, OR, XOR, NOT). As another example, the query formulation module 112 may form the resulting modified query by concatenating the selected terms into a single text string, possibly separated by spaces or other delimiters, in the sequence “subject predicate,” “predicate object,” “subject predicate object,” “subject object,” “subject <verbPhrase>,” “<nounPhrase> predicate,” “<verbPhrase> object,” or “predicate <nounPhrase>.”
  • The query formulation module 112 may form any number of queries using any combination of the techniques described above to generate the set of queries 114, which may include any number of queries that differ from the original query 104, and which may also include the original query 104.
  • The system 100 also includes a search engine 116, which performs searches on a dataset 118 using the queries 114 to produce search results 120 (FIG. 2, operation 208). The search engine 116 may be any kind of search engine, such as any commercially available search engine. The dataset 118 may be any kind of dataset, such as a database of documents (e.g., patents) or a collection of Web pages on the Internet.
  • The search engine 116 may, for example, perform one search for each query in the queries 114 to produce one set of search results corresponding to each such query, and then combine the resulting sets of search results to produce the search results 120.
  • The search engine 116 may use any technique(s) to search the dataset 118 using the queries 114. For example, the search engine 116 may use one or more of the queries 114 as natural language queries to search the dataset 118. As another example, the search engine 116 may use one or more of the queries 114 as structured queries (e.g., in cases in which the queries 114 contain Boolean operators, such as AND, OR, and NOT) to search the dataset 118. As these examples imply, the search engine 116 may treat a record in the dataset 118 as a record that matches a particular one of the queries 114 even if the record does not contain a verbatim copy of the query. For example, the search engine 116 may consider the query “reduce vibrations” to be matched by phrases such as “reduce high-frequency vibrations” and “reduce some of the vibrations.” The extent to which a dataset record may differ from a query yet still be treated by the search engine 116 as matching the query (e.g., the number of words that may occur between subject and predicate, predicate and object, or subject and object) may be adjustable by the user 102. These particular examples are provided merely as illustrations and not as limitations of the present invention. More generally, the search engine 116 may use any search techniques to perform the functions disclosed herein.
  • Although the search engine 116 may search the entire dataset 118, this is not a requirement of the present invention. Alternatively, for example, the search engine 116 may search only a subset of the dataset 118, such as a subset of the records in the dataset 118, or a portion of each of some or all of the records in the dataset 118. For example, if the dataset 118 is a database of patents, then the search engine 116 may search only the claims section of each of the patents in the dataset 118. The user 102 may, for example, specify, in the original query 104, which portion of the dataset 118 is to be searched by the search engine 116. Such subset-specifying information may be carried over to one or more of the queries 114, which may therefore specify which portion of the dataset 118 is to be searched by the search engine 116.
  • The search results 120 may, for example, be or be derived from a subset of the dataset 118. For example, the search results 120 may be a subset of the dataset 118 that has properties that satisfy one or more of the queries 114. For example, the dataset 118 may include a plurality of data records, such as a plurality of documents. If one of the queries 114 matches one of the plurality of documents, then the search engine 116 may copy the matching document into the search results 120, include a reference to the matching document into the search results 120, or include a summary of or other information derived from the matching document in the search results 120.
  • The system 100 may also include a search result processor 122, which may process the search results 120 in any of a variety of ways to produce processed search results 124 (FIG. 2, operation 210). For example, the search result processor 122 may remove (filter) certain data records from the search results 120 to produce the processed search results 124. This filtering may, for example, be performed based on input received from the user 102, such as one or more of the following: prepositions, adjectives, adverbs, additional noun phrases (i.e., in addition to the subject and object in a query), and additional verb phrases (i.e., in addition to the predicate in a query). For example, if one of the queries 114 is “fluid reduces vibrations,” then the user 102 may input the additional phrase, “under low temperature.” The prepositional phrase, “under low temperature” consists of a preposition (“under”) and a noun phrase (“low temperature”). In turn, the noun phrase consists of an adjective (“low”) and a noun (“temperature”). These terms may be used by the search result processor 122 to filter the search results 120 by removing from the search results 120 any results that do not satisfy the additional constraints specified by the phrase, “under low temperature,” and thereby to produce the processed search results 124.
  • As another example, the search result processor 122 may sort the search results 120 to produce the processed search results 124. For example, the search result processor may re-order the search results so that data records which resulted from performing the original query 104 on the dataset 118 appearing lower in the processed search results 124 than data records which resulted from performing queries other than the original query 104 on the dataset 118, so that data records representing information analogous to the problem described by the original query 104 are emphasized to the user 102.
  • Although the search result processor 122 may sort the search results 120 automatically to produce the processed search results 124, the system 100 may enable the user 102 to manually change the order of the processed search results 124 and otherwise modify the processed search results 124. For example, the system 100 may enable the user 102 to move individual search results up/down in the processed search results 124, mark individual search results as relevant/irrelevant, mark individual search results as favorites, or otherwise tag individual search results with any kind of data (such as indicates of categories or colors).
  • The system 100 may provide output representing the processed search results 124 (or, if the system 100 does not include the search result processor 122, output representing the original search results 120) to the user 102 (FIG. 5, operation 212). The system 100 may provide the output representing the processed search results 124 to the user 102 in any of a variety of ways, such as by displaying output representing the processed search results 124 on a display screen, e.g., in the form of textual summaries of the processed search results 124 or in the form of a graph illustrating features of the processed search results 124.
  • The techniques disclosed above may be repeated any number of times. For example, the user 102 may use the system 100 to perform additional searches on the processed search results 124 instead of the dataset 118. In other words, the user 102 may use the system 100 in the manner described above to perform an initial set of searches on the dataset 118 to produce the processed search results 124, and then use the system 100 in the manner described above to perform one or more additional sets of searches on the processed search results 124, thereby refining the initial set of processed search results 124. Such additional searches may be performed using queries that differ from the initial set of queries in any of a variety of ways. For example, the user 102 may perform any one or more of the following to produce a new query: (1) arbitrarily add terms to the original query 104, such as conditions that restrict terms in the original query (e.g., by adding the condition “>1800 Hz” to the term “vibrations” to restrict the search to vibrations greater than 1800 Hz); (2) add/replace terms in the original query 104 with synonyms (e.g., hypernyms and/or hyponyms) of such terms; and (3) combine the query 104 with one or more other queries. The system 100 may then use the new query to produce a modified set of queries using the techniques disclosed above and then to perform searches on the processed search results 124 using the modified set of queries and thereby to produce yet further processed search results.
  • The system 100 may assist the user 102 in producing such new queries in any of a variety of ways. For example, after performing the initial set of searches using the original query 104, the system 100 may display the original query 104 to the user 102 and enable the user 102 to browse synonyms (e.g., hypernyms and/or hypernyms) of each term in the query 104, such as by displaying such synonyms in a hierarchy in the manner described above. The user 102 may then select any such synonym and thereby generate a new query which is identical to the new query 104 except that the synonym selected by the user replaces the original term.
  • The system 100 may store a record of all queries used by the user 102 and enable the user 102 to view all such queries in the record. The user 102 may select any query from the record and instruct the system to perform a search using the selected query, either on the same dataset 118, an updated version of the dataset 118, or a different dataset. The user 102 may also select any query from the record, modify the selected query using any of the techniques disclosed herein, and then instruct the system to perform a search using the selected query, either on the same dataset 118, an updated version of the dataset 118, or a different dataset.
  • Although FIGS. 1 and 2, and the examples described above, envisage the use of a single original query 104, this is merely an example and does not constitute a limitation of the present invention. The user 102 may provide a plurality of original queries (such as “reduce vibrations” and “maintain strength”). The techniques disclosed herein may then be applied to each such original query to provide a plurality of queries for each of the original queries. For example, assume that the user 102 provides a first original query and a second original query, which causes the system 100 to produce a first set of final queries and a second set of final queries, respectively. The system 100 may then create a derived query by selecting one of the queries in the first set of final queries and one of the queries in the second set of final queries, and combine the selected queries using a logical operator (e.g., AND, OR, NOT, or XOR). The system 100 may then search the dataset 118 using the derived query to produce search results 120. The system 100 may perform any number of such searches using any combinations of queries from the first and second sets of queries.
  • Embodiments of the present invention have a variety of advantages. For example, embodiments of the present invention may be used to find information that is analogous to solutions to the problem represented by the user's original query 104. Consider, for example, a query of the form “predicate object,” such as “reduce vibrations.” Embodiments of the present invention may perform searches using queries that are synonyms of “reduce vibrations” to find data that may be used to answer the question, “What physical objects have the effect of reducing vibrations?”
  • Now consider, for example, a query of the form “subject predicate,” such as “fluid reduces.” Embodiments of the present invention may perform searches using queries that are synonyms of “fluid reduces” to find data that may be used to answer the question, “What does fluid reduce?” If the query is generalized even further to take the form “fluid <verbPhrase>,” where <verbPhrase> is a token that is used as a wildcard that is matched by any verb phrase in the dataset 118 (not merely verb phrases that are synonyms of “reduces”), then embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What are the effects of a fluid?” Similarly, if the query is generalized to take the form “<nounPhrase> reduces,” where <nounPhrase> is a token that is used as a wildcard that is matched by any noun phrase in the dataset (not merely noun phrases that are synonyms of “fluid”), then embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What are the known things that have the effect of reducing another thing?”
  • Now consider, for example, a query of the form “<nounPhrase1> <nounPhrase2>,” such as “fluid vibrations.” Embodiments of the present invention may perform searches using queries that include synonyms of “fluid” and “vibrations” to find data that may be used to answer the question, “What is the functional relationship between fluid and vibrations?” Such searches may, for example, be used to discover the verb phrase (e.g., “reduces”) that links “fluid” to “vibrations.”
  • Now consider, for example, a query of the form “<verbPhrase> object,” such as “<verbPhrase> vibrations,” where <verbPhrase> is a token that is used as a wildcard that is matched by any verb phrase in the dataset. Embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What things effect a change on vibrations?”
  • Now consider, for example, a query of the form “predicate <nounPhrase>,” such as “reduces <nounPhrase>,” where <nounPhrase> is a token that is used as a wildcard that is matched by any noun phrase in the dataset. Embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What things have the effect of reducing another thing?”
  • Now consider, for example, a query of the form “subject predicate object,” such as “fluid reduces vibrations.” Embodiments of the present invention may perform searches using such a query to find data that may be used to answer the question, “What things semantically synonymous to ‘fluid’ and ‘vibrations’ have a relationship that is semantically synonymous to ‘reduces’?”
  • It is to be understood that although the invention has been described above in terms of particular embodiments, the foregoing embodiments are provided as illustrative only, and do not limit or define the scope of the invention. Various other embodiments, including but not limited to the following, are also within the scope of the claims. For example, elements and components described herein may be further divided into additional components or joined together to form fewer components for performing the same functions.
  • Any of the functions disclosed herein may be implemented using means for performing those functions. Such means include, but are not limited to, any of the components disclosed herein, such as the computer-related components described below.
  • The techniques described above may be implemented, for example, in hardware, one or more computer programs tangibly stored on one or more computer-readable media, firmware, or any combination thereof. The techniques described above may be implemented in one or more computer programs executing on (or executable by) a programmable computer including any combination of any number of the following: a processor, a storage medium readable and/or writable by the processor (including, for example, volatile and non-volatile memory and/or storage elements), an input device, and an output device. Program code may be applied to input entered using the input device to perform the functions described and to generate output using the output device.
  • Each computer program within the scope of the claims below may be implemented in any programming language, such as assembly language, machine language, a high-level procedural programming language, or an object-oriented programming language. The programming language may, for example, be a compiled or interpreted programming language.
  • Each such computer program may be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a computer processor. Method steps of the invention may be performed by one or more computer processors executing a program tangibly embodied on a computer-readable medium to perform functions of the invention by operating on input and generating output. Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, the processor receives (reads) instructions and data from a memory (such as a read-only memory and/or a random access memory) and writes (stores) instructions and data to the memory. Storage devices suitable for tangibly embodying computer program instructions and data include, for example, all forms of non-volatile memory, such as semiconductor memory devices, including EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROMs. Any of the foregoing may be supplemented by, or incorporated in, specially-designed ASICs (application-specific integrated circuits) or FPGAs (Field-Programmable Gate Arrays). A computer can generally also receive (read) programs and data from, and write (store) programs and data to, a non-transitory computer-readable storage medium such as an internal disk (not shown) or a removable disk. These elements will also be found in a conventional desktop or workstation computer as well as other computers suitable for executing computer programs implementing the methods described herein, which may be used in conjunction with any digital print engine or marking engine, display monitor, or other raster output device capable of producing color or gray scale pixels on paper, film, display screen, or other output medium.
  • Any data disclosed herein may be implemented, for example, in one or more data structures tangibly stored on a non-transitory computer-readable medium. Embodiments of the invention may store such data in such data structure(s) and read such data from such data structure(s).

Claims (1)

1. A method performed by at least one computer processor, the method comprising:
(A) obtaining, from a user, first data representing a first part of a sentence, wherein the first part of the sentence comprises a first one of a subject, a predicate, and an object;
(B) obtaining, from the user, second data representing a second part of the sentence, wherein the second part of the sentence comprises a second one of the subject, the predicate, and the object;
(C) identifying a synonym of the first part of the sentence;
(D) identifying a synonym of the second part of the sentence;
(E) forming a first query from the synonym of the first part of the sentence and the synonym of the second part of the sentence, comprising:
(E) (1) selecting a first form for the first query, wherein the first form specifies a first one of the following sequences:
subject, predicate, object;
subject, predicate;
subject, object;
predicate, object;
(E) (2) forming the first query in the first form the forming comprising:
(E) (2) (a) if the first form specifies the sequence subject, predicate, object, then forming the first query to include a subject followed by a predicate followed by an object, wherein the subject, predicate, and object are selected from the first part of the sentence, the second part of the sentence, the synonym of the first part of the sentence, and the synonym of the second part of the sentence;
(E) (2) (b) if the first form specifies the sequence subject, predicate, then forming the first query to include a subject followed by a predicate, wherein the subject and predicate are selected from the first part of the sentence, the second part of the sentence, the synonym of the first part of the sentence, and the synonym of the second part of the sentence;
(E) (2) (c) if the first form specifies the sequence subject, object, then forming the first query to include a subject followed by an object, wherein the subject and object are selected from the first part of the sentence, the second part of the sentence, the synonym of the first part of the sentence, and the synonym of the second part of the sentence; and
(E) (2) (d) if the first form specifies the sequence predicate, object, then forming the first query to include a predicate followed by an object, wherein the predicate and object are selected from the first part of the sentence, the second part of the sentence, the synonym of the first part of the sentence, and the synonym of the second part of the sentence; and
(F) searching a dataset in memory using the first query to identify a first subset of the dataset;
(G) providing, to the user, executed by the computer processor, output representing the subset of the dataset;
wherein the sentence differs from the first query.
US15/355,418 2012-11-21 2016-11-18 Analogy Finder Abandoned US20170068725A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US15/355,418 US20170068725A1 (en) 2012-11-21 2016-11-18 Analogy Finder
US15/717,521 US20180025074A1 (en) 2012-11-21 2017-09-27 Analogy Finder

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261728924P 2012-11-21 2012-11-21
US14/085,897 US9501469B2 (en) 2012-11-21 2013-11-21 Analogy finder
US15/355,418 US20170068725A1 (en) 2012-11-21 2016-11-18 Analogy Finder

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US14/085,897 Continuation US9501469B2 (en) 2012-11-21 2013-11-21 Analogy finder

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/717,521 Continuation US20180025074A1 (en) 2012-11-21 2017-09-27 Analogy Finder

Publications (1)

Publication Number Publication Date
US20170068725A1 true US20170068725A1 (en) 2017-03-09

Family

ID=50975664

Family Applications (3)

Application Number Title Priority Date Filing Date
US14/085,897 Active 2033-12-11 US9501469B2 (en) 2012-11-21 2013-11-21 Analogy finder
US15/355,418 Abandoned US20170068725A1 (en) 2012-11-21 2016-11-18 Analogy Finder
US15/717,521 Abandoned US20180025074A1 (en) 2012-11-21 2017-09-27 Analogy Finder

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US14/085,897 Active 2033-12-11 US9501469B2 (en) 2012-11-21 2013-11-21 Analogy finder

Family Applications After (1)

Application Number Title Priority Date Filing Date
US15/717,521 Abandoned US20180025074A1 (en) 2012-11-21 2017-09-27 Analogy Finder

Country Status (1)

Country Link
US (3) US9501469B2 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130157242A1 (en) * 2011-12-19 2013-06-20 Sanford, L.P. Generating and evaluating learning activities for an educational environment
US10157350B2 (en) * 2015-03-26 2018-12-18 Tata Consultancy Services Limited Context based conversation system
US9588965B2 (en) 2015-03-30 2017-03-07 Tata Consultancy Services Limited Identifying and characterizing an analogy in a document
US10055400B2 (en) * 2016-11-11 2018-08-21 International Business Machines Corporation Multilingual analogy detection and resolution
US10282603B2 (en) * 2017-01-09 2019-05-07 International Business Machines Corporation Analyzing technical documents against known art
US20180196870A1 (en) * 2017-01-12 2018-07-12 Microsoft Technology Licensing, Llc Systems and methods for a smart search of an electronic document
JP6791825B2 (en) * 2017-09-26 2020-11-25 株式会社日立製作所 Information processing device, dialogue processing method and dialogue system
WO2019140358A1 (en) * 2018-01-12 2019-07-18 Gamalon, Inc. Probabilistic modeling system and method
JP6584622B1 (en) * 2018-04-24 2019-10-02 株式会社Nttドコモ Sentence matching system
US10929602B2 (en) * 2018-08-06 2021-02-23 International Business Machines Corporation Training of cognitive systems
US11302330B2 (en) * 2019-06-03 2022-04-12 Microsoft Technology Licensing, Llc Clarifying questions for rewriting ambiguous user utterance
US11501753B2 (en) * 2019-06-26 2022-11-15 Samsung Electronics Co., Ltd. System and method for automating natural language understanding (NLU) in skill development
US11216473B2 (en) * 2019-09-30 2022-01-04 Lenovo (Singapore) Pte. Ltd. Qualitative modifier in query input
CN112052321A (en) * 2020-09-02 2020-12-08 平安科技(深圳)有限公司 Man-machine conversation method, device, computer equipment and storage medium

Citations (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070282811A1 (en) * 2006-01-03 2007-12-06 Musgrove Timothy A Search system with query refinement and search method
US20080104071A1 (en) * 2006-10-31 2008-05-01 Execue, Inc. System and method for converting a natural language query into a logical query
US20080154828A1 (en) * 2006-12-21 2008-06-26 Support Machines Ltd. Method and a Computer Program Product for Providing a Response to A Statement of a User
US20080195570A1 (en) * 2005-03-31 2008-08-14 Koninklijke Philips Electronics, N.V. System and Method for Collecting Evidence Pertaining to Relationships Between Biomolecules and Diseases
US20090070322A1 (en) * 2007-08-31 2009-03-12 Powerset, Inc. Browsing knowledge on the basis of semantic relations
US20090100053A1 (en) * 2007-10-10 2009-04-16 Bbn Technologies, Corp. Semantic matching using predicate-argument structure
US20090119095A1 (en) * 2007-11-05 2009-05-07 Enhanced Medical Decisions. Inc. Machine Learning Systems and Methods for Improved Natural Language Processing
US20100070517A1 (en) * 2008-09-17 2010-03-18 Oracle International Corporation System and Method for Semantic Search in an Enterprise Application
US20110022635A1 (en) * 2009-07-26 2011-01-27 Moris Michael Method and System to Formulate Queries With Minivisuals
US20110161070A1 (en) * 2009-12-31 2011-06-30 International Business Machines Corporation Pre-highlighting text in a semantic highlighting system
US20110173174A1 (en) * 2010-01-13 2011-07-14 Flitcroft Investments Ltd Linguistically enhanced search engine and meta-search engine
US20110295823A1 (en) * 2010-05-25 2011-12-01 Nokia Corporation Method and apparatus for modeling relations among data items
US20110301941A1 (en) * 2009-03-20 2011-12-08 Syl Research Limited Natural language processing method and system
US20120254143A1 (en) * 2011-03-31 2012-10-04 Infosys Technologies Ltd. Natural language querying with cascaded conditional random fields
US20130151236A1 (en) * 2011-12-09 2013-06-13 Igor Iofinov Computer implemented semantic search methodology, system and computer program product for determining information density in text
US8602793B1 (en) * 2006-07-11 2013-12-10 Erwin Ernest Sniedzins Real time learning and self improvement educational system and method
US20140059078A1 (en) * 2012-08-27 2014-02-27 Microsoft Corporation Semantic query language
US8700620B1 (en) * 2010-04-27 2014-04-15 Jeremy Lieberman Artificial intelligence method and apparatus
US8838659B2 (en) * 2007-10-04 2014-09-16 Amazon Technologies, Inc. Enhanced knowledge repository
US8880547B1 (en) * 2009-07-21 2014-11-04 Amazon Technologies, Inc. Implementation of a search for items
US8909516B2 (en) * 2011-10-27 2014-12-09 Microsoft Corporation Functionality for normalizing linguistic items
US8924198B2 (en) * 2010-07-26 2014-12-30 Radiant Logic, Inc. Searching and browsing of contextual information
US8943094B2 (en) * 2009-09-22 2015-01-27 Next It Corporation Apparatus, system, and method for natural language processing

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6167370A (en) 1998-09-09 2000-12-26 Invention Machine Corporation Document semantic analysis/selection with knowledge creativity capability utilizing subject-action-object (SAO) structures
US6611825B1 (en) 1999-06-09 2003-08-26 The Boeing Company Method and system for text mining using multidimensional subspaces
US7120574B2 (en) 2000-04-03 2006-10-10 Invention Machine Corporation Synonym extension of search queries with validation
US7962326B2 (en) 2000-04-20 2011-06-14 Invention Machine Corporation Semantic answering system and method
US7398201B2 (en) 2001-08-14 2008-07-08 Evri Inc. Method and system for enhanced data searching
US7567917B2 (en) 2002-03-14 2009-07-28 Kenneth Miller Product design methodology
JP3940707B2 (en) 2003-06-23 2007-07-04 桂子 溝尾 Sentence analysis apparatus and sentence analysis program
US8856163B2 (en) 2003-07-28 2014-10-07 Google Inc. System and method for providing a user interface with search query broadening
US7516114B2 (en) 2004-10-22 2009-04-07 International Business Machines Corporation Visual structuring of multivariable data
EP1952280B8 (en) 2005-10-11 2016-11-30 Ureveal, Inc. System, method&computer program product for concept based searching&analysis
US7672831B2 (en) 2005-10-24 2010-03-02 Invention Machine Corporation System and method for cross-language knowledge searching
US20090112828A1 (en) 2006-03-13 2009-04-30 Answers Corporation Method and system for answer extraction
US8332883B2 (en) 2007-10-02 2012-12-11 The Nielsen Company (Us), Llc Providing actionable insights based on physiological responses from viewers of media
US8347231B2 (en) 2007-10-08 2013-01-01 At&T Intellectual Property I, L.P. Methods, systems, and computer program products for displaying tag words for selection by users engaged in social tagging of content
WO2009073634A1 (en) 2007-11-30 2009-06-11 Emsense Corporation Correlating media instance information with physiological responses from participating subjects
KR101040119B1 (en) 2008-10-14 2011-06-09 한국전자통신연구원 Apparatus and Method for Search of Contents
US20100281025A1 (en) 2009-05-04 2010-11-04 Motorola, Inc. Method and system for recommendation of content items
CN101901230A (en) 2009-05-31 2010-12-01 国际商业机器公司 Information retrieval method, user comment processing method and system thereof
US20110035211A1 (en) 2009-08-07 2011-02-10 Tal Eden Systems, methods and apparatus for relative frequency based phrase mining
US20110190035A1 (en) 2010-02-03 2011-08-04 Research In Motion Limited System and method of enhancing user interface interactions on a mobile device
US8892554B2 (en) 2011-05-23 2014-11-18 International Business Machines Corporation Automatic word-cloud generation
US20130212093A1 (en) 2012-02-15 2013-08-15 International Business Machines Corporation Generating visualizations of a display group of tags representing content instances in objects satisfying a search criteria
US9514461B2 (en) 2012-02-29 2016-12-06 Adobe Systems Incorporated Systems and methods for analysis of content items

Patent Citations (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080195570A1 (en) * 2005-03-31 2008-08-14 Koninklijke Philips Electronics, N.V. System and Method for Collecting Evidence Pertaining to Relationships Between Biomolecules and Diseases
US20070282811A1 (en) * 2006-01-03 2007-12-06 Musgrove Timothy A Search system with query refinement and search method
US8602793B1 (en) * 2006-07-11 2013-12-10 Erwin Ernest Sniedzins Real time learning and self improvement educational system and method
US20080104071A1 (en) * 2006-10-31 2008-05-01 Execue, Inc. System and method for converting a natural language query into a logical query
US20080154828A1 (en) * 2006-12-21 2008-06-26 Support Machines Ltd. Method and a Computer Program Product for Providing a Response to A Statement of a User
US20090070322A1 (en) * 2007-08-31 2009-03-12 Powerset, Inc. Browsing knowledge on the basis of semantic relations
US8838659B2 (en) * 2007-10-04 2014-09-16 Amazon Technologies, Inc. Enhanced knowledge repository
US20090100053A1 (en) * 2007-10-10 2009-04-16 Bbn Technologies, Corp. Semantic matching using predicate-argument structure
US20090119095A1 (en) * 2007-11-05 2009-05-07 Enhanced Medical Decisions. Inc. Machine Learning Systems and Methods for Improved Natural Language Processing
US20100070517A1 (en) * 2008-09-17 2010-03-18 Oracle International Corporation System and Method for Semantic Search in an Enterprise Application
US20110301941A1 (en) * 2009-03-20 2011-12-08 Syl Research Limited Natural language processing method and system
US8880547B1 (en) * 2009-07-21 2014-11-04 Amazon Technologies, Inc. Implementation of a search for items
US20110022635A1 (en) * 2009-07-26 2011-01-27 Moris Michael Method and System to Formulate Queries With Minivisuals
US8943094B2 (en) * 2009-09-22 2015-01-27 Next It Corporation Apparatus, system, and method for natural language processing
US20110161070A1 (en) * 2009-12-31 2011-06-30 International Business Machines Corporation Pre-highlighting text in a semantic highlighting system
US20110173174A1 (en) * 2010-01-13 2011-07-14 Flitcroft Investments Ltd Linguistically enhanced search engine and meta-search engine
US8700620B1 (en) * 2010-04-27 2014-04-15 Jeremy Lieberman Artificial intelligence method and apparatus
US20110295823A1 (en) * 2010-05-25 2011-12-01 Nokia Corporation Method and apparatus for modeling relations among data items
US8924198B2 (en) * 2010-07-26 2014-12-30 Radiant Logic, Inc. Searching and browsing of contextual information
US20120254143A1 (en) * 2011-03-31 2012-10-04 Infosys Technologies Ltd. Natural language querying with cascaded conditional random fields
US8909516B2 (en) * 2011-10-27 2014-12-09 Microsoft Corporation Functionality for normalizing linguistic items
US20130151236A1 (en) * 2011-12-09 2013-06-13 Igor Iofinov Computer implemented semantic search methodology, system and computer program product for determining information density in text
US20140059078A1 (en) * 2012-08-27 2014-02-27 Microsoft Corporation Semantic query language

Also Published As

Publication number Publication date
US20140180677A1 (en) 2014-06-26
US9501469B2 (en) 2016-11-22
US20180025074A1 (en) 2018-01-25

Similar Documents

Publication Publication Date Title
US20180025074A1 (en) Analogy Finder
US10324967B2 (en) Semantic text search
US11080295B2 (en) Collecting, organizing, and searching knowledge about a dataset
US9037529B2 (en) Method for generating visual mapping of knowledge information from parsing of text inputs for subjects and predicates
US7657546B2 (en) Knowledge management system, program product and method
Abbache et al. Arabic query expansion using wordnet and association rules
US20140149390A1 (en) Automatically Providing Relevant Search Results Based on User Behavior
Armentano et al. NLP-based faceted search: Experience in the development of a science and technology search engine
L'Homme et al. Combining the semantics of collocations with situation-driven search paths in specialized dictionaries
Fauzi et al. Image understanding and the web: a state-of-the-art review
US20090240687A1 (en) Method of Processing a Collection of Document Sources
Park et al. Towards ontologies on demand
Othman et al. A multi-lingual approach to improve passage retrieval for automatic question answering
Nadia Design and implementation of information retrieval system based ontology
Yi et al. An empirical examination of the associations between social tags and Web queries
Dinh et al. Sense-based biomedical indexing and retrieval
Qassimi et al. Towards an emergent semantic of web resources using collaborative tagging
Hinze et al. Low-cost semantic enhancement to digital library metadata and indexing: Simple yet effective strategies
Ting et al. Query refinement for ontology information extraction
Briscoe et al. Intelligent information access from scientific papers
Melo Mora et al. Automatic validation of terminology by means of formal concept analysis
Yilmaz et al. Snippet Generation Using Local Alignment for Information Retrieval (LAIR)
Nunes et al. Building Distant Supervised Relation Extractors
Ullah et al. Semantic Space Expansion and Refinement
Zhang Smart Image Search System Using Personalized Semantic Search Method

Legal Events

Date Code Title Description
AS Assignment

Owner name: UNIVERSITY OF MASSACHUSETTS, MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MCCAFFREY, ANTHONY;REEL/FRAME:040368/0414

Effective date: 20130717

Owner name: INNOVATION ACCELERATOR, INC., MASSACHUSETTS

Free format text: LICENSE;ASSIGNOR:UNIVERSITY OF MASSACHUSETTS;REEL/FRAME:040368/0449

Effective date: 20130717

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION