WO2019235131A1 - Search text generation system and search text generation method - Google Patents

Search text generation system and search text generation method Download PDF

Info

Publication number
WO2019235131A1
WO2019235131A1 PCT/JP2019/018693 JP2019018693W WO2019235131A1 WO 2019235131 A1 WO2019235131 A1 WO 2019235131A1 JP 2019018693 W JP2019018693 W JP 2019018693W WO 2019235131 A1 WO2019235131 A1 WO 2019235131A1
Authority
WO
WIPO (PCT)
Prior art keywords
search
sentence
word
input
determined
Prior art date
Application number
PCT/JP2019/018693
Other languages
French (fr)
Japanese (ja)
Inventor
拓生 逸見
佐藤 淳一
Original Assignee
株式会社ユニバーサルエンターテインメント
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社ユニバーサルエンターテインメント filed Critical 株式会社ユニバーサルエンターテインメント
Priority to CN201980037385.XA priority Critical patent/CN112236768A/en
Priority to US17/059,831 priority patent/US20210263915A1/en
Publication of WO2019235131A1 publication Critical patent/WO2019235131A1/en
Priority to PH12020552089A priority patent/PH12020552089A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/243Natural language query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/268Morphological analysis

Definitions

  • the present invention relates to a search sentence generation system that generates a search sentence based on a natural sentence input by a user.
  • a user of a commercial database or the like can create a search expression including keywords, logical operators, parentheses, etc., and obtain target information from the commercial database.
  • Patent Document 1 An information retrieval apparatus that automatically creates a retrieval formula based on a natural sentence input by a user has been proposed (see Patent Document 1).
  • This information search device creates a search formula based on a natural text search question entered by the user. More specifically, create a search expression using operators by referring to a search term dictionary that stores synonyms, related terms, and hierarchical relationships of terms included in the search question. Use to search the database to be searched. In addition, after obtaining the search results, the user is inquired whether the number of answers is appropriate, and the search expression is recreated using the search term dictionary according to the answer result of the inquiry.
  • a search expression is created based on a search sentence composed of natural sentences.
  • keywords and operators included in the search expression are appropriately changed according to the answer result of the inquiry by the user. It is configured to be.
  • the information search device refers to the search term dictionary (in parallel) and adds or replaces keywords to replace the operator. Is changed from “AND” to “OR” to re-create a search expression that can obtain a large number of responses.
  • Such an information search apparatus is effective in assisting a user who is unfamiliar with the creation of a search sentence, but on the other hand, the true intention or nuance of the search included in the natural sentence may not be expressed correctly.
  • search word such as “not gorgeous” or “not colorful”, “splendid” or “colored” respectively.
  • a search using a keyword is performed, and the negative meaning of a search word input by a user may be eliminated.
  • an object of the present invention is to provide a search sentence generation system capable of automatically generating an appropriate search sentence based on a natural sentence input by a user.
  • Another object of the present invention is to provide a search sentence generation system capable of generating a search sentence that reflects the true intention and nuance of the search included in the natural sentence based on the natural sentence input by the user. There is.
  • the present invention provides the following search sentence generation system and search sentence generation method.
  • the invention according to the first embodiment of the present invention has the following configuration.
  • An input sentence receiving means for receiving an input sentence of a natural sentence input from a user;
  • An input sentence analyzing means for analyzing the input sentence and dividing it into a plurality of words; Based on the plurality of words, a word (for example, “de” or “other than”) corresponding to an operator (for example, AND or NOT) is determined, and a target word related to the determined word is determined.
  • a determination means A search sentence generation system comprising: an operator corresponding to the determined word; and a search sentence generation means for generating a search sentence corresponding to the input sentence using a set of target words determined to be related to the word (For example, search sentence generation system 11).
  • Such a configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by the user, and the user is not accustomed to creating a search sentence, Even if there is no knowledge, an intended search result can be obtained without creating a complicated search sentence.
  • the invention according to the second embodiment of the present invention has the following configuration in the first embodiment.
  • the determination means determines a word corresponding to an operator of a negative logical operation based on a predetermined determination criterion (for example, a word such as “other than” or “not” is extracted in an input sentence after analysis) In this case, an operator “NOT” is associated with the word).
  • an intended search result can be obtained even when a user's input sentence includes a negative condition.
  • the search sentence generation means uses a predetermined operator based on a relationship between the operator corresponding to the determined word and the operator and the target word in each set of target words determined to be related to the word.
  • the search sentence is generated so that the priority of the logical operation is determined (for example, in FIG. 8B, “clothes NOT blue” of the search sentence 364 and “3” of the partial search sentence 363 (3) “ When “clothes AND” is generated, the target word “clothes” of both is common, and in the search sentence 364, “clothes” is in a relationship modified with the operator “NOT” and the target word “blue”. Therefore, both are collected by “clothes”, and “(clothes NOT blue) AND” of the search sentence 365 is generated).
  • This configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by a user.
  • the invention according to the fourth embodiment of the present invention has the following configuration in the first embodiment.
  • the determination means sequentially determines a word corresponding to an operator for the plurality of words (for example, in step S54 in FIG. 7, sequentially acquires words corresponding to the operator from the analysis result of the input sentence)
  • the search sentence generation unit is configured to generate a search sentence relating to the determined word each time a word corresponding to the operator is determined, and to integrate the search sentence with the generated search sentence. (For example, step S56 in FIG. 7).
  • This configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by a user.
  • the invention according to the fifth embodiment of the present invention has the following configuration.
  • a search statement generation method that is executed in a search statement generation server that generates a search statement based on an input sentence of a natural sentence input by a user, An input sentence receiving step of receiving the input sentence; An input sentence analysis step of analyzing the input sentence and dividing it into a plurality of words; A determination step of determining a word corresponding to an operator based on the plurality of words and determining a target word related to the determined word; A search sentence generation step of generating a search sentence corresponding to the input sentence by using an operator corresponding to the determined word and a set of target words determined to be related to the word. Search sentence generation method.
  • Such a configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by the user, and the user is not accustomed to creating a search sentence, Even if there is no knowledge, an intended search result can be obtained without creating a complicated search sentence.
  • the search sentence generation system makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by the user, and the user can create a normal search sentence without creating a complicated search sentence.
  • a desired search result can be obtained by input that asks in conversation.
  • the search sentence generation system makes it possible to generate a search sentence that reflects the true intention or nuance of the search included in the natural sentence based on the natural sentence input by the user. Even if the user is unfamiliar with the creation of the search sentence or does not have knowledge of the search formula, the user can obtain the intended search result without creating a complicated search sentence.
  • FIG. 1 is a diagram showing an overview of an information providing system 1 that realizes a conversation with a user by providing a response corresponding to the user's input.
  • the information providing system 1 is configured to include a search sentence generation system 11 according to the first embodiment of the present invention.
  • the user 10 inputs a character string (for example, using a keyboard or a touch panel of the user terminal 100) in an input area of a predetermined WEB page displayed by the WEB browser executed on the user terminal 100.
  • the character string input by the user 10 is generally a search sentence for performing an information search on the search target, and is a natural sentence expressed in a natural language.
  • this input sentence may not be in the form of a sentence as long as it can be used for the search. Nor does it need to be a question or question, such as “What is a non-bittering drug?”
  • various forms of input sentences such as an input sentence that forms part of a question sentence, such as “non-bitter medicine” can be accepted.
  • the user 10 interacts with the information providing system 1 in a conversation format, and the search result of the search executed based on the input sentence is displayed on the user terminal 100 as a response to the input sentence input by the user 10.
  • the WEB browser executed on the user terminal 100 is configured to display an input area for inputting an input sentence and a search result, but other applications executed on the user terminal 100 may be displayed. You may make it display said input area and a search result.
  • the input sentence input from the user terminal 100 by the user 10 is provided to the information providing system 1 via the network 500 by the WEB browser of the user terminal 100.
  • the network 500 is a network including the Internet, for example.
  • the information providing system 1 is configured to include an information providing server 200 and a search statement generation system 11.
  • the search sentence generation system 11 includes a search sentence generation server 300, and the search sentence generation server 300 includes analysis data 340 for analyzing an input sentence.
  • the information providing server 200 of the information providing system 1 When the information providing server 200 of the information providing system 1 receives the input sentence from the user terminal 100, the information providing server 200 transmits the input sentence to the search sentence generation system 11, and receives the search sentence generated there from the search sentence generation system 11.
  • the search sentence generation server 300 when an input sentence is received, the search sentence generation server 300 refers to the analysis data 340 and the like and generates a search sentence based on the received input sentence.
  • the information providing server 200 of the information providing system 1 receives the generated search sentence from the search sentence generating system 11, the information providing server 200 executes a search for the search target data 400 using the search sentence, and displays the search result as the user terminal 100. Send to.
  • the search target data 400 is a database or the like arranged outside the information providing system 1, but may be data included in the information providing system 1.
  • the search target data 400 may be a database that stores index information related to resources accessible via the Internet, for example. By searching such a database, it is possible to display on the user terminal 100 a list of WEB pages that match the search statement conditions, link information of WEB pages, contents of individual WEB pages, and the like.
  • the search target data 400 can be configured to include products provided by the company, inventory information of the products, and the like. Thus, the search target data 400 can be configured with various contents.
  • the user 10 may be a general user who browses a WEB page, or a company or organization that provides a WEB site (products or services of For example, a general user who asks questions etc. is assumed, and the user terminal 100 is normally assumed to be a smartphone or a PC (personal computer) owned by the user 10 that can be connected to the Internet or the like.
  • the user 10 inputs an input sentence (for example, using the keyboard or touch panel of the user terminal 100) into the input area of the WEB site and transmits the input sentence to the information providing system 1.
  • the system 1 may receive this input sentence by any method / route. For example, when the user 10 issues a query related to search by voice, the voice is subjected to voice recognition processing and converted into a character string, and the converted character string is provided to the information providing system 1 as an input sentence. can do. Further, it is also conceivable to convert the questionnaire related to the printed search into a character string by OCR (Optical Character Recognition), and provide the converted character string to the information providing system 1 as an input sentence.
  • OCR Optical Character Recognition
  • the response sentence from the information providing system 1 can be provided to the user terminal 100 not only by the character information indicating the search result but also by various methods / routes such as graphics, voice, and video.
  • the information providing server 200 is a WEB server that operates to perform data transmission / reception with a WEB browser executed on the user terminal 100 using a protocol such as http, and to display a predetermined WEB page on the WEB browser. It also has a function.
  • the information providing server 200 and the search statement generation server 300 are each shown here as one computer, but the same function can be distributed and executed by a plurality of computers. Further, the information providing server 200 and the search statement generation server 300 may be integrated and realized by one computer.
  • FIG. 1 shows an example in which the search sentence generation system 11 according to the first embodiment of the present invention is incorporated in the information providing system 1 as described above.
  • the search sentence generation system 11 is various other systems. It is possible to use it.
  • the search sentence generation system 11 is a system that receives an input sentence related to a search, at least a part of which is a natural sentence, and generates an appropriate search sentence based on the input sentence.
  • FIG. 2 is a flowchart showing the flow of response text output processing in the information providing system 1 according to the present embodiment. As described above, this process controls the information providing server 200 to search the content of the search target data 400 based on the input sentence input by the user 10 and to display the search result on the user terminal 100.
  • step S ⁇ b> 11 the information providing server 200 determines whether or not the input sentence of the user 10 is received from the user terminal 100. If the input sentence has not been received (NO in step S11), the determination process is repeated. If the input sentence has been received (YES in step S11), the process proceeds to step S12.
  • step S12 the information providing server 200 transmits the input sentence of the user 10 received from the user terminal 100 to the search sentence generation server 300, where a search sentence based on the input sentence is generated.
  • step S13 the generated search sentence is received from the search sentence generation server 300, and in step S14, the format of the received search sentence is adjusted according to the data management system of the search target data 400.
  • the search statement generation server 300 generates a search statement in a predetermined format, but the data management system of the search target data 400 has a specification that accepts a search statement in a format different from the format of the search statement generation server 300. If so, the format of the generated search statement is changed accordingly.
  • a commercial database there are cases where it is necessary to construct a search sentence in an SQL format including a SELECT sentence, and there are cases where it is necessary to construct it as a search sentence for use by an Internet search engine.
  • step S15 the content of the search target data 400 is searched using the adjusted search text, and the search result is acquired.
  • the search result is, for example, a list of items that satisfy the search statement condition and link information to data representing details of each item.
  • the search target data 400 is a database that stores an index related to a resource that can be accessed via the Internet
  • the search result includes, for example, a list of WEB pages that satisfy the search statement condition and links to each WEB page. Information.
  • step S16 the information providing server 200 edits the received search result.
  • the WEB page configured to display the contents of the search result is edited (in the html language or the like), and then in step S17, Control is performed to output the edited search result to the user terminal 100.
  • FIG. 3 is a flowchart showing the flow of search statement generation processing executed by the search statement generation server 300 according to the first embodiment.
  • a search sentence is generated according to a predetermined rule based on the input sentence, and the generated search sentence is provided as information. Transmit to server 200.
  • step S31 the search statement generation server 300 determines whether or not the input statement of the user 10 has been received from the information providing server 200. If the input sentence has not been received (NO in step S31), the determination process is repeated. If the input sentence has been received (YES in step S31), the process proceeds to step S32.
  • step S32 the search sentence generation server 300 analyzes the received input sentence and extracts a plurality of words from the input sentence.
  • morpheme analysis is performed using a morpheme database included in the analysis data 340.
  • the input sentence is divided into the smallest meaningful units (morphemes). For example, each word is classified and extracted (divided) into parts of speech such as nouns, verbs, particles, auxiliary verbs, and adjectives.
  • morphological analysis is performed.
  • there are various types of analysis such as analysis for matching with a predetermined keyword or character string pattern, and analysis for comparing with a word registered in a dictionary. It is possible to analyze the input sentence by the method and extract a plurality of words.
  • words that are not directly related to the structure of the generated search sentence are deleted or adjusted. For example, in the case of an input sentence such as “What is a non-bitter medicine?”, “Ha”, “what”, “?”, Etc. are not reflected in the search sentence, and are deleted at this stage.
  • nouns, adjectives, and the like are converted into a form (style) suitable for use in a search sentence as necessary.
  • step S33 the word corresponding to the operator included in the search sentence is determined from the analysis result (divided word) of the input sentence.
  • the target word related to the word corresponding to this operator is determined.
  • the target word is a word indicating an element necessary for the calculation by the operator, and is a noun, an adjective or the like (basically) adjacent to the word determined to correspond to the operator.
  • the target word can be defined as a search target item or search value of the search target data 400 in the search sentence.
  • the word “other” obtained by integrating these two words is a noun adjacent (for example, before and after). In consideration of a target word made up of, etc., it is determined to correspond to NOT of the logical operation. Furthermore, when a morpheme such as “de” is extracted from a part of the input sentence, the word “de” corresponds to the AND of the logical operation in consideration of the target word composed of adjacent nouns etc. Is determined.
  • the analysis data 340 is referred to, whether or not the adjective is a negative form, a word corresponding to the operator is determined, and as a result, It is possible to include NOT, and based on the type of the target word adjacent to the word (for example, the type of part of speech), it is possible to determine which logical operation operator the word corresponds to.
  • a specific character string in an input sentence is not simply extracted and associated with an operator, but the true intention or nuance of the input sentence is correctly extracted and reflected in the search sentence.
  • the operator “AND” is simply associated with the character string “to” of the input sentence, or the negative meaning in the search word “not gorgeous” is not reflected in the search sentence.
  • the word corresponding to the operator and the target word related to the word are determined from the words divided into a plurality by the input analysis. This is performed in consideration of related target words, and as a result, the true intention and nuance of the input sentence are correctly reflected in the search sentence.
  • step S34 the frame of the search sentence is determined from the analysis result of the input sentence in step S32 and the operator determined in step S33.
  • the search sentence frame defines a skeleton of the search sentence, and includes a parenthesis arrangement (search order evaluation order), an operator, and a variable (a target word is arranged later) in the search sentence.
  • step S35 based on the analysis result of the input sentence in step S32, the corresponding target word is placed at the position of the variable included in the frame of the search sentence, and the variable is replaced with the target word. Generate a statement.
  • step S36 the generated search text is transmitted to the information providing server 200.
  • the received input sentence is analyzed, the word corresponding to the operator is determined, the frame of the search sentence is determined, and then the search sentence of the search sentence is determined. Replace the variable part of the frame with the target word to complete the final search sentence.
  • a search sentence generation method it is possible to generate a search sentence that reflects the true intention or nuance of the search included in the natural sentence, based on the natural sentence input by the user.
  • FIG. 4 shows an example of a graphical user interface (GUI) of the conversation screen 120 displayed on the display of the user terminal 100.
  • the conversation screen 120 shows a character display unit 121 and an input response display unit 122.
  • the character display unit 121 displays, for example, a concierge character that changes facial expressions and movements in response to input text input by the user 10 and display of search results.
  • the input response display unit 122 displays a natural sentence input sentence input by the user 10 to the user terminal 100.
  • the input response display unit 122 as a concierge response, a message indicating that a search based on the input sentence is being performed and a search sentence generated based on the input sentence are shown. Thereafter, the search result corresponding to the input sentence transmitted from the information providing system 1 is displayed as a concierge response at the bottom.
  • a link button 122a for displaying the number of search hits and detailed information of the search result is shown.
  • the display of the user terminal 100 transitions to another WEB page that displays detailed information of the search result.
  • this WEB page for example, a list of items that satisfy the conditions of the search statement and link information to data representing details of each item are displayed.
  • FIG. 5 explains the search sentence generation process executed by the search sentence generation server 300 according to the first embodiment in more detail using an example input sentence 350.
  • the search statement generation server 300 receives an input sentence 350 of “non-blue clothes made by S company” from the user 10 and performs morphological analysis on the input sentence 350 (FIG. 5A). 3 step S32).
  • the result of the morphological analysis is shown as an analysis result 351.
  • the input sentence 350 is decomposed into, for example, the words “blue”, “other than”, “no”, “clothes”, “de”, “S company”, and “made”.
  • the words “blue”, “other than”, “clothes”, and “S company” are each determined as a noun, and the words “no” and “de” are each determined as a particle (case particle),
  • the word “made” is determined to be a suffix.
  • the word “Company S” is determined as a noun representing the company name by the proper noun dictionary included in the analysis data 340.
  • the word corresponding to the operator and the target word are determined (see step S33 in FIG. 3).
  • “no” as a noun and “no” as a particle are extracted, and the two words are connected (for example, by referring to the analysis data 340) to be an operator. It is determined that it corresponds to “NOT”.
  • the search expression is configured as “Q2 NOT P2” (reference numeral 353 in FIG. 5A).
  • Q2 and P2 are determined as target words related to the word “other than”.
  • Q2 is a noun
  • P2 is a noun or adjective that modifies Q2.
  • the particle “de” is extracted from the analysis result 351, and this word corresponds to “AND”.
  • the search expression is configured as “P1 AND Q1” (reference numeral 352 in FIG. 5A).
  • Q1 and P1 are determined as target words related to the word “de”.
  • Q1 is a noun
  • P1 is an adjective expression, and they are connected in parallel by “AND”.
  • search expressions “Q2 NOT P2” and “P1 AND Q1” are extracted from the analysis result 351 by determining the operator and the target word. These two search expressions are integrated and searched.
  • the sentence frame 354 is determined as “(Y NOT X) AND Z” (see step S34 in FIG. 3).
  • “P1 "And” Q2 "correspond to the same noun” clothes ", and” Q2 "is further modified with” P2 "and the operator” NOT ".
  • the frame 354 of the search sentence is first “(Q2 NOT P2) AND Q1”.
  • the search expression “Q2 NOT P2” is included in parentheses in accordance with the fact that “P1” is associated with “Q2 NOT P2”.
  • the noun “blue” decomposed in the analysis result 351 may be converted into a corresponding adjective “blue” (for example, obtained using a synonym dictionary included in the analysis data 340). it can.
  • the adjective expression “manufactured by S company”, which is a combination of the noun “S company” and the suffix “manufactured” decomposed in the analysis result 351 is associated with the above Z, but the noun “S company”. May be associated with Z.
  • a specific character string in an input sentence is not simply associated with an operator, and the true intention or nuance of the input sentence is correctly extracted and reflected in the search sentence. Can do.
  • FIG. 6 summarizes various cases of the search statement generation processing described with reference to FIG. An operator and a target word are determined based on a word determined in the input sentence and a word adjacent to the word, and a search sentence is finally generated.
  • the search statement generation processing of the present invention is not limited to the cases listed here.
  • Example of Case 1 relates to the operator of the logical operation “NOT”, and this operator represents “Negation”.
  • the word “bitter” of the target word is determined as an adjective (consecutive form) and the word “not” is determined as an adjective (auxiliary adjective) by morphological analysis.
  • the word “medicine” is determined as a noun.
  • the word corresponding to the operator is “none”.
  • the word “absent” is determined to have the meaning of negating the adjective “bitter”, and this word is determined to correspond to the operator “NOT”.
  • “drug” and the end of the adjective “bitter” “bitter” are selected as target words, and a search sentence “drug NOT bitter” is generated.
  • Example of Case 3 relates to an operator of logical operation “OR”, and this operator represents “logical sum”.
  • the word “red” of the target word is determined as an adjective
  • the word “or” is determined as a conjunction
  • the target word “blue” is analyzed by morphological analysis. Is determined as an adjective.
  • the word corresponding to the operator is “or”.
  • the word “or” is determined to mean a logical sum as described above, and this word is determined to correspond to the operator “OR”.
  • partial search sentences “red OR”, “OR blue”, “blue AND clothes” are generated, and these partial search sentences are integrated to generate a final search sentence (here, In the input sentence “blue clothes”, “blue” and “clothes” are the target words, and the word corresponding to the operator is “blue” that modifies “clothes”).
  • the partial search sentence “blue AND clothes” described above modifies the target word “clothes” with the operator “AND” and the target word “blue”.
  • the partial search sentences “red OR” and “OR blue” ” Is integrated with the search sentence“ red OR blue ”in parentheses so that“ red OR blue ”is prioritized in the logical operation.
  • Such use of parentheses in a search sentence is determined from the arrangement relationship and contents of a word corresponding to an operator and a target word adjacent to the word.
  • Cases 2, 4 to 6 also relate to operators of logical operations, and as in case 1, search sentences are assembled based on words corresponding to the operators included in the input sentence. Here, detailed description is omitted.
  • the word corresponding to the operator is “ga” as a particle (case particle).
  • the input sentence is divided into “price”, “ga”, “1”, “ten thousand”, “yen” by morphological analysis, and “price” is adjacent to the word “ga” corresponding to the operator.
  • Examples of cases 7 to 12 also relate to operators for comparison operations, and as in case 6, search sentences are assembled based on words corresponding to the operators included in the input sentence. Here, detailed description is omitted.
  • a search can be performed by combining a plurality of logical operations and comparison operations. For example, when the input sentence is “a non-bitter tablet medicine”, the search sentence can be generated as “drug AND tablet NOT bitter”.
  • FIG. 7 is a flowchart showing the flow of a search statement generation process executed by the search statement generation server 300 'according to the second embodiment of the present invention.
  • the search sentence generation server 300 ′ can be arranged in the information providing system 1 shown in FIG. 1 instead of the search sentence generation server 300 according to the first embodiment of the present invention.
  • the search statement generation process executed by the search statement generation server 300 ′ receives the input sentence of the user 10 from the information providing server 200. Based on the above, a search sentence is generated according to a predetermined rule, and the generated search sentence is transmitted to the information providing server 200.
  • step S51 the search statement generation server 300 'determines whether or not the input statement of the user 10 has been received from the information providing server 200.
  • the determination process is repeated.
  • the process proceeds to step S52.
  • the search sentence generation server 300 'analyzes the received input sentence and extracts a plurality of words from the input sentence.
  • morpheme analysis is performed using a morpheme database included in the analysis data 340.
  • the input sentence is divided into the smallest meaningful units (morphemes). For example, each word is classified and extracted (divided) into parts of speech such as nouns, verbs, particles, auxiliary verbs, and adjectives.
  • morphological analysis is performed.
  • the input sentence is analyzed by various methods such as analysis for matching with a predetermined keyword and analysis for comparing with a word registered in a dictionary. Can be extracted and a plurality of words can be extracted.
  • step S53 it is determined whether or not an unprocessed word (word corresponding to the operator) exists in the analysis result (divided word) of the input sentence. If there is no unprocessed word (NO in step S53), the generation of the search expression has been completed. Therefore, in step S60, the generated search sentence is transmitted to the information providing server 200, and the process of step S51 (information The process of determining whether or not the input sentence of the user 10 has been received from the providing server 200 is repeated.
  • step S54 the word corresponding to the operator included in the search sentence is sequentially determined from the analysis result (divided word) of the input sentence. For example, when continuous morphemes such as “other” and “no” are extracted from a part of the input sentence, the word “other” obtained by integrating these two words is the target word composed of adjacent nouns and the like. In consideration, it is determined that it corresponds to NOT of the logical operation. Furthermore, when a morpheme such as “de” is extracted from a part of the input sentence, the word “de” corresponds to the AND of the logical operation in consideration of the target word composed of adjacent nouns etc. Is determined.
  • the analysis data 340 is referred to, whether or not the adjective is a negative form, a word corresponding to the operator is determined, and as a result, Notation can be included, or based on the type of the target word adjacent to the word, it can be determined which logical operation operator the word corresponds to.
  • step S55 the target word corresponding to the operator is determined from the analysis result of the input sentence in step S52 and the one operator determined in step S54.
  • the target word is a noun, an adjective, or the like adjacent to the word determined to correspond to the operator, and corresponds to the search target item or the search value of the search target data 400 in the search sentence.
  • step S56 search sentences are generated and integrated based on the one operator determined in step S54 and the target word determined in step S55. Note that steps S54 to S56 are repeated while there are unprocessed words (words corresponding to operators) in the analysis result (divided words) of the input sentence. Each time a word corresponding to the word appears, it is sequentially integrated with the generated search sentence.
  • step S57 it is determined whether there is still a target word related to one operator determined in step S54.
  • the process returns to step S53, and whether or not there is a further unprocessed word (word corresponding to the operator) in the input sentence analysis result (divided word). Determine whether.
  • step S58 the target word corresponding to the operator is determined from the analysis result of the input sentence in step S52 and the one operator determined in step S54.
  • step S59 the search sentences are integrated based on the one operator determined in step S54 and the target word determined in step S58. After step S58, the process returns to step S57, and it is determined whether there is still a target word related to one operator determined in step S54.
  • the received input sentence is analyzed, the words corresponding to the operator are sequentially determined, and each time the target word of the operator is determined, A search sentence is assembled based on the operator and the target word, and this process is repeated for all the words corresponding to the operator to complete the final search sentence.
  • a search sentence generation method it is possible to generate a search sentence that reflects the true intention or nuance of the search included in the natural sentence, based on the natural sentence input by the user.
  • FIG. 8 illustrates the search sentence generation process executed by the search sentence generation server 300 ′ according to the second embodiment in more detail using an example input sentence 360.
  • the search statement generation server 300 ′ receives an input sentence 360 “from clothes other than blue and made by S company” from the user 10, and executes morphological analysis on the input sentence 360 ( (See step S52 in FIG. 7).
  • the result of the morphological analysis is shown as an analysis result 361.
  • the input sentence 360 is decomposed into, for example, the words “blue”, “other than”, “no”, “clothes”, “de”, “S company”, and “made”.
  • the words “blue”, “other than”, “clothes”, and “S company” are each determined as a noun, and the words “no” and “de” are each determined as a particle (case particle),
  • the word “made” is determined to be a suffix.
  • the word “Company S” is determined as a noun representing the company name by the proper noun dictionary included in the analysis data 340.
  • the first word that is the operator of the search sentence is determined and acquired (see step S54 in FIG. 7).
  • the noun “other” and the particle “no” are extracted, and the two words are connected (for example, by referring to the analysis data 340) so that the operator It is determined that it corresponds to “NOT”.
  • a target word corresponding to the operator “NOT” that is, a word adjacent to (for example, before and after) a word other than “not” is obtained from the analysis result 361 and determined as a target word (FIG. 7).
  • the word “other than” corresponding to the operator “NOT” and the target word “blue” corresponding to the word “other than” are determined.
  • the determination result 362 of FIG. A set of words other than "" is shown.
  • a search sentence “NOT blue” is generated based on the set of words other than “blue” (see step S56 in FIG. 7), and this search sentence is (1) of the partial search sentence 363 in FIG. 8A. It is expressed as
  • step S57 in FIG. 7 it is determined whether or not there is still a target word corresponding to the word “other than” (see step S57 in FIG. 7).
  • the target word “clothes” is determined, and the determination result in FIG. 362 shows a set of words other than “clothes”.
  • a search sentence “clothes NOT” is generated based on the set of words other than “clothes” (see step S56 of FIG. 7), and this search sentence is the partial search sentence 363 of FIG. 8A (2). It is expressed as
  • (1) and (2) of the partial search sentence 363 are further integrated as shown in FIG. 8B (see step S56 in FIG. 7), and “clothes NOT blue” is generated as the search sentence 364.
  • step S57 in FIG. 7 it is determined whether or not there is still a target word corresponding to the word “other than” (see step S57 in FIG. 7).
  • the target word since the target word does not exist, it becomes an operator of the search sentence.
  • the next word is determined and acquired (see step S54 in FIG. 7).
  • the particle “de” is extracted from the analysis result 361, and the word “de” corresponds to the operator “AND”.
  • an object word corresponding to the operator “AND”, that is, a word adjacent to the word “de” is obtained from the analysis result 361 and determined as an object word (see step S55 in FIG. 7).
  • the word “de” corresponding to the operator “AND” and the target word “clothes” corresponding to the word “de” are determined, and the determination result 362 of FIG. A set of words is represented. Further, a search sentence “clothes AND” is generated based on the set of words “clothes” (see step S56 in FIG. 7). It is represented.
  • (1), (2), and (3) of the partial search sentence 363 are integrated as shown in FIG. 8B (see step S56 in FIG. 7), and “(clothing NOT blue) AND is used as the search sentence 365. Is generated.
  • Such integration processing is performed by integrating the search sentence 364 shown in FIG. 8B and (3) of the partial search sentence 363 shown in FIG. 8A and grouping them together with the common target word “clothes”.
  • “clothes NOT blue” in the search sentence 364 modifies the subject word “clothes” with the operator “NOT” and the target word “blue”, and the search sentence “clothes” that is a partial search sentence.
  • parentheses When integrated with “AND”, as described above, “clothes NOT blue” is included in parentheses so that it is prioritized in the logical operation.
  • Such use of parentheses in a search sentence is determined from the arrangement relationship and contents of a word corresponding to an operator and a target word adjacent to the word.
  • the target word corresponding to the word “de” still exists (see step S57 in FIG. 7).
  • the target words “S company” and “manufactured” are determined.
  • a set of words “made by S company” is represented.
  • “Company S” and “Made” are extracted as morphemes in the analysis result 361, but here “S Company” and “Made” are used to make the target word an adjective expression. Are combined to make “S”. Of course, only “S company” which is a noun can be selected as the target word.
  • a search sentence “AND made by S company” is generated (see step S56 in FIG. 7), and this search sentence is converted to the partial search sentence 363 in FIG. 8A. It is expressed as (4).
  • (1) to (4) of the partial search sentences 363 are integrated as shown in FIG. 8B (see step S56 in FIG. 7), and “(clothing NOT blue) AND S company” is used as the search sentence 366. Is generated.
  • Such integration processing is performed by integrating the search statement 365 shown in FIG. 8B and (4) of the partial search statement 363 shown in FIG. 8A and grouping them together in a common operator “AND” portion.
  • step S57 in FIG. 7 it is determined whether or not the target word corresponding to the word “de” still exists.
  • the target word since the target word does not exist, the next as an operator of the search sentence.
  • the word is determined and acquired (see step S54 in FIG. 7).
  • the search statement 366 "(clothing NOT blue) AND S company" generated by the integration process so far is generated. It is transmitted to the information providing server 200 as a search sentence (see step S60 in FIG. 7).
  • the search statement generation process executed by the search statement generation server 300 ′ according to the second embodiment of the present invention as shown in FIGS. 7 and 8, the entire input sentence is analyzed at once (see FIG. 7 (step S52), and then the search sentence generation processing is performed in order for each word corresponding to the operator (step S53 to step S59 in FIG. 7), while partially analyzing the input sentence, When a word corresponding to the operator is extracted, it is possible to perform control so that a search sentence generation process related to the word is performed each time.
  • the user terminal 100 is a portable device such as a smartphone, for example.
  • the configuration of the user terminal 100 illustrated in FIG. 9 is merely an example of a typical configuration.
  • the user terminal 100 includes a CPU (Central Processing Unit) 101, a memory 102, a camera 103, a GPS control unit 104, a GPS antenna 105, a radio signal processing unit 106, a radio communication antenna 107, an audio control unit 108, a microphone 109, and a speaker. 110, a display control unit 111, an input device interface 112, a touch screen 113, a non-contact IC card reader / writer 114, a sensor 115, an auxiliary storage device 116, and an external recording medium interface 117.
  • a CPU Central Processing Unit
  • the CPU 101 controls the operation of each component of the user terminal 100 and executes each function under the control of the OS.
  • the memory 102 is usually composed of RAM (Random Access Memory).
  • RAM Random Access Memory
  • a program for realizing each function executed by the CPU 101 is loaded at the time of execution, and data necessary for the program is temporarily stored.
  • the camera 103 is a small imaging device built in the user terminal 100.
  • the GPS control unit 104 receives signals from a plurality of GPSs via the GPS antenna 105 and checks the position of the user terminal 100.
  • the wireless signal processing unit 106 communicates with the mobile phone base station via the wireless communication antenna 107, transmits / receives call data to / from other devices, and communicates with other terminals (via the Internet). Controls transmission / reception of WEB pages and mail data.
  • the wireless signal processing unit 106 implements wireless LAN communication with the wireless LAN access point using the wireless communication antenna 107, and performs data transmission / reception via the Internet (here, for convenience, the wireless signal processing unit 106 and the antenna 107 for wireless communication perform wireless communication with the mobile phone base station and wireless communication with the wireless LAN access point).
  • the audio control unit 108 controls the microphone 109 and the speaker 110 to realize a telephone call by wireless communication. On the other hand, the audio control unit 108 controls to output sound when playing a video or music in the application.
  • the touch screen 113 displays information on a display device configured by, for example, an LCD (Liquid Crystal Display), and the position where the user touches (presses) the screen surface with a finger or the like is displayed on the touch screen 113.
  • Detect with capacitive type touch sensor The display control unit 111 processes drawing data issued by the CPU 101 and outputs, for example, a WEB page, a moving image, or the like to the display device of the touch screen 113.
  • the touch sensor of the touch screen 113 detects an operation by the user, and transmits this to the CPU 101 as a predetermined signal.
  • the non-contact IC card reader / writer 114 stores data stored in the IC chip based on a command from the CPU 101 when the card in which the non-contact IC chip is embedded is held over a predetermined position of the user terminal 100. Or write predetermined data to the IC chip.
  • the sensor 115 is a sensor other than the touch sensor of the touch screen 113, and includes a motion sensor, a light sensor, a proximity sensor, and the like.
  • the auxiliary storage device 116 is composed of, for example, a semiconductor memory called a flash memory or a hard disk.
  • the auxiliary storage device 116 stores various data as well as programs for realizing the functions executed by the CPU 101.
  • the external recording medium interface 117 accesses the external recording medium 140 and reads data recorded therein.
  • the external recording medium 140 is, for example, a portable flash memory.
  • FIG. 10 an example of a hardware configuration of a computer constituting the search statement generation server 300 according to the first embodiment of the present invention will be described.
  • the configuration of the search statement generation server 300 illustrated in FIG. 10 is merely an example of a typical configuration.
  • Such a computer configuration may be employed in the search statement generation server 300 ′ according to the second embodiment of the present invention, the information provision server 200 of the information provision system 1, and the like.
  • the search statement generation server 300 includes a CPU 301, a RAM 302, a ROM (Read Only Memory) 303, a network interface 304, an audio control unit 305, a microphone 306, a speaker 307, a display controller 308, a display 309, an input device interface 310, a keyboard 311 and a mouse. 312, an external storage device 313, an external recording medium interface 314, and a bus 315 that connects these components to each other.
  • the CPU 301 controls the operation of each component of the search statement generation server 300, and controls the execution of each function according to the present invention under the control of the OS.
  • the RAM 302 temporarily stores programs for executing each process executed by the CPU 301 and data used during the execution of these programs.
  • the ROM 303 stores a program executed when the search statement generation server 300 is activated.
  • the network interface 304 is an interface for connecting to the network 320.
  • the network 320 is a network for connecting to the information providing server 200, for example.
  • the audio control unit 305 controls the microphone 306 and the speaker 307 to control audio input / output.
  • the display controller 308 is a dedicated controller for actually processing a drawing command issued by the CPU 301.
  • the display 309 is a display device configured by an LCD, for example.
  • the input device interface 310 receives a signal input from the keyboard 311 or the mouse 312 and transmits a predetermined command to the CPU 301 according to the signal pattern.
  • the external storage device 313 is, for example, a storage device such as a hard disk or a semiconductor memory, and the above-described program and data are recorded in this device.
  • the external recording medium interface 314 accesses the external recording medium 330 and reads data recorded therein.
  • the external recording medium 330 is, for example, a portable flash memory.
  • a program executed by the CPU 301 to realize each function of the present invention can be provided from the external recording medium 330 via the external recording medium interface 314.
  • As another distribution form of the program for realizing each function of the present invention a route that is stored in the external storage device 313 or the RAM 302 via a network 320 and a network interface 304 from a predetermined server on the network. Etc. are also conceivable.
  • search statement generation system 11 including the search statement generation server 300 according to the first embodiment of the present invention or the search statement generation server 300 ′ according to the second embodiment of the present invention, and the search statement generation system. 11 has been described with reference to specific examples in which the present invention can be implemented. However, these specific examples are merely examples for explaining the present invention, and the scope of rights of the present invention is limited. It is not limited to these specific examples. The technical idea of the present invention can be realized by various methods and configurations other than these.

Abstract

A problem addressed by the present invention is to provide a search text generation system capable of automatically generating an appropriate search text from natural language input by a user. The search text generation server 300: analyzes received input text; identifies a word corresponding to an operator and a word of interest which is adjacent to said word; and, using the words having thus been identified, automatically generates a search text which corresponds to the input text.

Description

検索文生成システム、及び検索文生成方法Search sentence generation system and search sentence generation method
 ユーザによって入力された自然文に基づいて検索文を生成する検索文生成システムに関する。 The present invention relates to a search sentence generation system that generates a search sentence based on a natural sentence input by a user.
 近年、インターネット上には多くの情報が公開されるようになっており、インターネットユーザは、検索エンジンにキーワードを入力して、インターネット上の膨大なWEBページのなかから所望のWEBページを特定し、必要な情報を閲覧することができる。 In recent years, a lot of information has been released on the Internet, and an Internet user can input a keyword into a search engine and specify a desired WEB page from among a huge number of WEB pages on the Internet. You can browse the necessary information.
 また、商用データベース等の利用者は、キーワード、論理演算子、括弧等を含んだ検索式を作成して、商用データベースのなかから、目的の情報を取得することができる。 Also, a user of a commercial database or the like can create a search expression including keywords, logical operators, parentheses, etc., and obtain target information from the commercial database.
 しかしながら、検索によって所望の情報を取得しようとする場合、上記のようなキーワードの入力や、キーワード、論理演算子、括弧等を含んだ検索式の作成は煩雑であり、また、検索式の作成に慣れた者でないと、検索の意図通りの検索結果を得ることは難しい。 However, when obtaining desired information by searching, it is complicated to input a keyword as described above and to create a search expression including keywords, logical operators, parentheses, etc. Unless you are used to it, it is difficult to obtain search results as intended.
 そこで、例えば、ユーザが入力した自然文に基づいて検索式を自動的に作成する情報検索装置が提案されている(特許文献1参照)。 Therefore, for example, an information retrieval apparatus that automatically creates a retrieval formula based on a natural sentence input by a user has been proposed (see Patent Document 1).
 この情報検索装置は、ユーザが入力した自然文での検索質問に基づいて検索式を作成する。より具体的には、検索質問に含まれる用語の同義語、関連語、階層関係が格納された検索用語辞書を参照して演算子等を使用した検索式を作成し、作成された検索式を使用して検索対象のデータベースを検索する。また、検索結果を得た後、回答件数が適性か否かの問い合わせをユーザに行い、当該問い合わせの回答結果に応じ、上記の検索用語辞書を用いて検索式の再作成を行うように構成される。 This information search device creates a search formula based on a natural text search question entered by the user. More specifically, create a search expression using operators by referring to a search term dictionary that stores synonyms, related terms, and hierarchical relationships of terms included in the search question. Use to search the database to be searched. In addition, after obtaining the search results, the user is inquired whether the number of answers is appropriate, and the search expression is recreated using the search term dictionary according to the answer result of the inquiry. The
特開平5-128158号公報Japanese Patent Laid-Open No. 5-128158
 上述のような情報検索装置では、自然文で構成された検索文に基づいて検索式が作成されるが、検索式に含まれるキーワードや演算子は、ユーザによる問い合わせの回答結果に応じて適宜変更されるように構成されている。 In the information search apparatus as described above, a search expression is created based on a search sentence composed of natural sentences. However, keywords and operators included in the search expression are appropriately changed according to the answer result of the inquiry by the user. It is configured to be.
 例えば、情報検索装置は、ユーザが、検索結果の回答件数が少ないと判断した場合に、検索用語辞書を参照して(並列的に)キーワードを追加したり、上位化したりして置き換え、演算子を「AND」から「OR」に変更して、回答件数が多く得られるような検索式を再作成する。 For example, when the user determines that the number of answers of the search result is small, the information search device refers to the search term dictionary (in parallel) and adds or replaces keywords to replace the operator. Is changed from “AND” to “OR” to re-create a search expression that can obtain a large number of responses.
 このような情報検索装置では、検索文の作成に不慣れなユーザをアシストする意味では有効であるが、一方で、自然文に含まれる検索の真の意図やニュアンスが正しく表現されない場合がある。 Such an information search apparatus is effective in assisting a user who is unfamiliar with the creation of a search sentence, but on the other hand, the true intention or nuance of the search included in the natural sentence may not be expressed correctly.
 また、現在、一般的に用いられているインターネットの検索エンジンでは、例えば、ユーザが「華々しくない」や「彩りが良くない」といった検索ワードを入力すると、それぞれ「華々しい」や「彩り」といったキーワードによる検索が行われ、ユーザによって入力された検索ワードの否定的な意味が排除されてしまうことがある。 In addition, in the currently popular Internet search engines, for example, when a user inputs a search word such as “not gorgeous” or “not colorful”, “splendid” or “colored” respectively. A search using a keyword is performed, and the negative meaning of a search word input by a user may be eliminated.
 このようなインターネットの検索エンジンにおいても、上記のように、自然文に含まれる検索の真の意図やニュアンスが正しく表現されない場合がある。 Even in such an Internet search engine, as described above, the true intention and nuance of the search included in the natural sentence may not be expressed correctly.
 従って、本発明の目的は、ユーザが入力した自然文に基づいて、自動的に適切な検索文を生成することが可能な検索文生成システムを提供することにある。 Therefore, an object of the present invention is to provide a search sentence generation system capable of automatically generating an appropriate search sentence based on a natural sentence input by a user.
 また、本発明の目的は、ユーザが入力した自然文に基づいて、自然文に含まれる検索の真の意図やニュアンスが反映された検索文を生成することが可能な検索文生成システムを提供することにある。 Another object of the present invention is to provide a search sentence generation system capable of generating a search sentence that reflects the true intention and nuance of the search included in the natural sentence based on the natural sentence input by the user. There is.
 本発明は、以下のような検索文生成システム、及び検索文生成方法を提供する。 The present invention provides the following search sentence generation system and search sentence generation method.
 本発明の第1の実施態様に係る発明は、下記の構成を有する。
 ユーザから入力された自然文の入力文を受信する入力文受信手段と、
 前記入力文を解析して、複数の語に分割する入力文解析手段と、
 前記複数の語に基づいて、演算子(例えば、ANDやNOT)に対応する語(例えば、「で」や「以外の」)を判定し、判定された前記語に関連する対象語を判定する判定手段と、
 判定された前記語に対応する演算子と、当該語に関連すると判定された対象語の組を用いて、前記入力文に対応する検索文を生成する検索文生成手段とを備える検索文生成システム(例えば、検索文生成システム11)である。
The invention according to the first embodiment of the present invention has the following configuration.
An input sentence receiving means for receiving an input sentence of a natural sentence input from a user;
An input sentence analyzing means for analyzing the input sentence and dividing it into a plurality of words;
Based on the plurality of words, a word (for example, “de” or “other than”) corresponding to an operator (for example, AND or NOT) is determined, and a target word related to the determined word is determined. A determination means;
A search sentence generation system comprising: an operator corresponding to the determined word; and a search sentence generation means for generating a search sentence corresponding to the input sentence using a set of target words determined to be related to the word (For example, search sentence generation system 11).
 本発明のこのような構成によって、ユーザが入力した自然文に基づいて、自動的に適切な検索文を生成することが可能となり、ユーザは、検索文の作成に不慣れであったり、検索式の知識を有さない場合であっても、複雑な検索文を作成することなく、意図した検索結果を得ることができる。 Such a configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by the user, and the user is not accustomed to creating a search sentence, Even if there is no knowledge, an intended search result can be obtained without creating a complicated search sentence.
 本発明の第2の実施態様に係る発明は、第1の実施態様において下記の構成を有する。
 前記判定手段は、所定の判定基準に基づいて、否定の論理演算の演算子に対応する語を判定する(例えば、解析後の入力文において「以外の」や「ない」といった語が抽出された場合、当該語に、演算子「NOT」が対応付けられる)ように構成される。
The invention according to the second embodiment of the present invention has the following configuration in the first embodiment.
The determination means determines a word corresponding to an operator of a negative logical operation based on a predetermined determination criterion (for example, a word such as “other than” or “not” is extracted in an input sentence after analysis) In this case, an operator “NOT” is associated with the word).
 本発明のこのような構成によって、ユーザの入力文が否定的な条件を含む場合であっても、意図した検索結果を得ることができる。 With such a configuration of the present invention, an intended search result can be obtained even when a user's input sentence includes a negative condition.
 本発明の第3の実施態様に係る発明は、第1の実施態様において下記の構成を有する。
 前記検索文生成手段は、判定された前記語に対応する演算子と、当該語に関連すると判定された対象語の組のそれぞれにおける演算子と対象語の関係に基づいて、所定の演算子による論理演算の優先度が決定されるように、前記検索文を生成するように構成される(例えば、図8Bにおいて、検索文364の「服 NOT 青」と部分検索文363の(3)の「服 AND」が生成されている場合、両者の対象語「服」が共通し、検索文364では、「服」が、演算子「NOT」と対象語「青」で修飾されている関係にあることから、両者を「服」でまとめ、検索文365の「(服 NOT 青) AND」が生成される)。
The invention according to the third embodiment of the present invention has the following configuration in the first embodiment.
The search sentence generation means uses a predetermined operator based on a relationship between the operator corresponding to the determined word and the operator and the target word in each set of target words determined to be related to the word. The search sentence is generated so that the priority of the logical operation is determined (for example, in FIG. 8B, “clothes NOT blue” of the search sentence 364 and “3” of the partial search sentence 363 (3) “ When “clothes AND” is generated, the target word “clothes” of both is common, and in the search sentence 364, “clothes” is in a relationship modified with the operator “NOT” and the target word “blue”. Therefore, both are collected by “clothes”, and “(clothes NOT blue) AND” of the search sentence 365 is generated).
 本発明のこのような構成によって、ユーザが入力した自然文に基づいて、自動的に適切な検索文を生成することが可能となる。 This configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by a user.
 本発明の第4の実施態様に係る発明は、第1の実施態様において下記の構成を有する。
 前記判定手段は、前記複数の語について、演算子に対応する語を順次判定し(例えば、図7のステップS54において、入力文の解析結果から、演算子に対応する語を順に取得する)、
 前記検索文生成手段は、前記演算子に対応する語が判定されるたびに、判定された前記語に関する検索文を生成し、前記検索文を生成済みの検索文と統合するように構成される(例えば、図7のステップS56)。
The invention according to the fourth embodiment of the present invention has the following configuration in the first embodiment.
The determination means sequentially determines a word corresponding to an operator for the plurality of words (for example, in step S54 in FIG. 7, sequentially acquires words corresponding to the operator from the analysis result of the input sentence)
The search sentence generation unit is configured to generate a search sentence relating to the determined word each time a word corresponding to the operator is determined, and to integrate the search sentence with the generated search sentence. (For example, step S56 in FIG. 7).
 本発明のこのような構成によって、ユーザが入力した自然文に基づいて、自動的に適切な検索文を生成することが可能となる。 This configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by a user.
 本発明の第5の実施態様に係る発明は、下記の構成を有する。
 ユーザから入力された自然文の入力文に基づいて検索文を生成する検索文生成サーバにおいて実行される検索文生成方法であって、
 前記入力文を受信する入力文受信ステップと、
 前記入力文を解析して、複数の語に分割する入力文解析ステップと、
 前記複数の語に基づいて、演算子に対応する語を判定し、判定された前記語に関連する対象語を判定する判定ステップと、
 判定された前記語に対応する演算子と、当該語に関連すると判定された対象語の組を用いて、前記入力文に対応する検索文を生成する検索文生成ステップと、を有することを特徴とする検索文生成方法。
The invention according to the fifth embodiment of the present invention has the following configuration.
A search statement generation method that is executed in a search statement generation server that generates a search statement based on an input sentence of a natural sentence input by a user,
An input sentence receiving step of receiving the input sentence;
An input sentence analysis step of analyzing the input sentence and dividing it into a plurality of words;
A determination step of determining a word corresponding to an operator based on the plurality of words and determining a target word related to the determined word;
A search sentence generation step of generating a search sentence corresponding to the input sentence by using an operator corresponding to the determined word and a set of target words determined to be related to the word. Search sentence generation method.
 本発明のこのような構成によって、ユーザが入力した自然文に基づいて、自動的に適切な検索文を生成することが可能となり、ユーザは、検索文の作成に不慣れであったり、検索式の知識を有さない場合であっても、複雑な検索文を作成することなく、意図した検索結果を得ることができる。 Such a configuration of the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by the user, and the user is not accustomed to creating a search sentence, Even if there is no knowledge, an intended search result can be obtained without creating a complicated search sentence.
 本発明に係る検索文生成システムによって、ユーザが入力した自然文に基づいて、自動的に適切な検索文を生成することが可能となり、ユーザは、複雑な検索文を作成することなく、通常の会話で問いかけるような入力によって、所望の検索結果を得ることができる。 The search sentence generation system according to the present invention makes it possible to automatically generate an appropriate search sentence based on a natural sentence input by the user, and the user can create a normal search sentence without creating a complicated search sentence. A desired search result can be obtained by input that asks in conversation.
 また、本発明に係る検索文生成システムによって、ユーザが入力した自然文に基づいて、自然文に含まれる検索の真の意図やニュアンスが反映された検索文を生成することが可能となり、これによって、ユーザは、検索文の作成に不慣れであったり、検索式の知識を有さない場合であっても、複雑な検索文を作成することなく、意図した検索結果を得ることができる。 Further, the search sentence generation system according to the present invention makes it possible to generate a search sentence that reflects the true intention or nuance of the search included in the natural sentence based on the natural sentence input by the user. Even if the user is unfamiliar with the creation of the search sentence or does not have knowledge of the search formula, the user can obtain the intended search result without creating a complicated search sentence.
本発明の第1実施形態に係る検索文生成システムを含む情報提供システムの概要を示す図である。It is a figure which shows the outline | summary of the information provision system containing the search sentence production | generation system which concerns on 1st Embodiment of this invention. 図1の情報提供システムにおける応答文出力処理を例示したフローチャートである。It is the flowchart which illustrated the response sentence output process in the information provision system of FIG. 本発明の第1実施形態に係る検索文生成システムの検索文生成処理を例示したフローチャートである。It is the flowchart which illustrated search text generation processing of the search text generation system concerning a 1st embodiment of the present invention. 図1の情報提供システムでの、ユーザ端末における会話画面のグラフィカルユーザインフェースを例示した図である。It is the figure which illustrated the graphical user interface of the conversation screen in a user terminal in the information provision system of FIG. 本発明の第1実施形態に係る検索文生成サーバの検索文生成処理の概要を例示した図である。It is the figure which illustrated the outline | summary of the search text production | generation process of the search text production | generation server which concerns on 1st Embodiment of this invention. 本発明の第1実施形態に係る検索文生成サーバによる検索文生成例を示した図である。It is the figure which showed the example of search text generation by the search text generation server which concerns on 1st Embodiment of this invention. 本発明の第2実施形態に係る検索文生成サーバの検索文生成処理を例示したフローチャートである。It is the flowchart which illustrated search text generation processing of the search text generation server concerning a 2nd embodiment of the present invention. 本発明の第2実施形態に係る検索文生成サーバの検索文生成処理の概要を例示した図である。It is the figure which illustrated the outline | summary of the search text production | generation process of the search text production | generation server which concerns on 2nd Embodiment of this invention. 本発明の情報提供システムに関連するユーザ端末を構成するコンピュータのハードウェア構成の例を示す図である。It is a figure which shows the example of the hardware constitutions of the computer which comprises the user terminal relevant to the information provision system of this invention. 本発明の検索文生成システムに含まれる検索文生成サーバを構成するコンピュータのハードウェア構成の例を示す図である。It is a figure which shows the example of the hardware constitutions of the computer which comprises the search sentence production | generation server contained in the search sentence production | generation system of this invention.
 最初に、本発明の第1実施形態に係る検索文生成システム11を含む情報提供システム1について、図面を参照して説明する。 First, the information providing system 1 including the search statement generation system 11 according to the first embodiment of the present invention will be described with reference to the drawings.
 図1は、ユーザの入力に対応する応答を提供することで、ユーザとの会話を実現する情報提供システム1の概要を示す図である。情報提供システム1は、本発明の第1実施形態に係る検索文生成システム11を含むように構成されている。 FIG. 1 is a diagram showing an overview of an information providing system 1 that realizes a conversation with a user by providing a response corresponding to the user's input. The information providing system 1 is configured to include a search sentence generation system 11 according to the first embodiment of the present invention.
 ユーザ10は、ユーザ端末100において実行されるWEBブラウザにより表示されている所定のWEBページの入力エリアに、(例えば、ユーザ端末100のキーボードやタッチパネルを用いて)文字列を入力する。ここでユーザ10により入力される文字列は、一般的には、検索対象に対して情報検索を行うための検索文であり、自然言語によって表現される自然文である。 The user 10 inputs a character string (for example, using a keyboard or a touch panel of the user terminal 100) in an input area of a predetermined WEB page displayed by the WEB browser executed on the user terminal 100. Here, the character string input by the user 10 is generally a search sentence for performing an information search on the search target, and is a natural sentence expressed in a natural language.
 ここでは、ユーザ10からこのような情報検索に係る自然文が入力されることが前提であり、以降、ユーザからの入力を「入力文」と表現する。ただし、この入力文は、検索に利用されることが可能な限り、文章の形態でなくてもよい。また、「苦くない薬は何?」といったような、質問文や疑問形である必要もない。例えば、「苦くない薬」といった、質問文の一部を構成する入力文など、様々な形態の入力文を受け付けることができる。 Here, it is assumed that a natural sentence related to such information retrieval is input from the user 10, and hereinafter, the input from the user is expressed as an “input sentence”. However, this input sentence may not be in the form of a sentence as long as it can be used for the search. Nor does it need to be a question or question, such as “What is a non-bittering drug?” For example, various forms of input sentences, such as an input sentence that forms part of a question sentence, such as “non-bitter medicine” can be accepted.
 ユーザ10は、会話形式で情報提供システム1とやりとりを行い、ユーザ10が入力する入力文に対する応答として、その入力文に基づいて実行された検索の検索結果がユーザ端末100に表示される。 The user 10 interacts with the information providing system 1 in a conversation format, and the search result of the search executed based on the input sentence is displayed on the user terminal 100 as a response to the input sentence input by the user 10.
 なお、本実施形態では、ユーザ端末100で実行されるWEBブラウザが、入力文を入力する入力エリアや検索結果を表示するように構成されているが、ユーザ端末100で実行される他のアプリケーションが上記の入力エリアや検索結果の表示を行うようにしてもよい。 In the present embodiment, the WEB browser executed on the user terminal 100 is configured to display an input area for inputting an input sentence and a search result, but other applications executed on the user terminal 100 may be displayed. You may make it display said input area and a search result.
 次に、ユーザ10によりユーザ端末100から入力された入力文は、ユーザ端末100のWEBブラウザによりネットワーク500を介して情報提供システム1に提供される。ここで、ネットワーク500は、例えば、インターネットを含むネットワークである。 Next, the input sentence input from the user terminal 100 by the user 10 is provided to the information providing system 1 via the network 500 by the WEB browser of the user terminal 100. Here, the network 500 is a network including the Internet, for example.
 情報提供システム1は、情報提供サーバ200と検索文生成システム11を含むように構成される。また、検索文生成システム11は、検索文生成サーバ300を含み、検索文生成サーバ300は、入力文を解析するための解析用データ340を備えている。 The information providing system 1 is configured to include an information providing server 200 and a search statement generation system 11. The search sentence generation system 11 includes a search sentence generation server 300, and the search sentence generation server 300 includes analysis data 340 for analyzing an input sentence.
 情報提供システム1の情報提供サーバ200は、ユーザ端末100から入力文を受信すると、この入力文を検索文生成システム11に送信し、そこで生成された検索文を検索文生成システム11から受信する。検索文生成システム11では、入力文を受信すると、検索文生成サーバ300が、解析用データ340等を参照して、受信した入力文に基づいて検索文を生成する。 When the information providing server 200 of the information providing system 1 receives the input sentence from the user terminal 100, the information providing server 200 transmits the input sentence to the search sentence generation system 11, and receives the search sentence generated there from the search sentence generation system 11. In the search sentence generation system 11, when an input sentence is received, the search sentence generation server 300 refers to the analysis data 340 and the like and generates a search sentence based on the received input sentence.
 情報提供システム1の情報提供サーバ200は、検索文生成システム11から、生成された検索文を受信すると、この検索文を用いて、検索対象データ400に対する検索を実行し、検索結果をユーザ端末100に送信する。 When the information providing server 200 of the information providing system 1 receives the generated search sentence from the search sentence generating system 11, the information providing server 200 executes a search for the search target data 400 using the search sentence, and displays the search result as the user terminal 100. Send to.
 本実施形態では、検索対象データ400は、情報提供システム1の外部に配置されたデータベース等であるが、情報提供システム1が備えるデータであってもよい。また、検索対象データ400は、例えば、インターネットを経由してアクセス可能な資源に関するインデックス情報を格納するデータベースであってもよい。このようなデータベースを検索することにより、検索文の条件に合致するWEBページのリスト、WEBページのリンク情報、個々のWEBページの内容等をユーザ端末100に表示させることができる。 In this embodiment, the search target data 400 is a database or the like arranged outside the information providing system 1, but may be data included in the information providing system 1. The search target data 400 may be a database that stores index information related to resources accessible via the Internet, for example. By searching such a database, it is possible to display on the user terminal 100 a list of WEB pages that match the search statement conditions, link information of WEB pages, contents of individual WEB pages, and the like.
 また、検索対象データ400を、企業が提供する商品や、その商品の在庫情報等を含むように構成することもできる。このように、検索対象データ400を、様々な内容で構成することができる。 Also, the search target data 400 can be configured to include products provided by the company, inventory information of the products, and the like. Thus, the search target data 400 can be configured with various contents.
 ここで、情報提供システム1が不特定のユーザに利用可能なものとして提供される場合、ユーザ10としては、WEBページを閲覧する一般ユーザや、WEBサイトを提供する企業や組織(の商品やサービス等)に対して質問等を行う一般ユーザ等が想定され、ユーザ端末100は、通常、インターネット等に接続可能なユーザ10が所有するスマートフォンやPC(パーソナルコンピュータ)等が想定される。 Here, when the information providing system 1 is provided as usable for an unspecified user, the user 10 may be a general user who browses a WEB page, or a company or organization that provides a WEB site (products or services of For example, a general user who asks questions etc. is assumed, and the user terminal 100 is normally assumed to be a smartphone or a PC (personal computer) owned by the user 10 that can be connected to the Internet or the like.
 なお、図1では、ユーザ10が、WEBサイトの入力エリアに入力文を(例えば、ユーザ端末100のキーボードやタッチパネルを用いて)入力し、情報提供システム1に送信する構成としたが、情報提供システム1としては、どのような方法・ルートでこの入力文を受信してもよい。例えば、ユーザ10が音声によって検索に係る質問を発した場合に、この音声に対して音声認識処理を施して文字列に変換し、当該変換された文字列を入力文として情報提供システム1に提供することができる。また、印字された検索に係る質問事項等をOCR(Optical Character Recognition)によって文字列に変換し、当該変換された文字列を入力文として情報提供システム1に提供するといったことも考えられる。 In FIG. 1, the user 10 inputs an input sentence (for example, using the keyboard or touch panel of the user terminal 100) into the input area of the WEB site and transmits the input sentence to the information providing system 1. The system 1 may receive this input sentence by any method / route. For example, when the user 10 issues a query related to search by voice, the voice is subjected to voice recognition processing and converted into a character string, and the converted character string is provided to the information providing system 1 as an input sentence. can do. Further, it is also conceivable to convert the questionnaire related to the printed search into a character string by OCR (Optical Character Recognition), and provide the converted character string to the information providing system 1 as an input sentence.
 また、情報提供システム1からの応答文も、検索結果を表す文字情報だけでなく、図形、音声、映像等、様々な方法・ルートでユーザ端末100に提供されうる。 In addition, the response sentence from the information providing system 1 can be provided to the user terminal 100 not only by the character information indicating the search result but also by various methods / routes such as graphics, voice, and video.
 なお、本実施形態の情報提供サーバ200は、ユーザ端末100において実行されるWEBブラウザとhttp等のプロトコルによってデータ送受信を行い、当該WEBブラウザに所定のWEBページを表示させるように動作するWEBサーバの機能も有している。 The information providing server 200 according to the present embodiment is a WEB server that operates to perform data transmission / reception with a WEB browser executed on the user terminal 100 using a protocol such as http, and to display a predetermined WEB page on the WEB browser. It also has a function.
 また、情報提供サーバ200と検索文生成サーバ300は、ここでは、それぞれ1つのコンピュータとして示されているが、複数のコンピュータによって同様の機能を分散して実行するよう構成することもできる。また、情報提供サーバ200と検索文生成サーバ300を統合して1つのコンピュータで実現するようにしてもよい。 Also, the information providing server 200 and the search statement generation server 300 are each shown here as one computer, but the same function can be distributed and executed by a plurality of computers. Further, the information providing server 200 and the search statement generation server 300 may be integrated and realized by one computer.
 また、図1では、本発明の第1実施形態に係る検索文生成システム11を、上記のような情報提供システム1に組み込んだ例を示したが、検索文生成システム11を他の様々なシステムに利用することが可能である。検索文生成システム11は、少なくとも一部が自然文である、検索に係る入力文を受信し、その入力文に基づいて適切な検索文を生成するシステムである。 FIG. 1 shows an example in which the search sentence generation system 11 according to the first embodiment of the present invention is incorporated in the information providing system 1 as described above. However, the search sentence generation system 11 is various other systems. It is possible to use it. The search sentence generation system 11 is a system that receives an input sentence related to a search, at least a part of which is a natural sentence, and generates an appropriate search sentence based on the input sentence.
 図2は、本実施形態に係る情報提供システム1における応答文出力処理の流れを示すフローチャートである。この処理は、上述したように、情報提供サーバ200が、ユーザ10の入力した入力文に基づいて検索対象データ400の内容を検索し、検索結果をユーザ端末100に表示させるよう制御する。 FIG. 2 is a flowchart showing the flow of response text output processing in the information providing system 1 according to the present embodiment. As described above, this process controls the information providing server 200 to search the content of the search target data 400 based on the input sentence input by the user 10 and to display the search result on the user terminal 100.
 最初に、ステップS11において、情報提供サーバ200は、ユーザ端末100からユーザ10の入力文を受信したか否かを判定する。入力文を受信していない場合は(ステップS11のNO)、当該判定処理を繰り返し、受信している場合は(ステップS11のYES)、ステップS12に進む。 First, in step S <b> 11, the information providing server 200 determines whether or not the input sentence of the user 10 is received from the user terminal 100. If the input sentence has not been received (NO in step S11), the determination process is repeated. If the input sentence has been received (YES in step S11), the process proceeds to step S12.
 ステップS12において、情報提供サーバ200は、ユーザ端末100から受信したユーザ10の入力文を検索文生成サーバ300に送信し、そこで、入力文に基づいた検索文が生成される。 In step S12, the information providing server 200 transmits the input sentence of the user 10 received from the user terminal 100 to the search sentence generation server 300, where a search sentence based on the input sentence is generated.
 次に、ステップS13において、検索文生成サーバ300から、生成された検索文を受信し、ステップS14において、受信した検索文の形式を、検索対象データ400のデータ管理システムに合わせて調整する。例えば、検索文生成サーバ300では、所定の形式で検索文が生成されるが、検索対象データ400のデータ管理システムが、検索文生成サーバ300の形式とは異なる形式の検索文を受け付ける仕様となっている場合、これに合わせて、生成された検索文のフォーマットを変更する。例えば、商用データベースにおいて、SELECT文を含むSQLのフォーマットで検索文を構成する必要がある場合もあるし、インターネットの検索エンジンで利用するための検索文として構成する必要がある場合もある。 Next, in step S13, the generated search sentence is received from the search sentence generation server 300, and in step S14, the format of the received search sentence is adjusted according to the data management system of the search target data 400. For example, the search statement generation server 300 generates a search statement in a predetermined format, but the data management system of the search target data 400 has a specification that accepts a search statement in a format different from the format of the search statement generation server 300. If so, the format of the generated search statement is changed accordingly. For example, in a commercial database, there are cases where it is necessary to construct a search sentence in an SQL format including a SELECT sentence, and there are cases where it is necessary to construct it as a search sentence for use by an Internet search engine.
 次に、ステップS15において、調整後の検索文を用いて、検索対象データ400の内容を検索し、検索結果を取得する。検索結果は、例えば、検索文の条件を満たした項目の一覧と、各項目の詳細を表すデータへのリンク情報である。検索対象データ400が、インターネットを経由してアクセス可能な資源に関するインデックスを格納するデータベースである場合、検索結果は、例えば、検索文の条件を満たしたWEBページのリストと、各WEBページへのリンク情報である。 Next, in step S15, the content of the search target data 400 is searched using the adjusted search text, and the search result is acquired. The search result is, for example, a list of items that satisfy the search statement condition and link information to data representing details of each item. When the search target data 400 is a database that stores an index related to a resource that can be accessed via the Internet, the search result includes, for example, a list of WEB pages that satisfy the search statement condition and links to each WEB page. Information.
 次に、ステップS16において、情報提供サーバ200は、受信した検索結果を編集する。例えば、ユーザ端末100がWEBブラウザによってユーザ10の入力文を送信してきた場合は、検索結果の内容を表示するよう構成されたWEBページを(html言語等で)編集し、その後、ステップS17において、当該編集された検索結果をユーザ端末100に出力するよう制御する。 Next, in step S16, the information providing server 200 edits the received search result. For example, when the user terminal 100 transmits the input sentence of the user 10 by the WEB browser, the WEB page configured to display the contents of the search result is edited (in the html language or the like), and then in step S17, Control is performed to output the edited search result to the user terminal 100.
 図3は、第1実施形態に係る検索文生成サーバ300で実行される検索文生成処理の流れを示すフローチャートである。この処理は、上述したように、情報提供サーバ200から、ユーザ10の入力文を受信した場合に、その入力文に基づいて所定のルールにより検索文を生成し、生成された検索文を情報提供サーバ200に送信する。 FIG. 3 is a flowchart showing the flow of search statement generation processing executed by the search statement generation server 300 according to the first embodiment. In this process, as described above, when an input sentence of the user 10 is received from the information providing server 200, a search sentence is generated according to a predetermined rule based on the input sentence, and the generated search sentence is provided as information. Transmit to server 200.
 最初に、ステップS31において、検索文生成サーバ300は、情報提供サーバ200からユーザ10の入力文を受信したか否かを判定する。入力文を受信していない場合は(ステップS31のNO)、当該判定処理を繰り返し、受信している場合は(ステップS31のYES)、ステップS32に進む。 First, in step S31, the search statement generation server 300 determines whether or not the input statement of the user 10 has been received from the information providing server 200. If the input sentence has not been received (NO in step S31), the determination process is repeated. If the input sentence has been received (YES in step S31), the process proceeds to step S32.
 ステップS32において、検索文生成サーバ300は、受信した入力文の解析を行って、当該入力文から複数の語を抽出する。本実施形態では、解析用データ340に含まれる形態素データベースを用いて形態素解析を行う。これによって、入力文は、意味を持つ最小の単位(形態素)に分割される。例えば、それぞれの語が、名詞、動詞、助詞、助動詞、形容詞等の品詞に分類され抽出(分割)される。なお、本実施形態では、形態素解析が行われるように構成されるが、所定のキーワードや文字列パターンとの照合を行う解析や、辞書に登録されたワードとの比較を行う解析など、様々な方法で入力文の解析を行い、複数の語を抽出することができる。 In step S32, the search sentence generation server 300 analyzes the received input sentence and extracts a plurality of words from the input sentence. In the present embodiment, morpheme analysis is performed using a morpheme database included in the analysis data 340. As a result, the input sentence is divided into the smallest meaningful units (morphemes). For example, each word is classified and extracted (divided) into parts of speech such as nouns, verbs, particles, auxiliary verbs, and adjectives. In this embodiment, morphological analysis is performed. However, there are various types of analysis, such as analysis for matching with a predetermined keyword or character string pattern, and analysis for comparing with a word registered in a dictionary. It is possible to analyze the input sentence by the method and extract a plurality of words.
 また、このとき、生成される検索文の構成に直接関係しない語は削除、又は調整される。例えば、「苦くない薬は何?」といった入力文の場合、「は」、「何」、「?」等は、検索文に反映されないので、この段階で削除される。また、名詞や形容詞等は、必要に応じて、検索文で使用するのに好適な形式(文体)に変換される。 At this time, words that are not directly related to the structure of the generated search sentence are deleted or adjusted. For example, in the case of an input sentence such as “What is a non-bitter medicine?”, “Ha”, “what”, “?”, Etc. are not reflected in the search sentence, and are deleted at this stage. In addition, nouns, adjectives, and the like are converted into a form (style) suitable for use in a search sentence as necessary.
 次に、ステップS33において、入力文の解析結果(分割された語)から、検索文に含まれる演算子に対応する語を判定する。また、このとき、この演算子に対応する語に関連する対象語が判定される。ここで、対象語は、演算子による演算で必要となる要素を示す語であって、演算子に対応すると判定された語に(基本的に)隣接する名詞や形容詞等である。対象語は、検索文においては、検索対象データ400の検索対象項目や、検索値として定義されうる。 Next, in step S33, the word corresponding to the operator included in the search sentence is determined from the analysis result (divided word) of the input sentence. At this time, the target word related to the word corresponding to this operator is determined. Here, the target word is a word indicating an element necessary for the calculation by the operator, and is a noun, an adjective or the like (basically) adjacent to the word determined to correspond to the operator. The target word can be defined as a search target item or search value of the search target data 400 in the search sentence.
 例えば、入力文の一部から、「以外」「の」といった連続する形態素が抽出されている場合、この2つの語を統合した「以外の」の語は、隣接する(例えば、前後の)名詞等からなる対象語も考慮したうえで、論理演算のNOTに対応するものと判定する。さらに、入力文の一部から、「で」といった形態素が抽出されている場合、「で」の語は、隣接する名詞等からなる対象語も考慮したうえで、論理演算のANDに対応するものと判定する。 For example, when continuous morphemes such as “other than” “no” are extracted from a part of the input sentence, the word “other” obtained by integrating these two words is a noun adjacent (for example, before and after). In consideration of a target word made up of, etc., it is determined to correspond to NOT of the logical operation. Furthermore, when a morpheme such as “de” is extracted from a part of the input sentence, the word “de” corresponds to the AND of the logical operation in consideration of the target word composed of adjacent nouns etc. Is determined.
 さらに、本実施形態は、必要に応じて、解析用データ340を参照し、形容詞が否定形であるか否かを判定して、演算子に対応する語を判定し、結果として、検索文にNOTを含めるようにしたり、語に隣接する対象語の種類(例えば、品詞の種類)に基づいて、当該語がどの論理演算の演算子に対応するかを判定することができる。 Furthermore, according to the present embodiment, if necessary, the analysis data 340 is referred to, whether or not the adjective is a negative form, a word corresponding to the operator is determined, and as a result, It is possible to include NOT, and based on the type of the target word adjacent to the word (for example, the type of part of speech), it is possible to determine which logical operation operator the word corresponds to.
 このような本発明の構成により、入力文のなかの特定の文字列を単純に取り出して演算子に対応付けるということがなく、入力文の真の意図やニュアンスを正しく抽出して検索文に反映することができる。例えば、従来の技術では、入力文の「と」の文字列に演算子の「AND」が単純に対応付けられたり、「華々しくない」といった検索ワードにおける否定的な意味が検索文に反映されないが、本発明では、上記のように、入力解析によって複数に分割された語のなかから、演算子に対応する語と、その語に関連する対象語が判定され、演算子自体の判定は、関連する対象語を考慮して行われ、この結果、入力文の真の意図やニュアンスが検索文に正しく反映される。 With this configuration of the present invention, a specific character string in an input sentence is not simply extracted and associated with an operator, but the true intention or nuance of the input sentence is correctly extracted and reflected in the search sentence. be able to. For example, in the conventional technology, the operator “AND” is simply associated with the character string “to” of the input sentence, or the negative meaning in the search word “not gorgeous” is not reflected in the search sentence. However, in the present invention, as described above, the word corresponding to the operator and the target word related to the word are determined from the words divided into a plurality by the input analysis. This is performed in consideration of related target words, and as a result, the true intention and nuance of the input sentence are correctly reflected in the search sentence.
 次に、ステップS34において、ステップS32による入力文の解析結果と、ステップS33で判定された演算子とから、検索文のフレームを決定する。検索文フレームは、検索文の骨格を定義するもので、検索文における括弧の配置(検索式の評価順)、演算子、及び変数(後で対象語が配置される)で構成される。 Next, in step S34, the frame of the search sentence is determined from the analysis result of the input sentence in step S32 and the operator determined in step S33. The search sentence frame defines a skeleton of the search sentence, and includes a parenthesis arrangement (search order evaluation order), an operator, and a variable (a target word is arranged later) in the search sentence.
 次に、ステップS35において、ステップS32による入力文の解析結果に基づいて、検索文のフレームに含まれる変数の位置に、対応する対象語を配置して変数を対象語に置き換え、最終的な検索文を生成する。 Next, in step S35, based on the analysis result of the input sentence in step S32, the corresponding target word is placed at the position of the variable included in the frame of the search sentence, and the variable is replaced with the target word. Generate a statement.
 次に、ステップS36において、生成された検索文を情報提供サーバ200に送信する。 Next, in step S36, the generated search text is transmitted to the information providing server 200.
 上述したように、第1実施形態に係る検索文生成サーバ300では、受信した入力文を解析して、演算子に対応する語を判定し、検索文のフレームを決定してから当該検索文のフレームの変数の部分を対象語に置き換え、最終的な検索文を完成させる。このような検索文生成方法により、ユーザが入力した自然文に基づいて、自然文に含まれる検索の真の意図やニュアンスが反映された検索文が生成可能となる。 As described above, in the search sentence generation server 300 according to the first embodiment, the received input sentence is analyzed, the word corresponding to the operator is determined, the frame of the search sentence is determined, and then the search sentence of the search sentence is determined. Replace the variable part of the frame with the target word to complete the final search sentence. With such a search sentence generation method, it is possible to generate a search sentence that reflects the true intention or nuance of the search included in the natural sentence, based on the natural sentence input by the user.
 なお、第1実施形態に係る検索文生成サーバ300における検索文生成に関する、より具体的な方法については、後で、図5を参照して詳細に説明する。 Note that a more specific method related to search sentence generation in the search sentence generation server 300 according to the first embodiment will be described later in detail with reference to FIG.
 図4には、ユーザ端末100のディスプレイに表示される、会話画面120のグラフィカルユーザインフェース(GUI)の一例が示されている。会話画面120には、キャラクタ表示部121、入力応答表示部122が示されている。キャラクタ表示部121には、例えば、ユーザ10による入力文の入力や、検索結果の表示に対応して表情や動きを変化させるコンシェルジュのキャラクタが表示される。入力応答表示部122には、ユーザ10がユーザ端末100に入力した自然文の入力文が表示される。 FIG. 4 shows an example of a graphical user interface (GUI) of the conversation screen 120 displayed on the display of the user terminal 100. The conversation screen 120 shows a character display unit 121 and an input response display unit 122. The character display unit 121 displays, for example, a concierge character that changes facial expressions and movements in response to input text input by the user 10 and display of search results. The input response display unit 122 displays a natural sentence input sentence input by the user 10 to the user terminal 100.
 さらに、入力応答表示部122には、コンシェルジュの応答として、その入力文に基づいた検索が行われている旨のメッセージと、入力文に基づいて生成された検索文が示されている。その後、その下部に、コンシェルジュの応答として、情報提供システム1から送信されてきた、入力文に対応するの検索結果が表示される。 Furthermore, in the input response display unit 122, as a concierge response, a message indicating that a search based on the input sentence is being performed and a search sentence generated based on the input sentence are shown. Thereafter, the search result corresponding to the input sentence transmitted from the information providing system 1 is displayed as a concierge response at the bottom.
 また、図4に示すように、情報提供システム1からの検索結果としては、検索のヒット件数と、検索結果の詳細な情報を表示するためのリンクボタン122aが示されている。例えば、ユーザ10がリンクボタン122aを押下(又はタッチ)すると、ユーザ端末100の表示が、検索結果の詳細情報を表示する別のWEBページに遷移する。このWEBページには、例えば、検索文の条件を満たした項目の一覧と、各項目の詳細を表すデータへのリンク情報が表示される。 Further, as shown in FIG. 4, as a search result from the information providing system 1, a link button 122a for displaying the number of search hits and detailed information of the search result is shown. For example, when the user 10 presses (or touches) the link button 122a, the display of the user terminal 100 transitions to another WEB page that displays detailed information of the search result. On this WEB page, for example, a list of items that satisfy the conditions of the search statement and link information to data representing details of each item are displayed.
 図5は、第1実施形態に係る検索文生成サーバ300で実行される検索文生成処理について、例となる入力文350を用いて、より詳細に説明するものである。 FIG. 5 explains the search sentence generation process executed by the search sentence generation server 300 according to the first embodiment in more detail using an example input sentence 350.
 図5Aに示すように、検索文生成サーバ300は、ユーザ10から「青以外の服でS社製」との入力文350を受信し、その入力文350に対して形態素解析を実行する(図3のステップS32参照)。図5Aでは、形態素解析の結果が、解析結果351として示されている。 As shown in FIG. 5A, the search statement generation server 300 receives an input sentence 350 of “non-blue clothes made by S company” from the user 10 and performs morphological analysis on the input sentence 350 (FIG. 5A). 3 step S32). In FIG. 5A, the result of the morphological analysis is shown as an analysis result 351.
 形態素解析により、入力文350は、例えば、「青」、「以外」、「の」、「服」、「で」、「S社」、「製」の各語に分解される。ここで、例えば、「青」、「以外」、「服」、「S社」の語はそれぞれ名詞と判定され、「の」、「で」の語はそれぞれ助詞(格助詞)と判定され、「製」の語は接尾辞と判定される。なお、「S社」の語は、解析用データ340に含まれる固有名詞辞書により、社名を表す名詞と判定される。 By the morphological analysis, the input sentence 350 is decomposed into, for example, the words “blue”, “other than”, “no”, “clothes”, “de”, “S company”, and “made”. Here, for example, the words “blue”, “other than”, “clothes”, and “S company” are each determined as a noun, and the words “no” and “de” are each determined as a particle (case particle), The word “made” is determined to be a suffix. The word “Company S” is determined as a noun representing the company name by the proper noun dictionary included in the analysis data 340.
 次に、演算子に対応する語と対象語についての判定を行う(図3のステップS33参照)。最初に、解析結果351から、名詞である「以外」と助詞の「の」が抽出され、この2つの語は、(例えば、解析用データ340を参照することによって)連結されることで演算子「NOT」に対応すると判定される。自然文が、「P2以外のQ2」となっていれば、検索式は「Q2 NOT P2」として構成される(図5Aの符号353)。このような構成において、「以外の」の語に関連する対象語として、Q2とP2が判定される。この例では、Q2は名詞であり、P2は、Q2を修飾する名詞や形容詞である。 Next, the word corresponding to the operator and the target word are determined (see step S33 in FIG. 3). First, from the analysis result 351, “no” as a noun and “no” as a particle are extracted, and the two words are connected (for example, by referring to the analysis data 340) to be an operator. It is determined that it corresponds to “NOT”. If the natural sentence is “Q2 other than P2,” the search expression is configured as “Q2 NOT P2” (reference numeral 353 in FIG. 5A). In such a configuration, Q2 and P2 are determined as target words related to the word “other than”. In this example, Q2 is a noun, and P2 is a noun or adjective that modifies Q2.
 次に、解析結果351から、助詞の「で」が抽出され、この語は、「AND」に対応する。自然文が、「P1でQ1」となっていれば、検索式は「P1 AND Q1」として構成される(図5Aの符号352)。このような構成において、「で」の語に関連する対象語として、Q1とP1が判定される。この例では、Q1は名詞であり、P1は形容詞的表現であり、「AND」で接続される、並列の関係となっている。 Next, the particle “de” is extracted from the analysis result 351, and this word corresponds to “AND”. If the natural sentence is “Q1 in P1,” the search expression is configured as “P1 AND Q1” (reference numeral 352 in FIG. 5A). In such a configuration, Q1 and P1 are determined as target words related to the word “de”. In this example, Q1 is a noun, P1 is an adjective expression, and they are connected in parallel by “AND”.
 このようにして、解析結果351から、演算子と対象語の判定により、検索式「Q2 NOT P2」と「P1 AND Q1」が抽出されるが、これらの2つの検索式を統合して、検索文のフレーム354は、「(Y NOT X) AND Z」のように決定される(図3のステップS34参照)。ここでは、解析結果351に含まれる、演算子に対応する語(「以外の」と「で」)、及びその語に隣接する(例えば、前後の)対象語の配置関係と内容から、「P1」と「Q2」は同じ名詞「服」に対応し、「Q2」はさらに、「P2」と演算子「NOT」で修飾されている。 In this way, the search expressions “Q2 NOT P2” and “P1 AND Q1” are extracted from the analysis result 351 by determining the operator and the target word. These two search expressions are integrated and searched. The sentence frame 354 is determined as “(Y NOT X) AND Z” (see step S34 in FIG. 3). Here, from the arrangement relationship and contents of the words (“other than” and “de”) corresponding to the operator and the target words adjacent to (for example, before and after) included in the analysis result 351, “P1 "And" Q2 "correspond to the same noun" clothes ", and" Q2 "is further modified with" P2 "and the operator" NOT ".
 その結果、「P1」は、「Q2 NOT P2」に対応付けられるので、検索文のフレーム354は、まず、「(Q2 NOT P2) AND Q1」となる。ここで、検索式「Q2 NOT P2」は、「P1」が「Q2 NOT P2」に対応付けられたことに応じて括弧書きで組み込まれ、最終的に生成される検索文では、当該括弧書きのなかの論理演算が優先される。なお、論理演算の優先度が高いからといって、当該論理演算が先に実行されるとは限らない。検索文を解釈する検索エンジン等で、同義となる別の検索式に展開される可能性があるからである。 As a result, since “P1” is associated with “Q2 NOT P2”, the frame 354 of the search sentence is first “(Q2 NOT P2) AND Q1”. Here, the search expression “Q2 NOT P2” is included in parentheses in accordance with the fact that “P1” is associated with “Q2 NOT P2”. Some logical operations are prioritized. Note that just because a logical operation has a high priority, the logical operation is not always executed first. This is because a search engine or the like that interprets a search sentence may be expanded into another search expression having the same meaning.
 その後、Q2=Y、P2=X、Q1=Zによる置き換えで、上記のような「(Y NOT X) AND Z」が決定される(なお、上記のような、X、Y、Zによる置き換えは、説明を分かり易くするためのものに過ぎない)。 After that, by replacing with Q2 = Y, P2 = X, Q1 = Z, “(Y NOT X) AND Z” as described above is determined (Note that replacement with X, Y, Z as described above is Just to make the explanation easier to understand).
 次に、図5Bに示すように、解析結果351に基づいて、上記のX、Y、Zを当てはめると、X=「青」、Y=「服」、Z=「S社製」となり、最終的に生成される検索文355は、「(服 NOT 青) AND S社製」、となる。 Next, as shown in FIG. 5B, based on the analysis result 351, when X, Y, and Z are applied, X = “blue”, Y = “clothes”, Z = “manufactured by S company”. The search statement 355 generated automatically is “(clothing NOT blue) AND made by S company”.
 なお、解析結果351において分解された名詞の「青」は、(例えば、解析用データ340に含まれる同義語辞書等を用いて求められる)対応する形容詞の連体形「青い」に変換することができる。また、ここでは、解析結果351において分解された名詞「S社」と接尾辞「製」を統合した形容詞的表現「S社製」を上記のZに対応付けているが、名詞「S社」をZに対応付けてもよい。 The noun “blue” decomposed in the analysis result 351 may be converted into a corresponding adjective “blue” (for example, obtained using a synonym dictionary included in the analysis data 340). it can. Here, the adjective expression “manufactured by S company”, which is a combination of the noun “S company” and the suffix “manufactured” decomposed in the analysis result 351, is associated with the above Z, but the noun “S company”. May be associated with Z.
 以上のような検索文生成処理によって、ユーザ10が入力した「青以外の服でS社製」という、検索に係る入力文に基づいて、「(服 NOT 青) AND S社製」といった検索文355が生成されることになる。 Based on the input sentence related to the search, “clothes other than blue made by S company” input by the user 10 through the above-described search sentence generation processing, a search sentence such as “(clothes NOT blue) AND made by S company”. 355 will be generated.
 このような本発明の構成により、入力文のなかの特定の文字列が単に演算子に対応付けられるということがなく、入力文の真の意図やニュアンスを正しく抽出して検索文に反映することができる。 With such a configuration of the present invention, a specific character string in an input sentence is not simply associated with an operator, and the true intention or nuance of the input sentence is correctly extracted and reflected in the search sentence. Can do.
 図6は、図5を参照して説明した検索文生成処理について、様々なケースをまとめたものである。入力文において判定された語やその語に隣接する語に基づいて、演算子と対象語が決定され、最終的に検索文が生成される。ただし、本発明の検索文生成処理は、ここで列挙したケースに限られるものではない。 FIG. 6 summarizes various cases of the search statement generation processing described with reference to FIG. An operator and a target word are determined based on a word determined in the input sentence and a word adjacent to the word, and a search sentence is finally generated. However, the search statement generation processing of the present invention is not limited to the cases listed here.
 ケース1の例は、論理演算「NOT」の演算子に関するものであり、この演算子は「否定」を表すものである。入力文の例が、「苦くない薬」である場合、形態素解析によって、対象語の「苦く」の語は形容詞(連用形)と判定され、「ない」の語は形容詞(補助形容詞)と判定され、対象語の「薬」の語は名詞と判定される。ここで、演算子に対応する語は、「ない」である。この場合、「ない」の語は、形容詞「苦く」を否定する意味を有するものと判断され、この語が、演算子「NOT」に対応すると判定される。その後、「薬」と、形容詞「苦く」の終止形である「苦い」が、対象語として選択され、「薬 NOT 苦い」という検索文が生成される。 Example of Case 1 relates to the operator of the logical operation “NOT”, and this operator represents “Negation”. When the example of the input sentence is “non-bitter medicine”, the word “bitter” of the target word is determined as an adjective (consecutive form) and the word “not” is determined as an adjective (auxiliary adjective) by morphological analysis. The word “medicine” is determined as a noun. Here, the word corresponding to the operator is “none”. In this case, the word “absent” is determined to have the meaning of negating the adjective “bitter”, and this word is determined to correspond to the operator “NOT”. After that, “drug” and the end of the adjective “bitter” “bitter” are selected as target words, and a search sentence “drug NOT bitter” is generated.
 ケース3の例は、論理演算「OR」の演算子に関するものであり、この演算子は「論理和」を表すものである。入力文の例が、「赤いもしくは青い服」である場合、形態素解析によって、対象語の「赤い」の語は形容詞と判定され、「もしくは」の語は接続詞と判定され、対象語の「青い」の語は形容詞と判定される。ここで、演算子に対応する語は、「もしくは」である。この場合、「もしくは」の語は、上記のように論理和を意味するものと判断され、この語が、演算子「OR」に対応すると判定される。 Example of Case 3 relates to an operator of logical operation “OR”, and this operator represents “logical sum”. When the example of the input sentence is “red or blue clothes”, the word “red” of the target word is determined as an adjective, the word “or” is determined as a conjunction, and the target word “blue” is analyzed by morphological analysis. Is determined as an adjective. Here, the word corresponding to the operator is “or”. In this case, the word “or” is determined to mean a logical sum as described above, and this word is determined to correspond to the operator “OR”.
 ここで、例えば、部分検索文「赤い OR」、「OR 青い」、「青い AND 服」が生成され、これらの部分検索文が統合され、最終的な検索文が生成される(なお、ここで、入力文「青い服」は、「青い」、と「服」が対象語であり、演算子に対応する語は、「服」を修飾している「青い」となる)。また、上述の部分検索文「青い AND 服」は、対象語の「服」を演算子「AND」と対象語「青い」で修飾するものであり、部分検索文「赤い OR」と「OR 青い」を統合した検索文「赤い OR 青い」と統合される場合には、「赤い OR 青い」が、論理演算で優先されるように、括弧書きで組み込まれる。このような、検索文における括弧書きの利用は、すなわち、演算子に対応する語、及びその語に隣接する対象語の配置関係と内容から判断されるものである。 Here, for example, partial search sentences “red OR”, “OR blue”, “blue AND clothes” are generated, and these partial search sentences are integrated to generate a final search sentence (here, In the input sentence “blue clothes”, “blue” and “clothes” are the target words, and the word corresponding to the operator is “blue” that modifies “clothes”). In addition, the partial search sentence “blue AND clothes” described above modifies the target word “clothes” with the operator “AND” and the target word “blue”. The partial search sentences “red OR” and “OR blue” ”Is integrated with the search sentence“ red OR blue ”in parentheses so that“ red OR blue ”is prioritized in the logical operation. Such use of parentheses in a search sentence is determined from the arrangement relationship and contents of a word corresponding to an operator and a target word adjacent to the word.
 ケース2、4~6の例も、論理演算の演算子に関するものであり、ケース1と同様に、入力文に含まれる、演算子に対応する語に基づいて検索文が組み立てられる。ここでは、詳細な説明を省略する。 Cases 2, 4 to 6 also relate to operators of logical operations, and as in case 1, search sentences are assembled based on words corresponding to the operators included in the input sentence. Here, detailed description is omitted.
 ケース7の例は、比較演算「==」の演算子に関するものであり、この演算子は「等しい」を表すものである。例えば、入力文が「価格が1万円」である場合、演算子に対応する語は、助詞(格助詞)の「が」である。この場合、入力文は、形態素解析によって、「価格」、「が」、「1」、「万」、「円」に分割され、演算子に対応する「が」の語に隣接して「価格」の語、及び数量を示す「1万」(ここでは、数詞と、単位を表す数詞が結合される)が配置されていることによって、「が」の語が、演算子「==」に対応するものであると判定される。その後、「価格」が「price」に変換され、「1万」が「10000」に変換された後、それぞれが、対象語として選択され、「price==10000」という検索文が生成される。 Example of Case 7 relates to the operator of the comparison operation “==”, and this operator represents “equal”. For example, when the input sentence is “price is 10,000 yen”, the word corresponding to the operator is “ga” as a particle (case particle). In this case, the input sentence is divided into “price”, “ga”, “1”, “ten thousand”, “yen” by morphological analysis, and “price” is adjacent to the word “ga” corresponding to the operator. ”And the number“ 10,000 ”indicating the quantity (here, the number and the number representing the unit are combined), the word“ ga ”becomes the operator“ == ”. It is determined that it corresponds. Thereafter, “price” is converted to “price”, “10,000” is converted to “10000”, each is selected as a target word, and a search sentence “price == 10000” is generated.
 ケース7~12の例も、比較演算の演算子に関するものであり、ケース6と同様に、入力文に含まれる、演算子に対応する語に基づいて検索文が組み立てられる。ここでは、詳細な説明を省略する。 Examples of cases 7 to 12 also relate to operators for comparison operations, and as in case 6, search sentences are assembled based on words corresponding to the operators included in the input sentence. Here, detailed description is omitted.
 また、論理演算同士や、比較演算同士を複数組み合わせて検索を行うことができる。例えば、入力文が「苦くない錠剤の薬」であった場合、検索文は、「薬 AND 錠剤 NOT 苦い」のように生成されうる。 Also, a search can be performed by combining a plurality of logical operations and comparison operations. For example, when the input sentence is “a non-bitter tablet medicine”, the search sentence can be generated as “drug AND tablet NOT bitter”.
 さらに、論理演算と比較演算を複数組み合わせて検索を行うことができる。例えば、入力文が「価格が1万円以下の赤い服」であった場合、検索文は、「服 AND 赤い AND 価格 <= 10000」のように生成されうる。 Furthermore, a search can be performed by combining a plurality of logical operations and comparison operations. For example, if the input sentence is “red clothes whose price is 10,000 yen or less”, the search sentence can be generated as “clothes AND red AND price <= 10000”.
 図7は、本発明の第2実施形態に係る検索文生成サーバ300’で実行される検索文生成処理の流れを示すフローチャートである。検索文生成サーバ300’は、図1に示す情報提供システム1において、本発明の第1実施形態に係る検索文生成サーバ300に代えて配置することができる。 FIG. 7 is a flowchart showing the flow of a search statement generation process executed by the search statement generation server 300 'according to the second embodiment of the present invention. The search sentence generation server 300 ′ can be arranged in the information providing system 1 shown in FIG. 1 instead of the search sentence generation server 300 according to the first embodiment of the present invention.
 検索文生成サーバ300’で実行される検索文生成処理は、検索文生成サーバ300の検索文生成処理と同様に、情報提供サーバ200から、ユーザ10の入力文を受信した場合に、その入力文に基づいて所定のルールにより検索文を生成し、生成された検索文を情報提供サーバ200に送信する。 Similar to the search statement generation process of the search statement generation server 300, the search statement generation process executed by the search statement generation server 300 ′ receives the input sentence of the user 10 from the information providing server 200. Based on the above, a search sentence is generated according to a predetermined rule, and the generated search sentence is transmitted to the information providing server 200.
 最初に、ステップS51において、検索文生成サーバ300’は、情報提供サーバ200からユーザ10の入力文を受信したか否かを判定する。入力文を受信していない場合は(ステップS51のNO)、当該判定処理を繰り返し、受信している場合は(ステップS51のYES)、ステップS52に進む。 First, in step S51, the search statement generation server 300 'determines whether or not the input statement of the user 10 has been received from the information providing server 200. When the input sentence has not been received (NO in step S51), the determination process is repeated. When the input sentence has been received (YES in step S51), the process proceeds to step S52.
 ステップS52において、検索文生成サーバ300’は、受信した入力文の解析を行って、当該入力文から複数の語を抽出する。本実施形態では、解析用データ340に含まれる形態素データベースを用いて形態素解析を行う。これによって、入力文は、意味を持つ最小の単位(形態素)に分割される。例えば、それぞれの語が、名詞、動詞、助詞、助動詞、形容詞等の品詞に分類され抽出(分割)される。なお、本実施形態では、形態素解析が行われるように構成されるが、所定のキーワードとの照合を行う解析や、辞書に登録されたワードとの比較を行う解析など、様々な方法で入力文の解析を行い、複数の語を抽出することができる。 In step S52, the search sentence generation server 300 'analyzes the received input sentence and extracts a plurality of words from the input sentence. In the present embodiment, morpheme analysis is performed using a morpheme database included in the analysis data 340. As a result, the input sentence is divided into the smallest meaningful units (morphemes). For example, each word is classified and extracted (divided) into parts of speech such as nouns, verbs, particles, auxiliary verbs, and adjectives. In this embodiment, morphological analysis is performed. However, the input sentence is analyzed by various methods such as analysis for matching with a predetermined keyword and analysis for comparing with a word registered in a dictionary. Can be extracted and a plurality of words can be extracted.
 また、このとき、生成される検索文の構成に直接関係しない語は、検索文生成サーバ300の場合と同様に削除、又は調整される。また、名詞や形容詞等は、必要に応じて、検索文で使用するのに好適な形式(文体)に変換される。 At this time, words that are not directly related to the structure of the generated search sentence are deleted or adjusted in the same manner as the search sentence generation server 300. In addition, nouns, adjectives, and the like are converted into a form (style) suitable for use in a search sentence as necessary.
 次に、ステップS53において、入力文の解析結果(分割された語)のなかに、未処理の語(演算子に対応する語)が存在するか否かを判定する。未処理の語が存在しない場合(ステップS53のNO)、検索式の生成が終了しているので、ステップS60において、生成された検索文を情報提供サーバ200に送信し、ステップS51の処理(情報提供サーバ200からユーザ10の入力文を受信したか否かを判定する処理)を繰り返す。 Next, in step S53, it is determined whether or not an unprocessed word (word corresponding to the operator) exists in the analysis result (divided word) of the input sentence. If there is no unprocessed word (NO in step S53), the generation of the search expression has been completed. Therefore, in step S60, the generated search sentence is transmitted to the information providing server 200, and the process of step S51 (information The process of determining whether or not the input sentence of the user 10 has been received from the providing server 200 is repeated.
 未処理の語が存在する場合(ステップS53のYES)、ステップS54において、入力文の解析結果(分割された語)から、検索文に含まれる演算子に対応する語を、順に判定する。例えば、入力文の一部から、「以外」「の」といった連続する形態素が抽出されている場合、この2つの語を統合した「以外の」の語は、隣接する名詞等からなる対象語も考慮したうえで、論理演算のNOTに対応するものと判定する。さらに、入力文の一部から、「で」といった形態素が抽出されている場合、「で」の語は、隣接する名詞等からなる対象語も考慮したうえで、論理演算のANDに対応するものと判定する。 If there is an unprocessed word (YES in step S53), in step S54, the word corresponding to the operator included in the search sentence is sequentially determined from the analysis result (divided word) of the input sentence. For example, when continuous morphemes such as “other” and “no” are extracted from a part of the input sentence, the word “other” obtained by integrating these two words is the target word composed of adjacent nouns and the like. In consideration, it is determined that it corresponds to NOT of the logical operation. Furthermore, when a morpheme such as “de” is extracted from a part of the input sentence, the word “de” corresponds to the AND of the logical operation in consideration of the target word composed of adjacent nouns etc. Is determined.
 さらに、本実施形態は、必要に応じて、解析用データ340を参照し、形容詞が否定形であるか否かを判定して、演算子に対応する語を判定し、結果として、検索文にNOTを含めるようにしたり、語に隣接する対象語の種類に基づいて、当該語がどの論理演算の演算子に対応するかを判定することができる。 Furthermore, according to the present embodiment, if necessary, the analysis data 340 is referred to, whether or not the adjective is a negative form, a word corresponding to the operator is determined, and as a result, Notation can be included, or based on the type of the target word adjacent to the word, it can be determined which logical operation operator the word corresponds to.
 次に、ステップS55において、ステップS52による入力文の解析結果と、ステップS54で判定された1つの演算子とから、当該演算子に対応する対象語を判定する。ここで、対象語は、演算子に対応すると判定された語に隣接する名詞や形容詞等であり、検索文においては、検索対象データ400の検索対象項目や、検索値に相当するものである。 Next, in step S55, the target word corresponding to the operator is determined from the analysis result of the input sentence in step S52 and the one operator determined in step S54. Here, the target word is a noun, an adjective, or the like adjacent to the word determined to correspond to the operator, and corresponds to the search target item or the search value of the search target data 400 in the search sentence.
 次に、ステップS56において、ステップS54で判定された1つの演算子と、ステップS55において判定された対象語に基づいて、検索文を生成・統合する。なお、ステップS54ないしステップS56は、入力文の解析結果(分割された語)のなかに、未処理の語(演算子に対応する語)が存在する間繰り返されるので、検索文は、演算子に対応する語が現れるたびに、生成済みの検索文と順次統合されていくことになる。 Next, in step S56, search sentences are generated and integrated based on the one operator determined in step S54 and the target word determined in step S55. Note that steps S54 to S56 are repeated while there are unprocessed words (words corresponding to operators) in the analysis result (divided words) of the input sentence. Each time a word corresponding to the word appears, it is sequentially integrated with the generated search sentence.
 次に、ステップS57において、ステップS54で判定された1つの演算子に関連する対象語がまだあるか否かを判定する。対象語がない場合(ステップS57のNO)、ステップS53に戻り、入力文の解析結果(分割された語)のなかに、さらに未処理の語(演算子に対応する語)が存在するか否かを判定する。 Next, in step S57, it is determined whether there is still a target word related to one operator determined in step S54. When there is no target word (NO in step S57), the process returns to step S53, and whether or not there is a further unprocessed word (word corresponding to the operator) in the input sentence analysis result (divided word). Determine whether.
 対象語がある場合(ステップS57のYES)、ステップS58において、ステップS52による入力文の解析結果と、ステップS54で判定された1つの演算子とから、当該演算子に対応する対象語を判定し、次に、ステップS59において、ステップS54で判定された1つの演算子と、ステップS58において判定された対象語に基づいて、検索文を統合する。ステップS58の後、ステップS57に戻り、ステップS54で判定された1つの演算子に関連する対象語がまだあるか否かを判定する。 If there is a target word (YES in step S57), in step S58, the target word corresponding to the operator is determined from the analysis result of the input sentence in step S52 and the one operator determined in step S54. Next, in step S59, the search sentences are integrated based on the one operator determined in step S54 and the target word determined in step S58. After step S58, the process returns to step S57, and it is determined whether there is still a target word related to one operator determined in step S54.
 このように、第2実施形態に係る検索文生成サーバ300’では、受信した入力文を解析して、演算子に対応する語を順次判定し、その演算子の対象語が判定される都度、その演算子と対象語に基づいて検索文を組み立て、演算子に対応する語のすべてについてこの処理を繰り返すことにより、最終的な検索文を完成させる。このような検索文生成方法により、ユーザが入力した自然文に基づいて、自然文に含まれる検索の真の意図やニュアンスが反映された検索文が生成可能となる。 Thus, in the search sentence generation server 300 ′ according to the second embodiment, the received input sentence is analyzed, the words corresponding to the operator are sequentially determined, and each time the target word of the operator is determined, A search sentence is assembled based on the operator and the target word, and this process is repeated for all the words corresponding to the operator to complete the final search sentence. With such a search sentence generation method, it is possible to generate a search sentence that reflects the true intention or nuance of the search included in the natural sentence, based on the natural sentence input by the user.
 図8は、第2実施形態に係る検索文生成サーバ300’で実行される検索文生成処理について、例となる入力文360を用いて、より詳細に説明するものである。 FIG. 8 illustrates the search sentence generation process executed by the search sentence generation server 300 ′ according to the second embodiment in more detail using an example input sentence 360.
 図8Aに示すように、検索文生成サーバ300’は、ユーザ10から「青以外の服でS社製」との入力文360を受信し、その入力文360に対して形態素解析を実行する(図7のステップS52参照)。図8Aでは、形態素解析の結果が、解析結果361として示されている。 As illustrated in FIG. 8A, the search statement generation server 300 ′ receives an input sentence 360 “from clothes other than blue and made by S company” from the user 10, and executes morphological analysis on the input sentence 360 ( (See step S52 in FIG. 7). In FIG. 8A, the result of the morphological analysis is shown as an analysis result 361.
 形態素解析により、入力文360は、例えば、「青」、「以外」、「の」、「服」、「で」、「S社」、「製」の各語に分解される。ここで、例えば、「青」、「以外」、「服」、「S社」の語はそれぞれ名詞と判定され、「の」、「で」の語はそれぞれ助詞(格助詞)と判定され、「製」の語は接尾辞と判定される。なお、「S社」の語は、解析用データ340に含まれる固有名詞辞書により、社名を表す名詞と判定される。 Through the morphological analysis, the input sentence 360 is decomposed into, for example, the words “blue”, “other than”, “no”, “clothes”, “de”, “S company”, and “made”. Here, for example, the words “blue”, “other than”, “clothes”, and “S company” are each determined as a noun, and the words “no” and “de” are each determined as a particle (case particle), The word “made” is determined to be a suffix. The word “Company S” is determined as a noun representing the company name by the proper noun dictionary included in the analysis data 340.
 ここで、検索文の演算子となる最初の語についての判定を行い取得する(図7のステップS54参照)。最初に、解析結果361から、名詞である「以外」と助詞の「の」が抽出され、この2つの語は、(例えば、解析用データ340を参照することによって)連結されることで演算子「NOT」に対応すると判定される。次に、この演算子「NOT」に対応する対象語、すなわち、「以外の」の語に隣接する(例えば、前後の)語を解析結果361から求め、これを対象語として判定する(図7のステップS55参照)。 Here, the first word that is the operator of the search sentence is determined and acquired (see step S54 in FIG. 7). First, from the analysis result 361, the noun “other” and the particle “no” are extracted, and the two words are connected (for example, by referring to the analysis data 340) so that the operator It is determined that it corresponds to “NOT”. Next, a target word corresponding to the operator “NOT”, that is, a word adjacent to (for example, before and after) a word other than “not” is obtained from the analysis result 361 and determined as a target word (FIG. 7). Step S55).
 この処理の結果、演算子「NOT」に対応する語「以外の」と、この「以外の」の語に対応する対象語「青」が判定され、図8Aの判定結果362には、「青 以外の」の語の組が表されている。さらに、この「青 以外の」の語の組に基づいて、検索文「NOT 青」が生成され(図7のステップS56参照)、この検索文が、図8Aの部分検索文363の(1)として表されている。 As a result of this processing, the word “other than” corresponding to the operator “NOT” and the target word “blue” corresponding to the word “other than” are determined. The determination result 362 of FIG. A set of words other than "" is shown. Further, a search sentence “NOT blue” is generated based on the set of words other than “blue” (see step S56 in FIG. 7), and this search sentence is (1) of the partial search sentence 363 in FIG. 8A. It is expressed as
 この後、「以外の」の語に対応する対象語がまだ存在するか否かが判定され(図7のステップS57参照)、その結果、対象語「服」が判定され、図8Aの判定結果362には、「以外の 服」の語の組が表されている。さらに、この「以外の 服」の語の組に基づいて、検索文「服 NOT」が生成され(図7のステップS56参照)、この検索文が、図8Aの部分検索文363の(2)として表されている。 Thereafter, it is determined whether or not there is still a target word corresponding to the word “other than” (see step S57 in FIG. 7). As a result, the target word “clothes” is determined, and the determination result in FIG. 362 shows a set of words other than “clothes”. Further, a search sentence “clothes NOT” is generated based on the set of words other than “clothes” (see step S56 of FIG. 7), and this search sentence is the partial search sentence 363 of FIG. 8A (2). It is expressed as
 ここでさらに、部分検索文363の(1)と(2)が、図8Bに示すように統合され(図7のステップS56参照)、検索文364として「服 NOT 青」が生成される。 Here, (1) and (2) of the partial search sentence 363 are further integrated as shown in FIG. 8B (see step S56 in FIG. 7), and “clothes NOT blue” is generated as the search sentence 364.
 この後、「以外の」の語に対応する対象語がまだ存在するか否かが判定され(図7のステップS57参照)、その結果、対象語が存在しないので、検索文の演算子となる次の語についての判定を行い取得する(図7のステップS54参照)。ここで、解析結果361から、助詞の「で」が抽出され、「で」の語は、演算子「AND」に対応する。次に、この演算子「AND」に対応する対象語、すなわち、「で」の語に隣接する語を解析結果361から求め、これを対象語として判定する(図7のステップS55参照)。 Thereafter, it is determined whether or not there is still a target word corresponding to the word “other than” (see step S57 in FIG. 7). As a result, since the target word does not exist, it becomes an operator of the search sentence. The next word is determined and acquired (see step S54 in FIG. 7). Here, the particle “de” is extracted from the analysis result 361, and the word “de” corresponds to the operator “AND”. Next, an object word corresponding to the operator “AND”, that is, a word adjacent to the word “de” is obtained from the analysis result 361 and determined as an object word (see step S55 in FIG. 7).
 この処理の結果、演算子「AND」に対応する語「で」と、この「で」の語に対応する対象語「服」が判定され、図8Aの判定結果362には、「服 で」の語の組が表されている。さらに、この「服 で」の語の組に基づいて、検索文「服 AND」が生成され(図7のステップS56参照)、この検索文が、図8Aの部分検索文363の(3)として表されている。 As a result of this processing, the word “de” corresponding to the operator “AND” and the target word “clothes” corresponding to the word “de” are determined, and the determination result 362 of FIG. A set of words is represented. Further, a search sentence “clothes AND” is generated based on the set of words “clothes” (see step S56 in FIG. 7). It is represented.
 ここでさらに、部分検索文363の(1)、(2)、(3)が、図8Bに示すように統合され(図7のステップS56参照)、検索文365として「(服 NOT 青) AND」が生成される。このような統合処理は、図8Bに示した検索文364と、図8Aに示した部分検索文363の(3)を統合し、共通する対象語「服」の部分でまとめることによって行う。また、ここで、検索文364の「服 NOT 青」は、対象語の「服」を演算子「NOT」と対象語「青」で修飾するものであり、部分検索文である検索文「服 AND」と統合される場合には、上述のように、「服 NOT 青」が、論理演算で優先されるように、括弧書きで組み込まれる。このような、検索文における括弧書きの利用は、すなわち、演算子に対応する語、及びその語に隣接する対象語の配置関係と内容から判断されるものである。 Further, (1), (2), and (3) of the partial search sentence 363 are integrated as shown in FIG. 8B (see step S56 in FIG. 7), and “(clothing NOT blue) AND is used as the search sentence 365. Is generated. Such integration processing is performed by integrating the search sentence 364 shown in FIG. 8B and (3) of the partial search sentence 363 shown in FIG. 8A and grouping them together with the common target word “clothes”. In addition, “clothes NOT blue” in the search sentence 364 modifies the subject word “clothes” with the operator “NOT” and the target word “blue”, and the search sentence “clothes” that is a partial search sentence. When integrated with “AND”, as described above, “clothes NOT blue” is included in parentheses so that it is prioritized in the logical operation. Such use of parentheses in a search sentence is determined from the arrangement relationship and contents of a word corresponding to an operator and a target word adjacent to the word.
 この後、「で」の語に対応する対象語がまだ存在するか否かが判定され(図7のステップS57参照)、その結果、対象語「S社」、及び「製」が判定され、図8Aの判定結果362には、「で S社製」の語の組が表されている。なお、本実施形態では、解析結果361で「S社」と「製」が形態素としてそれぞれ抽出されているが、ここでは、対象語を形容詞的表現にするため、「S社」と「製」を結合して、「S社製」としている。もちろん、対象語として名詞である「S社」のみを選択することもできる。 Thereafter, it is determined whether or not the target word corresponding to the word “de” still exists (see step S57 in FIG. 7). As a result, the target words “S company” and “manufactured” are determined. In the determination result 362 of FIG. 8A, a set of words “made by S company” is represented. In the present embodiment, “Company S” and “Made” are extracted as morphemes in the analysis result 361, but here “S Company” and “Made” are used to make the target word an adjective expression. Are combined to make “S”. Of course, only “S company” which is a noun can be selected as the target word.
 さらに、この「で S社製」の語の組に基づいて、検索文「AND S社製」が生成され(図7のステップS56参照)、この検索文が、図8Aの部分検索文363の(4)として表されている。 Further, based on the set of words “manufactured by S company”, a search sentence “AND made by S company” is generated (see step S56 in FIG. 7), and this search sentence is converted to the partial search sentence 363 in FIG. 8A. It is expressed as (4).
 ここでさらに、部分検索文363の(1)~(4)が、図8Bに示すように統合され(図7のステップS56参照)、検索文366として「(服 NOT 青) AND S社製」が生成される。このような統合処理は、図8Bに示した検索文365と、図8Aに示した部分検索文363の(4)を統合し、共通する演算子「AND」の部分でまとめることによって行う。 Further, (1) to (4) of the partial search sentences 363 are integrated as shown in FIG. 8B (see step S56 in FIG. 7), and “(clothing NOT blue) AND S company” is used as the search sentence 366. Is generated. Such integration processing is performed by integrating the search statement 365 shown in FIG. 8B and (4) of the partial search statement 363 shown in FIG. 8A and grouping them together in a common operator “AND” portion.
 この後、「で」の語に対応する対象語がまだ存在するか否かが判定され(図7のステップS57参照)、その結果、対象語が存在しないので、検索文の演算子となる次の語についての判定を行い取得する(図7のステップS54参照)。ここで、解析結果361から、演算子に対応する未処理の語がないので、これまでの統合処理で生成された検索文366の「(服 NOT 青) AND S社製」が、生成された検索文として情報提供サーバ200に送信される(図7のステップS60参照)。 Thereafter, it is determined whether or not the target word corresponding to the word “de” still exists (see step S57 in FIG. 7). As a result, since the target word does not exist, the next as an operator of the search sentence. The word is determined and acquired (see step S54 in FIG. 7). Here, since there is no unprocessed word corresponding to the operator from the analysis result 361, the search statement 366 "(clothing NOT blue) AND S company" generated by the integration process so far is generated. It is transmitted to the information providing server 200 as a search sentence (see step S60 in FIG. 7).
 なお、本発明の第2実施形態に係る検索文生成サーバ300’で実行される検索文生成処理では、図7、及び図8に示すように、入力文全体の解析が一度に行われ(図7のステップS52)、その後、演算子に対応する語について1つずつ、順に検索文生成処理が行われるが(図7のステップS53~ステップS59)、入力文の解析を部分的に行いながら、演算子に対応する語が抽出された場合に、その都度、当該語に関する検索文生成処理を行うように制御することもできる。 In the search statement generation process executed by the search statement generation server 300 ′ according to the second embodiment of the present invention, as shown in FIGS. 7 and 8, the entire input sentence is analyzed at once (see FIG. 7 (step S52), and then the search sentence generation processing is performed in order for each word corresponding to the operator (step S53 to step S59 in FIG. 7), while partially analyzing the input sentence, When a word corresponding to the operator is extracted, it is possible to perform control so that a search sentence generation process related to the word is performed each time.
 次に、図9を参照して、本発明の実施形態に係るユーザ端末100を構成するコンピュータのハードウェア構成の例について説明する。ユーザ端末100は、例えば、スマートフォンといった携帯機器である。ただし、図9に示すユーザ端末100の構成は、その代表的な構成を例示したにすぎない。 Next, with reference to FIG. 9, an example of a hardware configuration of a computer constituting the user terminal 100 according to the embodiment of the present invention will be described. The user terminal 100 is a portable device such as a smartphone, for example. However, the configuration of the user terminal 100 illustrated in FIG. 9 is merely an example of a typical configuration.
 ユーザ端末100は、CPU(Central Processing Unit)101、メモリ102、カメラ103、GPS制御部104、GPS用アンテナ105、無線信号処理部106、無線通信用アンテナ107、オーディオ制御部108、マイクロフォン109、スピーカ110、ディスプレイ制御部111、入力機器インタフェース112、タッチスクリーン113、非接触ICカードリーダ/ライタ114、センサー115、補助記憶装置116、及び外部記録媒体インタフェース117を含んでいる。 The user terminal 100 includes a CPU (Central Processing Unit) 101, a memory 102, a camera 103, a GPS control unit 104, a GPS antenna 105, a radio signal processing unit 106, a radio communication antenna 107, an audio control unit 108, a microphone 109, and a speaker. 110, a display control unit 111, an input device interface 112, a touch screen 113, a non-contact IC card reader / writer 114, a sensor 115, an auxiliary storage device 116, and an external recording medium interface 117.
 CPU101は、ユーザ端末100の各構成要素の動作を制御し、OSの制御下で、各機能を実行する。 The CPU 101 controls the operation of each component of the user terminal 100 and executes each function under the control of the OS.
 メモリ102は通常RAM(Random Access Memory)で構成される。メモリ102には、CPU101で実行される各機能を実現するためのプログラムが実行時にロードされ、当該プログラムに必要なデータ等が一時的に記憶される。 The memory 102 is usually composed of RAM (Random Access Memory). In the memory 102, a program for realizing each function executed by the CPU 101 is loaded at the time of execution, and data necessary for the program is temporarily stored.
 カメラ103は、ユーザ端末100に内蔵される小型の撮像装置である。GPS制御部104は、GPS用アンテナ105を介して、複数のGPSから信号を受信し、ユーザ端末100の位置を調べる。 The camera 103 is a small imaging device built in the user terminal 100. The GPS control unit 104 receives signals from a plurality of GPSs via the GPS antenna 105 and checks the position of the user terminal 100.
 無線信号処理部106は、無線通信用アンテナ107を介して携帯電話基地局と通信を行い、他の機器との間での通話データの送受信や、(インターネットを介した)他端末との間でのWEBページやメールデータの送受信を制御する。また、無線信号処理部106は、無線通信用アンテナ107を用いて無線LANアクセスポイントとの間で無線LANによる通信を実現し、インターネット経由のデータ送受信を行う(ここでは、便宜上、無線信号処理部106と無線通信用アンテナ107が、携帯電話基地局との間の無線通信及び無線LANアクセスポイントとの間の無線通信を行うものとした)。 The wireless signal processing unit 106 communicates with the mobile phone base station via the wireless communication antenna 107, transmits / receives call data to / from other devices, and communicates with other terminals (via the Internet). Controls transmission / reception of WEB pages and mail data. The wireless signal processing unit 106 implements wireless LAN communication with the wireless LAN access point using the wireless communication antenna 107, and performs data transmission / reception via the Internet (here, for convenience, the wireless signal processing unit 106 and the antenna 107 for wireless communication perform wireless communication with the mobile phone base station and wireless communication with the wireless LAN access point).
 オーディオ制御部108は、マイクロフォン109とスピーカ110を制御して無線通信による通話を実現し、一方で、アプリケーションにおいて動画や音楽を再生する場合に、音声を出力するよう制御する。 The audio control unit 108 controls the microphone 109 and the speaker 110 to realize a telephone call by wireless communication. On the other hand, the audio control unit 108 controls to output sound when playing a video or music in the application.
 タッチスクリーン113は、例えば、LCD(Liquid Crystal Display)などで構成される表示装置で情報を表示するとともに、ユーザが指などで画面表面をタッチした(押した)位置を、抵抗膜方式や静電容量方式などのタッチセンサーで検知する。ディスプレイ制御部111は、CPU101が発行する描画データを処理して、例えば、WEBページや動画等を、タッチスクリーン113の表示装置に出力する。入力機器インタフェース112は、タッチスクリーン113のタッチセンサーが、ユーザによる操作を検知し、これを所定の信号としてCPU101に送信する。 The touch screen 113 displays information on a display device configured by, for example, an LCD (Liquid Crystal Display), and the position where the user touches (presses) the screen surface with a finger or the like is displayed on the touch screen 113. Detect with capacitive type touch sensor. The display control unit 111 processes drawing data issued by the CPU 101 and outputs, for example, a WEB page, a moving image, or the like to the display device of the touch screen 113. In the input device interface 112, the touch sensor of the touch screen 113 detects an operation by the user, and transmits this to the CPU 101 as a predetermined signal.
 非接触ICカードリーダ/ライタ114は、非接触ICチップが埋め込まれたカードがユーザ端末100の所定の位置にかざされた場合に、CPU101の指令に基づいて、そのICチップに記憶されているデータを読み取り、又は、ICチップに所定のデータを書き込む。 The non-contact IC card reader / writer 114 stores data stored in the IC chip based on a command from the CPU 101 when the card in which the non-contact IC chip is embedded is held over a predetermined position of the user terminal 100. Or write predetermined data to the IC chip.
 センサー115は、タッチスクリーン113のタッチセンサー以外のセンサーであり、モーションセンサー、光センサー、近接センサーなどがある。 The sensor 115 is a sensor other than the touch sensor of the touch screen 113, and includes a motion sensor, a light sensor, a proximity sensor, and the like.
 補助記憶装置116は、例えば、フラッシュメモリと呼ばれる半導体メモリやハードディスクで構成される。補助記憶装置116は、CPU101で実行される各機能を実現するためのプログラムを記憶するほか、各種データを記憶する。 The auxiliary storage device 116 is composed of, for example, a semiconductor memory called a flash memory or a hard disk. The auxiliary storage device 116 stores various data as well as programs for realizing the functions executed by the CPU 101.
 外部記録媒体インタフェース117は、外部記録媒体140にアクセスして、そこに記録されているデータを読み取る。外部記録媒体140は、例えば、可搬型のフラッシュメモリである。 The external recording medium interface 117 accesses the external recording medium 140 and reads data recorded therein. The external recording medium 140 is, for example, a portable flash memory.
 次に、図10を参照して、本発明の第1実施形態に係る検索文生成サーバ300を構成するコンピュータのハードウェア構成の例について説明する。ただし、図10に示す検索文生成サーバ300の構成は、その代表的な構成を例示したにすぎない。なお、このようなコンピュータの構成を、本発明の第2実施形態に係る検索文生成サーバ300’や、情報提供システム1の情報提供サーバ200等に採用してもよい。 Next, with reference to FIG. 10, an example of a hardware configuration of a computer constituting the search statement generation server 300 according to the first embodiment of the present invention will be described. However, the configuration of the search statement generation server 300 illustrated in FIG. 10 is merely an example of a typical configuration. Such a computer configuration may be employed in the search statement generation server 300 ′ according to the second embodiment of the present invention, the information provision server 200 of the information provision system 1, and the like.
 検索文生成サーバ300は、CPU301、RAM302、ROM(Read Only Memory)303、ネットワークインタフェース304、オーディオ制御部305、マイクロフォン306、スピーカ307、ディスプレイコントローラ308、ディスプレイ309、入力機器インタフェース310、キーボード311、マウス312、外部記憶装置313、外部記録媒体インタフェース314、及びこれらの構成要素を互いに接続するバス315を含んでいる。 The search statement generation server 300 includes a CPU 301, a RAM 302, a ROM (Read Only Memory) 303, a network interface 304, an audio control unit 305, a microphone 306, a speaker 307, a display controller 308, a display 309, an input device interface 310, a keyboard 311 and a mouse. 312, an external storage device 313, an external recording medium interface 314, and a bus 315 that connects these components to each other.
 CPU301は、検索文生成サーバ300の各構成要素の動作を制御し、OSの制御下で、本発明に係る各機能の実行を制御する。 The CPU 301 controls the operation of each component of the search statement generation server 300, and controls the execution of each function according to the present invention under the control of the OS.
 RAM302には、CPU301で実行される各処理を実行するためのプログラムや、それらのプログラムが実行中に使用するデータが一時的に格納される。ROM303には、検索文生成サーバ300の起動時に実行されるプログラム等が格納される。 The RAM 302 temporarily stores programs for executing each process executed by the CPU 301 and data used during the execution of these programs. The ROM 303 stores a program executed when the search statement generation server 300 is activated.
 ネットワークインタフェース304は、ネットワーク320に接続するためのインタフェースである。ネットワーク320は、例えば、情報提供サーバ200と接続するためのネットワークである。 The network interface 304 is an interface for connecting to the network 320. The network 320 is a network for connecting to the information providing server 200, for example.
 オーディオ制御部305は、マイクロフォン306とスピーカ307を制御して音声の入出力を制御する。ディスプレイコントローラ308は、CPU301が発行する描画命令を実際に処理するための専用コントローラである。ディスプレイ309は、例えば、LCDで構成される表示装置である。 The audio control unit 305 controls the microphone 306 and the speaker 307 to control audio input / output. The display controller 308 is a dedicated controller for actually processing a drawing command issued by the CPU 301. The display 309 is a display device configured by an LCD, for example.
 入力機器インタフェース310は、キーボード311やマウス312から入力された信号を受信して、その信号パターンに応じて所定の指令をCPU301に送信する。 The input device interface 310 receives a signal input from the keyboard 311 or the mouse 312 and transmits a predetermined command to the CPU 301 according to the signal pattern.
 外部記憶装置313は、例えば、ハードディスクや半導体メモリのような記憶装置であり、この装置内には上述したプログラムやデータが記録される。 The external storage device 313 is, for example, a storage device such as a hard disk or a semiconductor memory, and the above-described program and data are recorded in this device.
 外部記録媒体インタフェース314は、外部記録媒体330にアクセスして、そこに記録されているデータを読み取る。外部記録媒体330は、例えば、可搬型のフラッシュメモリなどである。CPU301で実行され、本発明の各機能を実現するためのプログラムは、この外部記録媒体インタフェース314を介して外部記録媒体330から提供されうる。また、本発明の各機能を実現するためのプログラムの他の流通形態としては、ネットワーク上の所定のサーバから、ネットワーク320及びネットワークインタフェース304を介して外部記憶装置313やRAM302に格納されるというルート等も考えられる。 The external recording medium interface 314 accesses the external recording medium 330 and reads data recorded therein. The external recording medium 330 is, for example, a portable flash memory. A program executed by the CPU 301 to realize each function of the present invention can be provided from the external recording medium 330 via the external recording medium interface 314. As another distribution form of the program for realizing each function of the present invention, a route that is stored in the external storage device 313 or the RAM 302 via a network 320 and a network interface 304 from a predetermined server on the network. Etc. are also conceivable.
 なお、ここまで、本発明の第1実施形態に係る検索文生成サーバ300、又は本発明の第2実施形態に係る検索文生成サーバ300’を含む検索文生成システム11、及び当該検索文生成システム11を含む情報提供システム1について、本発明を実施することができる具体例を示しながら説明してきたが、これらの具体例は本発明を説明するための一例に過ぎず、本発明の権利範囲はこれらの具体例に限定されることはない。これら以外の様々な方法や構成によって、本発明の技術的思想を実現することができる。 Until now, the search statement generation system 11 including the search statement generation server 300 according to the first embodiment of the present invention or the search statement generation server 300 ′ according to the second embodiment of the present invention, and the search statement generation system. 11 has been described with reference to specific examples in which the present invention can be implemented. However, these specific examples are merely examples for explaining the present invention, and the scope of rights of the present invention is limited. It is not limited to these specific examples. The technical idea of the present invention can be realized by various methods and configurations other than these.
 1 情報提供システム
 11 検索文生成システム
 100 ユーザ端末
 200 情報提供サーバ
 300 検索文生成サーバ
 300’ 検索文生成サーバ
 340 解析用データ
 400 検索対象データ
 500 ネットワーク
DESCRIPTION OF SYMBOLS 1 Information provision system 11 Search sentence generation system 100 User terminal 200 Information provision server 300 Search sentence generation server 300 'Search sentence generation server 340 Analysis data 400 Search object data 500 Network

Claims (5)

  1.  ユーザから入力された自然文の入力文を受信する入力文受信手段と、
     前記入力文を解析して、複数の語に分割する入力文解析手段と、
     前記複数の語に基づいて、演算子に対応する語を判定し、判定された前記語に関連する対象語を判定する判定手段と、
     判定された前記語に対応する演算子と、当該語に関連すると判定された対象語の組を用いて、前記入力文に対応する検索文を生成する検索文生成手段と、を備えることを特徴とする検索文生成システム。
    An input sentence receiving means for receiving an input sentence of a natural sentence input from a user;
    An input sentence analyzing means for analyzing the input sentence and dividing it into a plurality of words;
    Determining means for determining a word corresponding to an operator based on the plurality of words and determining a target word related to the determined word;
    Search sentence generation means for generating a search sentence corresponding to the input sentence by using an operator corresponding to the determined word and a set of target words determined to be related to the word. Search sentence generation system.
  2.  前記判定手段は、所定の判定基準に基づいて、否定の論理演算の演算子に対応する語を判定することを特徴とする、請求項1に記載の検索文生成システム。 2. The search sentence generation system according to claim 1, wherein the determination means determines a word corresponding to an operator of a negative logical operation based on a predetermined determination criterion.
  3.  前記検索文生成手段は、判定された前記語に対応する演算子と、当該語に関連すると判定された対象語の組のそれぞれにおける演算子と対象語の関係に基づいて、所定の演算子による論理演算の優先度が決定されるように、前記検索文を生成することを特徴とする、請求項1に記載の検索文生成システム。 The search sentence generation means uses a predetermined operator based on a relationship between the operator corresponding to the determined word and the operator and the target word in each set of target words determined to be related to the word. The search statement generation system according to claim 1, wherein the search statement is generated so that a priority of a logical operation is determined.
  4.  前記判定手段は、前記複数の語について、演算子に対応する語を順次判定し、
     前記検索文生成手段は、前記演算子に対応する語が判定されるたびに、判定された前記語に関する検索文を生成し、前記検索文を生成済みの検索文と統合することを特徴とする、請求項1に記載の検索文生成システム。
    The determination means sequentially determines words corresponding to operators for the plurality of words,
    The search sentence generation unit generates a search sentence relating to the determined word each time a word corresponding to the operator is determined, and integrates the search sentence with the generated search sentence. The search sentence generation system according to claim 1.
  5.  ユーザから入力された自然文の入力文に基づいて検索文を生成する検索文生成サーバにおいて実行される検索文生成方法であって、
     前記入力文を受信する入力文受信ステップと、
     前記入力文を解析して、複数の語に分割する入力文解析ステップと、
     前記複数の語に基づいて、演算子に対応する語を判定し、判定された前記語に関連する対象語を判定する判定ステップと、
     判定された前記語に対応する演算子と、当該語に関連すると判定された対象語の組を用いて、前記入力文に対応する検索文を生成する検索文生成ステップと、を有することを特徴とする検索文生成方法。
    A search statement generation method that is executed in a search statement generation server that generates a search statement based on an input sentence of a natural sentence input by a user,
    An input sentence receiving step of receiving the input sentence;
    An input sentence analysis step of analyzing the input sentence and dividing it into a plurality of words;
    A determination step of determining a word corresponding to an operator based on the plurality of words and determining a target word related to the determined word;
    A search sentence generation step of generating a search sentence corresponding to the input sentence by using an operator corresponding to the determined word and a set of target words determined to be related to the word. Search sentence generation method.
PCT/JP2019/018693 2018-06-04 2019-05-10 Search text generation system and search text generation method WO2019235131A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201980037385.XA CN112236768A (en) 2018-06-04 2019-05-10 Search text generation system and search text generation method
US17/059,831 US20210263915A1 (en) 2018-06-04 2019-05-10 Search Text Generation System and Search Text Generation Method
PH12020552089A PH12020552089A1 (en) 2018-06-04 2020-12-04 Search sentence generation system and search sentence generation method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2018107217A JP7139157B2 (en) 2018-06-04 2018-06-04 Search statement generation system and search statement generation method
JP2018-107217 2018-06-04

Publications (1)

Publication Number Publication Date
WO2019235131A1 true WO2019235131A1 (en) 2019-12-12

Family

ID=68770837

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2019/018693 WO2019235131A1 (en) 2018-06-04 2019-05-10 Search text generation system and search text generation method

Country Status (5)

Country Link
US (1) US20210263915A1 (en)
JP (1) JP7139157B2 (en)
CN (1) CN112236768A (en)
PH (1) PH12020552089A1 (en)
WO (1) WO2019235131A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102396881B1 (en) * 2021-08-18 2022-05-12 주식회사 비포플레이 Method, system and non-transitory computer-readable recording medium for supporting search of contents
JP2023036140A (en) * 2021-09-02 2023-03-14 株式会社日立製作所 Business data analysis device, business data analysis system and business data analysis method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08339383A (en) * 1995-04-11 1996-12-24 Ricoh Co Ltd Document retrieving device and dictionary preparing device
JPH11328194A (en) * 1998-05-13 1999-11-30 Nippon Telegr & Teleph Corp <Ntt> Keyword retrieval method and device and storage medium storing keyword retrieval program

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2792287B2 (en) * 1991-10-31 1998-09-03 日本電気株式会社 Information retrieval device
JP4011595B2 (en) * 1998-02-02 2007-11-21 株式会社リコー Electronic document retrieval system and recording medium
US8775459B2 (en) * 2005-01-07 2014-07-08 International Business Machines Corporation Method and apparatus for robust input interpretation by conversation systems
US20070106499A1 (en) * 2005-08-09 2007-05-10 Kathleen Dahlgren Natural language search system
JP2010003015A (en) * 2008-06-18 2010-01-07 Hitachi Software Eng Co Ltd Document search system
JP5306356B2 (en) * 2008-08-26 2013-10-02 インターナショナル・ビジネス・マシーンズ・コーポレーション SEARCH DEVICE, SEARCH METHOD, AND SEARCH PROGRAM USING PUBLIC SEARCH ENGINE
KR101023209B1 (en) * 2008-10-13 2011-03-18 한국전자통신연구원 Document translation apparatus and its method
RU2564629C1 (en) * 2014-03-31 2015-10-10 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Method of clustering of search results depending on semantics
CN104199965B (en) * 2014-09-22 2020-08-07 吴晨 Semantic information retrieval method
KR102033395B1 (en) * 2014-11-20 2019-10-18 한국전자통신연구원 Question answering system and method for structured knowledge-base using deep natrural language question analysis
US11151249B2 (en) * 2017-01-06 2021-10-19 Crowdstrike, Inc. Applications of a binary search engine based on an inverted index of byte sequences
US11481387B2 (en) * 2017-12-28 2022-10-25 Microsoft Technology Licensing, Llc Facet-based conversational search

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08339383A (en) * 1995-04-11 1996-12-24 Ricoh Co Ltd Document retrieving device and dictionary preparing device
JPH11328194A (en) * 1998-05-13 1999-11-30 Nippon Telegr & Teleph Corp <Ntt> Keyword retrieval method and device and storage medium storing keyword retrieval program

Also Published As

Publication number Publication date
US20210263915A1 (en) 2021-08-26
CN112236768A (en) 2021-01-15
PH12020552089A1 (en) 2021-08-02
JP7139157B2 (en) 2022-09-20
JP2019211987A (en) 2019-12-12

Similar Documents

Publication Publication Date Title
US11328004B2 (en) Method and system for intelligently suggesting tags for documents
JP6714024B2 (en) Automatic generation of N-grams and conceptual relationships from language input data
CN109196496B (en) Unknown word predictor and content integrated translator
US20150286943A1 (en) Decision Making and Planning/Prediction System for Human Intention Resolution
MXPA05002951A (en) Rendering tables with natural language commands.
JP2010517133A (en) Web site integrated search apparatus and method
JP7297458B2 (en) Interactive content creation support method
WO2020026366A1 (en) Patent evaluation determination method, patent evaluation determination device, and patent evaluation determination program
US11651147B2 (en) Method and system for intelligently detecting and modifying unoriginal content
WO2019235131A1 (en) Search text generation system and search text generation method
JP2006323517A (en) Text classification device and program
KR102088619B1 (en) System and method for providing variable user interface according to searching results
Bakar The development of an integrated corpus for Malay language
US20230103313A1 (en) User assistance system
US20230351091A1 (en) Presenting Intelligently Suggested Content Enhancements
JP7086710B2 (en) Information provision system
JP2004220226A (en) Document classification method and device for retrieved document
JP2020095653A (en) Creation support device and creation support method
NL2025417B1 (en) Intelligent Content Identification and Transformation
TWI220203B (en) Method for mobile device real-time edit and access on-line data
JP6796337B2 (en) Comment database creation device, comment database creation method, comment database creation program
WO2024067705A1 (en) Information acquisition method, device and system
JP2022051321A (en) Electronic dictionary, electronic dictionary system, information processing system, information processing method and program
Saraiva A Conversational Interface for Webpage Code Generation
Brändle MASTERARBEIT/MASTER’S THESIS

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19815736

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19815736

Country of ref document: EP

Kind code of ref document: A1