WO2021234844A1 - Dispositif de sélection, dispositif de détermination de priorité, dispositif d'acquisition de candidat de réponse, procédé de sélection, procédé de détermination de priorité, procédé d'acquisition de candidat de réponse et programme - Google Patents

Dispositif de sélection, dispositif de détermination de priorité, dispositif d'acquisition de candidat de réponse, procédé de sélection, procédé de détermination de priorité, procédé d'acquisition de candidat de réponse et programme Download PDF

Info

Publication number
WO2021234844A1
WO2021234844A1 PCT/JP2020/019902 JP2020019902W WO2021234844A1 WO 2021234844 A1 WO2021234844 A1 WO 2021234844A1 JP 2020019902 W JP2020019902 W JP 2020019902W WO 2021234844 A1 WO2021234844 A1 WO 2021234844A1
Authority
WO
WIPO (PCT)
Prior art keywords
search
question
query
search query
words
Prior art date
Application number
PCT/JP2020/019902
Other languages
English (en)
Japanese (ja)
Inventor
知史 三枝
裕一郎 関口
Original Assignee
日本電信電話株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電信電話株式会社 filed Critical 日本電信電話株式会社
Priority to PCT/JP2020/019902 priority Critical patent/WO2021234844A1/fr
Publication of WO2021234844A1 publication Critical patent/WO2021234844A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation

Definitions

  • This disclosure relates to a selection device, a priority determination device, an answer candidate acquisition device, a selection method, a priority determination method, an answer candidate acquisition method, and a program.
  • the call center administrator etc. confirms the response log (recorded data, text data obtained by voice recognition for the recorded data, etc.) that records the response with the customer by the operator, and adds it as FAQ.
  • the response log recorded data, text data obtained by voice recognition for the recorded data, etc.
  • the contents to be maintained as FAQ may change, and it costs a lot to maintain the FAQ.
  • Patent Document 1 a FAQ question is given based on a text that voice-recognizes recorded data obtained by recording a customer's response by an operator, an existing FAQ, and information on whether or not the FAQ was useful when the operator searched for the FAQ.
  • the method of servicing is described.
  • the search result for a certain query is presented to the operator, and the operator gives information as to whether or not the search result is useful.
  • the question (FAQ candidate) extracted from the recorded data, and the question that has already been prepared similar questions are aggregated and the aggregated question Among them, the set that does not include the questions that have already been prepared is extracted as the questions that should be added to the FAQ.
  • Non-Patent Document 1 As a method for the operator to obtain the answer to the question from the customer, there is a method in which the operator searches the business manual used as a reference for the answer by using a search query corresponding to the question.
  • a search query contains words and phrases that frequently appear throughout the business manual, even items that are not relevant as answers to the question may be searched as search results. Therefore, in order to remove words and phrases scattered in a document, a method of using the appearance information of the words and phrases is often used (see Non-Patent Document 1).
  • this method for example, even if a phrase frequently appears in a certain passage in the business manual, if the phrase frequently appears throughout the business manual, the priority of the phrase is lowered.
  • Patent Document 2 describes a method of obtaining a statement section of a customer's request from a voice log recording a conversation between an operator and a customer using the word appearance density, but this method is more appropriate. It is not intended for a search.
  • Non-Patent Document 1 In order to obtain answer candidates for FQA maintenance from the response log, it is necessary to create a search query to search the business manual.
  • a search query When the method described in Non-Patent Document 1 is used to create a search query, words and phrases that appear collectively in a specific range (such as a chapter or section) of a business manual are excluded from the search query, and appropriate search results are obtained. Can't be done.
  • the purpose of the present disclosure made in view of the above-mentioned problems is a selection device, a priority determination device, and an answer candidate acquisition that can select a phrase more suitable for a search query for searching a document to be searched. It is an object of the present invention to provide an apparatus, a selection method, a priority determination method, an answer candidate acquisition method, and a program.
  • the selection device has a calculation unit that calculates the degree of dispersion of the appearance positions of words and phrases in the document to be searched, and the search based on the degree of dispersion calculated by the calculation unit.
  • a word / phrase selection unit for selecting whether or not a word / phrase in the target document can be used in a search query for searching the search target document is provided.
  • the priority determination device includes a plurality of prioritization devices related to the document to be searched based on the frequency of appearance of words and phrases selected as usable in the search query by the selection device described above. The priority of words and phrases in the question sentence or the plurality of question sentences is determined.
  • the answer candidate acquisition device includes a question acquisition unit that acquires a question cluster consisting of a plurality of question sentences related to the document to be searched, and the above-mentioned priority determination device.
  • the plurality of question sentences constituting the question cluster are input, and the plurality of question sentences or the plurality of question sentences are determined based on the priority of the plurality of question sentences or the words and phrases in the plurality of question sentences determined by the above-mentioned priority determination device.
  • the search query generator that calculates the query score of words and phrases in the plurality of question sentences and generates the search query based on the calculated query score, and the generated search query are used to search the document to be searched.
  • a search score is calculated based on the frequency of appearance of words and phrases in the plurality of search results, and based on the calculated search score, the search score is calculated.
  • An output unit for determining the output order of search results for the search query is provided.
  • the selection method according to the present disclosure is based on a calculation step of calculating the degree of dispersion of the appearance positions of words and phrases in the document to be searched and the calculated degree of dispersion. Includes a selection step of selecting whether or not the phrase in the document can be used in a search query to search for the document to be searched.
  • the prioritization method includes a plurality of prioritization methods related to the document to be searched based on the frequency of appearance of words and phrases selected to be usable in the search query by the selection method described above. Includes a priority determination step that determines the priority of words in the interrogative text or the plurality of interrogative texts.
  • the answer candidate acquisition method is determined by the question acquisition step of acquiring a question cluster consisting of a plurality of question sentences related to the document to be searched and the above-mentioned priority determination method.
  • the query score of the words and phrases in the plurality of question sentences or the plurality of question sentences is calculated and calculated.
  • a search query generation step that generates the search query based on the generated query score
  • a search result acquisition step that searches the document to be searched using the generated search query and acquires the search result, and the acquisition.
  • a search score is calculated based on the frequency of appearance of words and phrases in the plurality of search results
  • an output step for determining the output order of the search results for the search query based on the calculated search score. including.
  • the program according to the present disclosure causes the computer to function as the above-mentioned selection device, priority determination device, or answer candidate acquisition device.
  • the selection device priority determination device, answer candidate acquisition device, selection method, priority determination method, answer candidate acquisition method, and program according to the present disclosure, words and phrases more suitable for a search query for searching a document to be searched. Can be selected.
  • FIG. 1 is a diagram showing a configuration example of the search system 10 according to the embodiment of the present disclosure.
  • the search system 10 according to the present embodiment generates a search query for searching a document to be searched, searches the document to be searched using the generated search query, and outputs a search result.
  • the business manual that the operator refers to when answering the question from the customer is the text to be searched
  • a search query for searching the business manual is generated, and the business manual is searched using the generated search query.
  • FAQ possible question and its answer
  • the search system 10 has a question determination unit 11, a selection unit 12 as a selection device, a priority determination unit 13 as a priority determination device, and a question candidate acquisition unit 14.
  • the question candidate acquisition unit 14 is an example of the question acquisition unit.
  • the pair output unit 17 is an example of an output unit.
  • the question candidate acquisition unit 14, the search query generation unit 15, the answer candidate acquisition unit 16, and the pair output unit 17 constitute an answer candidate acquisition device 18.
  • the question determination unit 11 inputs a response log recording the response of the operator to the customer.
  • the response log is, for example, recorded data obtained by recording a dialogue between an operator and a customer, text data obtained by converting the recorded data into text by voice recognition, and the like.
  • the question determination unit 11 acquires the question utterance (question sentence) in which the customer utters the question from the input response log.
  • the question determination unit 11 acquires a voice recognition result from the response log using, for example, a model learned by the classifier, and determines whether or not the question is uttered from the voice recognition result. Any known determination device can be used for the determination.
  • the question determination unit 11 outputs the acquired plurality of question utterances (question sentences) to the question candidate acquisition unit 14.
  • the selection unit 12 selects whether or not a word (word or phrase) in the business manual, which is the document to be searched, can be used in the search query, and outputs the selection result to the priority determination unit 13.
  • FIG. 2 is a diagram showing a configuration example of the selection unit 12.
  • the selection unit 12 as a selection device includes a calculation unit 121 and a phrase selection unit 126.
  • the calculation unit 121 calculates the degree of dispersion of the appearance positions of words and phrases in the business manual that is the document to be searched. As shown in FIG. 2, the calculation unit 121 includes a business manual division unit 122, a phrase division unit 123, an appearance frequency calculation unit 124, and an appearance dispersion calculation unit 125.
  • the business manual division unit 122 divides the business manual into line units.
  • the word / phrase division unit 123 divides the business manual divided into lines by the business manual division unit 122 into word / phrase units.
  • the appearance frequency calculation unit 124 calculates the appearance frequency in the business manual for each word / phrase divided by the word / phrase division unit 123.
  • the appearance dispersion calculation unit 125 calculates the degree of dispersion of the appearance position in the business manual for each word based on the appearance frequency and the appearance position of each word calculated by the appearance frequency calculation unit 124.
  • the word selection unit 126 selects whether or not the word can be used in the search query of the word in the business manual based on the degree of dispersion of the appearance position of the word calculated by the calculation unit 121, and determines the selection result as the priority determination unit 13. Output to.
  • the phrase selection unit 126 selects a phrase whose appearance position variance is larger than a predetermined threshold value as a phrase that cannot be used in a search query, and can use a phrase whose appearance position variance is less than or equal to a predetermined threshold value in a search query. Select a phrase.
  • Words and phrases with a large dispersion of appearance positions are words and phrases that frequently appear in the entire business manual. Including such words in your search query is likely to prevent you from getting the right answer to your question.
  • words and phrases that have a large dispersion of appearance positions as words and phrases that cannot be used in the search query words and phrases that frequently appear in the entire business manual can be excluded from the search query.
  • words and phrases that have a small dispersion of appearance positions as words and phrases that can be used in search queries words and phrases that appear frequently in a specific range can be used in search queries even if they appear in the entire business manual. Can be included. Therefore, it is possible to select a phrase that is more suitable for the search query. As a result, it becomes easier to obtain appropriate answers to the questions.
  • the priority determination unit 13 as the priority determination device has a plurality of interrogative sentences or a plurality of question sentences related to the business manual based on the frequency of occurrence of words and phrases selected as available in the search query by the selection unit 12.
  • the priority of words and phrases in the question sentence to be used for the search query is determined, and the result is output to the search query generation unit 15.
  • the priority determination unit 13 raises the priority of a question sentence that is selected to be usable in a search query and includes a phrase that frequently appears in a plurality of question sentences, for example. Further, the priority determination unit 13 raises the priority of the words and phrases selected to be usable in the search query, for example, the words and phrases that frequently appear in a plurality of interrogative sentences.
  • the question candidate acquisition unit 14 aggregates a plurality of question utterances (question sentences) output from the question determination unit 11 for each similar question sentence, and acquires a question cluster composed of a plurality of similar question sentences.
  • the question texts that make up the question cluster are candidate question texts (additional candidate question texts) that are added to the FAQ.
  • word2vec is used to extract words in each question sentence (question utterance), add the word vectors of the extracted words to obtain the utterance vector, and obtain the utterance vector of the obtained vector.
  • cosine similarity There is a method using cosine similarity.
  • the question candidate acquisition unit 14 outputs the acquired question cluster to the search query generation unit 15.
  • the search query generation unit 15 generates a search query for searching a business manual from the question sentences constituting the question cluster output from the question candidate acquisition unit 14.
  • the search query generation unit 15 inputs a plurality of question sentences constituting the question cluster into the priority determination unit 13.
  • the search query generation unit 15 is a query indicating the importance of the question sentence or the phrase included in the question sentence to the search query based on the priority of the question sentence or the phrase included in the question sentence determined by the priority determination unit 13. Calculate the score and generate a search query based on the calculated query score.
  • the search query generation unit 15 calculates the query score higher for the question sentence having the higher priority determined by the priority determination unit 13 among the plurality of question sentences constituting the question cluster, and the representative in the question cluster. Decide on a question.
  • the priority determination unit 13 raises the priority of a question sentence that is selected to be usable in a search query and includes a phrase that frequently appears in a plurality of question sentences, for example. .. Therefore, the search query generation unit 15 is a question sentence (plurality) that is a phrase that can be used in the search query and includes a phrase that frequently appears in other question sentences among the plurality of question sentences that constitute the question cluster.
  • the query score of the question sentence) that is close to the vector average of the question sentence) is calculated high, and the question is determined as the representative question.
  • the search query generation unit 15 may determine two or more question sentences as representative question sentences from a plurality of question sentences constituting the question cluster.
  • the search query generation unit 15 generates a search query by excluding words and phrases selected as unusable for the search query by the selection unit 12 from the determined representative question text.
  • the search query generation unit 15 may generate a search query based on the priority of words and phrases determined by the priority determination unit 13.
  • the search query generation unit 15 calculates, for example, a high query score of a phrase having a high priority, and generates a search query using the phrase having a high query score.
  • the priority determination unit 13 raises the priority of the phrase selected to be usable in the search query, for example, the phrase having a higher frequency of appearance in a plurality of interrogative sentences. Therefore, for example, assuming that the number of question sentences constituting the question cluster is M, the search query generation unit 15 generates a search query using words and phrases included in the question sentences of M / 2 or more.
  • the search query generation unit 15 outputs the generated search query to the answer candidate acquisition unit 16.
  • the search query generation unit 15 may output the query score of the search query (the query score of the question sentence or phrase that is the basis of the search query generation) to the answer candidate acquisition unit 16.
  • the answer candidate acquisition unit 16 searches the business manual using the search query generated by the search query generation unit 15 and acquires the search results.
  • the answer candidate acquisition unit 16 searches the business manual using each of the plurality of search queries and acquires the search results.
  • the search of the business manual using the search query can be performed by, for example, a search in sentence units or a search for partial documents in which sentences are combined in a certain unit.
  • the answer candidate acquisition unit 16 may search the business manual by combining a plurality of methods as described above.
  • the answer candidate acquisition unit 16 may acquire a plurality of (top N) search results.
  • the answer candidate acquisition unit 16 may use a phrase having a small dispersion value of the appearance position or a phrase appearing in the table of contents of the business manual to narrow down the search range.
  • the answer candidate acquisition unit 16 narrows down the search range using these words and phrases, and then performs a search using a search query from which the words and phrases used for narrowing down the search range are removed, so that the range is effective for FAQ maintenance. You can search.
  • the answer candidate acquisition unit 16 outputs the search query and the search result acquired by the search using the search query to the pair output unit 17.
  • the answer candidate acquisition unit 16 usually acquires a plurality of search results, and outputs the acquired plurality of search results to the pair output unit 17.
  • the answer candidate acquisition unit 16 may output the query score of the search query to the pair output unit 17.
  • the pair output unit 17 calculates a search score for each of the plurality of search results for the search query acquired by the answer candidate acquisition unit 16 based on the frequency of appearance of words and phrases in the plurality of search results.
  • the pair output unit 17 determines the output order of the search results for the search query based on the calculated search score, and outputs the pair of the search query and the search result in the determined order.
  • the pair output unit 17 calculates, for example, a high search score of a search result having a lot of duplication with other search results among a plurality of search results. Further, the pair output unit 17 may calculate, for example, the search score of each search result according to the frequency of appearance of words and phrases included in each of the plurality of search results in the plurality of search results.
  • the pair output unit 17 uses the query score of the search query and the search score of the search result obtained by the search by the search query.
  • the output order of the search results may be controlled based on the above.
  • the pair output unit 17 may control the output order of the pair of the search query and the search result based on the multiplication value of the query score and the search score.
  • FIG. 3 is a flowchart showing an example of the operation of the selection unit 12, and is a diagram for explaining a selection method by the selection unit 12 as a selection device.
  • the calculation unit 121 calculates the degree of dispersion of the appearance positions of words and phrases in the business manual that is the document to be searched (step S11). The calculation of the variance of the appearance position will be described in more detail, focusing on the operation of the appearance variance calculation unit 125.
  • the business manual is composed of one html (HyperTextMarkupLanguage) file or a text file.
  • the phrase dividing unit 123 extracts a phrase (noun or the like) by morphological analysis such as mecab, and acquires the extracted phrase and the line number in which the phrase appears.
  • the appearance frequency calculation unit 124 calculates the appearance frequency of the extracted words and phrases.
  • the appearance variance calculation unit 125 calculates the variance of the appearance position of the phrase from the line number where the extracted phrase appears.
  • the variance of the appearance position of the phrase X is calculated by dividing the sum of the differences from the average value of the appearance positions of X by the number of appearances N of the phrase X.
  • the calculation of the variance of the appearance position of a word by the appearance variance calculation unit 125 will be described using the following 10-line business manual as an example.
  • ⁇ Business manual> 1. How to set up the phone. 2. 2. The setting method when receiving a call is as follows. 3. 3. Select the menu, press the settings button, and select Phone ⁇ Incoming call. 4. To set the ringtone, select the ringtone and then the ringtone file. 5. To reject incoming calls, select Reject incoming calls and enter the number to reject. 6. The setting method when making a call is as follows. 7. Select the menu, press the settings button, and select Phone ⁇ Call. 8. To assign a number when making a call, select the numbering setting. 9. Operation method 10. others
  • the appearance variance calculation unit 125 obtains the normalized average appearance position by dividing the average value of the appearance positions of the phrase “telephone” by the total number of lines. Then, the appearance variance calculation unit 125 calculates the squared average of the value obtained by subtracting the normalized average appearance position from the value obtained by dividing the appearance position of the phrase “phone” by the total number of lines, thereby calculating the word “phone”. The variance of the appearance position of "" is calculated.
  • the variance of the appearance position of the phrase "menu” is calculated to be 0.2.
  • the appearance variance calculation unit 125 may calculate, for example, the variance of the appearance position of the phrase for each business manual. Further, in the present embodiment, the calculation unit 121 has described by using an example in which the business manual is divided into line units and the distribution value is calculated based on the line number, but the present invention is not limited to this. The calculation unit 121 may calculate the variance of the appearance position of the phrase by using the sentence number of the serial number assigned from the beginning of the business manual instead of the line number.
  • the phrase selection unit 126 selects whether or not the phrase in the business manual can be used in the search query for searching the business manual based on the calculated degree of dispersion (step S12).
  • the word selection unit 126 obtains, for example, the average value of the variance of the appearance positions of all words. Then, the phrase selection unit 126 selects, for example, a phrase whose appearance position variance is larger than the average value of the appearance position variance of all words as a phrase that cannot be used in the search query, and the variance of the appearance position is all words. Select words that are less than or equal to the average variance of the occurrence position as words that can be used in the search query. In this way, the phrase selection unit 126 selects whether or not the phrase in the business manual, which is the document to be searched, can be used in the search query, based on the degree of dispersion of the phrase. When a plurality of business files exist, the phrase selection unit 126 averages the variance of the appearance position in each business manual calculated for one phrase, and selects whether or not the phrase can be used in the search query. do.
  • the selection method according to the present embodiment is based on the step (calculation step) of calculating the degree of dispersion of the appearance position of the word in the document (business manual) to be searched and the calculated degree of dispersion. Includes a step (selection step) of selecting whether or not the phrase in the document can be used in a search query for searching the document to be searched.
  • FIG. 4 is a flowchart showing an example of the operation of the priority determination unit 13, and is a diagram for explaining a priority determination method by the priority determination unit 13 as a priority determination device.
  • the priority determination unit 13 determines the question text or the question based on the frequency of occurrence of the words and phrases selected to be usable in the search query by the selection method described with reference to FIG. 3 in each of the plurality of question texts related to the business manual.
  • the priority of words and phrases in a sentence is determined (step S21).
  • the priority determination method determines the frequency of appearance of words and phrases selected as usable in the search query by the selection method according to the present embodiment in each of the plurality of question sentences related to the document to be searched. Based on this, it includes a step of determining the priority of a question sentence or a phrase in the question sentence (priority determination step).
  • FIG. 5 is a flowchart showing an example of the operation of the answer candidate acquisition device 18, and is a diagram for explaining a method of acquiring answer candidates by the answer candidate acquisition device 18.
  • the question candidate acquisition unit 14 aggregates a plurality of question sentences output from the question determination unit 11 and acquires a question cluster (step S31).
  • the operator responds to customers by referring to the FAQ that has been prepared.
  • the categories or genres of FAQs to be maintained are determined, questions are extracted for each category or genre, and FAQs are maintained. Therefore, the question determination unit 11 narrows down the file group of the response log, for example, by using a keyword such as "security”.
  • the question determination unit 11 uses an utterance script for product sales or a product name as a keyword, and whether the response log after the place where the keyword appears is a question utterance (question sentence). Input to the question judge to judge whether or not, and acquire the question sentence (question candidate sentence).
  • the question candidate acquisition unit 14 extracts an arbitrary number of important words and phrases from the question candidate sentences.
  • the extracted important words and phrases are words and phrases indicating question candidate sentences.
  • the question candidate acquisition unit 14 used the frequency of appearance of words appearing in all question candidate sentences and the business manual referred to when creating the FAQ answer, and extracted by the inverse of the frequency of appearance of words in the business manual. Words are weighted and the vector expression of each question candidate sentence is obtained.
  • the question candidate acquisition unit 14 acquires a question cluster from the vector-expressed question candidate sentence.
  • the question candidate acquisition unit 14 acquires the question cluster by excluding the question text included in the existing (prepared) FAQ.
  • the question candidate acquisition unit 14 takes time to repeatedly confirm and answer questions from the response logs in the vicinity of the response log in which the question sentence appears. You may also include the part that is mentioned in the question candidate sentence. Specifically, the question candidate acquisition unit 14 uses the response log that is the basis for extracting the question sentence and the number of appearances of other question sentences after the utterance in which the question sentence appears as a score to ask a question. You may take out the candidate sentence. By doing this, it is possible to retrieve important questions that appear infrequently.
  • the search query generation unit 15 generates a search query based on the priority of the plurality of question sentences constituting the question cluster or the words and phrases in the plurality of question sentences determined by the priority determination method according to the present embodiment. (Step S32). For example, the search query generation unit 15 calculates a higher query score for a determined question sentence with a higher priority, and removes words and phrases selected by the selection unit 12 as unusable for a search query from the question sentence having a higher query score. And generate a search query. Further, the search query generation unit 15 calculates the query score higher for words that are selected to be usable for the search query, have a higher frequency of appearance in the question sentence, and have a higher priority, and use words with a higher query score. And generate a search query. In this way, the search query generation unit 15 generates a search query without using the phrase selected as unavailable for the search query.
  • the answer candidate acquisition unit 16 searches the business manual using the generated search query and acquires the search result (step S33).
  • the pair output unit 17 calculates a search score for each of the acquired search results based on the frequency of appearance of words and phrases in the plurality of search results, and determines the output order of the search results for the search query based on the calculated search score. (Step S34).
  • the answer candidate acquisition method includes a step of acquiring a question candidate (question acquisition step), a step of generating a search query (search query generation step), and a step of acquiring a search result (search result). It includes a step (acquisition step) and a step (output step) for controlling the output order of search results.
  • a question acquisition step a question cluster consisting of a plurality of question sentences related to the document to be searched is acquired.
  • search query generation step a plurality of question sentences or a plurality of question sentences or words are prioritized in a plurality of question sentences or a plurality of question sentences constituting the acquired question cluster determined by the priority determination method according to the present embodiment.
  • search result acquisition step the document to be searched is searched using the generated search query, and the search result is acquired.
  • search result acquisition step a search score is calculated for each of the acquired search results based on the frequency of appearance of words and phrases in the plurality of search results, and the output order of the search results for the search query is determined based on the calculated search score.
  • search query generation step multiple search queries may be generated from one question cluster.
  • the business manual in the search result acquisition step, the business manual may be searched using each of the generated plurality of search queries and the search results may be acquired.
  • the search score is calculated for each search result using multiple search queries, and the search result is based on the query score of the search query and the search score of the search result obtained by the search using the search query. The output order may be determined.
  • the word or phrase it is determined whether or not the word or phrase can be used in the search query in the search target document based on the degree of dispersion of the appearance position of the word or phrase in the search target document.
  • a computer to function as each part of the above-mentioned search system 10.
  • a computer stores a program describing processing contents that realize the functions of each part of the search system 10 in the storage unit of the computer, and the CPU (Central Processing Unit) of the computer reads and executes this program. It can be realized by letting it. That is, the program can cause the computer to function as the selection device 12 described above. The program can also cause the computer to function as the prioritization device 13 described above. Alternatively, the program may allow the computer to function as the answer candidate acquisition device 18 described above.
  • this program may be recorded on a computer-readable medium. It can be installed on a computer using a computer-readable medium.
  • the computer-readable medium on which the program is recorded may be a non-transient recording medium.
  • the non-transient recording medium is not particularly limited, but may be, for example, a recording medium such as a CD-ROM or a DVD-ROM. This program can also be provided via a network.
  • each component can be rearranged so as not to be logically inconsistent, and a plurality of components can be combined or divided into one.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

La présente divulgation concerne un dispositif de sélection (12) qui comprend : une unité de calcul (121) qui calcule le degré de dispersion de positions auxquelles une expression apparaît dans un document cible de récupération ; et une unité de sélection d'expression (126) qui sélectionne, sur la base du degré de dispersion calculé, si oui ou non l'expression dans le document cible de récupération doit être utilisée pour une interrogation de récupération pour récupérer le document cible de récupération.
PCT/JP2020/019902 2020-05-20 2020-05-20 Dispositif de sélection, dispositif de détermination de priorité, dispositif d'acquisition de candidat de réponse, procédé de sélection, procédé de détermination de priorité, procédé d'acquisition de candidat de réponse et programme WO2021234844A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/019902 WO2021234844A1 (fr) 2020-05-20 2020-05-20 Dispositif de sélection, dispositif de détermination de priorité, dispositif d'acquisition de candidat de réponse, procédé de sélection, procédé de détermination de priorité, procédé d'acquisition de candidat de réponse et programme

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2020/019902 WO2021234844A1 (fr) 2020-05-20 2020-05-20 Dispositif de sélection, dispositif de détermination de priorité, dispositif d'acquisition de candidat de réponse, procédé de sélection, procédé de détermination de priorité, procédé d'acquisition de candidat de réponse et programme

Publications (1)

Publication Number Publication Date
WO2021234844A1 true WO2021234844A1 (fr) 2021-11-25

Family

ID=78708302

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/019902 WO2021234844A1 (fr) 2020-05-20 2020-05-20 Dispositif de sélection, dispositif de détermination de priorité, dispositif d'acquisition de candidat de réponse, procédé de sélection, procédé de détermination de priorité, procédé d'acquisition de candidat de réponse et programme

Country Status (1)

Country Link
WO (1) WO2021234844A1 (fr)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060080315A1 (en) * 2004-10-08 2006-04-13 The Greentree Group Statistical natural language processing algorithm for use with massively parallel relational database management system
US20080243820A1 (en) * 2007-03-27 2008-10-02 Walter Chang Semantic analysis documents to rank terms
US20120197905A1 (en) * 2011-02-02 2012-08-02 Microsoft Corporation Information retrieval using subject-aware document ranker
US20140244384A1 (en) * 2007-03-23 2014-08-28 Walter Chang Method and apparatus for performing targeted advertising in documents
US20190095802A1 (en) * 2017-09-25 2019-03-28 International Business Machines Corporation Heuristic and non-semantic prediction of the cost to find and review data relevant to a task

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060080315A1 (en) * 2004-10-08 2006-04-13 The Greentree Group Statistical natural language processing algorithm for use with massively parallel relational database management system
US20140244384A1 (en) * 2007-03-23 2014-08-28 Walter Chang Method and apparatus for performing targeted advertising in documents
US20080243820A1 (en) * 2007-03-27 2008-10-02 Walter Chang Semantic analysis documents to rank terms
US20120197905A1 (en) * 2011-02-02 2012-08-02 Microsoft Corporation Information retrieval using subject-aware document ranker
US20190095802A1 (en) * 2017-09-25 2019-03-28 International Business Machines Corporation Heuristic and non-semantic prediction of the cost to find and review data relevant to a task

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KANAKO ONISHI; ICHIRO KOBAYASHI: "An Information Enhancement Technique by using Linked Data", DEIM FORUM 2011, 4 August 2011 (2011-08-04), pages 1 - 8, XP009532380 *
TAKAAKI HASEGAWA: "Automatic Knowledge Assistance System Supporting Operator Responses", NTT TECHNICAL REVIEW, vol. 17, no. 9, 1 September 2019 (2019-09-01), pages 15 - 18, XP055874245 *

Similar Documents

Publication Publication Date Title
CN107832286B (zh) 智能交互方法、设备及存储介质
JP4924950B2 (ja) 質問応答データ編集装置、質問応答データ編集方法、質問応答データ編集プログラム
WO2018034118A1 (fr) Système de dialogue et programme informatique associé
JP6998680B2 (ja) 対話型業務支援システムおよび対話型業務支援プログラム
CN105979376A (zh) 一种推荐方法和装置
US11494434B2 (en) Systems and methods for managing voice queries using pronunciation information
CA2823835C (fr) Recherche et reponse vocale fondee sur la pertinence
JP2017037588A (ja) 情報処理装置及び情報処理プログラム
US20210026890A1 (en) Faq consolidation assistance device, faq consolidation assistance method, and program
KR102348084B1 (ko) 영상표시장치, 영상표시장치의 구동방법 및 컴퓨터 판독가능 기록매체
JP2019207648A (ja) 対話型業務支援システム
US11450306B2 (en) Systems and methods for generating synthesized speech responses to voice inputs by training a neural network model based on the voice input prosodic metrics and training voice inputs
CN110852095B (zh) 语句热点提取方法及系统
WO2019159986A1 (fr) Dispositif de fourniture d'informations, procédé de fourniture d'informations et programme
JP6873805B2 (ja) 対話支援システム、対話支援方法、及び対話支援プログラム
US20210034662A1 (en) Systems and methods for managing voice queries using pronunciation information
CA3143970A1 (fr) Systemes et procedes d'identification de types dynamiques dans des interrogations vocales
WO2021234844A1 (fr) Dispositif de sélection, dispositif de détermination de priorité, dispositif d'acquisition de candidat de réponse, procédé de sélection, procédé de détermination de priorité, procédé d'acquisition de candidat de réponse et programme
CN111460114A (zh) 检索方法、装置、设备及计算机可读存储介质
JP7028746B2 (ja) 質問生成装置および質問生成方法
US20230205794A1 (en) Generating search insight data
US20210035587A1 (en) Systems and methods for managing voice queries using pronunciation information
JP6882975B2 (ja) 対話ログ群からコンテキストを決定可能な対話シナリオ生成装置、プログラム及び方法
CN111354350A (zh) 语音处理方法及装置、语音处理设备、电子设备
US20220207066A1 (en) System and method for self-generated entity-specific bot

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20937107

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20937107

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP