WO2015058604A1

WO2015058604A1 - Apparatus and method for obtaining degree of association of question and answer pair and for search ranking optimization

Info

Publication number: WO2015058604A1
Application number: PCT/CN2014/086838
Authority: WO
Inventors: 孙林; 陈培军; 秦吉胜
Original assignee: 北京奇虎科技有限公司; 奇智软件（北京）有限公司
Priority date: 2013-10-21
Filing date: 2014-09-18
Publication date: 2015-04-30

Abstract

An apparatus and method for obtaining the degree of association of a question and answer pair, a method for the search ranking optimization of the question and answer pair, and an apparatus and method for determining the crawling frequency of a network resource point. The method for obtaining the degree of association of the question and answer pair comprises the following steps: performing a word extraction operation on the question content and answer content of a question and answer pair to be analyzed, to obtain at least one question word to be analyzed and at least one answer word to be analyzed; selecting at least one question and answer knowledge record from a question and answer knowledge library including a plurality of question and answer knowledge records according to the question word to be analyzed and the answer word to be analyzed, and calculating the degree of association of the question and answer pair to be analyzed according to the selected question and answer knowledge record. With the apparatus and method for obtaining the degree of association of the question and answer pair, the quality of the question and answer pair can be evaluated semantically, and the evaluation effect is better; in addition, the apparatus and method are easy to implement and excellent in universality.

Description

Obtaining a device and method for questioning and answering the degree of association and optimizing search ranking

Technical field

The present invention relates to the field of network data communication technologies, and in particular, to an apparatus and method for obtaining a correlation degree of a question and answer pair, an apparatus and method for optimizing a search ranking of a question and answer pair, and a method for determining a frequency of capturing network resource points. Apparatus and method.

Background technique

The Q&A community is a web application that generates content for users. The basic form is that users ask questions according to their own needs, and other users give answers. This form provides a new channel for users to access information on the web. However, since any user is free to create content, the quality of the information in the Q&A community is so different that there are a large number of low-quality Q&A pairs in the Q&A community. This not only brings a lot of inconvenience to users to find information, but also reduces the quality of the Q&A community. At the same time, the prior art method of judging the quality of question and answer depends more on the non-text features of the question and answer pair to evaluate the quality of the question and answer, which will affect its versatility.

In addition, when using the existing search technology for question-and-answer search, there are some low-quality question and answer pairs in the obtained search results, and the prior art method of sorting the search results depends more on the question and answer on the website and question and answer. The non-text features of the pair to sort the question and answer pairs will affect the accuracy and versatility of the search results.

At the same time, when using the existing search technology for question and answer search, it is difficult to judge the quality of the question and answer community as a network resource point. The prior art (for example, a crawler spider) sets the crawl frequency method for the network resource point, and relies more on Q&A analysis of links to websites, such methods are used for question-and-answer searches. They cannot be semantically analyzed. Q&A pairs cannot adjust the frequency of crawling (or crawling fineness, crawling frequency) according to the quality of network resource points. The accuracy and versatility of search results.

Summary of the invention

In view of the above problems, the present invention has been made in order to provide an apparatus and method for obtaining the degree of association of a question and answer pair that overcomes the above problems or at least partially solves the above problems, and an apparatus and method for optimizing a search ranking of a question and answer pair, And an apparatus and method for determining a crawl frequency of a network resource point.

According to an aspect of the present invention, there is provided an apparatus for obtaining a degree of association of a question and answer pair, the apparatus comprising: a question and answer knowledge base adapted to store a plurality of question and answer knowledge records; a word extraction unit adapted to the question and answer pair to be analyzed The problem content and the answer content are subjected to a word extraction operation to obtain at least one question word to be analyzed and at least one answer word to be analyzed; the correlation degree calculating unit is adapted to select at least the question answer knowledge base according to the question word to be analyzed and the answer word to be analyzed. A question and answer knowledge record that calculates the degree of association of the question and answer pairs to be analyzed based on the selected question and answer knowledge record.

According to another aspect of the present invention, there is provided an apparatus for optimizing a search ranking of a question and answer pair, the apparatus comprising: a question and answer knowledge base adapted to store a plurality of question and answer knowledge records; and a search unit adapted to receive a user's search request, Obtaining, according to the user's search request, a plurality of pairs of questions and answers to be analyzed that are matched with the search request; and the calculating unit is configured to acquire, according to the question and answer knowledge base, the degree of association of each question and answer pair to be analyzed; the search ranking unit is adapted to be according to the The degree of association of the question and answer pairs to be analyzed optimizes the search ranking of the question and answer pairs to be analyzed.

According to still another aspect of the present invention, an apparatus for determining a crawling frequency of a network resource point is provided, the apparatus comprising: a question and answer knowledge base adapted to store a plurality of question and answer knowledge records; and a resource analysis unit adapted to be configured by a network resource point Grasping a plurality of pairs of questions to be analyzed; the calculating unit is adapted to obtain an association degree of each question and answer pair to be analyzed according to the question and answer knowledge base; the crawling frequency determining unit determines the association according to the degree of association of the question and answer pairs to be analyzed The frequency of crawling network resource points.

According to another aspect of the present invention, a method for obtaining a degree of association of a question and answer pair is provided, the method comprising the steps of: performing a word extraction operation on a question content and an answer content of the question and answer pair to be analyzed, and obtaining at least one problem to be analyzed a word and at least one word to be analyzed; selecting at least one question and answer knowledge record from the question and answer knowledge base including the plurality of question and answer knowledge records according to the question word to be analyzed and the word to be analyzed, and calculating the question and answer to be analyzed according to the selected question and answer knowledge record The degree of association.

According to still another aspect of the present invention, a method for optimizing a search ranking of a question and answer pair is provided, the method comprising the steps of: receiving a search request of a user, and acquiring a plurality of to-be-matched matches with the search request according to the search request of the user The question and answer pair is analyzed; according to the question and answer knowledge base including the plurality of question and answer knowledge records, the degree of association of each question and answer pair to be analyzed is obtained; and the search ranking of the question and answer pair to be analyzed is optimized according to the degree of association of the question and answer pairs to be analyzed.

According to still another aspect of the present invention, a method for determining a crawling frequency of a network resource point is provided, the method comprising the steps of: capturing, by a network resource point, a plurality of question and answer pairs to be analyzed; according to the plurality of question and answer knowledge records The question and answer knowledge base obtains the degree of association of each question and answer pair to be analyzed; and determines the frequency of the crawling of the network resource points according to the degree of association of the question and answer pairs to be analyzed.

According to the technical solution of the present invention, multiple question and answer pairs are extracted from a webpage containing a question and answer pair, and multiple pieces are constructed according to the extracted question and answer pairs. The question and answer knowledge base of the question and answer knowledge record, the word extraction operation of the question and answer pair of the question and the answer, and at least one word to be analyzed and at least one word to be analyzed are obtained, and then according to the question word to be analyzed and the word to be analyzed Selecting at least one Q&A knowledge record from the Q&A knowledge base and calculating the correlation degree of the Q&A pairs to be analyzed according to the selected Q&A knowledge record can evaluate the quality of the Q&A pair from the semantic aspect and solve the prior art evaluation only on the lexical level. The problem of poor evaluation caused by the quality of the question and answer pair. At the same time, in the case of multiple question and answer pairs to be analyzed that are matched with the search request according to the user's search request, each question and question to be analyzed is obtained according to the question and answer knowledge base. The degree of association of the pair and the search ranking of the question and answer pair to be analyzed according to the degree of association of the question and answer pairs to be analyzed can evaluate the quality of the question and answer pair to be analyzed from the semantic aspect, and solve the problem that the prior art relies on the question and answer on the webpage and question and answer. Pair of non-text features to sort the question and answer pairs The problem of poor sorting effect; further, by grasping a plurality of question and answer pairs to be analyzed by the network resource point, obtaining the correlation degree of each question and answer pair to be analyzed according to the question and answer knowledge base and determining the correlation degree according to the question and answer pair to be analyzed The crawling frequency of the network resource point can determine the crawling frequency by evaluating the quality of the network resource point, and solves the problem that the prior art cannot select the crawling frequency according to the quality of the network resource point. Moreover, the solution of the present application is easy to implement and has high versatility.

The above description is only an overview of the technical solutions of the present invention, and the above-described and other objects, features and advantages of the present invention can be more clearly understood. Specific embodiments of the invention are set forth below.

DRAWINGS

Various other advantages and benefits will become apparent to those skilled in the art from a The drawings are only for the purpose of illustrating the preferred embodiments and are not to be construed as limiting. Throughout the drawings, the same reference numerals are used to refer to the same parts. In the drawing:

1 shows a flow chart of a method of obtaining a degree of association of a question and answer pair, in accordance with one embodiment of the present invention;

Figure 2 shows a detailed flow chart for building a Q&A knowledge base;

FIG. 3 is a schematic diagram showing an explanation model of the question and answer knowledge base obtained by using the steps shown in FIG. 2;

Figure 4 shows a detailed flow chart of step S200 of Figure 1;

FIG. 5 illustrates a block diagram of an apparatus for obtaining a degree of association of a question and answer pair, in accordance with one embodiment of the present invention; FIG.

6 shows a flow chart of a method for optimizing a search ranking of a question and answer pair, in accordance with one embodiment of the present invention;

7 shows a block diagram of an apparatus for optimizing a search ranking of a question and answer pair, in accordance with one embodiment of the present invention;

8 shows a flow chart of a method of determining a crawl frequency of a network resource point, in accordance with one embodiment of the present invention;

9 shows a block diagram of an apparatus for determining a crawl frequency of a network resource point, in accordance with one embodiment of the present invention;

Figure 10 shows a block diagram of an application server for performing the method according to the invention;

Figure 11 shows a storage unit for holding or carrying program code implementing the method according to the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

The existing method of obtaining the degree of association of question and answer pairs is to use text features and non-text features to describe the questions and answers of the question and answer pairs. Similarly, the existing method for obtaining a search ranking of a question and answer pair is to use a text feature and a non-text feature to describe the question and answer pair to rank the question and answer pair, or to answer questions based on the question and answer. Ranking. Text features mainly include textual visual features (such as punctuation density, average word length, text entropy, etc.) and text content features (such as text content word scale, question word density, related word coverage, etc.), and extract Chinese automatic errors widely used. Features (such as single-word density features, etc.); non-text features include user weightedness indicators, answer question status, answer answer time, user relationship interaction features, and so on. After extracting the features from the questions and answers respectively, a problem quality prediction model and an answer quality prediction model are respectively learned on the training set, and the output of the two models is used to evaluate the quality of the question and answer. However, when using the existing method of obtaining the degree of relevance of the question and answer pair to evaluate the quality of the answer, only the relevant word coverage feature is used to describe the semantic matching of the question and answer questions, which is not only at the lexical level. And did not consider the semantic matching of questions and answers. However, the semantic matching of questions and answers is precisely the core of question and answer. For example, the question is “Where is the capital of China?”, the answer 1 is “Beijing” and the answer 2 is “China's capital is Shanghai”. Then the question is “where is the capital of China” after the word segmentation and discarding the stop words, the answer 1 word segmentation result is “Beijing”, and the answer 2 word segmentation result is “China Capital Shanghai”. In the prior art, the semantic matching degree can be defined as: the number of words co-occurring in the question and the answer divided by the number of all the words in the question and the answer. Then the semantic matching degree of question 1 and answer 1 is: 0/4=0. The semantic matching degree of question 2 and answer 2 is: 2/4=0.5. Using the prior art, it is considered that the answer 2 and the question are more matching. And we know that this is obviously not appropriate.

Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While the embodiments of the present invention have been shown in the drawings, the embodiments Rather, these embodiments are provided so that this disclosure will be more fully understood and the scope of the disclosure will be fully disclosed.

1 shows a flow chart of a method of obtaining the degree of association of a question and answer pair, in accordance with one embodiment of the present invention. According to another aspect of the present invention, there is provided a method of obtaining a degree of association of a question and answer pair, the method comprising the following steps S100 and S200:

S100: performing a word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, and obtaining at least one question word to be analyzed and at least one answer word to be analyzed.

In an embodiment of the present invention, the word extraction operation of the question content and the answer content of the question and answer pair to be analyzed specifically includes: segmenting the question content and the answer content of the question and answer pair to be analyzed, removing the stop word, and word merge (word Join), and the operation of extracting entity words (such as nouns, verbs, etc.). Then, at least one problem word to be analyzed is obtained from the question content of the question and answer pair to be analyzed, and at least one answer word to be analyzed is obtained from the answer content of the question and answer pair to be analyzed.

S200: Select at least one question and answer knowledge record from the question and answer knowledge base including the plurality of question and answer knowledge records according to the problem word to be analyzed and the answer word to be analyzed, and calculate the correlation degree of the question and answer pair to be analyzed according to the selected question and answer knowledge record.

In step S200 of the embodiment, the problem content and the answer content of the analysis question and answer pair may be analyzed from the semantic aspect by using the question and answer knowledge base to obtain the correlation degree of the question and answer pair to be analyzed, and the evaluation effect is better and easy to implement.

Further, the question and answer knowledge base including a plurality of question and answer knowledge records is obtained by extracting a plurality of question and answer pairs from a webpage having a question and answer pair in advance, and constructing according to the extracted question and answer pairs. In one embodiment of the present invention, when a plurality of question and answer pairs are extracted from a web page having a question and answer pair, the category corresponding to the question and answer pair is captured. Then, when constructing the question and answer knowledge base according to the extracted question and answer pairs, the question and answer knowledge record is constructed according to the question and answer pair and the category corresponding to the question and answer pair. Each question and answer knowledge record in the obtained question and answer knowledge base corresponds to a category, which includes a question word (QW), an answer word (AW), and a semantic relevance between the question word and the answer word. .

By constructing a Q&A knowledge base including multiple Q&A knowledge records by using a large number of high-quality Q&A pairs extracted from web pages, the semantics between problem words and answer words of multiple Q&A knowledge records can be obtained based on the learning of massive information. Correlation; and by building a Q&A knowledge base using information extracted from web pages, the scope of application is broader and the method is more versatile.

Figure 2 shows a detailed flow chart for building a Q&A knowledge base. Specifically, the following steps S310, S320, and S330 are included:

S310. Extract a plurality of question and answer pairs from the webpage containing the question and answer pair in advance, and grab the category corresponding to the question and answer pair.

In this embodiment, by using a web crawler, data may be fetched from a webpage containing a high-quality question and answer pair on the Internet, and a question and answer pair may be extracted to ensure the quality of the extracted question and answer pair; the webpage including the high-quality question and answer pair includes cQA (Customer Quality Assurance) community, major professional forums, etc., can use the floor identification technology, according to the landlord (that is, the first user to post a question), the first floor, 2nd floor (ie in order The user who replies to the post) waits for the content of the reply as the answer to extract the question and answer pair. Since the webpage containing the high-quality question and answer pair includes the category information corresponding to each question and answer pair, the category corresponding to the question and answer pair can be grasped together while the question and answer pair is captured.

S320. For each question and answer pair, perform a word extraction operation on the question content and the answer content of the question and answer pair to obtain a question word set and an answer word set; and each of the question words and the answer word set in the question word set The answer words form an information record on each category corresponding to the question and answer pair.

In an embodiment of the present invention, the word extraction operation is performed on the question content and the answer content of each question and answer pair in the question and answer pairs extracted in step S310, specifically including the question content and the answer content of the question and answer pair. Word segmentation, removal of stop words, word merging, and operations for extracting entity words.

Then, at least one question word is obtained from the question content of each question and answer pair, and at least one answer word is obtained from the answer content of each question and answer pair, and the category set <C ₁ ,..., C _k ,... for the question and answer pair can be obtained. C _p >, question word set <QW ₁ ,...,QW _i ,...,QW _m >and answer word set <AW ₁ ,...,AW _j ,...,AW _n >.

Forming an information record on each of the question words (QW _i ) in the set of question words and each answer word (AW _j ) in the set of answer words, respectively, on each category (C _k ) corresponding to the question and answer pair, For example, <QW _i , AW _j , C _k >, then m*n*p information records can be formed.

S330. For each piece of information record, perform the following operations: calculate a probability that the answer word belongs to the category, calculate a degree of specificity of the answer word to the question word in the category, and calculate the problem word in the category. The strength of the answer word is explained; the above probability, the degree of specificity and the intensity are multiplied, and the obtained product is the semantic relevance of the answer word and the question word; the question word, the answer word and its semantic relevance A question and answer knowledge record <QW _i , AW _j , weight(QW _i , AW _j )> or <QW _i , AW _j , C _k , weight(QW _i , AW _j )> corresponding to the category is formed. Step S330 in this embodiment may be performed based on the mass information record after the massive question and answer pair obtained from the web page is subjected to the word extraction operation as described in step S320 to obtain a massive information record. The semantic relevance obtained based on massive information records is more accurate.

Preferably, the calculating the probability that the answer word belongs to the category includes:

The calculating the degree of specificity of each answer word on the question word in the category includes:

The calculating the strength of the question word in the category to be explained by each answer word, specifically comprising:

Multiply the above probability, specificity and intensity, including:

Weight(QWi, AWj|C=Ck)=P(Ck|AWj)*specific(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

Where P(C _k ) represents the probability of occurrence of the category C _k ; P(AW _j ) represents the probability that the answer is AW _j ; P(AW _j |C _k ) represents the probability that the C _k category belongs to AW _j ;

#(QW _i , AW _j ) indicates the number of times the question word is QW _i and the answer word is AW _j ;

#(AW _j ) indicates the number of times the answer word is AW _j .

From step S310, step S320 and step S330, a question and answer knowledge record can be obtained to construct a question and answer knowledge base. Figure 3 shows a schematic diagram of an explanatory model of a question and answer knowledge base obtained using the steps shown in Figure 2. It can be seen that for each question word QW _i , n question and answer knowledge records can be obtained for each of the category sets <C ₁ , . . . , C _k , . . . , C _p >. Of course, those skilled in the art can understand that if the calculated semantic relevance is 0, the corresponding question and answer knowledge record can be deleted; further, if the number of question and answer knowledge records in the question and answer knowledge base is too large, the question and answer knowledge is stored. The overhead of recording and calculating the degree of association of the question and answer pairs to be analyzed is too large, and a threshold can be preset, and the question and answer knowledge record whose semantic relevance is less than the threshold is deleted to reduce the overhead.

FIG. 4 shows a detailed flowchart of step S200 in FIG. 1. After obtaining at least one problem word to be analyzed and at least one word to be analyzed by step S100, step S200 specifically includes the following steps S210, S220, and S230:

S210: Select a question and answer knowledge record that matches the problem words included in the problem word to be analyzed and the included answer words and the answer words to be analyzed. In this embodiment, the matching of the problem words and the problem words to be analyzed refers to the sub-strings of the problem words to be analyzed and the problem words to be analyzed or the problem words to be analyzed are problem words; the matching words and the words to be analyzed match the words to be analyzed and The answer word is the same or the answer word to be analyzed is a substring of the answer word. In this embodiment, through step S210, a field matching or field search method is used to select a part of the question and answer knowledge record related to the question and answer pair to be analyzed from the question and answer knowledge base. .

S220. According to the question and answer knowledge record corresponding to the same category in the selected question and answer knowledge record, obtain the degree of association of the question and answer pairs to be analyzed for each category, and specifically include: the selected question and answer knowledge record corresponds to the same category The semantic relevance of the Q&A knowledge record is weighted and added, and the degree of association of the question and answer pairs to be analyzed for each category is obtained.

In this embodiment, the Q&A knowledge records selected by step S210 are grouped according to their corresponding categories, and the Q&A knowledge records corresponding to the same category are grouped; the semantic relevance of each group of Q&A knowledge records is weighted (for example, And adding a weight of 1 or 100), obtaining the degree of association of the question and answer pair to be analyzed for the category; thereby obtaining at least one (the number of degrees of association in the embodiment is the corresponding category of the question and answer pair to be analyzed The number) the degree of association.

S230. Select the maximum value of the correlation degree of the question and answer pairs to be analyzed for each category, and use the maximum value as the correlation degree of the question and answer pair to be analyzed.

Figure 5 illustrates a block diagram of an apparatus for obtaining the degree of association of a question and answer pair, in accordance with one embodiment of the present invention. The apparatus includes a question and answer knowledge base 100, a word extraction unit 200, and an associated degree calculation unit 300.

The question and answer knowledge base 100 is adapted to store a plurality of question and answer knowledge records; the question and answer knowledge base 100 of the present embodiment can be constructed by crawling a large number of question and answer pairs in the web page.

The word extracting unit 200 is adapted to perform a word extracting operation on the question content and the answer content of the question and answer pair to be analyzed, and obtain at least one question word to be analyzed and at least one answer word to be analyzed.

In an embodiment of the present invention, the word extracting unit 200 is adapted to perform word segmentation, remove stop words, word join, and extract entity words (for example, nouns) for the question content and the answer content of the question and answer pair to be analyzed. The operation of the verb, etc.) to obtain at least one word to be analyzed and at least one word to be analyzed.

The association degree calculation unit 300 is adapted to select at least one question and answer knowledge record from the question and answer knowledge base according to the problem word to be analyzed and the answer word to be analyzed, and calculate the correlation degree of the question and answer pair to be analyzed according to the selected question and answer knowledge record.

In an embodiment of the present invention, the correlation degree calculation unit 300 is adapted to select a question and answer knowledge record whose question words are matched with the question words to be analyzed and the included answer words match the answer words to be analyzed. In this embodiment, the matching of the problem words and the problem words to be analyzed refers to the sub-strings of the problem words to be analyzed and the problem words to be analyzed or the problem words to be analyzed are problem words; the matching words and the words to be analyzed match the words to be analyzed and The answer word is the same or the answer word to be analyzed is a substring of the answer word; according to the Q&A knowledge record corresponding to the same category in the selected question and answer knowledge record, the relevance of the question and answer pair to be analyzed for each category is obtained, more specific And adding the semantic relevance weights (for example, the weights of 1 or 100) corresponding to the same category of question and answer knowledge records in the selected question and answer knowledge records to obtain the association of the question and answer pairs to be analyzed respectively for each category. Degree, thereby obtaining at least one (the number of degrees of association in the embodiment, that is, the number of categories to be analyzed, the number of categories to be analyzed); the above-mentioned question and answer pairs to be analyzed are selected for each category The maximum value of the degree of association, with the maximum value as the degree of association of the question and answer pairs to be analyzed.

Using the question and answer knowledge base 100, the word extracting unit 200, and the associated degree calculating unit 300, selecting at least one question and answer knowledge record from the question and answer knowledge base by using the question word to be analyzed and the answer word to be analyzed, and calculating according to the selected question and answer knowledge record The degree of correlation between the question and answer pairs to be analyzed can be analyzed from the semantic aspect of the analysis question and answer pair. The evaluation effect is better and easier to implement. By using the information extracted from the web page to construct the question and answer knowledge base, the scope of application is wider and versatile. Stronger.

In this embodiment, the device further includes a question and answer knowledge base construction unit 400, and the question and answer knowledge base construction unit 400 is adapted to extract a plurality of question and answer pairs from the webpage containing the question and answer pair in advance, and construct a plurality of question and answer knowledge according to the extracted question and answer pairs. Recorded Q&A knowledge base. In the device shown in FIG. 5, the Q&A knowledge base is existing. Since the amount of information of the actual network is increasing, the information content changes rapidly, and the content of the Q&A knowledge base often needs to be updated, by adding a Q&A knowledge base building unit 400. Build (or update) the Q&A knowledge base to ensure the immediacy and reliability of the content of the Q&A knowledge base.

Preferably, when a plurality of question and answer pairs are extracted from the web page containing the question and answer pair, the question and answer knowledge base construction unit 400 grabs the category corresponding to the question and answer pair. In this embodiment, by using a web crawler, data may be fetched from a webpage containing a high-quality question and answer pair on the Internet, and a question and answer pair may be extracted to ensure the quality of the extracted question and answer pair; the webpage including the high-quality question and answer pair includes cQA community, major professional forums, etc. Since the webpage containing the high quality question and answer pair includes category information corresponding to each question and answer pair, the question and answer knowledge base construction unit 400 can grab the category corresponding to the question and answer pair while grabbing the question and answer pair.

In this embodiment, the question and answer knowledge base construction unit 400 is adapted to perform the following operations on each question and answer pair: performing a word extraction operation on the question content and the answer content of the question and answer pair to obtain a question word set and an answer word set, specifically The question and answer knowledge base construction unit 400 performs the word segmentation, the removal of the stop word, the word combination, and the operation of extracting the entity word for the problem content and the answer content of each of the question and answer pairs in the extracted question and answer pairs to obtain the question words and answers. a word; each of the question words in the set of question words and each answer word in the set of answer words form an information record on each of the categories corresponding to the question and answer pair. The question and answer knowledge base construction unit 400 is adapted to record, for each piece of information, an operation of calculating a probability that the answer word belongs to the category, and calculating a degree of specificity of the answer word to the question word on the category, The strength of the question word in the category to be explained by the answer word; multiplying the above probability, the degree of specificity and the intensity, the product obtained is the semantic relevance of the answer word and the question word; The answer words and their semantic relevance form a question and answer knowledge record corresponding to the category.

More specifically, the question and answer knowledge base construction unit 400 is adapted to calculate the probability that the answer word belongs to the category according to the following method:

More specifically, the question and answer knowledge base construction unit 400 is adapted to calculate the degree of specificity of the interpretation of the question words by the respective answer words on the category according to the following method:

More specifically, the question and answer knowledge base construction unit 400 is adapted to calculate the strength of the problem words explained by the respective answer words on the category according to the following method:

More specifically, the question and answer knowledge base construction unit 400 is adapted to multiply the above probability, specific degree, and intensity according to the following method:

Weight(QWi, AWj|C=Ck)=P(Ck|AWj)*specific(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

#(AW _j ) indicates the number of times the answer word is AW _j .

The following can be used to illustrate the effects that can be achieved by using the embodiments of the present invention, such as the following question and answer pairs, the category is "medical health":

Through the word segmentation technology, the words to be analyzed and the words to be analyzed are as follows:

As can be seen from the word segmentation results, there is no relevant word coverage in the questions and answers, so if the existing technology is used, it is easy to think that the question and answer is low in relevance and low in quality. However, it is obvious that the question and answer pair is a high-quality question and answer pair.

If the method and apparatus of the present invention are used to process the above question and answer pairs, first, an existing Q&A knowledge base may be retrieved, or a Q&A knowledge base may be constructed by grasping the QQA community and the Q&A pairs of the major professional forums;

The second step is to answer the question and answer pair to be analyzed. After the word extraction operation, the word set to be analyzed is obtained. <Child, cough, snot>, the answer word set to be analyzed <symptoms, drugs, treatment, anti-virus, pediatric cold particles, description , dosage, cough, Chinese medicine, granules, antibiotics, amoxicillin, amoxicillin granules, granules, oral, roxithromycin, efficacy>, and the type of question and answer pair to be analyzed is “medical health”;

In the third step, according to the words to be analyzed and the category, a plurality of question and answer knowledge records matching the problem words and the words to be analyzed are selected from the question and answer knowledge base, thereby obtaining the following answer words and semantic relevance (for convenience of reading, The values of the semantic relevance in the table are the values that have been properly normalized):

In the fourth step, according to the answer words to be analyzed in the set of answers to be analyzed, based on the Q&A knowledge records selected in the third step, the Q&A knowledge records including the answer words and the answers to be analyzed are selected, and further Get the semantic relevance of the selected question and answer knowledge records. According to the analysis, the answers to the answers in this example that match the answer words in the Q&A knowledge record include: <Oral, Kechuan, Pediatric cold particles, examination, cough, treatment, flu symptoms, cold particles>.

The degree of correlation of the question and answer pairs to be analyzed may be calculated, and the degree of correlation of the question and answer pairs to be analyzed reaches 0.9 (under the condition that the correlation degree ranges from 0 to 1).

6 shows a flow chart of a method of optimizing a search ranking of a question and answer pair, in accordance with one embodiment of the present invention. The method includes the following steps S610, S620, and S630:

S610. Receive a search request of the user, and obtain a plurality of question and answer pairs to be analyzed that match the search request according to the search request of the user.

In an embodiment of the present invention, the network search technology may be used, for example, using a question and answer pair search engine to obtain a question and answer pair to be analyzed according to the user's search request.

S620: Obtain an association degree of each question and answer pair to be analyzed according to a Q&A knowledge base including a plurality of Q&A knowledge records.

In step S620 of the embodiment, the question content and the answer content of the question and answer pair may be analyzed from the semantic aspect by using the question and answer knowledge base to obtain the correlation degree of the question and answer pair to be analyzed, and the evaluation effect is better and easy to implement.

More specifically, the specific implementation manner of obtaining the degree of association of the question and answer pair to be analyzed in step S620 of the embodiment is substantially the same as the method of obtaining the degree of association of the question and answer pair as shown in FIG. repeat.

Further, the question and answer knowledge base including a plurality of question and answer knowledge records is obtained by extracting a plurality of question and answer pairs from a webpage having a question and answer pair in advance, and constructing according to the extracted question and answer pairs. In one embodiment of the present invention, when a plurality of question and answer pairs are extracted from a web page having a question and answer pair, the category corresponding to the question and answer pair is captured. Then, when constructing the question and answer knowledge base according to the extracted question and answer pairs, the question and answer knowledge record is constructed according to the question and answer pair and the category corresponding to the question and answer pair. Each question and answer knowledge record in the obtained question and answer knowledge base corresponds to a category, which includes a question word (QW), an answer word (AW), and a semantic relevance between the question word and the answer word. . By constructing a Q&A knowledge base including multiple Q&A knowledge records by using a large number of high-quality Q&A pairs extracted from web pages, the semantics between problem words and answer words of multiple Q&A knowledge records can be obtained based on the learning of massive information. Correlation; by using the information extracted from the web page to build a question-and-answer knowledge base, the scope of application is broader, and the method is more versatile.

More specifically, the method of the embodiment further includes the step of constructing the question and answer knowledge base, and the process of constructing the question and answer knowledge base is substantially the same as the process shown in FIG. 2; the interpretation model of the question and answer knowledge base of the present embodiment is as shown in FIG. The interpretation model is roughly the same. It will not be repeated here.

S630. Optimize a search ranking of the pair of questions to be analyzed according to the degree of association of the question and answer pairs to be analyzed.

Since the degree of association of the question and answer pairs to be analyzed reflects the quality, the search ranking of the question and answer pair to be analyzed can be optimized by using the degree of association, and the ranking effect is better.

The specific method may be the search ranking of the question-and-answer pair to be analyzed in the order of the degree of association of the question-and-answer pairs to be analyzed, that is, the search ranking of the question-and-answer pair with a high degree of relevance is ranked first; or may be based on the search first The ranking technique initially arranges the website to which the question and answer pair to be analyzed belongs, and calculates a search ranking of the pair of questions to be analyzed according to the degree of association between the sequence number of the preliminary arrangement and the question and answer pair to be analyzed, for example, the waiting The analysis question and answer is multiplied by the degree of association of the preliminary arrangement of the website to which it belongs, and the order of the result of the multiplication operation is used as the search ranking of the question and answer pair to be analyzed; The quality of the pair and the row of the website to which it belongs The combination of names, sorting pairs of questions and answers to be analyzed, users can get better results sorting quality when using Q&A.

7 shows a block diagram of an apparatus for optimizing a search ranking of a question and answer pair, in accordance with one embodiment of the present invention. The device includes a question and answer knowledge base 710, a search unit 720, a calculation unit 730, and a search ranking unit 740.

The question and answer knowledge base 710 is adapted to store a plurality of question and answer knowledge records. The question and answer knowledge base 710 of the present embodiment can be constructed by crawling a massive question and answer pair in a web page.

The searching unit 720 is adapted to receive a search request of the user, and obtain a plurality of question and answer pairs to be analyzed that match the search request according to the search request of the user.

In an embodiment of the present invention, the search unit 720 may be a question and answer pair search engine, and obtain a question and answer pair to be analyzed according to the user's search request; for example, the search unit 720 is a web search engine for question and answer search, and the receiving user passes The search request entered by the browser and the question and answer pair to be analyzed.

The calculating unit 730 is adapted to obtain the degree of association of each question and answer pair to be analyzed according to the question and answer knowledge base 710.

The calculation unit 730 of the present invention can analyze the problem content and the answer content of the analysis question and answer pair from the semantic aspect by using the question and answer knowledge base to obtain the correlation degree of the question and answer pair to be analyzed, and the evaluation effect is better and easy to implement. The question and answer knowledge base 710 constructs and includes a plurality of question and answer knowledge records using a large number of high quality question and answer pairs extracted from web pages, and can acquire semantics between problem words and answer words of multiple question and answer knowledge records based on learning of massive information. relativity.

The search ranking unit 740 is adapted to optimize the search ranking of the question and answer pair to be analyzed according to the degree of association of the question and answer pairs to be analyzed.

Since the degree of association of the question and answer pairs to be analyzed reflects the quality, the search ranking of the question and answer pair to be analyzed can be optimized by using the degree of association, and the ranking effect is better. The specific method may be the search ranking of the question-and-answer pair to be analyzed in the order of the degree of association of the question-and-answer pairs to be analyzed, that is, the search ranking of the question-and-answer pair with a high degree of relevance is ranked first; or may be based on the search first The ranking technique initially arranges the website to which the question and answer pair to be analyzed belongs, and calculates a search ranking of the pair of questions to be analyzed according to the degree of association between the sequence number of the preliminary arrangement and the question and answer pair to be analyzed, for example, the waiting The analysis question and answer is multiplied by the degree of association of the preliminary arrangement of the website to which it belongs, and the order of the result of the multiplication operation is used as the search ranking of the question and answer pair to be analyzed.

In this embodiment, the apparatus further includes a question and answer knowledge base construction unit 750, wherein the question and answer knowledge base construction unit 750 is adapted to extract a plurality of question and answer pairs from the webpage containing the question and answer pair in advance, and construct a plurality of question and answer knowledge according to the extracted question and answer pairs. Recorded Q&A knowledge base. In the device shown in FIG. 7, the Q&A knowledge base 710 is already existing. Since the information volume of the actual network is increasing, the information content changes rapidly, and the content of the Q&A knowledge base 710 often needs to be updated. The knowledge base building unit 750 constructs (or updates) the question and answer knowledge base 710, which can ensure the immediacy and reliability of the content of the question and answer knowledge base 710. The question and answer knowledge base construction unit 750 of the present embodiment is the same as the question and answer knowledge base construction unit 400 shown in FIG. 5, and the description thereof will not be repeated here.

The calculation unit 630 in FIG. 7 specifically includes a word extraction subunit and an associated degree calculation subunit (not shown).

The word extraction subunit is adapted to perform the word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, and obtain at least one question word to be analyzed and at least one answer word to be analyzed.

In an embodiment of the present invention, the word extraction subunit is adapted to perform word segmentation, remove stop words, word join, and extract entity words (eg, nouns, the question content and the answer content of the question and answer pair to be analyzed. The operation of the verb, etc.) to obtain at least one word to be analyzed and at least one word to be analyzed.

The correlation degree calculation subunit is adapted to select at least one question and answer knowledge record from the question and answer knowledge base according to the problem word to be analyzed and the answer word to be analyzed, and calculate the correlation degree of the question and answer pair to be analyzed according to the selected question and answer knowledge record.

In an embodiment of the present invention, the correlation degree calculation subunit is adapted to select a question and answer knowledge record whose question words are matched with the question word to be analyzed and the included answer words match the answer words to be analyzed. In this embodiment, the matching of the problem words and the problem words to be analyzed refers to the sub-strings of the problem words to be analyzed and the problem words to be analyzed or the problem words to be analyzed are problem words; the matching words and the words to be analyzed match the words to be analyzed and The answer word is the same or the answer word to be analyzed is a substring of the answer word; according to the Q&A knowledge record corresponding to the same category in the selected question and answer knowledge record, the relevance of the question and answer pair to be analyzed for each category is obtained, more specific And adding the semantic relevance weights (for example, the weights of 1 or 100) corresponding to the same category of question and answer knowledge records in the selected question and answer knowledge records to obtain the association of the question and answer pairs to be analyzed respectively for each category. Degree, thereby obtaining at least one (the number of degrees of association in the embodiment, that is, the number of categories to be analyzed, the number of categories to be analyzed) is associated; selecting the above-mentioned question and answer pairs to be analyzed is the largest degree of association for each category The value, with the maximum value as the degree of association of the question and answer pairs to be analyzed.

FIG. 8 illustrates a flow chart of a method of determining a crawl frequency of a network resource point, in accordance with one embodiment of the present invention. The method includes the following steps S810, S820, and S830:

S810. The plurality of to-be-analyzed question and answer pairs are captured by the network resource point.

In an embodiment of the present invention, it may be a network resource point for determining a specific fetching frequency, for example, a Q&A community that needs to determine a fetching frequency, using a floor identification technology, according to the landlord (ie, the first post for a question) The user asks questions, and the content of the reply on the 2nd floor of the 1st floor (that is, the user who replies to the post in order) is the answer, to extract the question and answer pair to be analyzed.

S820. Obtain an association degree of each question and answer pair to be analyzed according to a Q&A knowledge base including a plurality of Q&A knowledge records.

In step S820 of the embodiment, the question content and the answer content of the question and answer pair may be analyzed semantically by using the question and answer knowledge base. The analysis is performed to obtain the degree of correlation of the question and answer pairs to be analyzed, and the evaluation effect is better and easier to implement.

More specifically, the specific implementation manner of obtaining the degree of association of the question and answer pair to be analyzed in step S820 of the embodiment is substantially the same as the method of obtaining the degree of association of the question and answer pair as shown in FIG. repeat.

More specifically, the method of the embodiment further includes the step of constructing a question and answer knowledge base, wherein the process of constructing the question and answer knowledge base is substantially the same as the process shown in FIG. 2; the interpretation model of the question and answer knowledge base of the present embodiment is as shown in FIG. 3 The explanatory models shown are roughly the same. It will not be repeated here.

S830. Determine a frequency of capturing the network resource point according to the correlation degree of the question and answer pair to be analyzed.

Since the degree of association of the question and answer pairs to be analyzed reflects the quality, the quality of the network resource points can be determined by using the correlation degree of the plurality of question and answer pairs to be analyzed, thereby determining the frequency of the network resource points.

The specific method may be that the average value of the correlation degree of the pair of questions to be analyzed is used as the crawling frequency of the network resource point, that is, the network resource point with a large average value (ie, good quality) of the associated degree The higher the frequency (for example, the frequency at which the spider crawler crawls the network resource point is high); or the spider crawler may be used to obtain the initial crawl frequency of the network resource point, and calculate the correlation degree of the question and answer pair to be analyzed. An average value, using the average value to adjust the initial crawl frequency to determine a crawl frequency of the network resource point, for example, an spider crawler may be used to obtain an initial crawl frequency of the network resource point, using the correlation degree The average value of the initial capture frequency is weighted (including multiplication, normalization, etc.) to determine the capture frequency of the network resource point, so that the capture frequency of the high-quality network resource point is improved, thereby optimizing Search quality.

In this embodiment, the correlation degree of the question and answer pair to be analyzed is analyzed by the network resource point, and the crawling frequency of the network resource point is determined according to the degree of association, so that the accuracy of the crawling result can be improved.

9 shows a block diagram of an apparatus for determining a crawl frequency of a network resource point, in accordance with one embodiment of the present invention. The apparatus includes a question and answer knowledge base 91, a resource analysis unit 920, a calculation unit 930, and a capture frequency acquisition unit 940.

The Q&A knowledge base 910 is adapted to store a plurality of Q&A knowledge records. The question and answer knowledge base 910 of the present embodiment can be constructed by crawling a large number of question and answer pairs in a web page.

The resource analysis unit 920 is adapted to capture a plurality of question and answer pairs to be analyzed by the network resource point.

In an embodiment of the present invention, the resource analysis unit 920 may determine a network resource point of a capture frequency for a specific need, for example, a question and answer community that needs to determine a crawl frequency, and use a floor identification technology according to the landlord (ie, for a problem first) The user who posts the question) asks questions, and the content of the reply on the 1st floor and the 2nd floor (that is, the user who replies to the post in order) is the answer, to extract the question and answer pair to be analyzed.

The calculating unit 930 is adapted to obtain the degree of association of each question and answer pair to be analyzed according to the question and answer knowledge base.

The calculation unit 930 of the present invention can analyze the problem content and the answer content of the analysis question and answer pair from the semantic aspect by using the question and answer knowledge base to obtain the correlation degree of the question and answer pair to be analyzed, and the evaluation effect is better and easy to implement. The Q&A knowledge base 910 is constructed using a large number of high-quality Q&A pairs extracted from web pages and includes a plurality of Q&A knowledge records, which can acquire semantics between problem words and answer words of multiple Q&A knowledge records based on learning of massive information. relativity.

The capture frequency determining unit 940 is adapted to determine a crawling frequency of the network resource point according to the correlation degree of the question and answer pair to be analyzed.

Since the degree of association of the question and answer pairs to be analyzed reflects the quality, the quality of the network resource points can be determined by using the correlation degree of the plurality of question and answer pairs to be analyzed, thereby determining the frequency of the network resource points. The specific method may be that the average value of the correlation degree of the pair of questions to be analyzed is used as the crawling frequency of the network resource point, that is, the network resource point with a large average value (ie, good quality) of the associated degree The higher the frequency (for example, the frequency at which the spider crawler crawls the network resource point is high); or the spider crawler may be used to obtain the initial crawl frequency of the network resource point, and calculate the correlation degree of the question and answer pair to be analyzed. An average value, using the average value to adjust the initial crawl frequency to determine a crawl frequency of the network resource point, for example, an spider crawler may be used to obtain an initial crawl frequency of the network resource point, using the correlation degree The average value of the initial capture frequency is weighted (including multiplication, normalization, etc.) to determine the capture frequency of the network resource point, so that the capture frequency of the high-quality network resource point is improved, thereby optimizing Search quality.

In this embodiment, the apparatus further includes a question and answer knowledge base construction unit 950, and the question and answer knowledge base construction unit 950 is adapted to extract a plurality of question and answer pairs from the webpage containing the question and answer pair in advance, and construct a plurality of question and answer knowledge according to the extracted question and answer pairs. Recorded Q&A knowledge base. In the apparatus shown in FIG. 9, the Q&A knowledge base 910 is existing. Since the amount of information of the actual network is increasing, the information content changes rapidly, and the content of the Q&A knowledge base 910 often needs to be updated. The knowledge base building unit 950 builds (or updates) the Q&A knowledge base to ensure the immediacy and reliability of the content of the Q&A knowledge base. The question and answer knowledge base construction unit 950 of the present embodiment is the same as the question and answer knowledge base construction unit 400 shown in FIG. 5, and the description thereof will not be repeated here.

The calculation unit 930 in FIG. 9 specifically includes a word extraction subunit and an associated degree calculation subunit (not shown).

The various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. It should be understood by those skilled in the art that a microprocessor or digital signal processor (DSP) can be used in practice to implement a device for obtaining the degree of association of a question and answer pair according to an embodiment of the present invention, and a device for optimizing search ranking of a question and answer pair. And some or all of the functions of some or all of the means for determining the frequency of crawling of network resource points. The invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein. Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.

For example, FIG. 10 illustrates a method for performing an association degree of obtaining a question and answer pair according to the present invention, a method of optimizing a search ranking of a question and answer pair, and a server for determining a frequency of crawling a network resource point, such as an application server. Block diagram. The application server traditionally includes a processor 1010 and a computer program product or computer readable medium in the form of a memory 1020. The memory 1020 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM. The memory 1020 has a memory space 1030 for executing program code 1031 of any of the above method steps. For example, storage space 1030 for program code may include various program code 1031 for implementing various steps in the above methods, respectively. The program code can be read from or written to one or more computer program products. These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks. Such a computer program product is typically a portable or fixed storage unit as described with reference to FIG. The storage unit may have a storage section, a storage space, and the like arranged similarly to the storage 1020 in the application server of FIG. The program code can be compressed, for example, in an appropriate form. Typically, the storage unit includes computer readable code 1131 ', ie, code that can be read by, for example, a processor, such as processor 1010, which, when executed by a server, causes the server to perform each of the methods described above. step.

"an embodiment," or "an embodiment," or "an embodiment," In addition, it is noted that the phrase "in one embodiment" is not necessarily referring to the same embodiment.

In the description provided herein, numerous specific details are set forth. However, it is understood that the embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures, and techniques are not shown in detail so as not to obscure the understanding of the description.

It is to be noted that the above-described embodiments are illustrative of the invention and are not intended to be limiting, and that the invention may be devised without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as a limitation. The word "comprising" does not exclude the presence of the elements or steps that are not recited in the claims. The word "a" or "an" The invention can be implemented by means of hardware comprising several distinct elements and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means can be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.

In addition, it should be noted that the language used in the specification has been selected for the purpose of readability and teaching, and is not intended to be construed or limited. Therefore, many modifications and changes will be apparent to those skilled in the art without departing from the scope of the invention. The disclosure of the present invention is intended to be illustrative, and not restrictive, and the scope of the invention is defined by the appended claims.

Claims

A device for obtaining the degree of association of a question and answer pair, the device comprising:

Question and answer knowledge base, suitable for storing multiple Q&A knowledge records;

a word extracting unit, configured to perform a word extracting operation on the question content and the answer content of the question and answer pair to be analyzed, to obtain at least one question word to be analyzed and at least one answer word to be analyzed;

The correlation degree calculation unit is adapted to select at least one question and answer knowledge record from the question and answer knowledge base according to the problem word to be analyzed and the answer word to be analyzed, and calculate the correlation degree of the question and answer pair to be analyzed according to the selected question and answer knowledge record.
The apparatus of claim 1, wherein the apparatus further comprises a question and answer knowledge base building unit,

The question and answer knowledge base construction unit is adapted to extract a plurality of question and answer pairs from a webpage having a question and answer pair in advance, and construct a question and answer knowledge base including a plurality of question and answer knowledge records according to the extracted question and answer pairs;

The question and answer knowledge base construction unit is further adapted to: when extracting a plurality of question and answer pairs from the webpage having the question and answer pair, grab the category corresponding to the question and answer pair;

The question and answer knowledge base construction unit is further adapted to construct a question and answer knowledge record according to the question and answer pair and the category corresponding to the question and answer pair when constructing the question and answer knowledge base according to the extracted question and answer pair; each question and answer knowledge record corresponds to a category, Each includes a question word, an answer word, and a semantic relevance between the question word and the answer word.
The device according to claim 1 or 2, wherein

The correlation degree calculation unit is adapted to select a question and answer knowledge record whose question words are matched with the question word to be analyzed and the included answer words and the answer words to be analyzed match; according to the selected question and answer knowledge records, the same corresponds to the same The question and answer knowledge record of the category, the degree of association of the question and answer pairs to be analyzed for each category is obtained; the maximum value of the correlation degree of the question and answer pairs to be analyzed for each category is selected, and the maximum value is used as the question and answer pair to be analyzed. The degree of association.
The device according to claim 2, wherein

The question and answer knowledge base building unit is adapted to perform the following operations on each question and answer pair:

Performing a word extraction operation on the question content and the answer content of the question and answer pair to obtain a question word set and an answer word set; respectively, each question word in the question word set and each answer word in the answer word set are respectively associated with the question and answer pair Forming an information record on each of the corresponding categories;

The question and answer knowledge base building unit is adapted to record each piece of information and perform the following operations:

Calculating a probability that the answer word belongs to the category, calculating a degree of specificity of the answer word to the question word on the category, and calculating an intensity of the question word using the answer word in the category; The degree of specificity is multiplied by the intensity, and the resulting product is the semantic relevance of the answer word and the question word; the question word, the answer word, and its semantic relevance form a question and answer knowledge record corresponding to the category.
A device according to any one of claims 1 to 4, wherein

The association degree calculation unit is adapted to weight-add the semantic relevance of the question-and-answer knowledge records corresponding to the same category in the selected question-and-answer knowledge records to obtain the degree of association of the question-answer pairs to be analyzed for each category.
A device according to any one of claims 1 to 5, wherein

Optionally, the word extracting unit is adapted to perform word segmentation, remove stop words, word merge, and extract entity words from the question content and the answer content of the question and answer pair to be analyzed.
A device according to any one of claims 1 to 6, wherein

The question and answer knowledge base construction unit is adapted to calculate a probability that the answer word belongs to the category according to the following method:

The question and answer knowledge base construction unit is adapted to calculate a degree of specificity of the interpretation of the question word by each answer word in the category according to the following method:

The question and answer knowledge base construction unit is adapted to calculate the strength of the problem word explained by each answer word in the category according to the following method:

The question and answer knowledge base construction unit is adapted to multiply the above probability, specific degree and intensity according to the following method:

Weight(QWi, AWj|C=Ck)=P(Ck|AWj)*specific(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

Where P(C k ) represents the probability of occurrence of the category C k ; P(AW j ) represents the probability that the answer is AW j ; P(AW j |C k ) represents the probability that the C k category belongs to AW j ;

#(QW i , AW j ) indicates the number of times the question word is QW i and the answer word is AW j ;

#(AW j ) indicates the number of times the answer word is AW j .
A device for optimizing a search ranking of a question and answer pair, the device comprising:

Question and answer knowledge base, suitable for storing multiple Q&A knowledge records;

The search unit is adapted to receive a search request of the user, and obtain a plurality of question and answer pairs to be analyzed that match the search request according to the search request of the user;

a calculating unit, configured to obtain, according to the question and answer knowledge base, the degree of association of each question and answer pair to be analyzed;

The search ranking unit is adapted to optimize the search ranking of the pair of questions to be analyzed according to the degree of association of the question and answer pairs to be analyzed.
The apparatus of claim 8 wherein said computing unit comprises:

a word extraction subunit, which is adapted to perform a word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, and obtain at least one question word to be analyzed and at least one answer word to be analyzed;

The correlation degree calculation subunit is adapted to select at least one question and answer knowledge record from the question and answer knowledge base according to the problem word to be analyzed and the answer word to be analyzed, and calculate the correlation degree of the question and answer pair to be analyzed according to the selected question and answer knowledge record.
The device according to claim 8 or 9, wherein

The search ranking unit is adapted to use the order of relevance of the question and answer pairs to be analyzed as the search ranking of the question and answer pair to be analyzed.
The apparatus according to any one of claims 8 to 10, wherein the apparatus further comprises a question and answer knowledge base building unit,

The question and answer knowledge base construction unit is adapted to extract a plurality of question and answer pairs from a webpage having a question and answer pair in advance, and construct a question and answer knowledge base including a plurality of question and answer knowledge records according to the extracted question and answer pairs;

The question and answer knowledge base construction unit is further adapted to: when extracting a plurality of question and answer pairs from the webpage having the question and answer pair, grab the category corresponding to the question and answer pair;

The question and answer knowledge base construction unit is further adapted to construct a question and answer knowledge record according to the question and answer pair and the category corresponding to the question and answer pair when constructing the question and answer knowledge base according to the extracted question and answer pair; each question and answer knowledge record corresponds to a category, Each includes a question word, an answer word, and a semantic relevance between the question word and the answer word.
The apparatus according to any one of claims 8 to 11, wherein

The correlation degree calculation subunit is adapted to select a question and answer knowledge record whose question words are matched with the question word to be analyzed and the included answer words and the answer words to be analyzed match; according to the selected question and answer knowledge record corresponds to The question and answer knowledge record of the same category is obtained, and the degree of association of the question and answer pairs to be analyzed for each category is obtained; the maximum value of the correlation degree of the question and answer pairs to be analyzed for each category is selected, and the maximum value is used as the question and answer to be analyzed. The degree of association.
A device according to any one of claims 8 to 12, wherein

The association degree calculation sub-unit is adapted to weight-add the semantic relevance of the question-and-answer knowledge records corresponding to the same category in the selected question-and-answer knowledge record, to obtain the degree of association of the question-answer pairs to be analyzed for each category.
A device according to any one of claims 8 to 13, wherein

The word extraction subunit is adapted to perform word segmentation, remove stop words, word merge, and extract entity words for the question content and the answer content of the question and answer pair to be analyzed.
A device according to any one of claims 8 to 14, wherein

The question and answer knowledge base construction unit is adapted to perform the following operations on each question and answer pair: performing a word extraction operation on the question content and the answer content of the question and answer pair, obtaining a question word set and an answer word set; and making each of the question word sets Each of the answer words in the question word and the answer word set form an information record on each category corresponding to the question and answer pair;

The question and answer knowledge base construction unit is adapted to perform, for each piece of information record, an operation of calculating a probability that the answer word belongs to the category, calculating a degree of specificity of the answer word on the question word in the category, and calculating The strength of the question word in the category to be explained by the answer word; multiplying the above probability, the degree of specificity and the intensity, the product obtained is the semantic relevance of the answer word and the question word; The answer word and its semantic relevance form a question and answer knowledge record corresponding to the category.
A device according to any one of claims 8 to 15, wherein

The question and answer knowledge base construction unit is adapted to calculate a probability that the answer word belongs to the category according to the following method:

The question and answer knowledge base construction unit is adapted to calculate a degree of specificity of the interpretation of the question word by each answer word in the category according to the following method:

The question and answer knowledge base construction unit is adapted to calculate the strength of the problem word explained by each answer word in the category according to the following method:

The question and answer knowledge base construction unit is adapted to multiply the above probability, specific degree and intensity according to the following method:

Weight(QWi, AWj|C=Ck)=P(Ck|AWj)*specific(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

Where P(C k ) represents the probability of occurrence of the category C k ; P(AW j ) represents the probability that the answer is AW j ; P(AW j |C k ) represents the probability that the C k category belongs to AW j ;

#(QW i , AW j ) indicates the number of times the question word is QW i and the answer word is AW j ;

#(AW j ) indicates the number of times the answer word is AW j .
A device for determining a crawling frequency of a network resource point, the device comprising:

Question and answer knowledge base, suitable for storing multiple Q&A knowledge records;

The resource analysis unit is adapted to capture a plurality of question and answer pairs to be analyzed by the network resource point;

a calculating unit, configured to obtain, according to the question and answer knowledge base, the degree of association of each question and answer pair to be analyzed;

The capture frequency determining unit determines the crawling frequency of the network resource point according to the correlation degree of the question and answer pair to be analyzed.
The apparatus of claim 17, wherein the computing unit comprises:

a word extraction subunit, which is adapted to perform a word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, and obtain at least one question word to be analyzed and at least one answer word to be analyzed;

The correlation degree calculation subunit is adapted to select at least one question and answer knowledge record from the question and answer knowledge base according to the problem word to be analyzed and the answer word to be analyzed, and calculate the correlation degree of the question and answer pair to be analyzed according to the selected question and answer knowledge record.
The device according to claim 17 or 18, wherein

The capture frequency determining unit is configured to use, as the crawling frequency of the network resource point, an average value of the correlation degree of the question and answer pair to be analyzed; or use an spider crawler to obtain an initial crawling of the network resource point. Frequency, calculating an average value of the correlation degree of the question and answer pair to be analyzed, and using the average value to adjust the initial grab frequency to determine a crawling frequency of the network resource point.
The apparatus according to any one of claims 17 to 19, wherein the apparatus further comprises a question and answer knowledge base building unit,

The question and answer knowledge base construction unit is adapted to extract a plurality of question and answer pairs from a webpage having a question and answer pair in advance, and construct a question and answer knowledge base including a plurality of question and answer knowledge records according to the extracted question and answer pairs;

The question and answer knowledge base construction unit is further adapted to: when extracting a plurality of question and answer pairs from the webpage having the question and answer pair, grab the category corresponding to the question and answer pair;

The question and answer knowledge base construction unit is further adapted to construct a question and answer knowledge record according to the question and answer pair and the category corresponding to the question and answer pair when constructing the question and answer knowledge base according to the extracted question and answer pair; each question and answer knowledge record corresponds to a category, Each includes a question word, an answer word, and a semantic relevance between the question word and the answer word.
The apparatus according to any one of claims 17 to 20, wherein

The correlation degree calculation subunit is adapted to select a question and answer knowledge record whose question words are matched with the question word to be analyzed and the included answer words and the answer words to be analyzed match; according to the selected question and answer knowledge record corresponds to The question and answer knowledge record of the same category is obtained, and the degree of association of the question and answer pairs to be analyzed for each category is obtained; the maximum value of the correlation degree of the question and answer pairs to be analyzed for each category is selected, and the maximum value is used as the question and answer to be analyzed. The degree of association.
The apparatus according to any one of claims 17 to 21, wherein

The association degree calculation sub-unit is adapted to weight-add the semantic relevance of the question-and-answer knowledge records corresponding to the same category in the selected question-and-answer knowledge record, to obtain the degree of association of the question-answer pairs to be analyzed for each category.
The apparatus according to any one of claims 17 to 22, wherein

The word extraction subunit is adapted to perform word segmentation, remove stop words, word merge, and extract entity words for the question content and the answer content of the question and answer pair to be analyzed.
The apparatus according to any one of claims 17 to 23, wherein

The question and answer knowledge base construction unit is adapted to perform the following operations on each question and answer pair: performing a word extraction operation on the question content and the answer content of the question and answer pair, obtaining a question word set and an answer word set; and making each of the question word sets Each of the answer words in the question word and the answer word set form an information record on each category corresponding to the question and answer pair;

The question and answer knowledge base construction unit is adapted to perform, for each piece of information record, an operation of calculating a probability that the answer word belongs to the category, calculating a degree of specificity of the answer word on the question word in the category, and calculating Use the answer word for the question word on the category The strength of the interpretation of the language; multiplying the above probability, the degree of specificity and the intensity, the product obtained is the semantic relevance of the answer word and the question word; making the question word, the answer word and its semantic relevance form a Corresponds to the Q&A knowledge record for this category.
A device according to any one of claims 17 to 24, wherein

The question and answer knowledge base construction unit is adapted to calculate a probability that the answer word belongs to the category according to the following method:

The question and answer knowledge base construction unit is adapted to calculate a degree of specificity of the interpretation of the question word by each answer word in the category according to the following method:

The question and answer knowledge base construction unit is adapted to calculate the strength of the problem word explained by each answer word in the category according to the following method:

The question and answer knowledge base construction unit is adapted to multiply the above probability, specific degree and intensity according to the following method:

Weight(QWi,AWj|C=Ck)=P(Ck|AWj)*soecific(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

Where P(C k ) represents the probability of occurrence of the category C k ; P(AW j ) represents the probability that the answer is AW j ; P(AW j |C k ) represents the probability that the C k category belongs to AW j ;

#(QW i , AW j ) indicates the number of times the question word is QW i and the answer word is AW j ;

#(AW j ) indicates the number of times the answer word is AW j .
A method of obtaining the degree of association of a question and answer pair, the method comprising the following steps:

Performing a word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, and obtaining at least one word to be analyzed and at least one word to be analyzed;

According to the word to be analyzed and the word to be analyzed, at least one question and answer knowledge record is selected from the question and answer knowledge base including the plurality of question and answer knowledge records, and the degree of association of the question and answer pairs to be analyzed is calculated according to the selected question and answer knowledge record.
The method of claim 26, wherein the method further comprises:

Extracting multiple question and answer pairs from the web page containing the question and answer pairs, and constructing a question and answer knowledge base including multiple question and answer knowledge records according to the extracted question and answer pairs;

When extracting a plurality of question and answer pairs from a webpage having a question and answer pair, fetching a category corresponding to the question and answer pair;

When constructing the question and answer knowledge base according to the extracted question and answer pairs, construct a question and answer knowledge record according to the question and answer pair and the category corresponding to the question and answer pair;

Each question and answer knowledge record corresponds to a category, including a question word, an answer word, and a semantic relevance between the question word and the answer word.
The method according to claim 26 or 27, wherein

According to the problem word to be analyzed and the answer word to be analyzed, at least one question and answer knowledge record is selected from the question and answer knowledge base, and the correlation degree of the question and answer pair to be analyzed is calculated according to the selected question and answer knowledge record, which specifically includes:

Selecting a question and answer knowledge record that matches the question words included in the problem word to be analyzed and the included answer words and the answer words to be analyzed;

Correlating the question and answer knowledge records corresponding to the same category in the selected question and answer knowledge record, and obtaining the correlation degree of the question and answer pairs to be analyzed for each category;

The maximum number of associations of the question and answer pairs to be analyzed for each category is selected, and the maximum value is used as the correlation degree of the question and answer pairs to be analyzed.
A method according to the method of any of claims 26 to 28, wherein

According to the question and answer knowledge record corresponding to the same category in the selected question and answer knowledge record, the degree of association of the question and answer pair to be analyzed for each category is obtained, which specifically includes:

The semantic relevance of the question and answer knowledge records corresponding to the same category in the selected question and answer knowledge records is weighted and added, and the degree of association of the question and answer pairs to be analyzed for each category is obtained.
The method according to the method of any one of claims 26 to 29, wherein the constructing the question and answer knowledge base according to the question and answer pair and the category corresponding to the question and answer pair comprises:

For each question and answer pair, the word extraction operation is performed on the question content and the answer content of the question and answer pair, and the problem word set and the answer word set are obtained;

Having each question word in the question word set and each answer word in the answer word set in each of the question and answer pairs Form an information record on the category;

For each record of information, do the following:

Calculating a probability that the answer word belongs to the category, calculating a degree of specificity of the answer word to the question word on the category, and calculating an intensity of the question word using the answer word in the category;

Multiplying the above probability, specificity, and intensity, the resulting product is the semantic relevance of the answer word and the question word;

The question word, the answer word, and its semantic relevance are formed into a question and answer knowledge record corresponding to the category.
A method according to the method of any of claims 26 to 30, wherein

The calculating the probability that the answer word belongs to the category includes:

The calculating the degree of specificity of each answer word on the question word in the category includes:

The calculating the strength of the question word in the category to be explained by each answer word, specifically comprising:

Multiply the above probability, specificity and intensity, including:

Weight(QWi, AWj|C=Ck)=P(Ck|AWj)*specific(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

Where P(C k ) represents the probability of occurrence of the category C k ; P(AW j ) represents the probability that the answer is AW j ; P(AW j |C k ) represents the probability that the C k category belongs to AW j ;

#(QW i , AW j ) indicates the number of times the question word is QW i and the answer word is AW j ;

#(AW j ) indicates the number of times the answer word is AW j .
A method according to any of claims 26 to 31, wherein

Performing a word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, specifically including: segmenting the question content and the answer content of the question and answer pair to be analyzed, removing the stop word, word merging, and extracting the entity word Operation.
A method for optimizing a search ranking of a question and answer pair, the method comprising the following steps:

Receiving a search request of the user, and acquiring a plurality of question and answer pairs to be analyzed that match the search request according to the search request of the user;

Obtain the correlation degree of each question and answer pair to be analyzed according to the Q&A knowledge base including multiple Q&A knowledge records;

The search ranking of the pair of questions to be analyzed is optimized according to the degree of association of the question and answer pairs to be analyzed.
The method according to claim 33, wherein said obtaining a degree of association of each question and answer pair to be analyzed according to a question and answer knowledge base comprising a plurality of question and answer knowledge records comprises performing the following operations for each question and answer pair to be analyzed:

Performing a word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, and obtaining at least one question word to be analyzed and at least one answer word to be analyzed;

According to the problem words to be analyzed and the words to be analyzed, at least one question and answer knowledge record is selected from the question and answer knowledge base, and the degree of association of the question and answer pairs to be analyzed is calculated according to the selected question and answer knowledge record.
The method according to claim 33 or claim 34, wherein the adjusting the search ranking of the question and answer pair to be analyzed according to the degree of association of the question and answer pair to be analyzed comprises:

The ranking of the degree of association of the question and answer pairs to be analyzed is used as the search ranking of the question and answer pair to be analyzed.
The method of any of claims 33 to 35, wherein the method further comprises:

Extracting multiple question and answer pairs from the web page containing the question and answer pairs, and constructing a question and answer knowledge base including multiple question and answer knowledge records according to the extracted question and answer pairs;

When extracting a plurality of question and answer pairs from a webpage having a question and answer pair, fetching a category corresponding to the question and answer pair;

When constructing the question and answer knowledge base according to the extracted question and answer pairs, construct a question and answer knowledge record according to the question and answer pair and the category corresponding to the question and answer pair;

Each question and answer knowledge record corresponds to a category, including a question word, an answer word, and a semantic relevance between the question word and the answer word.
A method according to any one of claims 33 to 36, wherein

According to the problem word to be analyzed and the answer word to be analyzed, at least one question and answer knowledge record is selected from the question and answer knowledge base, and the correlation degree of the question and answer pair to be analyzed is calculated according to the selected question and answer knowledge record, which specifically includes:

Selecting a question and answer knowledge record that matches the question words included in the problem word to be analyzed and the included answer words and the answer words to be analyzed;

Correlating the question and answer knowledge records corresponding to the same category in the selected question and answer knowledge record, and obtaining the correlation degree of the question and answer pairs to be analyzed for each category;

The maximum number of associations of the question and answer pairs to be analyzed for each category is selected, and the maximum value is used as the correlation degree of the question and answer pairs to be analyzed.
A method according to any one of claims 33 to 37, wherein

According to the question and answer knowledge record corresponding to the same category in the selected question and answer knowledge record, the degree of association of the question and answer pair to be analyzed for each category is obtained, which specifically includes:

The semantic relevance of the question and answer knowledge records corresponding to the same category in the selected question and answer knowledge records is weighted and added, and the degree of association of the question and answer pairs to be analyzed for each category is obtained.
A method according to any one of claims 33 to 38, wherein

And performing the word extraction operation on the problem content and the answer content of the question and answer pair to be analyzed, specifically including:

The problem content and answer content of the question and answer pair to be analyzed are performed by word segmentation, removal of stop words, word merging, and extraction of entity words.
A method according to any one of claims 33 to 39, wherein

The constructing the question and answer knowledge base according to the question and answer pair and the category corresponding to the question and answer pair, specifically includes:

For each question and answer pair, the word extraction operation is performed on the question content and the answer content of the question and answer pair, and the problem word set and the answer word set are obtained;

Having each of the question words in the set of question words and each answer word in the set of answer words form an information record on each category corresponding to the question and answer pair;

For each record of information, do the following:

Calculating a probability that the answer word belongs to the category, calculating a degree of specificity of the answer word to the question word on the category, and calculating an intensity of the question word using the answer word in the category;

Multiplying the above probability, specificity, and intensity, the resulting product is the semantic relevance of the answer word and the question word;

The question word, the answer word, and its semantic relevance are formed into a question and answer knowledge record corresponding to the category.
A method according to any one of claims 33 to 40, wherein

The calculating the probability that the answer word belongs to the category includes:

The calculating the degree of specificity of each answer word on the question word in the category includes:

The calculating the strength of the question word in the category to be explained by each answer word, specifically comprising:

Multiply the above probability, specificity and intensity, including:

Weight(QWi,AWj|C=Ck)=P(Ck|AWj)*specfic(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

Where P(C k ) represents the probability of occurrence of the category C k ; P(AW j ) represents the probability that the answer is AW j ; P(AW j |C k ) represents the probability that the C k category belongs to AW j ;

#(QW i , AW j ) indicates the number of times the question word is QW i and the answer word is AW j ;

#(AW j ) indicates the number of times the answer word is AW j .
A method for determining a crawl frequency of a network resource point, the method comprising the following steps:

A plurality of question and answer pairs to be analyzed are captured by the network resource point;

Obtain the correlation degree of each question and answer pair to be analyzed according to the Q&A knowledge base including multiple Q&A knowledge records;

Determining the frequency of the network resource points according to the degree of association of the question and answer pairs to be analyzed.
The method according to claim 42, wherein said obtaining a degree of association of each question and answer pair to be analyzed according to a question and answer knowledge base including a plurality of question and answer knowledge records, comprising performing the following operations for each question and answer pair to be analyzed:

Performing a word extraction operation on the question content and the answer content of the question and answer pair to be analyzed, and obtaining at least one question word to be analyzed and at least one answer word to be analyzed;

According to the problem words to be analyzed and the words to be analyzed, at least one question and answer knowledge record is selected from the question and answer knowledge base, and the degree of association of the question and answer pairs to be analyzed is calculated according to the selected question and answer knowledge record.
The method according to claim 42 or 43, wherein said determining said network based on said degree of association of said question and answer pairs to be analyzed The crawling frequency of the network resource points, including:

Taking the average value of the correlation degree of the question and answer pair to be analyzed as the crawling frequency of the network resource point;

or,

Obtaining an initial crawling frequency of the network resource point by using a spider crawler, calculating an average value of the correlation degree of the question and answer pair to be analyzed, and using the average value to adjust the initial crawling frequency to determine the network resource point Take the frequency.
The method of any of claims 42 to 44, wherein the method further comprises:

Extracting multiple question and answer pairs from the web page containing the question and answer pairs, and constructing a question and answer knowledge base including multiple question and answer knowledge records according to the extracted question and answer pairs;

When extracting a plurality of question and answer pairs from a webpage having a question and answer pair, fetching a category corresponding to the question and answer pair;

When constructing the question and answer knowledge base according to the extracted question and answer pairs, construct a question and answer knowledge record according to the question and answer pair and the category corresponding to the question and answer pair;

Each question and answer knowledge record corresponds to a category, including a question word, an answer word, and a semantic relevance between the question word and the answer word.
A method according to any one of claims 42 to 45, wherein

According to the problem word to be analyzed and the answer word to be analyzed, at least one question and answer knowledge record is selected from the question and answer knowledge base, and the correlation degree of the question and answer pair to be analyzed is calculated according to the selected question and answer knowledge record, which specifically includes:

Selecting a question and answer knowledge record that matches the question words included in the problem word to be analyzed and the included answer words and the answer words to be analyzed;

Correlating the question and answer knowledge records corresponding to the same category in the selected question and answer knowledge record, and obtaining the correlation degree of the question and answer pairs to be analyzed for each category;

The maximum number of associations of the question and answer pairs to be analyzed for each category is selected, and the maximum value is used as the correlation degree of the question and answer pairs to be analyzed.
A method according to any one of claims 42 to 46, wherein

According to the question and answer knowledge record corresponding to the same category in the selected question and answer knowledge record, the degree of association of the question and answer pair to be analyzed for each category is obtained, which specifically includes:

The semantic relevance of the question and answer knowledge records corresponding to the same category in the selected question and answer knowledge records is weighted and added, and the degree of association of the question and answer pairs to be analyzed for each category is obtained.
A method according to any one of claims 42 to 47, wherein

And performing the word extraction operation on the problem content and the answer content of the question and answer pair to be analyzed, specifically including:

The problem content and answer content of the question and answer pair to be analyzed are performed by word segmentation, removal of stop words, word merging, and extraction of entity words.
A method according to any one of claims 42 to 48, wherein

The constructing the question and answer knowledge base according to the question and answer pair and the category corresponding to the question and answer pair, specifically includes:

For each question and answer pair, the word extraction operation is performed on the question content and the answer content of the question and answer pair, and the problem word set and the answer word set are obtained;

Having each of the question words in the set of question words and each answer word in the set of answer words form an information record on each category corresponding to the question and answer pair;

For each record of information, do the following:

Calculating a probability that the answer word belongs to the category, calculating a degree of specificity of the answer word to the question word on the category, and calculating an intensity of the question word using the answer word in the category;

Multiplying the above probability, specificity, and intensity, the resulting product is the semantic relevance of the answer word and the question word;

The question word, the answer word, and its semantic relevance are formed into a question and answer knowledge record corresponding to the category.
A method according to any one of claims 42 to 49, wherein

The calculating the probability that the answer word belongs to the category includes:

The calculating the degree of specificity of each answer word on the question word in the category includes:

The calculating the strength of the question word in the category to be explained by each answer word, specifically comprising:

Multiply the above probability, specificity and intensity, including:

Weight(QWi, AWj|C=Ck)=P(Ck|AWj)*specific(QWi,AWj|C=Ck)*interpret(QWi,AWj|C=Ck);

Where P(C k ) represents the probability of occurrence of the category C k ; P(AW j ) represents the probability that the answer is AW j ; P(AW j |C k ) represents the probability that the C k category belongs to AW j ;

#(QW i , AW j ) indicates the number of times the question word is QW i and the answer word is AW j ;

#(AW j ) indicates the number of times the answer word is AW j .
A computer program comprising computer readable code that, when executed on a computing device, causes the computing device to perform the method of any one of claims 26-50.
A computer readable medium storing the computer program of claim 51.