WO2021174814A1 - Answer verification method and apparatus for crowdsourcing task, computer device, and storage medium - Google Patents

Answer verification method and apparatus for crowdsourcing task, computer device, and storage medium Download PDF

Info

Publication number
WO2021174814A1
WO2021174814A1 PCT/CN2020/117671 CN2020117671W WO2021174814A1 WO 2021174814 A1 WO2021174814 A1 WO 2021174814A1 CN 2020117671 W CN2020117671 W CN 2020117671W WO 2021174814 A1 WO2021174814 A1 WO 2021174814A1
Authority
WO
WIPO (PCT)
Prior art keywords
answer
value
preset
target
credibility
Prior art date
Application number
PCT/CN2020/117671
Other languages
French (fr)
Chinese (zh)
Inventor
王健宗
李佳琳
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021174814A1 publication Critical patent/WO2021174814A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques

Definitions

  • This application relates to the field of data processing technology, and in particular to methods, devices, computer equipment, and storage media for verifying answers to crowdsourced tasks.
  • crowdsourcing tasks means that internally executed tasks are distributed to external execution objects for completion in order to shorten the task completion time.
  • the purpose of the embodiments of the present application is to propose an answer verification method for crowdsourced tasks, and solve the problem of low answer verification efficiency in the manner of manually verifying crowdsourced tasks in the prior art.
  • an embodiment of the present application provides an answer verification method for crowdsourcing tasks, including:
  • each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
  • semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
  • the similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M ⁇ N, and N is a positive integer;
  • the comparison result is that the maximum credibility value is greater than or equal to the preset standard value
  • the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
  • a technical solution adopted in this application is to provide an answer verification device for crowdsourcing tasks, including:
  • the initial answer obtaining module is used to obtain each answer answer corresponding to the target task from all the answer answers obtained from the client as an initial answer, wherein each answer answer corresponds to a answer object;
  • the semantic recognition result module is used to perform semantic recognition on each of the initial answers through natural language semantic recognition to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is positive Integer
  • the reference answer classification module is used to combine the semantic recognition results in pairs, and use each combination as a set of results, and use the similarity calculation method to count the similarity values between the semantic recognition results in each set of results , If the obtained similarity value is greater than the preset similarity threshold, the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained, where M ⁇ N, and N is a positive integer ;
  • the credibility value statistics module is used to determine the credibility value of each type of reference answer through a preset consistency check method
  • the credibility value comparison module is used to select the credibility value with the largest value from all the credibility values of the reference answers as the maximum credibility value, and compare the maximum credibility value with the predicted value. Set standard value comparison and get the comparison result;
  • the simulation answer obtaining module is configured to obtain the target task if the comparison result is that the maximum credibility value is less than the preset standard value, and input the target task into the preset model, and pass the The preset model gets the simulated answer;
  • the answer answer verification module is used to count the similarity values between the reference answer and the simulated answer for each type of the reference answer to obtain M similarity values, and select a numerical value from the M similarity values For the maximum similarity value, the reference answer corresponding to the similarity value with the largest numerical value is used as the target answer, and the response answer corresponding to the target answer is confirmed as the verified response answer.
  • a computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:
  • each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
  • semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
  • the similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M ⁇ N, and N is a positive integer;
  • the comparison result is that the maximum credibility value is greater than or equal to the preset standard value
  • the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
  • each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
  • semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
  • the similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M ⁇ N, and N is a positive integer;
  • the comparison result is that the maximum credibility value is greater than or equal to the preset standard value
  • the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
  • a method for verifying answers to crowdsourcing tasks in the above scheme From all the answer answers obtained from the client, the answer answer of each answer object corresponding to the target task is obtained as the initial answer; and semantics of each initial answer Then, the semantic recognition results are combined in pairs, and each combination is used as a set of results.
  • the similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results.
  • the obtained similarity value is greater than Preset the similarity threshold, then use the two semantic recognition results in the group as the same type of reference answer to obtain the M type of reference answer; and determine the credibility value of each type of reference answer through the preset consistency check method ; Then select the highest credibility value from all the credibility values of the reference answers as the maximum credibility value, and compare the maximum credibility value with the preset standard value to obtain the comparison result; if the comparison result If the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the response answer corresponding to the target answer is confirmed as the verified answer.
  • the maximum credibility value and performing similarity value statistics the largest numerical similarity value is obtained, and finally the answer answers that have passed the verification are confirmed, which can effectively improve the answer verification efficiency of the crowdsourcing task.
  • FIG. 1 is a schematic diagram of an application environment of a method for verifying answers to crowdsourced tasks provided by an embodiment of the present application
  • FIG. 2 is an implementation flow chart of the method for verifying the answers of crowdsourcing tasks according to an embodiment of the present application
  • FIG. 3 is an implementation flow chart of step 2 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application
  • FIG. 4 is an implementation flowchart of step S21 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application
  • FIG. 5 is an implementation flow chart after step S5 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application
  • FIG. 6 is another implementation flowchart after step S5 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application
  • FIG. 7 is an implementation flowchart of step S57 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application
  • FIG. 8 is a schematic diagram of an answer verification device for crowdsourcing tasks provided by an embodiment of the present application.
  • Fig. 9 is a schematic diagram of a computer device provided by an embodiment of the present application.
  • the system architecture 100 may include terminal devices 101, 102, and 103, a network 104 and a server 105.
  • the network 104 is used to provide a medium for communication links between the terminal devices 101, 102, 103 and the server 105.
  • the network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, and so on.
  • the user can use the terminal devices 101, 102, and 103 to interact with the server 105 through the network 104 to receive or send messages and so on.
  • Various communication client applications such as web browser applications, search applications, instant messaging tools, etc., may be installed on the terminal devices 101, 102, and 103.
  • the terminal devices 101, 102, 103 may be various electronic devices with display screens and supporting web browsing, including but not limited to smart phones, tablet computers, laptop computers, desktop computers, and so on.
  • the server 105 may be a server that provides various services, for example, a background server that provides support for pages displayed on the terminal devices 101, 102, and 103.
  • the method for verifying answers for crowdsourcing tasks provided by the embodiments of the present application is generally executed by a server. Accordingly, a device for verifying answers for crowdsourcing tasks is generally set in the server.
  • terminal devices, networks, and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks, and servers according to implementation needs.
  • FIG. 2 shows a specific implementation of the method for verifying the answers of crowdsourcing tasks.
  • the method of the present application is not limited to the sequence of the process shown in FIG. 2, and the method includes the following steps:
  • each answer answer corresponding to the target task is obtained as an initial answer, where each answer answer corresponds to a answer object.
  • the server pushes the target task to multiple objects. After the object answers the target task, a response answer is obtained, and the response answer is fed back to the server through the client.
  • the network transmission protocol receives the answer sent by each client as the initial answer.
  • each respondent On the server side, the information of each respondent is pre-stored. After the respondent responds to the task and gets a response, the mapping relationship between the response and the respondent is established. After obtaining the initial answer, obtain each initial corresponding response object, that is, obtain a list of all objects participating in the target task and feedback the response answer to the server.
  • the response object in this embodiment may specifically be a preset network model (machine learning model or neural network model), or may be a preset computing engine or a big data platform and other individuals with autonomous learning and discrimination capabilities.
  • a preset network model machine learning model or neural network model
  • the response object in this embodiment may specifically be a preset computing engine or a big data platform and other individuals with autonomous learning and discrimination capabilities.
  • the server will send and push multiple tasks at the same time, and may also receive answers from different tasks at the same time period.
  • the server will pre-select each task Set a task ID for each. The initial answer corresponding to the target task is selected from all the answer answers through the target task identifier.
  • all the response answers obtained from the client include three task identifiers, namely task A, task B, and task C.
  • the task identifier of the current crowdsourcing task is task A. Therefore, select The task identifier is the answer to task A, as the initial answer.
  • the task identification can be specifically represented by a combination of characters, numbers, or characters and numbers, for example, a task identification is "TPSB5201906280236".
  • the target task in this embodiment is a type of statement of objective facts, for example, recognizing and transcribing text on a given image.
  • an electronic device (such as the server shown in FIG. 1) on which a method for verifying answers to crowdsourcing tasks runs may be connected via a wired connection or a wireless connection.
  • wireless connection methods can include, but are not limited to, 3G/4G connection, WiFi connection, Bluetooth connection, WiMAX connection, Zigbee connection, UWB (ultra wideband) connection, and other wireless connection methods currently known or developed in the future .
  • S2 Used to perform semantic recognition on each initial answer through natural language semantic recognition, and obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer.
  • semantic recognition is performed on each initial answer, and the semantic recognition result of each initial answer is obtained.
  • NLP Natural Language Processing
  • Understanding natural language requires extensive knowledge about the external world and the ability to use and operate this knowledge.
  • Natural language cognition is also regarded as an AI-complete problem.
  • NLP tasks mainly refer to some tasks that involve the semantic understanding or parsing of natural language. Common NLP tasks include but are not limited to: Speech recognition, Chinese word segmentation, Part-of- speech tagging, text categorization, parsing, automatic summarization, question answering, information extraction, etc.
  • S3 Combine the semantic recognition results in pairs, and use each combination as a set of results.
  • the similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the obtained similarity value is greater than the expected value If the similarity threshold is set, the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained, where M ⁇ N, and N is a positive integer.
  • the similarity calculation is used to obtain the similarity value between each set of semantic recognition results, and the similarity threshold is preset, if If it is greater than the preset similarity threshold, any two semantic recognition results are classified as the preset similarity threshold.
  • the preset similarity threshold is a server-side preset setting, which is used to classify different semantic recognition results.
  • the more reasonable preset similarity thresholds are 0.9, 0.8, 0.7, 0.6, etc., and the specific settings can be based on actual conditions.
  • the situation is set, which is not limited here.
  • the preset similarity threshold value in this embodiment is 0.8.
  • the similarity calculation methods include, but are not limited to: Minkowski Distance, Manhattan Distance, Euclidean Distance, Chebyshev Distance, and Hamming Distance. Distance) and Mahalanobis Distance (Mahalanobis Distance) and so on.
  • the Euclidean distance is used to calculate the similarity value between the semantic recognition results in each group of results; the Euclidean distance calculation is used to quickly and efficiently calculate the similarity value between the semantic recognition results in each group of results.
  • the credibility value refers to the reliability of this type of reference answer.
  • the following formula is used to calculate the credibility value of each type of reference answer:
  • P ⁇ P 1 ,P 2 ,...,P m ⁇ is the reference answer set
  • P m is the number of answer answers contained in the m-th reference answer
  • CR m is the credibility value corresponding to the m-th reference answer
  • M is the number of reference answer categories
  • n is the number of initial answers.
  • S5 Select the credibility value with the largest value from the credibility values of all reference answers as the maximum credibility value, and compare the maximum credibility value with the preset standard value to obtain the comparison result.
  • the credibility value with the largest value is selected from the credibility value of each type of reference answer to compare with the preset standard value, and based on the comparison result, it is determined whether there is a reference answer that meets the requirements among all the classified reference answers.
  • the preset standard value is the server-side preset setting, which is used to evaluate whether the credibility value meets the requirements.
  • the reasonable range is (0.5,1).
  • the specific setting can be set according to the actual situation.
  • the preset standard value in this embodiment is 0.6.
  • the comparison result includes that the maximum credibility value is less than the preset standard value, and the maximum credibility value is greater than or equal to the preset standard value.
  • the maximum credibility value when the maximum credibility value is greater than or equal to the preset standard value, it indicates that the reference answer corresponding to the maximum credibility value is a trustworthy answer answer, which is taken as the target answer, and the answer answer corresponding to the target answer is confirmed To verify the answer answer passed.
  • the answer verification result is sent to the terminal devices 101, 102, 103, so that the respondent can learn the answer verification result.
  • the response answer of each response object corresponding to the crowdsourcing task is obtained as the initial answer; semantic recognition is performed on each initial answer, and then the semantic recognition results are divided into two Two combinations, and each combination is used as a set of results.
  • the similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results.
  • the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained; and the credibility value of each type of reference answer is determined through the preset consistency verification method; then all the reference answers of each type
  • the maximum credibility value of the answer is selected to obtain the comparison result; if the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer. And confirm the answer answer corresponding to the target answer as the answer answer that passed the verification.
  • step S2 semantic recognition is performed on each initial answer by means of natural language semantic recognition, and the semantic recognition results of N initial answers are obtained.
  • N is the number of response objects, and N is a positive integer.
  • S21 Perform word segmentation processing on the initial answer through the preset word segmentation method to obtain the basic word segmentation contained in the initial answer.
  • S22 Convert the basic word segmentation into word vectors, and cluster the word vectors through a clustering algorithm to obtain a cluster center corresponding to each word vector.
  • the semantic recognition result of the initial answer is obtained, so that the semantic recognition of the initial answer is more accurate, and the response to the crowdsourcing task is further improved.
  • the verification accuracy of the answer is performed by performing word segmentation processing on the initial answer and clustering the word vector through a clustering algorithm.
  • step S21 the initial answer is segmented through a preset word segmentation method to obtain the specific implementation process of the basic word segmentation contained in the initial answer.
  • the details are as follows:
  • S213 From the occurrence probabilities of K word segmentation sequences, select the word segmentation sequence corresponding to the occurrence probability that reaches the preset probability threshold as the target word segmentation sequence, and use each word segmentation in the target word segmentation sequence as the basic word segmentation included in the initial answer .
  • the word segmentation analysis is performed through the basic sentence, and the occurrence probability of each word segmentation sequence is calculated, so that the basic word segmentation contained in the initial answer can be accurately obtained, so that the initial answer can obtain an accurate semantic recognition result.
  • Figure 5 shows a specific implementation after step S5. The specific implementation process is described in detail as follows:
  • a corresponding task model is preset, and the task model does not have the ability to learn independently, but a simulated answer with an accuracy that meets the requirements can be obtained.
  • the maximum credibility value is less than the preset standard value, the target task is obtained, and the target task is input into the preset model, and the simulated answer is obtained through the preset model.
  • a crowdsourcing task is text recognition and transliteration in an image
  • the preset model is an optical character recognition matching model
  • the credibility value is between 0.65 and 0.75
  • the preset credibility value It is 0.55.
  • step S50 when the calculated credibility value is lower than 0.55, the preset model is used to obtain a simulated answer.
  • the simulated answer is obtained through the preset model, and the similarity value between each reference answer and the simulated answer is obtained by calculating the similarity between each reference answer and the simulated answer, and the similarity value is ranked from large to small Arrange in order, select the reference answer with the largest similarity value as the target answer, and confirm the answer corresponding to the target answer as the verified answer.
  • step S3 the similarity calculation method has been shown in step S3, which is not redundant here.
  • the Euclidean distance between each reference answer and the simulated answer such as 0.9, 0.8, 0.5, 0.2, and 0.1, is obtained. Due to the Euclidean distance, the Euclidean distance is larger. If it is small, the similarity value between the reference answer and the simulated answer. Therefore, in the above Euclidean distance, the reference answer corresponding to the Euclidean distance of 0.1 should be selected as the target answer, and the answer corresponding to the target answer should be confirmed as the verified answer Answer the answer.
  • the simulated answer is obtained through the preset model, and the verified answer is determined, which can further improve the accuracy of the answer verification of the crowdsourcing task.
  • Figure 6 shows a specific implementation after step S6.
  • the specific implementation process is described in detail as follows:
  • the response object includes a first object and a second object.
  • the answer of the first object is the first answer
  • the answer of the second object is the second answer
  • both the first object and the second object have preset weights.
  • the preset weight of the first object is less than the preset weight of the second object.
  • the response object includes a first object and a second object, where the answer of the first object is the first answer, the answer of the second object is the second answer, and the first object and the first object Both objects have preset weights corresponding to them, and the preset weight of the first object is smaller than the preset weight of the second object.
  • the preset weight is based on the answer to the crowdsourcing task of the respondent in the past, and the answer weight set by the server can be used to evaluate the credibility of the answer of the respondent.
  • the first object and the second object can be assigned according to the preset weight.
  • the response answers corresponding to the two subjects are weighted, so that the distribution of the reliability value of the response answer is more reasonable, which is beneficial to improve the accuracy of the reliability value.
  • the answer answer cannot be verified, and the first answer answer and the second answer answer of the first answer object and the second answer object need to be obtained, and the first answer is assigned
  • the answer and the second answer have different weights, and the credibility of the answer is determined according to the weight assigned.
  • the preset weight is the server preset setting, and the reasonable range is (0.1, 1).
  • the specific setting can be set according to the actual situation, and it is not limited here.
  • the first preset in this embodiment Set the weight to 0.6, and the second preset weight to 0.8.
  • S55 Determine the credibility weight of each type of reference answer according to the reference answer category to which the first answer answer and the second answer answer belong, the first preset weight, and the second preset weight.
  • crowdsourcing tasks have different answer question types, and correspondingly, there are also reference answer categories to which the answer should belong. According to the reference answer category to which the first answer answer and the second answer answer belong, identify the first preset weight and the second preset weight of the first answer answer and the second answer answer, and finally determine the credibility weight of each type of reference answer .
  • the credibility weight is the credibility ratio given to each type of reference answer by the server.
  • S56 Determine the weighted credibility value of each type of reference answer according to the credibility weight of each type of reference answer and the preset weight verification method.
  • the weighted reliability value is the reliability value corresponding to the answer answer.
  • the following calculation method is used to calculate the weighted credibility value of each type of reference answer:
  • P m is the number of answer answers contained in the m-th reference answer
  • CR is the weighted credibility value corresponding to the m-th reference answer
  • m is the number of reference answer categories
  • n is the number of initial answers.
  • the reference answer corresponding to the weighted credibility value is the best response answer for the crowdsourcing task, and by using it as the response answer of the crowdsourcing task, and confirming that it has passed the verification The answer to the answer.
  • step S57 shows a specific implementation of step S57.
  • the weighted credibility value with the largest value is selected as the target weighted credibility value, and the corresponding weighted credibility value of the target is obtained.
  • the reference answer is used as the target answer, and the answer corresponding to the target answer is confirmed as the verified answer.
  • the server divides the objects of the crowdsourcing task into a first object and a second object; the first object and the second object refer to different types of objects, rather than specifically one or two of them; through The historical response answers of the first object and the second object to the crowdsourcing task can be obtained, and the response level of the first object and the second object to the crowdsourcing task can be learned.
  • the accuracy rate corresponding to the responses of the first object and the second object is obtained, and the pair of the first object and the second object is further learned The ability to respond to crowdsourcing tasks.
  • the response accuracy rates of the first object and the second object are obtained. If a part of the first object has a higher accuracy rate and the accuracy rate exceeds the preset classification threshold, then the first object of this part is adjusted Is the second object; if a part of the second object has a low accuracy rate, and the accuracy rate is lower than the preset classification threshold, then adjust this part of the second object as the first object.
  • the preset classification threshold is the server-side preset setting, and the reasonable range is (0.6, 0.9).
  • the specific setting can be set according to the actual situation, and it is not limited here. Preferably, it is preset in this embodiment. Set the classification threshold to 0.6.
  • the preset weight of the first object and the second object by judging the response accuracy of the first object and the second object, and updating the preset weight of the first object and the preset weight of the second object, the preset weight of the first object and the second object The preset weights of is updated, so that the respondent pays more attention to the answers to the crowdsourcing task, which further improves the accuracy and efficiency of the crowdsourcing task.
  • the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through computer-readable instructions, which can be stored in a computer-readable storage medium.
  • the computer-readable instructions When executed, they may include the processes of the above-mentioned method embodiments.
  • the aforementioned storage medium may be a non-volatile storage medium such as a magnetic disk, an optical disc, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.
  • this application provides an embodiment of an answer verification device for crowdsourced tasks.
  • the device embodiment corresponds to the method embodiment shown in FIG. 2.
  • the device can be specifically applied to various electronic devices.
  • the answer verification device for crowdsourcing tasks in this embodiment includes: an initial answer obtaining module 81, a semantic recognition result module 82, a reference answer classification module 83, a credibility value statistics module 84, and credibility The value comparison module 85 and the answer verification module 86. in:
  • the initial answer obtaining module 81 is used to obtain each answer answer corresponding to the target task from all the answer answers obtained from the client as an initial answer, where each answer answer corresponds to a answer object;
  • the semantic recognition result module 82 is used to perform semantic recognition on each initial answer by means of natural language semantic recognition to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
  • the reference answer classification module 83 is used to combine the semantic recognition results in pairs and use each combination as a set of results.
  • the similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value of is greater than the preset similarity threshold, the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained, where M ⁇ N, and N is a positive integer;
  • the credibility value statistics module 84 is used to determine the credibility value of each type of reference answer through a preset consistency check method
  • the credibility value comparison module 85 is used to select the credibility value with the largest value from the credibility values of all the reference answers as the maximum credibility value, and compare the maximum credibility value with the preset standard value Compare, get the comparison result;
  • the answer answer verification module 86 is configured to, if the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is used as the target answer, and the answer answer corresponding to the target answer is confirmed as Verify the answer that passed the verification.
  • the semantic recognition result module 82 includes:
  • the basic word segmentation acquisition unit is used to perform word segmentation processing on the initial answer through a preset word segmentation method to obtain the basic word segmentation contained in the initial answer.
  • the word vector acquisition unit is used to convert the basic word segmentation into word vectors, and cluster the word vectors through a clustering algorithm to obtain the cluster center corresponding to each word vector.
  • the semantic recognition result unit is used to obtain the preset semantics corresponding to the cluster center corresponding to each word vector as the semantic recognition result of the initial answer.
  • the basic word segmentation acquisition unit includes:
  • the word segmentation sequence acquisition subunit is used to perform word segmentation analysis on the basic sentence to obtain K word segmentation sequences.
  • the word segmentation sequence occurrence probability subunit is used to calculate the occurrence probability of each word segmentation sequence based on the word sequence data of the preset training corpus for each word segmentation sequence to obtain the occurrence probability of K word segmentation sequences.
  • the target word segmentation sequence determination subunit is used to select the word segmentation sequence corresponding to the occurrence probability of the preset probability threshold from the occurrence probabilities of the K word segmentation sequences as the target word segmentation sequence, and use each word segmentation in the target word segmentation sequence as The basic participle included in the initial answer.
  • the answer verification device for crowdsourcing tasks also includes:
  • the simulation answer obtaining module is used to obtain the target task if the comparison result is that the maximum credibility value is less than the preset standard value, and input the target task into the preset model to obtain the simulated answer through the preset model;
  • the target answer selection module is used to calculate the similarity value between the reference answer and the simulated answer for each type of reference answer to obtain M similarity values, and from the M similarity values, select the highest similarity value, and calculate the value
  • the reference answer corresponding to the largest similarity value is used as the target answer, and the answer answer corresponding to the target answer is confirmed as the answer answer that has passed the verification.
  • the answer verification device for crowdsourcing tasks also includes:
  • the preset weight setting module is used for answering objects including a first object and a second object, the answer of the first object is the first answer, the answer of the second object is the second answer, the first object and the second object Each corresponds to a preset weight, and the preset weight of the first object is less than the preset weight of the second object;
  • the preset weight obtaining module is configured to obtain the first preset weight corresponding to the first response answer and the second preset weight corresponding to the second response answer if the comparison result is that the maximum credibility value is less than the preset standard value;
  • the credibility weight determination module is used to determine the credibility weight of each type of reference answer according to the reference answer category to which the first answer answer and the second answer answer belong, the first preset weight, and the second preset weight;
  • the weighted credibility value determination module is used to confirm the weighted credibility value of each type of reference answer according to the credibility weight of each type of reference answer and the preset weight check method;
  • the weighted credibility value selection module is used to select the weighted credibility value with the largest value as the target weighted credibility value, obtain the reference answer corresponding to the target weighted credibility value, and use it as the target answer. The answer is confirmed as the answer that passed the verification.
  • weighted credibility value selection module includes:
  • the historical response answer obtaining unit is used to obtain the historical response answers of the first object and the second object;
  • the response accuracy rate verification unit is used to determine the proportion of the response answers that passed the verification in the historical response answers of the first subject, obtain the response accuracy rate of the first subject, and determine the response answers that passed the verification in the historical response answers of the second subject Ratio, get the response accuracy rate of the second object;
  • the following new object unit is used to update the preset weight of the first object and the preset weight of the second object according to the response accuracy rate and the preset classification threshold to obtain the updated preset weight and updated weight of the first object The preset weight of the second object.
  • FIG. 9 is a block diagram of the basic structure of the computer device in this embodiment.
  • the computer device 9 includes a memory 91, a processor 92, and a network interface 93 that communicate with each other through a system bus. It should be pointed out that the figure only shows a computer device 9 with three types of components: memory 91, processor 92, and network interface 93, but it should be understood that it is not required to implement all the components shown, and alternative implementations are possible. More or fewer components. Among them, those skilled in the art can understand that the computer device here is a device that can automatically perform numerical calculation and/or information processing in accordance with pre-set or stored instructions.
  • Its hardware includes, but is not limited to, a microprocessor, a dedicated Integrated Circuit (Application Specific Integrated Circuit, ASIC), Programmable Gate Array (Field-Programmable Gate Array, FPGA), Digital Processor (Digital Signal Processor, DSP), embedded equipment, etc.
  • ASIC Application Specific Integrated Circuit
  • ASIC Application Specific Integrated Circuit
  • FPGA Field-Programmable Gate Array
  • DSP Digital Processor
  • Computer equipment can be computing equipment such as desktop computers, notebooks, palmtop computers, and cloud servers.
  • the computer equipment can interact with the user through a keyboard, a mouse, a remote control, a touch panel, or a voice control device.
  • the memory 91 includes at least one type of readable storage medium.
  • the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory ( SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disks, optical disks, etc.
  • the memory 91 may be an internal storage unit of the computer device 9, for example, the hard disk or memory of the computer device 9.
  • the memory 91 may also be an external storage device of the computer device 9, such as a plug-in hard disk, a smart media card (SMC), and a secure digital (Secure Digital, SMC) equipped on the computer device 9. SD) card, flash card (Flash Card), etc.
  • the memory 91 may also include both the internal storage unit of the computer device 9 and its external storage device.
  • the memory 91 is generally used to store an operating system and various application software installed in the computer device 9, such as a computer-readable instruction of a crowdsourced task sampling method.
  • the memory 91 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 92 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips.
  • the processor 92 is generally used to control the overall operation of the computer device 9.
  • the processor 92 is used to run the process code or process data stored in the memory 91, for example, run a computer-readable instruction of a crowdsourced task sampling method.
  • the network interface 93 may include a wireless network interface or a wired network interface, and the network interface 93 is generally used to establish a communication connection between the computer device 9 and other electronic devices.
  • the present application also provides another implementation manner, that is, a computer-readable storage medium is provided with a random inspection process stored in the computer-readable storage medium, and the random inspection process can be executed by at least one processor, so that the at least one processor executes as described above The steps of a sampling method for crowdsourcing tasks.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes a number of instructions to enable a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to execute the methods of the various embodiments of the present application.
  • a terminal device which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

An answer verification method and apparatus for a crowdsourcing task, a computer device, and a storage medium. The method comprises: acquiring respective response answers corresponding to a target task to serve as initial answers; performing semantic recognition on the respective initial answers; determining a plurality of reference answers according to semantic recognition results, and determining a credibility value for each type of reference answer; if the credibility value, upon comparison, is less than a preset standard value, acquiring, via a preset model, a sample answer corresponding to the target task; and calculating a similarity value between each reference answer and the sample answer, selecting a similarity value having the greatest numerical value, determining, as a target answer, a reference answer corresponding to the similarity value having the greatest numerical value, and determining that a response answer corresponding to the target answer is a successfully verified response answer. The method can improve the accuracy and efficiency of answer verification for crowdsourcing tasks.

Description

众包任务的答案验证方法、装置、计算机设备及存储介质Answer verification method, device, computer equipment and storage medium of crowdsourcing task
本申请要求于2020年03月02日提交中国专利局、申请号为202010135251.6,发明名称为“众包任务的答案验证方法、装置、计算机设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed with the Chinese Patent Office on March 2, 2020, the application number is 202010135251.6, and the invention title is "Crowdsourcing task answer verification method, device, computer equipment and storage medium", all of which The content is incorporated in this application by reference.
技术领域Technical field
本申请涉及数据处理技术领域,尤其涉及众包任务的答案验证方法、装置、计算机设备及存储介质。This application relates to the field of data processing technology, and in particular to methods, devices, computer equipment, and storage media for verifying answers to crowdsourced tasks.
背景技术Background technique
随着网络技术的飞速发展,一些公司或者机构为了获取更多创意信息,或者高效便捷解决一些跨领域问题,往往会通过互联网向互联网对象发放众包任务,通过众包任务的方式,来解决这些问题,众包任务是指将内部执行的工作任务,分发给外部执行对象去完成以便缩短任务完成时间。With the rapid development of network technology, in order to obtain more creative information, or to solve some cross-domain problems efficiently and conveniently, some companies or institutions often issue crowdsourcing tasks to Internet objects through the Internet, and solve these problems through crowdsourcing tasks. The problem is that crowdsourcing tasks means that internally executed tasks are distributed to external execution objects for completion in order to shorten the task completion time.
发明人发现,由于不同的执行对象,其学习方式和实现逻辑的不同,因而,不同执行对象对同一任务的执行结果不一样,同时,由于执行对象具有学习能力,无法直接确定哪些执行对象提供的应答答案一定优于其他执行对象,应答答案也无法进行验证。在当前,为确保得到准确的任务答案,一般是将一个众包任务分发给多个执行对象,进而获取每个执行对象的应答答案,再通过人工筛选的方式,进行应答答案的验证,这种方式使得对于众包任务的答案验证准确性较低,且答案验证效率低,如何提高众包任务的答案验证效率,成了一个亟待解决的难题。The inventor found that due to the different learning methods and implementation logic of different execution objects, different execution objects have different execution results for the same task. At the same time, because the execution objects have the learning ability, it is impossible to directly determine which execution objects provide The response answer must be better than other execution objects, and the response answer cannot be verified. At present, in order to ensure accurate task answers, it is generally to distribute a crowdsourced task to multiple execution objects, and then obtain the answer answer of each execution object, and then verify the answer answer through manual screening. This method makes the accuracy of answer verification for crowdsourcing tasks low, and the efficiency of answer verification is low. How to improve the efficiency of answer verification for crowdsourcing tasks has become an urgent problem to be solved.
发明内容Summary of the invention
本申请实施例的目的在于提出一种众包任务的答案验证方法,解决现有技术人工验证众包任务的方式答案验证效率低的问题。The purpose of the embodiments of the present application is to propose an answer verification method for crowdsourced tasks, and solve the problem of low answer verification efficiency in the manner of manually verifying crowdsourced tasks in the prior art.
为了解决上述技术问题,本申请实施例提供一种众包任务的答案验证方法,包括:In order to solve the above technical problems, an embodiment of the present application provides an answer verification method for crowdsourcing tasks, including:
从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个所述应答答案对应一个应答对象;From all the answer answers obtained by the client, obtain each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;By means of natural language semantic recognition, semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;Combine the semantic recognition results in pairs, and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M≤N, and N is a positive integer;
通过预设的一致性校验方式,确定每类所述参考答案的可信度值;Determine the credibility value of each type of reference answer through a preset consistency check method;
从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;From all the credibility values of the reference answers, select the credibility value with the largest numerical value as the maximum credibility value, and compare the maximum credibility value with a preset standard value to obtain a comparison result;
若所述对比结果为所述最大可信度值大于或等于所述预设标准值,则将所述最大可信度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。If the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
为解决上述技术问题,本申请采用的一个技术方案是:提供一种众包任务的答案验证装置,包括:In order to solve the above technical problems, a technical solution adopted in this application is to provide an answer verification device for crowdsourcing tasks, including:
初始答案获取模块,用于从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个所述应答答案对应一个应答对象;The initial answer obtaining module is used to obtain each answer answer corresponding to the target task from all the answer answers obtained from the client as an initial answer, wherein each answer answer corresponds to a answer object;
语义识别结果模块,用于通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;The semantic recognition result module is used to perform semantic recognition on each of the initial answers through natural language semantic recognition to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is positive Integer
参考答案分类模块,用于将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;The reference answer classification module is used to combine the semantic recognition results in pairs, and use each combination as a set of results, and use the similarity calculation method to count the similarity values between the semantic recognition results in each set of results , If the obtained similarity value is greater than the preset similarity threshold, the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained, where M≤N, and N is a positive integer ;
可信度值统计模块,用于通过预设的一致性校验方式,确定每类所述参考答案的可信度值;The credibility value statistics module is used to determine the credibility value of each type of reference answer through a preset consistency check method;
可信度值对比模块,用于从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;The credibility value comparison module is used to select the credibility value with the largest value from all the credibility values of the reference answers as the maximum credibility value, and compare the maximum credibility value with the predicted value. Set standard value comparison and get the comparison result;
模拟答案获取模块,用于若所述对比结果为所述最大可信度值小于所述预设标准值,则获取所述目标任务,并将所述目标任务输入到预设模型中,通过所述预设模型得到模拟答案;The simulation answer obtaining module is configured to obtain the target task if the comparison result is that the maximum credibility value is less than the preset standard value, and input the target task into the preset model, and pass the The preset model gets the simulated answer;
应答答案验证模块,用于针对每类所述参考答案,统计所述参考答案与所述模拟答案的相似度值,得到M个相似度值,并从M个所述相似度值中,选取数值最大的相似度值,将所述数值最大的相似度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。The answer answer verification module is used to count the similarity values between the reference answer and the simulated answer for each type of the reference answer to obtain M similarity values, and select a numerical value from the M similarity values For the maximum similarity value, the reference answer corresponding to the similarity value with the largest numerical value is used as the target answer, and the response answer corresponding to the target answer is confirmed as the verified response answer.
一种计算机设备,包括存储器、处理器,以及存储在所述存储器中,并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:
从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个所述应答答案对应一个应答对象;From all the answer answers obtained by the client, obtain each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;By means of natural language semantic recognition, semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;Combine the semantic recognition results in pairs, and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M≤N, and N is a positive integer;
通过预设的一致性校验方式,确定每类所述参考答案的可信度值;Determine the credibility value of each type of reference answer through a preset consistency check method;
从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;From all the credibility values of the reference answers, select the credibility value with the largest numerical value as the maximum credibility value, and compare the maximum credibility value with a preset standard value to obtain a comparison result;
若所述对比结果为所述最大可信度值大于或等于所述预设标准值,则将所述最大可信度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。If the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
一种计算机可读存储介质,所述计算机可读存储介质上存储有可执行代码,所述可执行代码被处理器执行时实现如下所述众包任务的答案验证方法的步骤:A computer-readable storage medium having executable code stored on the computer-readable storage medium, and when the executable code is executed by a processor, the steps of the method for verifying the answer of the crowdsourcing task as described below are realized:
从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个所述应答答案对应一个应答对象;From all the answer answers obtained by the client, obtain each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;By means of natural language semantic recognition, semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;Combine the semantic recognition results in pairs, and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M≤N, and N is a positive integer;
通过预设的一致性校验方式,确定每类所述参考答案的可信度值;Determine the credibility value of each type of reference answer through a preset consistency check method;
从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;From all the credibility values of the reference answers, select the credibility value with the largest numerical value as the maximum credibility value, and compare the maximum credibility value with a preset standard value to obtain a comparison result;
若所述对比结果为所述最大可信度值大于或等于所述预设标准值,则将所述最大可信度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。If the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
以上方案中的一种众包任务的答案验证方法,通过从客户端获取的所有应答答案中,获取目标任务对应的每个应答对象的应答答案,作为初始答案;并对每个初始答案进行语义识别,然后将语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案;并通过预设的一致性校验方式,确定每类参考答案的可信度值;接着从所有的参考答案的可信度值中选取数值最大的可信度值,作为最大可信度值,并将最大可信度值与预设标准值对比,得到对比结果;若对比结果为最大可信度值大于或等于预设标准值,则将最大可信度值对应的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。通过确定最大可信度值,并进行相似度值统计,得出数值最大的相似度值,最终确认验证通过的应答答案,能够有效的提高众包任务的答案验证效率。A method for verifying answers to crowdsourcing tasks in the above scheme. From all the answer answers obtained from the client, the answer answer of each answer object corresponding to the target task is obtained as the initial answer; and semantics of each initial answer Then, the semantic recognition results are combined in pairs, and each combination is used as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the obtained similarity value is greater than Preset the similarity threshold, then use the two semantic recognition results in the group as the same type of reference answer to obtain the M type of reference answer; and determine the credibility value of each type of reference answer through the preset consistency check method ; Then select the highest credibility value from all the credibility values of the reference answers as the maximum credibility value, and compare the maximum credibility value with the preset standard value to obtain the comparison result; if the comparison result If the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the response answer corresponding to the target answer is confirmed as the verified answer. By determining the maximum credibility value and performing similarity value statistics, the largest numerical similarity value is obtained, and finally the answer answers that have passed the verification are confirmed, which can effectively improve the answer verification efficiency of the crowdsourcing task.
附图说明Description of the drawings
为了更清楚地说明本申请中的方案,下面将对本申请实施例描述中所需要使用的附图作一个简单介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the solution in this application more clearly, the following will briefly introduce the drawings used in the description of the embodiments of the application. Obviously, the drawings in the following description are some embodiments of the application. Ordinary technicians can obtain other drawings based on these drawings without creative work.
图1是本申请实施例提供的众包任务的答案验证方法的应用环境示意图;FIG. 1 is a schematic diagram of an application environment of a method for verifying answers to crowdsourced tasks provided by an embodiment of the present application;
图2根据本申请实施例提供的众包任务的答案验证方法的一实现流程图;FIG. 2 is an implementation flow chart of the method for verifying the answers of crowdsourcing tasks according to an embodiment of the present application;
图3是本申请实施例提供的众包任务的答案验证方法中步骤2的一实现流程图;FIG. 3 is an implementation flow chart of step 2 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application;
图4是本申请实施例提供的众包任务的答案验证方法中步骤S21的一实现流程图;FIG. 4 is an implementation flowchart of step S21 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application;
图5是本申请实施例提供的众包任务的答案验证方法中步骤S5之后的一实现流程图;FIG. 5 is an implementation flow chart after step S5 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application;
图6是本申请实施例提供的众包任务的答案验证方法中步骤S5之后的又一实现流程图;FIG. 6 is another implementation flowchart after step S5 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application;
图7是本申请实施例提供的众包任务的答案验证方法中步骤S57的一实现流程图;FIG. 7 is an implementation flowchart of step S57 in the method for verifying answers to crowdsourced tasks provided by an embodiment of the present application;
图8是本申请实施例提供的众包任务的答案验证装置示意图;FIG. 8 is a schematic diagram of an answer verification device for crowdsourcing tasks provided by an embodiment of the present application;
图9是本申请实施例提供的计算机设备的示意图。Fig. 9 is a schematic diagram of a computer device provided by an embodiment of the present application.
具体实施方式Detailed ways
除非另有定义,本文所使用的所有的技术和科学术语与属于本申请的技术领域的技术人员通常理解的含义相同;本文中在申请的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本申请;本申请的说明书和权利要求书及上述附图说明中的术语“包括”和“具有”以及它们的任何变形,意图在于覆盖不排他的包含。本申请的说明书和权利要求书或上述附图中的术语“第一”、“第二”等是用于区别不同对象,而不是用于描述特定顺序。Unless otherwise defined, all technical and scientific terms used herein have the same meanings as commonly understood by those skilled in the technical field of the application; the terms used in the specification of the application herein are only for describing specific embodiments. The purpose is not to limit the application; the terms "including" and "having" in the specification and claims of the application and the above-mentioned description of the drawings and any variations thereof are intended to cover non-exclusive inclusions. The terms "first", "second", etc. in the specification and claims of the present application or the above-mentioned drawings are used to distinguish different objects, rather than to describe a specific sequence.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。The reference to "embodiments" herein means that a specific feature, structure, or characteristic described in conjunction with the embodiments may be included in at least one embodiment of the present application. The appearance of the phrase in various places in the specification does not necessarily refer to the same embodiment, nor is it an independent or alternative embodiment mutually exclusive with other embodiments. Those skilled in the art clearly and implicitly understand that the embodiments described herein can be combined with other embodiments.
为了使本技术领域的人员更好地理解本申请方案,下面将结合附图,对本申请实施例中的技术方案进行清楚、完整地描述。In order to enable those skilled in the art to better understand the solutions of the present application, the technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings.
下面结合附图和实施方式对本申请进行详细说明。The application will be described in detail below with reference to the drawings and implementations.
请参阅图1,系统架构100可以包括终端设备101、102、103,网络104和服务器105。网络104用以在终端设备101、102、103和服务器105之间提供通信链路的介质。网络104可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等等。Referring to FIG. 1, the system architecture 100 may include terminal devices 101, 102, and 103, a network 104 and a server 105. The network 104 is used to provide a medium for communication links between the terminal devices 101, 102, 103 and the server 105. The network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, and so on.
用户可以使用终端设备101、102、103通过网络104与服务器105交互,以接收或发送消息等。终端设备101、102、103上可以安装有各种通讯客户端应用,例如网页浏览器应用、搜索类应用、即时通信工具等。The user can use the terminal devices 101, 102, and 103 to interact with the server 105 through the network 104 to receive or send messages and so on. Various communication client applications, such as web browser applications, search applications, instant messaging tools, etc., may be installed on the terminal devices 101, 102, and 103.
终端设备101、102、103可以是具有显示屏并且支持网页浏览的各种电子设备,包括但不限于智能手机、平板电脑、膝上型便携计算机和台式计算机等等。The terminal devices 101, 102, 103 may be various electronic devices with display screens and supporting web browsing, including but not limited to smart phones, tablet computers, laptop computers, desktop computers, and so on.
服务器105可以是提供各种服务的服务器,例如对终端设备101、102、103上显示的页面提供支持的后台服务器。The server 105 may be a server that provides various services, for example, a background server that provides support for pages displayed on the terminal devices 101, 102, and 103.
需要说明的是,本申请实施例所提供的一种众包任务的答案验证方法一般由服务器执行,相应地,一种众包任务的答案验证装置一般设置于服务器中。It should be noted that the method for verifying answers for crowdsourcing tasks provided by the embodiments of the present application is generally executed by a server. Accordingly, a device for verifying answers for crowdsourcing tasks is generally set in the server.
应该理解,图1中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。It should be understood that the numbers of terminal devices, networks, and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks, and servers according to implementation needs.
请参阅图2,图2示出了众包任务的答案验证方法的一种具体实施方式。Please refer to FIG. 2. FIG. 2 shows a specific implementation of the method for verifying the answers of crowdsourcing tasks.
需注意的是,若有实质上相同的结果,本申请的方法并不以图2所示的流程顺序为限,该方法包括如下步骤:It should be noted that if there are substantially the same results, the method of the present application is not limited to the sequence of the process shown in FIG. 2, and the method includes the following steps:
S1:从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个应答答案对应一个应答对象。S1: Among all the answer answers obtained from the client, each answer answer corresponding to the target task is obtained as an initial answer, where each answer answer corresponds to a answer object.
具体地,在生成或获取到目标任务之后,服务端将目标任务推送给多个对象,对象对目标任务进行回答后,得到一个应答答案,通过客户端将应答答案反馈给服务端,服务端通过网络传输协议,接收每个客户端发送的应答答案,并作为初始答案。Specifically, after the target task is generated or obtained, the server pushes the target task to multiple objects. After the object answers the target task, a response answer is obtained, and the response answer is fed back to the server through the client. The network transmission protocol receives the answer sent by each client as the initial answer.
在服务端,预先存储由每个应答对象的信息,在应答对象进行任务的回答反馈后,得到一个应答答案后,建立该应答答案与应答对象的映射关系。在获取到初始答案之后,获取每个初始对应的应答对象,也即,获取所有参与该目标任务并反馈应答答案到服务端的对象的名单。On the server side, the information of each respondent is pre-stored. After the respondent responds to the task and gets a response, the mapping relationship between the response and the respondent is established. After obtaining the initial answer, obtain each initial corresponding response object, that is, obtain a list of all objects participating in the target task and feedback the response answer to the server.
其中,本实施例中的应答对象,具体可以是预设的网络模型(机器学习模型或神经网络模型),也可以是预设的计算引擎或者大数据平台等具有自主学习能力和辨别能力的个体,此处不做具体限定。对于同一个众包任务,不同对象由于其结构和实现逻辑不同,会给出各自的应答答案。Among them, the response object in this embodiment may specifically be a preset network model (machine learning model or neural network model), or may be a preset computing engine or a big data platform and other individuals with autonomous learning and discrimination capabilities. , There is no specific limitation here. For the same crowdsourcing task, different objects will give their own answers due to their different structures and implementation logic.
需要说明的是,服务端同时会进行多个任务的派发推送,在同一时间段也可能接收到来自不同任务的应答答案,为了方便后续对众包参考答案的筛选,服务端预先对每个任务均设置一个任务标识。通过目标任务标识从所有应答答案中,选取出目标任务对应的初始答案。It should be noted that the server will send and push multiple tasks at the same time, and may also receive answers from different tasks at the same time period. In order to facilitate the subsequent screening of crowdsourced reference answers, the server will pre-select each task Set a task ID for each. The initial answer corresponding to the target task is selected from all the answer answers through the target task identifier.
例如,在一具体实施方式中,从客户端获取的所有应答答案中,包含三种任务标识,分别为任务A、任务B和任务C,当前众包任务的任务标识为任务A,因而,选取任务标识为任务A的应答答案,作为初始答案。For example, in a specific embodiment, all the response answers obtained from the client include three task identifiers, namely task A, task B, and task C. The task identifier of the current crowdsourcing task is task A. Therefore, select The task identifier is the answer to task A, as the initial answer.
其中,任务标识具体可以由字符、数字或者字符数字的组合的方式来表示,例如,一任务标识为“TPSB5201906280236”。Among them, the task identification can be specifically represented by a combination of characters, numbers, or characters and numbers, for example, a task identification is "TPSB5201906280236".
其中,本实施例中的目标任务为对客观事实的陈述类型,例如,对给定图像上的文字进行识别转写等。Among them, the target task in this embodiment is a type of statement of objective facts, for example, recognizing and transcribing text on a given image.
在本实施例中,一种众包任务的答案验证方法运行于其上的电子设备(例如图1所示的服务器),可以通过有线连接方式或者无线连接方式。需要指出的是,上述无线连接方式可以包括但不限于3G/4G连接、WiFi连接、蓝牙连接、WiMAX连接、Zigbee连接、UWB(ultra wideband)连接、以及其他现在已知或将来开发的无线连接方式。In this embodiment, an electronic device (such as the server shown in FIG. 1) on which a method for verifying answers to crowdsourcing tasks runs may be connected via a wired connection or a wireless connection. It should be pointed out that the above-mentioned wireless connection methods can include, but are not limited to, 3G/4G connection, WiFi connection, Bluetooth connection, WiMAX connection, Zigbee connection, UWB (ultra wideband) connection, and other wireless connection methods currently known or developed in the future .
S2:用于通过自然语言语义识别的方式,对每个初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为应答对象的数量,N为正整数。S2: Used to perform semantic recognition on each initial answer through natural language semantic recognition, and obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer.
具体地,通过自然语言语义识别的方法,对每个初始答案进行语义识别,得到每个初始答案的语义识别结果。Specifically, through the method of natural language semantic recognition, semantic recognition is performed on each initial answer, and the semantic recognition result of each initial answer is obtained.
其中,自然语言语义识别,主要通过采用NLP的方式,NLP(Natural Language Processing)又称自然语言处理,由于理解(understanding)自然语言,需要关于外在世界的广泛知识以及运用操作这些知识的能力,自然语言认知,同时也被视为一个人工智能完备(AI-complete)的问题。NLP任务主要是指涉及到自然语言的语义理解或解析的一些任务,常见的NLP任务包括但不限于:语音识别(Speech recognition)、中文自动分词(Chinese word segmentation)、词性标注(Part-of-speech tagging)、文本分类(Text categorization)、句法分析(Parsing)、自动摘要(Automatic summarization)、问答系统(Question answering)和信息抽取(Information extraction)等。Among them, natural language semantic recognition is mainly through the use of NLP. NLP (Natural Language Processing) is also called natural language processing. Understanding natural language requires extensive knowledge about the external world and the ability to use and operate this knowledge. Natural language cognition is also regarded as an AI-complete problem. NLP tasks mainly refer to some tasks that involve the semantic understanding or parsing of natural language. Common NLP tasks include but are not limited to: Speech recognition, Chinese word segmentation, Part-of- speech tagging, text categorization, parsing, automatic summarization, question answering, information extraction, etc.
S3:将语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数。S3: Combine the semantic recognition results in pairs, and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the obtained similarity value is greater than the expected value If the similarity threshold is set, the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained, where M≤N, and N is a positive integer.
具体的,通过将语义识别结果两两组合,并将每个组合作为一组结果,运用相似度计算,得出每一组语义识别结果之间的相似度值,通过预设相似度阈值,若大于预设相似度阈值,则将任意两个语义识别结果归为预设相似度阈值。Specifically, by combining the semantic recognition results in pairs, and using each combination as a set of results, the similarity calculation is used to obtain the similarity value between each set of semantic recognition results, and the similarity threshold is preset, if If it is greater than the preset similarity threshold, any two semantic recognition results are classified as the preset similarity threshold.
其中,预设相似度阈值为服务端预设设置,用于将不同的语义识别结果分别归类,较为合理的预设相似度阈值为0.9、0.8、0.7、0.6等,其具体设置可依据实际情况进行设定,此处不做限定,优选的,本实施例中预设相似度阈值为0.8。Among them, the preset similarity threshold is a server-side preset setting, which is used to classify different semantic recognition results. The more reasonable preset similarity thresholds are 0.9, 0.8, 0.7, 0.6, etc., and the specific settings can be based on actual conditions. The situation is set, which is not limited here. Preferably, the preset similarity threshold value in this embodiment is 0.8.
其中,相似度计算的方式包括但不限于:闵可夫斯基距离(Minkowski Distance)、曼哈顿距离(Manhattan Distance)、欧氏距离(Euclidean Distance)、切比雪夫距离(Chebyshev Distance)、汉明距离(Hamming Distance)和马氏距离(Mahalanobis Distance)等。Among them, the similarity calculation methods include, but are not limited to: Minkowski Distance, Manhattan Distance, Euclidean Distance, Chebyshev Distance, and Hamming Distance. Distance) and Mahalanobis Distance (Mahalanobis Distance) and so on.
优选的,采用欧氏距离计算每组结果中语义识别结果之间的相似度值;采用欧氏距离计算,能够快速高效的计算出每组结果中语义识别结果之间的相似度值。Preferably, the Euclidean distance is used to calculate the similarity value between the semantic recognition results in each group of results; the Euclidean distance calculation is used to quickly and efficiently calculate the similarity value between the semantic recognition results in each group of results.
S4:通过预设的一致性校验方式,确定每类参考答案的可信度值。S4: Determine the credibility value of each type of reference answer through a preset consistency check method.
其中,可信度值是指该类参考答案的可靠程度。Among them, the credibility value refers to the reliability of this type of reference answer.
在一实施例中,采用如下公式计算每类参考答案的可信度值:In one embodiment, the following formula is used to calculate the credibility value of each type of reference answer:
CR 1+CR 2+…+CR m=1 CR 1 +CR 2 +…+CR m =1
P={P 1,P 2,…,P m} P={P 1 ,P 2 ,…,P m }
Figure PCTCN2020117671-appb-000001
Figure PCTCN2020117671-appb-000001
其中,P={P 1,P 2,…,P m}为参考答案集合,P m为第m类参考答案包含的应答答案的数量,CR m为第m类参考答案对应的可信度值,m为参考答案类别数量,n为初始答案的数量。 Among them, P = {P 1 ,P 2 ,...,P m } is the reference answer set, P m is the number of answer answers contained in the m-th reference answer, and CR m is the credibility value corresponding to the m-th reference answer , M is the number of reference answer categories, n is the number of initial answers.
S5:从所有的参考答案的可信度值中选取数值最大的可信度值,作为最大可信度值,并将最大可信度值与预设标准值对比,得到对比结果。S5: Select the credibility value with the largest value from the credibility values of all reference answers as the maximum credibility value, and compare the maximum credibility value with the preset standard value to obtain the comparison result.
具体地,从每类参考答案的可信度值中选取数值最大的可信度值与预设标准值对比,根据对比结果,确定所有分类的参考答案中,是否存在符合要求的参考答案。Specifically, the credibility value with the largest value is selected from the credibility value of each type of reference answer to compare with the preset standard value, and based on the comparison result, it is determined whether there is a reference answer that meets the requirements among all the classified reference answers.
其中,预设标准值为服务端预设设置,用于评估可信度值是否符合要求的数值,较为合理的范围为(0.5,1),其具体设置可依据实际情况进行设定,此处不做限定,优选的,本实施例中预设标准值为0.6。Among them, the preset standard value is the server-side preset setting, which is used to evaluate whether the credibility value meets the requirements. The reasonable range is (0.5,1). The specific setting can be set according to the actual situation. Here It is not limited, and preferably, the preset standard value in this embodiment is 0.6.
其中,对比结果包括最大可信度值小于预设标准值、最大可信度值大于或等于预设标准值。Among them, the comparison result includes that the maximum credibility value is less than the preset standard value, and the maximum credibility value is greater than or equal to the preset standard value.
S6:若对比结果为最大可信度值大于或等于预设标准值,则将最大可信度值对应的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。S6: If the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the response answer corresponding to the target answer is confirmed as the verified answer.
具体的,当最大可信度值大于或等于预设标准值,说明该最大可信度值对应的参考答案为值得信赖的应答答案,将其作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。Specifically, when the maximum credibility value is greater than or equal to the preset standard value, it indicates that the reference answer corresponding to the maximum credibility value is a trustworthy answer answer, which is taken as the target answer, and the answer answer corresponding to the target answer is confirmed To verify the answer answer passed.
向终端设备101、102、103发送答案验证结果,使得应答对象能够获知答案验证的结果。The answer verification result is sent to the terminal devices 101, 102, 103, so that the respondent can learn the answer verification result.
本实施例中,通过从客户端获取的所有应答答案中,获取众包任务对应的每个应答对象的应答答案,作为初始答案;并对每个初始答案进行语义识别,然后将语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案;并通过预设的一致性校验方法,确定每类参考答案的可信度值;接着从所有的每类参考答案的可信度值中选取最大可信度值,得到对比结果;若对比结果为最大可信度值大于或等于预设标准值,则将最大可信度值对应的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案通过确定最大可信度值,并进行相似度值统计,得出数值最大的相似度值,最终确认验证通过的应答答案,能够有效的提高众包任务的答案验证效率。In this embodiment, from all the response answers obtained from the client, the response answer of each response object corresponding to the crowdsourcing task is obtained as the initial answer; semantic recognition is performed on each initial answer, and then the semantic recognition results are divided into two Two combinations, and each combination is used as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the obtained similarity value is greater than the preset similarity threshold, the The two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained; and the credibility value of each type of reference answer is determined through the preset consistency verification method; then all the reference answers of each type The maximum credibility value of the answer is selected to obtain the comparison result; if the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer. And confirm the answer answer corresponding to the target answer as the answer answer that passed the verification. By determining the maximum credibility value, and performing similarity value statistics, the largest numerical similarity value is obtained, and finally the answer answer that has passed the verification is confirmed to be effective Improve the efficiency of answer verification for crowdsourcing tasks.
请参阅图3,图3示出了步骤S2的一种具体实施方式,步骤S2中,通过自然语言语义识别的方式,对每个初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为应答对象的数量,N为正整数的具体实现过程,详叙如下:Please refer to FIG. 3, which shows a specific implementation of step S2. In step S2, semantic recognition is performed on each initial answer by means of natural language semantic recognition, and the semantic recognition results of N initial answers are obtained. Among them, N is the number of response objects, and N is a positive integer. The specific implementation process is described in detail as follows:
S21:通过预设的分词方式,对初始答案进行分词处理,得到初始答案中包含的基础分词。S21: Perform word segmentation processing on the initial answer through the preset word segmentation method to obtain the basic word segmentation contained in the initial answer.
S22:将基础分词转换为词向量,并通过聚类算法,对词向量进行聚类,得到每个词向量对应的聚类中心。S22: Convert the basic word segmentation into word vectors, and cluster the word vectors through a clustering algorithm to obtain a cluster center corresponding to each word vector.
S23:获取每个词向量对应的聚类中心对应的预设语义,作为初始答案的语义识别结果。S23: Obtain the preset semantics corresponding to the cluster centers corresponding to each word vector as the semantic recognition result of the initial answer.
本实施例中,通过对初始答案进行分词处理和通过聚类算法对词向量进行聚类,得到初始答案的语义识别结果,使得对初始答案的语义识别更加准确,进一步提高了众包任务的应答答案的验证准确性。In this embodiment, by performing word segmentation processing on the initial answer and clustering the word vector through a clustering algorithm, the semantic recognition result of the initial answer is obtained, so that the semantic recognition of the initial answer is more accurate, and the response to the crowdsourcing task is further improved. The verification accuracy of the answer.
请参阅图4,图4示出了步骤S21的一种具体实施方式,步骤S21中,通过预设的分词方式,对初始答案进行分词处理,得到初始答案中包含的基础分词的具体实现过程,详叙如下:Please refer to FIG. 4, which shows a specific implementation of step S21. In step S21, the initial answer is segmented through a preset word segmentation method to obtain the specific implementation process of the basic word segmentation contained in the initial answer. The details are as follows:
S211:对基础语句进行分词解析,得到K个分词序列。S211: Perform word segmentation analysis on the basic sentence to obtain K word segmentation sequences.
S212:针对每个分词序列,依据预设的训练语料库的词序列数据,计算每个分词序列的发生概率,得到K个分词序列的发生概率。S212: For each word segmentation sequence, calculate the occurrence probability of each word segmentation sequence according to the word sequence data of the preset training corpus to obtain the occurrence probability of K word segmentation sequences.
S213:从K个分词序列的发生概率中,选取达到预设概率阈值的发生概率对应的分词序列,作为目标分词序列,并将目标分词序列中的每个分词,作为初始答案中包含的基础分词。S213: From the occurrence probabilities of K word segmentation sequences, select the word segmentation sequence corresponding to the occurrence probability that reaches the preset probability threshold as the target word segmentation sequence, and use each word segmentation in the target word segmentation sequence as the basic word segmentation included in the initial answer .
本实施例中,通过基础语句进行分词解析,计算每个分词序列的发生概率,能够准确地得到初始答案中包含的基础分词,使得初始答案能够得到准确语义识别结果。In this embodiment, the word segmentation analysis is performed through the basic sentence, and the occurrence probability of each word segmentation sequence is calculated, so that the basic word segmentation contained in the initial answer can be accurately obtained, so that the initial answer can obtain an accurate semantic recognition result.
请参阅图5,图5示出了步骤S5之后的一种具体实施方式,具体实现过程,详叙如下:Please refer to Figure 5. Figure 5 shows a specific implementation after step S5. The specific implementation process is described in detail as follows:
S51:若对比结果为最大可信度值小于预设标准值,则获取目标任务,并将目标任务输入到预设模型中,通过预设模型得到模拟答案。S51: If the comparison result is that the maximum credibility value is less than the preset standard value, the target task is obtained, and the target task is input into the preset model, and the simulated answer is obtained through the preset model.
具体地,每类目标任务,均预设有对应的任务模型,还任务模型不具备自主学习能力,但可得到一个准确率符合要求的模拟答案。在最大可信度值小于预设标准值时,获取目标任务,并将目标任务输入到预设模型中,通过预设模型得到模拟答案。Specifically, for each type of target task, a corresponding task model is preset, and the task model does not have the ability to learn independently, but a simulated answer with an accuracy that meets the requirements can be obtained. When the maximum credibility value is less than the preset standard value, the target task is obtained, and the target task is input into the preset model, and the simulated answer is obtained through the preset model.
在一具体实施方式中,一众包任务为图像中的文字识别与转写,预设模型为光学字符文字识别匹配模型,可信度值在0.65至0.75之间,预设的可信度值为0.55,在步骤S50中,计算的可信度值低于0.55时,使用该预设模型来得到一个模拟答案。In a specific embodiment, a crowdsourcing task is text recognition and transliteration in an image, the preset model is an optical character recognition matching model, the credibility value is between 0.65 and 0.75, and the preset credibility value It is 0.55. In step S50, when the calculated credibility value is lower than 0.55, the preset model is used to obtain a simulated answer.
S52:针对每类参考答案,统计参考答案与模拟答案的相似度值,得到M个相似度值,并从M个相似度值中,选取数值最大的相似度值,将数值最大的相似度值对应的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。S52: For each type of reference answer, count the similarity values between the reference answer and the simulated answer to obtain M similarity values, select the highest similarity value from the M similarity values, and set the highest similarity value The corresponding reference answer is used as the target answer, and the answer answer corresponding to the target answer is confirmed as the answer answer that has passed the verification.
具体的,通过预设模型得到模拟答案,将每个参考答案与模拟答案,通过相似度计算的方式,得到每个参考答案与模拟答案的相似度值,将相似度值按由大到小的顺序排列,选取其中最大的相似度值的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。Specifically, the simulated answer is obtained through the preset model, and the similarity value between each reference answer and the simulated answer is obtained by calculating the similarity between each reference answer and the simulated answer, and the similarity value is ranked from large to small Arrange in order, select the reference answer with the largest similarity value as the target answer, and confirm the answer corresponding to the target answer as the verified answer.
其中,相似度计算的方式已在步骤S3中示出,此处不在累赘。Among them, the similarity calculation method has been shown in step S3, which is not redundant here.
在一具体实施方式中,通过采用欧式距离的方式,得到每个参考答案与模拟答案的欧氏距离,如0.9、0.8、0.5、0.2和0.1等,由于采用欧式距离的方式,欧氏距离越小,则其参考答案与模拟答案的相似度值,所以在上述欧氏距离中,应选取欧氏距离为0.1对应的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。In a specific embodiment, by adopting the Euclidean distance, the Euclidean distance between each reference answer and the simulated answer, such as 0.9, 0.8, 0.5, 0.2, and 0.1, is obtained. Due to the Euclidean distance, the Euclidean distance is larger. If it is small, the similarity value between the reference answer and the simulated answer. Therefore, in the above Euclidean distance, the reference answer corresponding to the Euclidean distance of 0.1 should be selected as the target answer, and the answer corresponding to the target answer should be confirmed as the verified answer Answer the answer.
本实施例中,通过预设模型得到模拟答案,并确定验证通过的应答答案,能够进一步的提高众包任务的答案验证的准确性。In this embodiment, the simulated answer is obtained through the preset model, and the verified answer is determined, which can further improve the accuracy of the answer verification of the crowdsourcing task.
请参阅图6,图6示出了步骤S6之后的一种具体实施方式,具体实现过程,详叙如下:Please refer to Figure 6. Figure 6 shows a specific implementation after step S6. The specific implementation process is described in detail as follows:
S53:应答对象包括第一对象和第二对象,第一对象的回答答案为第一应答答案,第二对象的回答答案为第二应答答案,第一对象和第二对象均对应有预设权重,且第一对象的预设权重小于第二对象的预设权重。S53: The response object includes a first object and a second object. The answer of the first object is the first answer, the answer of the second object is the second answer, and both the first object and the second object have preset weights. , And the preset weight of the first object is less than the preset weight of the second object.
具体的,在服务端中,应答对象包括第一对象和第二对象,其中,第一对象的回答答案为第一应答答案,第二对象的回答答案为第二应答答案,第一对象和第二对象均对应有预设权重,同时第一对象的预设权重小于第二对象的预设权重。Specifically, in the server, the response object includes a first object and a second object, where the answer of the first object is the first answer, the answer of the second object is the second answer, and the first object and the first object Both objects have preset weights corresponding to them, and the preset weight of the first object is smaller than the preset weight of the second object.
其中,预设权重是根据应答对象以往对众包任务的应答答案,服务器所设置的答案权重,可以用于评估应答对象的应答答案可信度。Among them, the preset weight is based on the answer to the crowdsourcing task of the respondent in the past, and the answer weight set by the server can be used to evaluate the credibility of the answer of the respondent.
应理解,通过赋予第一对象和第二对象均对应有预设权重,在计算的可信度值达不到要求(小于预设标准值)时,可根据预设权重对第一对象和第二对象对应的应答答案进行加权,使得应答答案的可信度值分布更为合理,有利于提高可信度值的准确性。It should be understood that by assigning preset weights to both the first object and the second object, when the calculated credibility value does not meet the requirements (less than the preset standard value), the first object and the second object can be assigned according to the preset weight. The response answers corresponding to the two subjects are weighted, so that the distribution of the reliability value of the response answer is more reasonable, which is beneficial to improve the accuracy of the reliability value.
S54:若对比结果为最大可信度值小于预设标准值,则获取第一应答答案对应的第一预设权重,以及第二应答答案对应的第二预设权重。S54: If the comparison result is that the maximum credibility value is less than the preset standard value, obtain the first preset weight corresponding to the first response answer and the second preset weight corresponding to the second response answer.
具体的,若最大可信度值小于预设标准值,则无法对应答答案进行验证,需要获取第一应答对象和第二应答对象的第一应答答案和第二应答答案,并赋予第一应答答案和第二应答答案不同的权重,根据赋予的权重确定应答答案的可信度。Specifically, if the maximum credibility value is less than the preset standard value, the answer answer cannot be verified, and the first answer answer and the second answer answer of the first answer object and the second answer object need to be obtained, and the first answer is assigned The answer and the second answer have different weights, and the credibility of the answer is determined according to the weight assigned.
其中,预设权重为服务端预设设置,较为合理的范围为(0.1,1),其具体设置可依据实际情况进行设定,此处不做限定,优选的,本实施例中第一预设权重为0.6,第二预设权重为0.8。Among them, the preset weight is the server preset setting, and the reasonable range is (0.1, 1). The specific setting can be set according to the actual situation, and it is not limited here. Preferably, the first preset in this embodiment Set the weight to 0.6, and the second preset weight to 0.8.
S55:根据第一应答答案和第二应答答案所属的参考答案类别、第一预设权重和第二预设权重,确定每类参考答案的可信度权重。S55: Determine the credibility weight of each type of reference answer according to the reference answer category to which the first answer answer and the second answer answer belong, the first preset weight, and the second preset weight.
具体的,众包任务有不同的应答题目类型,相应的,也有应应答答案所属的参考答案类别。根据第一应答答案和第二应答答案所属的参考答案类别,识别第一应答答案和第二应答答案的第一预设权重和第二预设权重,最终确定每类参考答案的可信度权重。Specifically, crowdsourcing tasks have different answer question types, and correspondingly, there are also reference answer categories to which the answer should belong. According to the reference answer category to which the first answer answer and the second answer answer belong, identify the first preset weight and the second preset weight of the first answer answer and the second answer answer, and finally determine the credibility weight of each type of reference answer .
其中,可信度权重是服务端赋予每类参考答案的可信度比例。Among them, the credibility weight is the credibility ratio given to each type of reference answer by the server.
S56:根据每类参考答案的可信度权重和预设的权重校验方式,确定每类参考答案的加权可信度值。S56: Determine the weighted credibility value of each type of reference answer according to the credibility weight of each type of reference answer and the preset weight verification method.
其中,加权可信度值是应答答案对应的可靠度值。Among them, the weighted reliability value is the reliability value corresponding to the answer answer.
在一实施例中,采用如下计算方式计算每类参考答案的加权可信度值:In one embodiment, the following calculation method is used to calculate the weighted credibility value of each type of reference answer:
假设第一应答答案为P 1,则该参考答案集合为P={P 1,P 2,…,P m},对应的加权可信度值为CR={CR 1+0.5,CR 2,…,CR m}。 Assuming that the first answer is P 1 , then the reference answer set is P={P 1 ,P 2 ,...,P m }, and the corresponding weighted credibility value is CR={CR 1 +0.5,CR 2 ,... ,CR m }.
其中,P m为第m类参考答案包含的应答答案的数量,CR为第m类参考答案对应的加权可信度值,m为参考答案类别数量,n为初始答案的数量。 Among them, P m is the number of answer answers contained in the m-th reference answer, CR is the weighted credibility value corresponding to the m-th reference answer, m is the number of reference answer categories, and n is the number of initial answers.
S57:选取数值最大的加权可信度值,作为目标加权可信度值,获取目标加权可信度值对应的参考答案,作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。S57: Select the weighted credibility value with the largest numerical value as the target weighted credibility value, obtain the reference answer corresponding to the target weighted credibility value as the target answer, and confirm the response answer corresponding to the target answer as the verified response Answer.
具体的,通过获取最大的加权可信度值,即该加权可信度值对应的参考答案为众包任务最佳的应答答案,通过将其作为众包任务的应答答案,并确认为验证通过的应答答案。Specifically, by obtaining the largest weighted credibility value, that is, the reference answer corresponding to the weighted credibility value is the best response answer for the crowdsourcing task, and by using it as the response answer of the crowdsourcing task, and confirming that it has passed the verification The answer to the answer.
本实施例中,通过确定每类参考答案的可信度权重和计算每类参考答案的加权可信度值,获取到每类参考答案的可靠程度,提高众包任务的应答答案验证的准确性。In this embodiment, by determining the credibility weight of each type of reference answer and calculating the weighted credibility value of each type of reference answer, the reliability of each type of reference answer is obtained, and the accuracy of the answer verification of the crowdsourcing task is improved. .
请参阅图7,图7示出了步骤S57的一种具体实施方式,步骤S57中,选取数值最大的加权可信度值,作为目标加权可信度值,获取目标加权可信度值对应的参考答案,作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案,详叙如下:Please refer to Figure 7. Figure 7 shows a specific implementation of step S57. In step S57, the weighted credibility value with the largest value is selected as the target weighted credibility value, and the corresponding weighted credibility value of the target is obtained. The reference answer is used as the target answer, and the answer corresponding to the target answer is confirmed as the verified answer. The details are as follows:
S571:获取第一对象和第二对象的历史应答答案。S571: Obtain historical response answers of the first object and the second object.
具体的,服务端将众包任务的对象,分为第一对象和第二对象;第一对象和第二对象指的是不同类别的对象,而不是特指其中的一个或者两个对象;通过获取第一对象和第二对象对众包任务的历史应答答案,能够获知第一对象和第二对象对于众包任务的应答水平。Specifically, the server divides the objects of the crowdsourcing task into a first object and a second object; the first object and the second object refer to different types of objects, rather than specifically one or two of them; through The historical response answers of the first object and the second object to the crowdsourcing task can be obtained, and the response level of the first object and the second object to the crowdsourcing task can be learned.
S572:判断第一对象的历史应答答案中,验证通过的应答答案比例,得到第一对象的应答准确率,并判断第二对象的历史应答答案中,验证通过的应答答案比例,得到第二对象的应答准确率。S572: Judging the proportion of the first subject’s historical response answers and obtaining the response accuracy rate of the first subject, and judging the proportion of the second subject’s historical response answers, obtaining the second subject The response accuracy rate.
具体的,通过判断第一对象和第二对象的历史应答答案中,验证通过的应答答案比例,得到第一对象和第二对象的应答对应的准确率,进一步获知第一对象和第二对象对众包任务的应答能力。Specifically, by judging the ratio of the verified response answers in the historical response answers of the first object and the second object, the accuracy rate corresponding to the responses of the first object and the second object is obtained, and the pair of the first object and the second object is further learned The ability to respond to crowdsourcing tasks.
S573:根据应答准确率与预设的分类阈值,对第一对象的预设权重和第二对象的预设权重更新,得到更新后的第一对象的预设权重和更新后的第二对象的预设权重。S573: According to the response accuracy rate and the preset classification threshold, update the preset weight of the first object and the preset weight of the second object to obtain the updated preset weight of the first object and the updated second object. Default weight.
具体的,获取到第一对象和第二对象的应答准确率,如若部分第一对象有较高的准确率,且该准确率超过了预设的分类阈值,则将这一部分的第一对象调整为第二对象;如若部分第二对象存在较低的准确率,且该准确率低于预设的分类阈值,则将这一部分的第二对象调整为第一对象。Specifically, the response accuracy rates of the first object and the second object are obtained. If a part of the first object has a higher accuracy rate and the accuracy rate exceeds the preset classification threshold, then the first object of this part is adjusted Is the second object; if a part of the second object has a low accuracy rate, and the accuracy rate is lower than the preset classification threshold, then adjust this part of the second object as the first object.
其中,预设的分类阈值为服务端预设设置,较为合理的范围为(0.6,0.9),其具体设置可依据实际情况进行设定,此处不做限定,优选的,本实施例中预设分类阈值为0.6。Among them, the preset classification threshold is the server-side preset setting, and the reasonable range is (0.6, 0.9). The specific setting can be set according to the actual situation, and it is not limited here. Preferably, it is preset in this embodiment. Set the classification threshold to 0.6.
在本实施例中,通过判断第一对象和第二对象的应答准确率,并更新第一对象的预设权重和第二对象的预设权重,使得第一对象的预设权重和第二对象的预设权重得以更新,使得应答对象对众包任务的应答答案更加重视,进一步的高了众包任务的准确性和效率。In this embodiment, by judging the response accuracy of the first object and the second object, and updating the preset weight of the first object and the preset weight of the second object, the preset weight of the first object and the second object The preset weights of is updated, so that the respondent pays more attention to the answers to the crowdsourcing task, which further improves the accuracy and efficiency of the crowdsourcing task.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence number of each step in the foregoing embodiment does not mean the order of execution. The execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present application.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,该计算机可读指令可存储于一计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,前述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)等非易失性存储介质,或随机存储记忆体(Random Access Memory,RAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through computer-readable instructions, which can be stored in a computer-readable storage medium. When the computer-readable instructions are executed, they may include the processes of the above-mentioned method embodiments. Among them, the aforementioned storage medium may be a non-volatile storage medium such as a magnetic disk, an optical disc, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.
请参考图8,作为对上述图2所示方法的实现,本申请提供了一种众包任务的答案验证装置的一个实施例,该装置实施例与图2所示的方法实施例相对应,该装置具体可以应用于各种电子设备中。Please refer to FIG. 8. As an implementation of the method shown in FIG. 2, this application provides an embodiment of an answer verification device for crowdsourced tasks. The device embodiment corresponds to the method embodiment shown in FIG. 2. The device can be specifically applied to various electronic devices.
如图8所示,本实施例的一种众包任务的答案验证装置包括:初始答案获取模块81、语义识别结果模块82、参考答案分类模块83、可信度值统计模块84、可信度值对比模块85和应答答案验证模块86。其中:As shown in FIG. 8, the answer verification device for crowdsourcing tasks in this embodiment includes: an initial answer obtaining module 81, a semantic recognition result module 82, a reference answer classification module 83, a credibility value statistics module 84, and credibility The value comparison module 85 and the answer verification module 86. in:
初始答案获取模块81,用于从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个应答答案对应一个应答对象;The initial answer obtaining module 81 is used to obtain each answer answer corresponding to the target task from all the answer answers obtained from the client as an initial answer, where each answer answer corresponds to a answer object;
语义识别结果模块82,用于通过自然语言语义识别的方式,对每个初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为应答对象的数量,N为正整数;The semantic recognition result module 82 is used to perform semantic recognition on each initial answer by means of natural language semantic recognition to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
参考答案分类模块83,用于将语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;The reference answer classification module 83 is used to combine the semantic recognition results in pairs and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value of is greater than the preset similarity threshold, the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained, where M≤N, and N is a positive integer;
可信度值统计模块84,用于通过预设的一致性校验方式,确定每类参考答案的可信度值;The credibility value statistics module 84 is used to determine the credibility value of each type of reference answer through a preset consistency check method;
可信度值对比模块85,用于从所有的参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将最大可信度值与预设标准值对比,得到对比结果;The credibility value comparison module 85 is used to select the credibility value with the largest value from the credibility values of all the reference answers as the maximum credibility value, and compare the maximum credibility value with the preset standard value Compare, get the comparison result;
应答答案验证模块86,用于若对比结果为最大可信度值大于或等于预设标准值,则将最大可信度值对应的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。The answer answer verification module 86 is configured to, if the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is used as the target answer, and the answer answer corresponding to the target answer is confirmed as Verify the answer that passed the verification.
进一步的,语义识别结果模块82包括:Further, the semantic recognition result module 82 includes:
基础分词获取单元,用于通过预设的分词方式,对初始答案进行分词处理,得到初始答案中包含的基础分词。The basic word segmentation acquisition unit is used to perform word segmentation processing on the initial answer through a preset word segmentation method to obtain the basic word segmentation contained in the initial answer.
词向量获取单元,用于将基础分词转换为词向量,并通过聚类算法,对词向量进行聚类,得到每个词向量对应的聚类中心。The word vector acquisition unit is used to convert the basic word segmentation into word vectors, and cluster the word vectors through a clustering algorithm to obtain the cluster center corresponding to each word vector.
语义识别结果单元,用于获取每个词向量对应的聚类中心对应的预设语义,作为初始答案的语义识别结果。The semantic recognition result unit is used to obtain the preset semantics corresponding to the cluster center corresponding to each word vector as the semantic recognition result of the initial answer.
进一步的,基础分词获取单元包括:Further, the basic word segmentation acquisition unit includes:
分词序列获取子单元,用于对基础语句进行分词解析,得到K个分词序列。The word segmentation sequence acquisition subunit is used to perform word segmentation analysis on the basic sentence to obtain K word segmentation sequences.
分词序列发生概率子单元,用于针对每个分词序列,依据预设的训练语料库的词序列数据,计算每个分词序列的发生概率,得到K个分词序列的发生概率。The word segmentation sequence occurrence probability subunit is used to calculate the occurrence probability of each word segmentation sequence based on the word sequence data of the preset training corpus for each word segmentation sequence to obtain the occurrence probability of K word segmentation sequences.
目标分词序列确定子单元,用于从K个分词序列的发生概率中,选取达到预设概率阈值的发生概率对应的分词序列,作为目标分词序列,并将目标分词序列中的每个分词,作为初始答案中包含的基础分词。The target word segmentation sequence determination subunit is used to select the word segmentation sequence corresponding to the occurrence probability of the preset probability threshold from the occurrence probabilities of the K word segmentation sequences as the target word segmentation sequence, and use each word segmentation in the target word segmentation sequence as The basic participle included in the initial answer.
进一步地,众包任务的答案验证装置还包括:Further, the answer verification device for crowdsourcing tasks also includes:
模拟答案获取模块,用于若对比结果为最大可信度值小于预设标准值,则获取目标任务,并将目标任务输入到预设模型中,通过预设模型得到模拟答案;The simulation answer obtaining module is used to obtain the target task if the comparison result is that the maximum credibility value is less than the preset standard value, and input the target task into the preset model to obtain the simulated answer through the preset model;
目标答案选取模块,用于针对每类参考答案,统计参考答案与模拟答案的相似度值,得到M个相似度值,并从M个相似度值中,选取数值最大的相似度值,将数值最大的相似度值对应的参考答案作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。The target answer selection module is used to calculate the similarity value between the reference answer and the simulated answer for each type of reference answer to obtain M similarity values, and from the M similarity values, select the highest similarity value, and calculate the value The reference answer corresponding to the largest similarity value is used as the target answer, and the answer answer corresponding to the target answer is confirmed as the answer answer that has passed the verification.
进一步地,众包任务的答案验证装置还包括:Further, the answer verification device for crowdsourcing tasks also includes:
预设权重设置模块,用于应答对象包括第一对象和第二对象,第一对象的回答答案为第一应答答案,第二对象的回答答案为第二应答答案,第一对象和第二对象均对应有预设权重,且第一对象的预设权重小于第二对象的预设权重;The preset weight setting module is used for answering objects including a first object and a second object, the answer of the first object is the first answer, the answer of the second object is the second answer, the first object and the second object Each corresponds to a preset weight, and the preset weight of the first object is less than the preset weight of the second object;
预设权重获取模块,用于若对比结果为最大可信度值小于预设标准值,则获取第一应答答案对应的第一预设权重,以及第二应答答案对应的第二预设权重;The preset weight obtaining module is configured to obtain the first preset weight corresponding to the first response answer and the second preset weight corresponding to the second response answer if the comparison result is that the maximum credibility value is less than the preset standard value;
可信度权重确定模块,用于根据第一应答答案和第二应答答案所属的参考答案类别、 第一预设权重和第二预设权重,确定每类参考答案的可信度权重;The credibility weight determination module is used to determine the credibility weight of each type of reference answer according to the reference answer category to which the first answer answer and the second answer answer belong, the first preset weight, and the second preset weight;
加权可信度值确定模块,用于根据每类参考答案的可信度权重和预设的权重校验方式,确认每类参考答案的加权可信度值;The weighted credibility value determination module is used to confirm the weighted credibility value of each type of reference answer according to the credibility weight of each type of reference answer and the preset weight check method;
加权可信度值选取模块,用于选取数值最大的加权可信度值,作为目标加权可信度值,获取目标加权可信度值对应的参考答案,作为目标答案,并将目标答案对应的应答答案确认为验证通过的应答答案。The weighted credibility value selection module is used to select the weighted credibility value with the largest value as the target weighted credibility value, obtain the reference answer corresponding to the target weighted credibility value, and use it as the target answer. The answer is confirmed as the answer that passed the verification.
进一步地,加权可信度值选取模块包括:Further, the weighted credibility value selection module includes:
历史应答答案获取单元,用于获取第一对象和第二对象的历史应答答案;The historical response answer obtaining unit is used to obtain the historical response answers of the first object and the second object;
应答准确率验证单元,用于判断第一对象的历史应答答案中,验证通过的应答答案比例,得到第一对象的应答准确率,并判断第二对象的历史应答答案中,验证通过的应答答案比例,得到第二对象的应答准确率;The response accuracy rate verification unit is used to determine the proportion of the response answers that passed the verification in the historical response answers of the first subject, obtain the response accuracy rate of the first subject, and determine the response answers that passed the verification in the historical response answers of the second subject Ratio, get the response accuracy rate of the second object;
跟新对象单元,用于根据应答准确率与预设的分类阈值,对第一对象的预设权重和第二对象的预设权重更新,得到更新后的第一对象的预设权重和更新后的第二对象的预设权重。The following new object unit is used to update the preset weight of the first object and the preset weight of the second object according to the response accuracy rate and the preset classification threshold to obtain the updated preset weight and updated weight of the first object The preset weight of the second object.
为解决上述技术问题,本申请实施例还提供计算机设备。具体请参阅图9,图9为本实施例计算机设备基本结构框图。In order to solve the above technical problems, the embodiments of the present application also provide computer equipment. Please refer to FIG. 9 for details. FIG. 9 is a block diagram of the basic structure of the computer device in this embodiment.
计算机设备9包括通过系统总线相互通信连接存储器91、处理器92、网络接口93。需要指出的是,图中仅示出了具有三种组件存储器91、处理器92、网络接口93的计算机设备9,但是应理解的是,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。其中,本技术领域技术人员可以理解,这里的计算机设备是一种能够按照事先设定或存储的指令,自动进行数值计算和/或信息处理的设备,其硬件包括但不限于微处理器、专用集成电路(Application Specific Integrated Circuit,ASIC)、可编程门阵列(Field-Programmable Gate Array,FPGA)、数字处理器(Digital Signal Processor,DSP)、嵌入式设备等。The computer device 9 includes a memory 91, a processor 92, and a network interface 93 that communicate with each other through a system bus. It should be pointed out that the figure only shows a computer device 9 with three types of components: memory 91, processor 92, and network interface 93, but it should be understood that it is not required to implement all the components shown, and alternative implementations are possible. More or fewer components. Among them, those skilled in the art can understand that the computer device here is a device that can automatically perform numerical calculation and/or information processing in accordance with pre-set or stored instructions. Its hardware includes, but is not limited to, a microprocessor, a dedicated Integrated Circuit (Application Specific Integrated Circuit, ASIC), Programmable Gate Array (Field-Programmable Gate Array, FPGA), Digital Processor (Digital Signal Processor, DSP), embedded equipment, etc.
计算机设备可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。计算机设备可以与用户通过键盘、鼠标、遥控器、触摸板或声控设备等方式进行人机交互。Computer equipment can be computing equipment such as desktop computers, notebooks, palmtop computers, and cloud servers. The computer equipment can interact with the user through a keyboard, a mouse, a remote control, a touch panel, or a voice control device.
存储器91至少包括一种类型的可读存储介质,可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等。在一些实施例中,存储器91可以是计算机设备9的内部存储单元,例如该计算机设备9的硬盘或内存。在另一些实施例中,存储器91也可以是计算机设备9的外部存储设备,例如该计算机设备9上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。当然,存储器91还可以既包括计算机设备9的内部存储单元也包括其外部存储设备。本实施例中,存储器91通常用于存储安装于计算机设备9的操作系统和各类应用软件,例如一种众包任务的抽检方法的计算机可读指令等。此外,存储器91还可以用于暂时地存储已经输出或者将要输出的各类数据。The memory 91 includes at least one type of readable storage medium. The readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory ( SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disks, optical disks, etc. In some embodiments, the memory 91 may be an internal storage unit of the computer device 9, for example, the hard disk or memory of the computer device 9. In other embodiments, the memory 91 may also be an external storage device of the computer device 9, such as a plug-in hard disk, a smart media card (SMC), and a secure digital (Secure Digital, SMC) equipped on the computer device 9. SD) card, flash card (Flash Card), etc. Of course, the memory 91 may also include both the internal storage unit of the computer device 9 and its external storage device. In this embodiment, the memory 91 is generally used to store an operating system and various application software installed in the computer device 9, such as a computer-readable instruction of a crowdsourced task sampling method. In addition, the memory 91 can also be used to temporarily store various types of data that have been output or will be output.
处理器92在一些实施例中可以是中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器92通常用于控制计算机设备9的总体操作。本实施例中,处理器92用于运行存储器91中存储的流程代码或者处理数据,例如运行一种众包任务的抽检方法的计算机可读指令。In some embodiments, the processor 92 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips. The processor 92 is generally used to control the overall operation of the computer device 9. In this embodiment, the processor 92 is used to run the process code or process data stored in the memory 91, for example, run a computer-readable instruction of a crowdsourced task sampling method.
网络接口93可包括无线网络接口或有线网络接口,该网络接口93通常用于在计算机设备9与其他电子设备之间建立通信连接。The network interface 93 may include a wireless network interface or a wired network interface, and the network interface 93 is generally used to establish a communication connection between the computer device 9 and other electronic devices.
本申请还提供了另一种实施方式,即提供一种计算机可读存储介质,计算机可读存储介质存储有抽检流程,抽检流程可被至少一个处理器执行,以使至少一个处理器执行如上述的一种众包任务的抽检方法的步骤。所述计算机可读存储介质可以是非易失性,也可以 是易失性。The present application also provides another implementation manner, that is, a computer-readable storage medium is provided with a random inspection process stored in the computer-readable storage medium, and the random inspection process can be executed by at least one processor, so that the at least one processor executes as described above The steps of a sampling method for crowdsourcing tasks. The computer-readable storage medium may be non-volatile or volatile.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例的方法。Through the description of the above implementation manners, those skilled in the art can clearly understand that the above-mentioned embodiment method can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes a number of instructions to enable a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to execute the methods of the various embodiments of the present application.
显然,以上所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例,附图中给出了本申请的较佳实施例,但并不限制本申请的范围。本申请可以以许多不同的形式来实现,相反地,提供这些实施例的目的是使对本申请的公开内容的理解更加透彻全面。尽管参照前述实施例对本申请进行了详细的说明,对于本领域的技术人员来而言,其依然可以对前述各具体实施方式所记载的技术方案进行修改,或者对其中部分技术特征进行等效替换。凡是利用本申请说明书及附图内容所做的等效结构,直接或间接运用在其他相关的技术领域,均同理在本申请保护范围之内。Obviously, the embodiments described above are only a part of the embodiments of the present application, rather than all of the embodiments. The drawings show preferred embodiments of the present application, but do not limit the scope of the present application. The present application can be implemented in many different forms. On the contrary, the purpose of providing these examples is to make the understanding of the disclosure of the present application more thorough and comprehensive. Although this application has been described in detail with reference to the foregoing embodiments, for those skilled in the art, it is still possible for those skilled in the art to modify the technical solutions described in each of the foregoing specific embodiments, or equivalently replace some of the technical features. . All equivalent structures made by using the contents of the description and drawings of this application, directly or indirectly used in other related technical fields, are similarly within the protection scope of this application.

Claims (20)

  1. 一种众包任务的答案验证方法,包括:A method of answer verification for crowdsourcing tasks, including:
    从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个所述应答答案对应一个应答对象;From all the answer answers obtained by the client, obtain each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
    通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;By means of natural language semantic recognition, semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
    将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;Combine the semantic recognition results in pairs, and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M≤N, and N is a positive integer;
    通过预设的一致性校验方式,确定每类所述参考答案的可信度值;Determine the credibility value of each type of reference answer through a preset consistency check method;
    从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;From all the credibility values of the reference answers, select the credibility value with the largest numerical value as the maximum credibility value, and compare the maximum credibility value with a preset standard value to obtain a comparison result;
    若所述对比结果为所述最大可信度值大于或等于所述预设标准值,则将所述最大可信度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。If the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
  2. 根据权利要求1所述众包任务的答案验证方法,其中,所述通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果包括:The answer verification method for crowdsourcing tasks according to claim 1, wherein said performing semantic recognition on each of said initial answers by means of natural language semantic recognition to obtain semantic recognition results of N initial answers comprises:
    通过预设的分词方式,对所述初始答案进行分词处理,得到所述初始答案中包含的基础分词;Perform word segmentation processing on the initial answer through a preset word segmentation method to obtain the basic word segmentation contained in the initial answer;
    将所述基础分词转换为词向量,并通过聚类算法,对所述词向量进行聚类,得到每个所述词向量对应的聚类中心;Converting the basic word segmentation into word vectors, and clustering the word vectors through a clustering algorithm, to obtain a clustering center corresponding to each of the word vectors;
    获取每个所述词向量对应的聚类中心对应的预设语义,作为所述初始答案的语义识别结果。Obtain the preset semantics corresponding to the cluster centers corresponding to each of the word vectors as the semantic recognition result of the initial answer.
  3. 根据权利要求2所述众包任务的答案验证方法,其中,所述通过预设的分词方式,对所述初始答案进行分词处理,得到所述初始答案中包含的基础分词包括:The answer verification method for crowdsourcing tasks according to claim 2, wherein said performing word segmentation processing on said initial answer through a preset word segmentation method to obtain the basic word segmentation contained in said initial answer comprises:
    对所述基础语句进行分词解析,得到K个分词序列;Perform word segmentation analysis on the basic sentence to obtain K word segmentation sequences;
    针对每个所述分词序列,依据所述预设的训练语料库的词序列数据,计算每个分词序列的发生概率,得到K个分词序列的发生概率;For each word segmentation sequence, calculate the occurrence probability of each word segmentation sequence according to the word sequence data of the preset training corpus to obtain the occurrence probability of K word segmentation sequences;
    从K个所述分词序列的发生概率中,选取达到预设概率阈值的发生概率对应的所述分词序列,作为目标分词序列,并将目标分词序列中的每个分词,作为所述初始答案中包含的基础分词。From the occurrence probabilities of the K word segmentation sequences, select the word segmentation sequence corresponding to the occurrence probability that reaches the preset probability threshold as the target word segmentation sequence, and use each word segmentation in the target word segmentation sequence as the initial answer The basic participle included.
  4. 根据权利要求1所述众包任务的答案验证方法,其中,在所述从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果之后,所述众包任务的答案验证方法还包括:The method for verifying the answer of the crowdsourcing task according to claim 1, wherein, among the credibility values of all the reference answers, the credibility value with the largest numerical value is selected as the maximum credibility value, and After comparing the maximum credibility value with the preset standard value, and after obtaining the comparison result, the answer verification method of the crowdsourcing task further includes:
    若所述对比结果为所述最大可信度值小于所述预设标准值,则获取所述目标任务,并将所述目标任务输入到预设模型中,通过所述预设模型得到模拟答案;If the comparison result is that the maximum credibility value is less than the preset standard value, the target task is obtained, and the target task is input into a preset model, and a simulated answer is obtained through the preset model ;
    针对每类所述参考答案,统计所述参考答案与所述模拟答案的相似度值,得到M个相似度值,并从M个所述相似度值中,选取数值最大的相似度值,将所述数值最大的相似度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。For each type of the reference answer, count the similarity values between the reference answer and the simulated answer to obtain M similarity values, and select the highest similarity value from the M similarity values, and The reference answer corresponding to the similarity value with the largest numerical value is used as a target answer, and the answer answer corresponding to the target answer is confirmed as a verified answer answer.
  5. 根据权利要求1所述众包任务的答案验证方法,其中,在所述从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果之后,所述众包任务的答案验证方法还包括:The method for verifying the answer of the crowdsourcing task according to claim 1, wherein, among the credibility values of all the reference answers, the credibility value with the largest numerical value is selected as the maximum credibility value, and After comparing the maximum credibility value with the preset standard value, and after obtaining the comparison result, the answer verification method of the crowdsourcing task further includes:
    所述应答对象包括第一对象和第二对象,所述第一对象的回答答案为第一应答答案,所述第二对象的回答答案为第二应答答案,所述第一对象和所述第二对象均对应有预设权重,且所述第一对象的预设权重小于所述第二对象的预设权重;The answer object includes a first object and a second object, the answer of the first object is a first answer, the answer of the second object is a second answer, the first object and the first object Both objects have preset weights corresponding to them, and the preset weight of the first object is less than the preset weight of the second object;
    若所述对比结果为所述最大可信度值小于所述预设标准值,则获取所述第一应答答案对应的第一预设权重,以及所述第二应答答案对应的第二预设权重;If the comparison result is that the maximum credibility value is less than the preset standard value, a first preset weight corresponding to the first answer answer and a second preset weight corresponding to the second answer answer are obtained Weights;
    根据所述第一应答答案和第二应答答案所属的参考答案类别、所述第一预设权重和所述第二预设权重,确定每类所述参考答案的可信度权重;Determine the credibility weight of each type of the reference answer according to the reference answer category to which the first answer answer and the second answer answer belong, the first preset weight and the second preset weight;
    根据每类所述参考答案的可信度权重和预设的权重校验方式,确定每类所述参考答案的加权可信度值;Determine the weighted credibility value of each type of reference answer according to the credibility weight of each type of reference answer and a preset weight check method;
    选取数值最大的所述加权可信度值,作为目标加权可信度值,获取所述目标加权可信度值对应的参考答案,作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。The weighted credibility value with the largest numerical value is selected as the target weighted credibility value, the reference answer corresponding to the target weighted credibility value is obtained as the target answer, and the response answer corresponding to the target answer is confirmed as Verify the answer that passed the verification.
  6. 根据权利要求5所述众包任务的答案验证方法,其中,所述选取数值最大的所述加权可信度值,作为目标加权可信度值,并将所述目标加权可信度值对应的参考答案,作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案之后,所述众包任务的答案验证方法还包括:The method for verifying the answer of the crowdsourcing task according to claim 5, wherein the weighted credibility value with the largest value is selected as the target weighted credibility value, and the target weighted credibility value corresponds to After the reference answer is used as the target answer, and after the answer answer corresponding to the target answer is confirmed as a verified answer answer, the answer verification method of the crowdsourcing task further includes:
    获取所述第一对象和所述第二对象的历史应答答案;Acquiring historical response answers of the first object and the second object;
    判断所述第一对象的历史应答答案中,验证通过的应答答案比例,得到所述第一对象的应答准确率,并判断所述第二对象的历史应答答案中,验证通过的应答答案比例,得到所述第二对象的应答准确率;Judging the proportion of the first subject’s historical response answers that have passed the verification, obtaining the response accuracy rate of the first subject, and judging the proportion of the second subject’s historical response answers that have passed the verification, Obtaining the response accuracy rate of the second object;
    根据所述应答准确率与预设的分类阈值,对所述第一对象的预设权重和所述第二对象的预设权重更新,得到更新后的第一对象的预设权重和更新后的第二对象的预设权重。According to the response accuracy rate and the preset classification threshold, the preset weight of the first object and the preset weight of the second object are updated to obtain the updated preset weight and updated weight of the first object The preset weight of the second object.
  7. 一种众包任务的答案验证装置,包括:An answer verification device for crowdsourcing tasks, including:
    初始答案获取模块,用于从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个所述应答答案对应一个应答对象;The initial answer obtaining module is used to obtain each answer answer corresponding to the target task from all the answer answers obtained from the client as an initial answer, wherein each answer answer corresponds to a answer object;
    语义识别结果模块,用于通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;The semantic recognition result module is used to perform semantic recognition on each of the initial answers through natural language semantic recognition to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is positive Integer
    参考答案分类模块,用于将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;The reference answer classification module is used to combine the semantic recognition results in pairs, and use each combination as a set of results, and use the similarity calculation method to count the similarity values between the semantic recognition results in each set of results , If the obtained similarity value is greater than the preset similarity threshold, the two semantic recognition results in the group are regarded as the same type of reference answer, and M type of reference answer is obtained, where M≤N, and N is a positive integer ;
    可信度值统计模块,用于通过预设的一致性校验方式,确定每类所述参考答案的可信度值;The credibility value statistics module is used to determine the credibility value of each type of reference answer through a preset consistency check method;
    可信度值对比模块,用于从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;The credibility value comparison module is used to select the credibility value with the largest value from all the credibility values of the reference answers as the maximum credibility value, and compare the maximum credibility value with the predicted value. Set standard value comparison and get the comparison result;
    应答答案验证模块,用于若所述对比结果为所述最大可信度值大于或等于所述预设标准值,则将所述最大可信度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。A response answer verification module, configured to, if the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, use the reference answer corresponding to the maximum credibility value as a target answer, and The answer answer corresponding to the target answer is confirmed as the answer answer passed the verification.
  8. 根据权利要求7所述众包任务的答案验证装置,其中,语义识别结果模块包括:The answer verification device for crowdsourcing tasks according to claim 7, wherein the semantic recognition result module comprises:
    基础分词获取单元,用于通过预设的分词方式,对所述初始答案进行分词处理,得到所述初始答案中包含的基础分词;The basic word segmentation acquisition unit is configured to perform word segmentation processing on the initial answer through a preset word segmentation method to obtain the basic word segmentation contained in the initial answer;
    词向量获取单元,用于将所述基础分词转换为词向量,并通过聚类算法,对所述词向量进行聚类,得到每个所述词向量对应的聚类中心;A word vector acquiring unit, configured to convert the basic word segmentation into a word vector, and cluster the word vector through a clustering algorithm to obtain a cluster center corresponding to each word vector;
    语义识别结果单元,用于获取每个所述词向量对应的聚类中心对应的预设语义,作为所述初始答案的语义识别结果。The semantic recognition result unit is used to obtain the preset semantics corresponding to the cluster center corresponding to each of the word vectors as the semantic recognition result of the initial answer.
  9. 一种计算机设备,包括存储器、处理器,以及存储在所述存储器中,并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:
    从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案, 其中,每个所述应答答案对应一个应答对象;From all the answer answers obtained by the client, obtain each answer answer corresponding to the target task as an initial answer, where each answer answer corresponds to a answer object;
    通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;By means of natural language semantic recognition, semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
    将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;Combine the semantic recognition results in pairs, and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M≤N, and N is a positive integer;
    通过预设的一致性校验方式,确定每类所述参考答案的可信度值;Determine the credibility value of each type of reference answer through a preset consistency check method;
    从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;From all the credibility values of the reference answers, select the credibility value with the largest numerical value as the maximum credibility value, and compare the maximum credibility value with a preset standard value to obtain a comparison result;
    若所述对比结果为所述最大可信度值大于或等于所述预设标准值,则将所述最大可信度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。If the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
  10. 根据权利要求9所述计算机设备,其中,所述通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果包括:9. The computer device according to claim 9, wherein said performing semantic recognition on each of said initial answers by means of natural language semantic recognition to obtain semantic recognition results of N initial answers comprises:
    通过预设的分词方式,对所述初始答案进行分词处理,得到所述初始答案中包含的基础分词;Perform word segmentation processing on the initial answer through a preset word segmentation method to obtain the basic word segmentation contained in the initial answer;
    将所述基础分词转换为词向量,并通过聚类算法,对所述词向量进行聚类,得到每个所述词向量对应的聚类中心;Converting the basic word segmentation into word vectors, and clustering the word vectors through a clustering algorithm, to obtain a clustering center corresponding to each of the word vectors;
    获取每个所述词向量对应的聚类中心对应的预设语义,作为所述初始答案的语义识别结果。Obtain the preset semantics corresponding to the cluster centers corresponding to each of the word vectors as the semantic recognition result of the initial answer.
  11. 根据权利要求10所述计算机设备,其中,所述通过预设的分词方式,对所述初始答案进行分词处理,得到所述初始答案中包含的基础分词包括:10. The computer device according to claim 10, wherein said performing word segmentation processing on said initial answer through a preset word segmentation method to obtain the basic word segmentation contained in said initial answer comprises:
    对所述基础语句进行分词解析,得到K个分词序列;Perform word segmentation analysis on the basic sentence to obtain K word segmentation sequences;
    针对每个所述分词序列,依据所述预设的训练语料库的词序列数据,计算每个分词序列的发生概率,得到K个分词序列的发生概率;For each word segmentation sequence, calculate the occurrence probability of each word segmentation sequence according to the word sequence data of the preset training corpus to obtain the occurrence probability of K word segmentation sequences;
    从K个所述分词序列的发生概率中,选取达到预设概率阈值的发生概率对应的所述分词序列,作为目标分词序列,并将目标分词序列中的每个分词,作为所述初始答案中包含的基础分词。From the occurrence probabilities of the K word segmentation sequences, select the word segmentation sequence corresponding to the occurrence probability that reaches the preset probability threshold as the target word segmentation sequence, and use each word segmentation in the target word segmentation sequence as the initial answer The basic participle included.
  12. 根据权利要求9所述计算机设备,其中,在所述从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果之后,所述处理器执行所述计算机可读指令时还实现如下步骤:9. The computer device according to claim 9, wherein, among the reliability values of all the reference answers, the reliability value with the largest numerical value is selected as the maximum reliability value, and the maximum reliability value is selected as the maximum reliability value. The reliability value is compared with the preset standard value, and after the comparison result is obtained, the processor further implements the following steps when executing the computer-readable instruction:
    若所述对比结果为所述最大可信度值小于所述预设标准值,则获取所述目标任务,并将所述目标任务输入到预设模型中,通过所述预设模型得到模拟答案;If the comparison result is that the maximum credibility value is less than the preset standard value, the target task is obtained, and the target task is input into a preset model, and a simulated answer is obtained through the preset model ;
    针对每类所述参考答案,统计所述参考答案与所述模拟答案的相似度值,得到M个相似度值,并从M个所述相似度值中,选取数值最大的相似度值,将所述数值最大的相似度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。For each type of the reference answer, count the similarity values between the reference answer and the simulated answer to obtain M similarity values, and from the M similarity values, select the similarity value with the largest numerical value, and The reference answer corresponding to the similarity value with the largest numerical value is used as a target answer, and the answer answer corresponding to the target answer is confirmed as a verified answer answer.
  13. 根据权利要求9所述计算机设备,其中,在所述从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果之后,所述处理器执行所述计算机可读指令时还实现如下步骤:9. The computer device according to claim 9, wherein, among the reliability values of all the reference answers, the reliability value with the largest numerical value is selected as the maximum reliability value, and the maximum reliability value is selected as the maximum reliability value. The reliability value is compared with the preset standard value, and after the comparison result is obtained, the processor further implements the following steps when executing the computer-readable instruction:
    所述应答对象包括第一对象和第二对象,所述第一对象的回答答案为第一应答答案,所述第二对象的回答答案为第二应答答案,所述第一对象和所述第二对象均对应有预设权重,且所述第一对象的预设权重小于所述第二对象的预设权重;The answer object includes a first object and a second object, the answer of the first object is a first answer, the answer of the second object is a second answer, the first object and the first object Both objects have preset weights corresponding to them, and the preset weight of the first object is less than the preset weight of the second object;
    若所述对比结果为所述最大可信度值小于所述预设标准值,则获取所述第一应答答案对应的第一预设权重,以及所述第二应答答案对应的第二预设权重;If the comparison result is that the maximum credibility value is less than the preset standard value, a first preset weight corresponding to the first answer answer and a second preset weight corresponding to the second answer answer are obtained Weights;
    根据所述第一应答答案和第二应答答案所属的参考答案类别、所述第一预设权重和所述第二预设权重,确定每类所述参考答案的可信度权重;Determine the credibility weight of each type of the reference answer according to the reference answer category to which the first answer answer and the second answer answer belong, the first preset weight and the second preset weight;
    根据每类所述参考答案的可信度权重和预设的权重校验方式,确定每类所述参考答案的加权可信度值;Determine the weighted credibility value of each type of reference answer according to the credibility weight of each type of reference answer and a preset weight check method;
    选取数值最大的所述加权可信度值,作为目标加权可信度值,获取所述目标加权可信度值对应的参考答案,作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。The weighted credibility value with the largest numerical value is selected as the target weighted credibility value, the reference answer corresponding to the target weighted credibility value is obtained as the target answer, and the response answer corresponding to the target answer is confirmed as Verify the answer that passed the verification.
  14. 根据权利要求13所述计算机设备,其中,所述选取数值最大的所述加权可信度值,作为目标加权可信度值,并将所述目标加权可信度值对应的参考答案,作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案之后,所述处理器执行所述计算机可读指令时还实现如下步骤:The computer device according to claim 13, wherein the selected weighted credibility value with the largest value is used as the target weighted credibility value, and the reference answer corresponding to the target weighted credibility value is used as the target Answer, and after confirming that the answer answer corresponding to the target answer is a verified answer answer, the processor further implements the following steps when executing the computer-readable instruction:
    获取所述第一对象和所述第二对象的历史应答答案;Acquiring historical response answers of the first object and the second object;
    判断所述第一对象的历史应答答案中,验证通过的应答答案比例,得到所述第一对象的应答准确率,并判断所述第二对象的历史应答答案中,验证通过的应答答案比例,得到所述第二对象的应答准确率;Judging the proportion of the first subject’s historical response answers that have passed the verification, obtaining the response accuracy rate of the first subject, and judging the proportion of the second subject’s historical response answers that have passed the verification, Obtaining the response accuracy rate of the second object;
    根据所述应答准确率与预设的分类阈值,对所述第一对象的预设权重和所述第二对象的预设权重更新,得到更新后的第一对象的预设权重和更新后的第二对象的预设权重。According to the response accuracy rate and the preset classification threshold, the preset weight of the first object and the preset weight of the second object are updated to obtain the updated preset weight and updated weight of the first object The preset weight of the second object.
  15. 一种计算机可读存储介质,所述计算机可读存储介质上存储有可执行代码,所述可执行代码被处理器执行时实现如下所述众包任务的答案验证方法的步骤:A computer-readable storage medium having executable code stored on the computer-readable storage medium, and when the executable code is executed by a processor, the steps of the method for verifying the answer of the crowdsourcing task as described below are realized:
    从客户端获取的所有应答答案中,获取目标任务对应的每个应答答案,作为初始答案,其中,每个所述应答答案对应一个应答对象;From all the answer answers obtained by the client, obtain each answer answer corresponding to the target task as an initial answer, wherein each answer answer corresponds to a answer object;
    通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果,其中,N为所述应答对象的数量,N为正整数;By means of natural language semantic recognition, semantic recognition is performed on each of the initial answers to obtain semantic recognition results of N initial answers, where N is the number of response objects, and N is a positive integer;
    将所述语义识别结果两两组合,并将每个组合作为一组结果,采用相似度计算的方式,统计每组结果中所述语义识别结果之间的相似度值,若得到的相似度值大于预设相似度阈值,则将所述组别中的两个语义识别结果作为同一类参考答案,得到M类参考答案,其中,M≤N,且N为正整数;Combine the semantic recognition results in pairs, and use each combination as a set of results. The similarity calculation method is used to calculate the similarity value between the semantic recognition results in each set of results. If the similarity value is obtained If it is greater than the preset similarity threshold, use the two semantic recognition results in the group as the same type of reference answer to obtain M type of reference answer, where M≤N, and N is a positive integer;
    通过预设的一致性校验方式,确定每类所述参考答案的可信度值;Determine the credibility value of each type of reference answer through a preset consistency check method;
    从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果;From all the credibility values of the reference answers, select the credibility value with the largest numerical value as the maximum credibility value, and compare the maximum credibility value with a preset standard value to obtain a comparison result;
    若所述对比结果为所述最大可信度值大于或等于所述预设标准值,则将所述最大可信度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。If the comparison result is that the maximum credibility value is greater than or equal to the preset standard value, the reference answer corresponding to the maximum credibility value is taken as the target answer, and the target answer is The answer is confirmed as the answer that passed the verification.
  16. 根据权利要求15所述计算机可读存储介质,其中,所述通过自然语言语义识别的方式,对每个所述初始答案进行语义识别,得到N个初始答案的语义识别结果包括:15. The computer-readable storage medium according to claim 15, wherein said performing semantic recognition on each of said initial answers by means of natural language semantic recognition to obtain semantic recognition results of N initial answers comprises:
    通过预设的分词方式,对所述初始答案进行分词处理,得到所述初始答案中包含的基础分词;Perform word segmentation processing on the initial answer through a preset word segmentation method to obtain the basic word segmentation contained in the initial answer;
    将所述基础分词转换为词向量,并通过聚类算法,对所述词向量进行聚类,得到每个所述词向量对应的聚类中心;Converting the basic word segmentation into word vectors, and clustering the word vectors through a clustering algorithm, to obtain a clustering center corresponding to each of the word vectors;
    获取每个所述词向量对应的聚类中心对应的预设语义,作为所述初始答案的语义识别结果。Obtain the preset semantics corresponding to the cluster centers corresponding to each of the word vectors as the semantic recognition result of the initial answer.
  17. 根据权利要求16所述计算机可读存储介质,其中,所述通过预设的分词方式,对所述初始答案进行分词处理,得到所述初始答案中包含的基础分词包括:15. The computer-readable storage medium according to claim 16, wherein said performing word segmentation processing on said initial answer through a preset word segmentation method to obtain the basic word segmentation contained in said initial answer comprises:
    对所述基础语句进行分词解析,得到K个分词序列;Perform word segmentation analysis on the basic sentence to obtain K word segmentation sequences;
    针对每个所述分词序列,依据所述预设的训练语料库的词序列数据,计算每个分词序列的发生概率,得到K个分词序列的发生概率;For each word segmentation sequence, calculate the occurrence probability of each word segmentation sequence according to the word sequence data of the preset training corpus to obtain the occurrence probability of K word segmentation sequences;
    从K个所述分词序列的发生概率中,选取达到预设概率阈值的发生概率对应的所述分词序列,作为目标分词序列,并将目标分词序列中的每个分词,作为所述初始答案中包含的基础分词。From the occurrence probabilities of the K word segmentation sequences, select the word segmentation sequence corresponding to the occurrence probability that reaches the preset probability threshold as the target word segmentation sequence, and use each word segmentation in the target word segmentation sequence as the initial answer The basic participle included.
  18. 根据权利要求15所述计算机可读存储介质,其中,在所述从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果之后,所述计算机可读指令被所述处理器执行时,使得所述处理器执行如下步骤:The computer-readable storage medium according to claim 15, wherein, among the reliability values of all the reference answers, the reliability value with the largest numerical value is selected as the maximum reliability value, and all the reliability values The maximum credibility value is compared with the preset standard value, and after the comparison result is obtained, when the computer-readable instruction is executed by the processor, the processor is caused to perform the following steps:
    若所述对比结果为所述最大可信度值小于所述预设标准值,则获取所述目标任务,并将所述目标任务输入到预设模型中,通过所述预设模型得到模拟答案;If the comparison result is that the maximum credibility value is less than the preset standard value, the target task is obtained, and the target task is input into a preset model, and a simulated answer is obtained through the preset model ;
    针对每类所述参考答案,统计所述参考答案与所述模拟答案的相似度值,得到M个相似度值,并从M个所述相似度值中,选取数值最大的相似度值,将所述数值最大的相似度值对应的所述参考答案作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。For each type of the reference answer, count the similarity values between the reference answer and the simulated answer to obtain M similarity values, and select the highest similarity value from the M similarity values, and The reference answer corresponding to the similarity value with the largest numerical value is used as a target answer, and the answer answer corresponding to the target answer is confirmed as a verified answer answer.
  19. 根据权利要求15所述计算机可读存储介质,其中,在所述从所有的所述参考答案的可信度值中,选取数值最大的可信度值,作为最大可信度值,并将所述最大可信度值与预设标准值对比,得到对比结果之后,所述计算机可读指令被所述处理器执行时,使得所述处理器还执行如下步骤:The computer-readable storage medium according to claim 15, wherein, among the reliability values of all the reference answers, the reliability value with the largest numerical value is selected as the maximum reliability value, and all the reliability values The maximum credibility value is compared with the preset standard value, and after the comparison result is obtained, when the computer-readable instruction is executed by the processor, the processor further executes the following steps:
    所述应答对象包括第一对象和第二对象,所述第一对象的回答答案为第一应答答案,所述第二对象的回答答案为第二应答答案,所述第一对象和所述第二对象均对应有预设权重,且所述第一对象的预设权重小于所述第二对象的预设权重;The answer object includes a first object and a second object, the answer of the first object is a first answer, the answer of the second object is a second answer, the first object and the first object Both objects have preset weights corresponding to them, and the preset weight of the first object is less than the preset weight of the second object;
    若所述对比结果为所述最大可信度值小于所述预设标准值,则获取所述第一应答答案对应的第一预设权重,以及所述第二应答答案对应的第二预设权重;If the comparison result is that the maximum credibility value is less than the preset standard value, a first preset weight corresponding to the first answer answer and a second preset weight corresponding to the second answer answer are obtained Weights;
    根据所述第一应答答案和第二应答答案所属的参考答案类别、所述第一预设权重和所述第二预设权重,确定每类所述参考答案的可信度权重;Determine the credibility weight of each type of the reference answer according to the reference answer category to which the first answer answer and the second answer answer belong, the first preset weight and the second preset weight;
    根据每类所述参考答案的可信度权重和预设的权重校验方式,确定每类所述参考答案的加权可信度值;Determine the weighted credibility value of each type of reference answer according to the credibility weight of each type of reference answer and a preset weight check method;
    选取数值最大的所述加权可信度值,作为目标加权可信度值,获取所述目标加权可信度值对应的参考答案,作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案。The weighted credibility value with the largest numerical value is selected as the target weighted credibility value, the reference answer corresponding to the target weighted credibility value is obtained as the target answer, and the response answer corresponding to the target answer is confirmed as Verify the answer that passed the verification.
  20. 根据权利要求19所述计算机可读存储介质,其中,所述选取数值最大的所述加权可信度值,作为目标加权可信度值,并将所述目标加权可信度值对应的参考答案,作为目标答案,并将所述目标答案对应的应答答案确认为验证通过的应答答案之后,所述计算机可读指令被所述处理器执行时,使得所述处理器还执行如下步骤:18. The computer-readable storage medium of claim 19, wherein the weighted credibility value with the largest value is selected as a target weighted credibility value, and the reference answer corresponding to the target weighted credibility value , As the target answer, and after confirming that the answer answer corresponding to the target answer is a verified answer answer, when the computer-readable instruction is executed by the processor, the processor further executes the following steps:
    获取所述第一对象和所述第二对象的历史应答答案;Acquiring historical response answers of the first object and the second object;
    判断所述第一对象的历史应答答案中,验证通过的应答答案比例,得到所述第一对象的应答准确率,并判断所述第二对象的历史应答答案中,验证通过的应答答案比例,得到所述第二对象的应答准确率;Judging the proportion of the first subject’s historical response answers that have passed the verification, obtaining the response accuracy rate of the first subject, and judging the proportion of the second subject’s historical response answers that have passed the verification, Obtaining the response accuracy rate of the second object;
    根据所述应答准确率与预设的分类阈值,对所述第一对象的预设权重和所述第二对象的预设权重更新,得到更新后的第一对象的预设权重和更新后的第二对象的预设权重。According to the response accuracy rate and the preset classification threshold, the preset weight of the first object and the preset weight of the second object are updated to obtain the updated preset weight and updated weight of the first object The preset weight of the second object.
PCT/CN2020/117671 2020-03-02 2020-09-25 Answer verification method and apparatus for crowdsourcing task, computer device, and storage medium WO2021174814A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010135251.6 2020-03-02
CN202010135251.6A CN111460811A (en) 2020-03-02 2020-03-02 Crowdsourcing task answer verification method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2021174814A1 true WO2021174814A1 (en) 2021-09-10

Family

ID=71684147

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/117671 WO2021174814A1 (en) 2020-03-02 2020-09-25 Answer verification method and apparatus for crowdsourcing task, computer device, and storage medium

Country Status (2)

Country Link
CN (1) CN111460811A (en)
WO (1) WO2021174814A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114153961A (en) * 2022-02-07 2022-03-08 杭州远传新业科技有限公司 Knowledge graph-based question and answer method and system

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111460811A (en) * 2020-03-02 2020-07-28 平安科技(深圳)有限公司 Crowdsourcing task answer verification method and device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105117398A (en) * 2015-06-25 2015-12-02 扬州大学 Software development problem automatic answering method based on crowdsourcing
US10002177B1 (en) * 2013-09-16 2018-06-19 Amazon Technologies, Inc. Crowdsourced analysis of decontextualized data
CN109582581A (en) * 2018-11-30 2019-04-05 平安科技(深圳)有限公司 A kind of result based on crowdsourcing task determines method and relevant device
CN110363194A (en) * 2019-06-17 2019-10-22 深圳壹账通智能科技有限公司 Intelligently reading method, apparatus, equipment and storage medium based on NLP
CN111460811A (en) * 2020-03-02 2020-07-28 平安科技(深圳)有限公司 Crowdsourcing task answer verification method and device, computer equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10002177B1 (en) * 2013-09-16 2018-06-19 Amazon Technologies, Inc. Crowdsourced analysis of decontextualized data
CN105117398A (en) * 2015-06-25 2015-12-02 扬州大学 Software development problem automatic answering method based on crowdsourcing
CN109582581A (en) * 2018-11-30 2019-04-05 平安科技(深圳)有限公司 A kind of result based on crowdsourcing task determines method and relevant device
CN110363194A (en) * 2019-06-17 2019-10-22 深圳壹账通智能科技有限公司 Intelligently reading method, apparatus, equipment and storage medium based on NLP
CN111460811A (en) * 2020-03-02 2020-07-28 平安科技(深圳)有限公司 Crowdsourcing task answer verification method and device, computer equipment and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114153961A (en) * 2022-02-07 2022-03-08 杭州远传新业科技有限公司 Knowledge graph-based question and answer method and system

Also Published As

Publication number Publication date
CN111460811A (en) 2020-07-28

Similar Documents

Publication Publication Date Title
WO2020207431A1 (en) Document classification method, apparatus and device, and storage medium
WO2021051560A1 (en) Text classification method and apparatus, electronic device, and computer non-volatile readable storage medium
US20180075368A1 (en) System and Method of Advising Human Verification of Often-Confused Class Predictions
US20180068222A1 (en) System and Method of Advising Human Verification of Machine-Annotated Ground Truth - Low Entropy Focus
US20180068221A1 (en) System and Method of Advising Human Verification of Machine-Annotated Ground Truth - High Entropy Focus
WO2021073390A1 (en) Data screening method and apparatus, device and computer-readable storage medium
WO2021121198A1 (en) Semantic similarity-based entity relation extraction method and apparatus, device and medium
JP2022512065A (en) Image classification model training method, image processing method and equipment
CN111831826B (en) Training method, classification method and device of cross-domain text classification model
WO2021051586A1 (en) Interview answer text classification method, device, electronic apparatus and storage medium
WO2021139316A1 (en) Method and apparatus for establishing expression recognition model, and computer device and storage medium
WO2021174814A1 (en) Answer verification method and apparatus for crowdsourcing task, computer device, and storage medium
US20230004819A1 (en) Method and apparatus for training semantic retrieval network, electronic device and storage medium
WO2020143301A1 (en) Training sample validity detection method, computer device, and computer non-volatile storage medium
US20220121668A1 (en) Method for recommending document, electronic device and storage medium
US10762438B1 (en) Extracting questions and answers
CN107291774B (en) Error sample identification method and device
US20180005248A1 (en) Product, operating system and topic based
CN112084752A (en) Statement marking method, device, equipment and storage medium based on natural language
CN115730597A (en) Multi-level semantic intention recognition method and related equipment thereof
WO2019041528A1 (en) Method, electronic apparatus, and computer readable storage medium for determining polarity of news sentiment
WO2016200408A1 (en) Hybrid classification system
WO2021196935A1 (en) Data checking method and apparatus, electronic device, and storage medium
CN113505786A (en) Test question photographing and judging method and device and electronic equipment
CN110059743B (en) Method, apparatus and storage medium for determining a predicted reliability metric

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20922954

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20922954

Country of ref document: EP

Kind code of ref document: A1