WO2019056496A1 - Method for generating picture review probability interval and method for picture review determination - Google Patents

Method for generating picture review probability interval and method for picture review determination Download PDF

Info

Publication number
WO2019056496A1
WO2019056496A1 PCT/CN2017/108480 CN2017108480W WO2019056496A1 WO 2019056496 A1 WO2019056496 A1 WO 2019056496A1 CN 2017108480 W CN2017108480 W CN 2017108480W WO 2019056496 A1 WO2019056496 A1 WO 2019056496A1
Authority
WO
WIPO (PCT)
Prior art keywords
probability
picture
original
review
bad
Prior art date
Application number
PCT/CN2017/108480
Other languages
French (fr)
Chinese (zh)
Inventor
郑佳
赵骏
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019056496A1 publication Critical patent/WO2019056496A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a method, an apparatus, a storage medium, and a computer device for generating a picture review probability interval, and a picture retry method, apparatus, storage medium, and computer device.
  • the bad picture authentication system can feedback the single-category probability of identifying the pictures as one of normal, sexy and pornographic. Or the three probabilities corresponding to the identification of the picture as normal, sexy, and erotic.
  • a picture review probability interval generation method is provided.
  • a picture review method, apparatus, storage medium, and computer apparatus are provided.
  • a method for generating a picture review probability interval comprising: obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; and for each group of pictures having the same bad probability, the statistical review is determined as The first number of normal pictures, and the second number determined to be a bad picture; the picture review probability interval is regenerated according to the first quantity and the second quantity corresponding to each group of bad probabilities.
  • the method before the obtaining a bad probability of each picture in the picture set corresponding to the original review probability interval, includes: acquiring a preset picture set, where each picture is a bad picture initial probability; Calculating an initial probability corresponding to each picture, generating a bad probability that each picture in the preset picture set is a bad picture; setting an original review probability interval, and extracting a picture corresponding to the bad probability in the original review probability interval, forming A set of pictures corresponding to the original review probability interval.
  • the regenerating the picture review probability interval according to the first quantity and the second quantity corresponding to each set of the bad probability comprises: obtaining an original upper limit probability and an original lower limit probability of the original review probability interval; Calculating a two-category probability by using the original upper bound probability and the original lower bound probability, by which the sum of the original lower bound probability to the second quantity corresponding to the second classification probability can be obtained, and the second classification probability corresponds to the original upper limit probability a sum of quantities, the minimum of the sum of the two; calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval.
  • the calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval including: when the two classification probability is less than or equal to the preset probability, according to the formula Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper bound probability s 2 ' Or the new lower limit probability s 1 ', regenerating the picture review probability interval.
  • the method further includes: acquiring a picture system weight of the picture transmission system corresponding to the picture set; Calculating the original upper limit probability and the original lower limit probability according to the second classification probability, and regenerating the picture review probability interval, comprising: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight value Perform calculations to regenerate the image review probability interval.
  • a picture re-determination method comprising: obtaining a bad probability that a picture to be identified is a bad picture; determining whether the bad probability is within a preset picture review probability interval, and the picture review probability interval is according to the foregoing implementation
  • the picture review probability interval generation method described in the example is generated; if yes, determining that the picture to be identified requires a picture review.
  • a picture review probability interval generating device comprising: a bad probability obtaining module, configured to acquire a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; and a picture determination quantity statistics module, configured to: For each group of pictures with the same bad probability, the statistics are determined to be the first number of normal pictures, and the second number is determined as the bad picture; the picture review probability interval generation module is used to correspond to each group of bad probabilities. The first quantity and the second quantity regenerate the picture review probability interval.
  • a picture re-determination device comprising: a picture defect probability acquisition module to be used for obtaining a bad probability that a picture to be identified is a bad picture; a picture review judging module, configured to determine whether the bad probability is preset Within the picture review probability interval, the picture review probability interval is generated according to the picture review probability interval generation method described in the above embodiments; if yes, determining that the picture to be identified requires image review.
  • One or more non-transitory readable storage mediums storing computer readable instructions, when executed by one or more processors, cause the one or more processors to execute The following steps: obtaining the bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; for each picture having the same bad probability, the statistics are determined to be the first quantity of the normal picture, and the trial is determined. a second number of bad pictures; and regenerating a picture review probability interval according to the first quantity and the second quantity corresponding to each set of bad probabilities.
  • One or more non-transitory readable storage mediums storing computer readable instructions, when executed by one or more processors, cause the one or more processors to perform the steps of: acquiring The probability that the picture to be identified is a bad picture; determining whether the bad probability is within a preset picture review probability interval, the step of generating the picture review probability interval includes acquiring each picture set corresponding to the original review probability interval The picture is a bad probability of a bad picture; for each group of pictures with the same bad probability, the statistics are determined to be the first number of normal pictures, and the second number of bad pictures is determined as the bad picture; A quantity and a second quantity regenerate the picture review probability interval; and if so, determining that the picture to be identified requires a picture review.
  • a computer device comprising a memory and one or more processors, the memory storing computer readable instructions, the computer readable instructions being executed by the processor, causing the one or more processors to execute The following steps: obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; for each picture having the same bad probability, the statistics are determined to be the first number of normal pictures, and the trial is determined a second number of bad pictures; and regenerating a picture review probability interval according to the first quantity and the second quantity corresponding to each set of bad probabilities.
  • a computer device comprising a memory and one or more processors, the memory storing computer readable instructions, the computer readable instructions being executed by the processor, causing the one or more processors to execute The following steps: obtaining a bad probability that the picture to be identified is a bad picture; determining whether the bad probability is within a preset picture review probability interval, and the generating step of the picture review probability interval includes obtaining a corresponding interval corresponding to the original review probability interval The bad probability of each picture being a bad picture in the picture set; for each group of pictures with the same bad probability, the statistics are determined to be the first number of normal pictures, and the second number of bad pictures is determined as a bad picture; The first quantity and the second quantity corresponding to the probability regenerate the picture review probability interval; and if so, determining that the picture to be identified requires a picture review.
  • FIG. 1 is an application environment diagram of a method for generating a picture review probability interval in an embodiment
  • FIG. 2 is a flowchart of a method for generating a picture review probability interval in an embodiment
  • FIG. 3 is a flowchart of a method for generating a picture review probability interval in another embodiment
  • FIG. 4 is a flowchart of a method for generating a picture review probability interval in still another embodiment
  • Figure 5 is a flow chart of a method for resuming a picture in an embodiment
  • FIG. 6 is a structural block diagram of a picture review probability interval generating apparatus in an embodiment
  • FIG. 7 is a structural block diagram of a picture review probability interval generating apparatus in another embodiment.
  • Figure 8 is a block diagram showing the structure of a picture re-determining device in an embodiment
  • Figure 9 is a diagram showing the internal structure of a first computer device in one embodiment.
  • first, second and the like may be used to describe various elements, but these elements are not limited by these terms. These terms are only used to distinguish one element from another.
  • the first quantity is referred to as a second quantity, and similarly, the second quantity may be referred to as a first quantity. Both the first quantity and the second quantity are quantities, but they are not the same quantity.
  • the image review probability interval generation method provided by the embodiment of the present application can be applied to the application environment as shown in FIG. 1 .
  • the application environment includes a first computer device 102 and a second computer device 104.
  • the first computer device 102 and the second computer device 104 can be terminals or servers.
  • the terminal includes but is not limited to a mobile phone, a tablet computer, or a personal digital assistant or a wearable device.
  • the server may be an independent physical server or a server cluster composed of multiple physical servers.
  • the first computer device 102 and the second computer device can be the same type of computer device, or can be different types of computer devices.
  • the first computer device 102 can be used to execute the picture review probability interval generation method provided by the embodiment of the present application.
  • the second computer device 104 may be a terminal or server that stores a bad picture authentication system that can be used to identify bad pictures and output an initial probability of the picture.
  • the first computer device 102 can be connected to the second computer device 104.
  • the first computer device 102 can obtain data such as an initial probability that the second computer device 104 authenticates the picture, and the network connection includes, but is not limited to, a wireless network. Wired network, etc.
  • the first computer device 102 and the second computer device 104 can be the same computer device.
  • a picture review probability interval generation method is provided, which is applicable to the first computer device 102 in the application environment as shown in FIG. 1, the method comprising:
  • Step S202 Obtain a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture.
  • the original review probability interval may refer to a preset fixed review probability interval, or may be a previous adjusted review probability interval.
  • the review probability interval refers to the probability interval corresponding to the picture that needs to be manually reviewed in the process of bad picture identification.
  • the bad probability refers to the possible degree of the picture being a bad picture, and the bad probability is in the original review probability interval. According to the initial probability of the bad picture identification system output, the bad probability can be calculated according to a preset algorithm. Further, according to the original image of the original review probability interval, the probability of each picture being a bad picture may be statistically shaped. The image is poorly distributed in the original review probability interval. In the bad distribution of the image, the abscissa may be a bad probability, and the ordinate may be the number of pictures corresponding to the same bad probability.
  • Step S204 for each group of pictures having the same bad probability, the first number of the normal picture is determined as the normal picture, and the second number is determined as the bad picture.
  • the number of pictures in the original review probability interval is determined to be the number of normal pictures or bad pictures. Among them, in the picture with the same bad probability, the number of pictures that are determined to be normal pictures is the first number, and the number of pictures that are determined to be bad pictures is the second number.
  • Step S206 Regenerate the picture review probability interval according to the first quantity and the second quantity corresponding to each group of the bad probability.
  • the new picture review probability interval refers to the probability interval used to determine whether the picture is to be manually reviewed.
  • the bad probability of the newly received picture is in the new picture review probability interval calculated by the above method, the picture can be determined.
  • the first quantity distribution and the second quantity distribution are respectively formed according to the first quantity and the second quantity corresponding to the picture of each bad probability, the first quantity distribution and the second quantity corresponding to each group of the bad probability are further The quantity distribution regenerates the image review probability interval.
  • the first number of the normal picture is determined as the normal picture, and the second number is determined as the bad picture, according to the first quantity and the second number.
  • the quantity is calculated for the original review probability interval, and the image review probability interval is regenerated.
  • the method before step S202, the method further includes:
  • Step S302 Acquire a preset picture set, and each picture is an initial probability of a bad picture.
  • the preset picture set refers to a picture set formed by the picture system inputting a picture of the bad picture authentication system within a preset time period.
  • the initial probability is that the probability of each picture outputted by the image authentication system is different.
  • the output of the bad picture authentication system may be a single-category probability that the picture is identified as one of normal, sexy, and erotic, or the picture is identified as normal. Three pairs of sexy, erotic, and erotic Multiple category probabilities. Among them, the single category probability and the multi-category probability may be the confidence that the picture is the corresponding picture category.
  • the single category probability and the multi-class probability may range from 0 to 1.
  • Step S304 calculating an initial probability corresponding to each picture, and generating a bad probability that each picture in the preset picture set is a bad picture.
  • the probability of badness is the probability of having a uniform standard obtained by calculating according to the initial probability according to a preset algorithm. Uniform mapping of single category probabilities or multi-category probabilities to probabilities over the same reference range.
  • the preset picture set is obtained, and each picture is a picture category weight corresponding to an initial probability of the bad picture; and an initial corresponding to each picture according to the picture category weight corresponding to the initial probability of the same picture
  • the probability is calculated to generate a bad probability that each picture in the preset picture set is a bad picture.
  • the picture category weight refers to the weight corresponding to the picture category output by the bad picture authentication system.
  • the picture category weight corresponding to each picture is one of three categories: a preset normal weight, a preset sexy weight, and a preset porn weight; when the initial probability is a multi-category
  • the picture category weight value corresponding to each picture may be three, which are a preset normal weight, a preset sexy weight, and a preset porn weight.
  • Step S306 setting an original review probability interval, and extracting a picture corresponding to the bad probability in the original review probability interval, and forming a picture set corresponding to the original review probability interval.
  • the corresponding picture may be extracted according to the original review probability interval to form a picture set corresponding to the original review probability interval, and the picture set is a picture set that needs manual review. Further, the picture set corresponding to the original review probability interval may be extracted and stored in a database under the preset path, so that the picture reviewer can directly review the extracted picture.
  • the initial probability of the picture outputted by the bad picture identification system is uniformly mapped to the bad probability for comparison, so that the original review probability interval having the unified reference standard can be determined, and the picture set corresponding to the original review probability interval is determined.
  • the results of the manual review are used to calculate the original review probability interval, regenerate the image review probability interval, standardize the determination method of the image review probability interval, and improve the accuracy of the image review probability interval.
  • step S206 further comprising: acquiring an original review probability interval on the original The probability of the limit and the probability of the original lower limit; the probability of the second class is calculated according to the original upper limit probability and the original lower limit probability, and the sum of the second lower limit probability corresponding to the second classification probability and the second classification probability to the original upper limit can be obtained by the two classification probability The sum of the first quantity corresponding to the probability, the minimum of the sum of the two; the original upper limit probability and the original lower limit probability are calculated according to the second classification probability, and the picture review probability interval is regenerated.
  • the original upper bound probability and the original lower bound probability refer to the endpoints of the original review probability interval, and the value of the original upper bound probability is greater than the original lower bound probability.
  • the second classification probability refers to dividing the original review probability interval into two interval values, and the second classification probability is greater than the original lower limit threshold and less than the original upper limit threshold.
  • s 1 is the original lower bound probability
  • s 2 is the original upper bound probability
  • M i represents the second quantity with probability i
  • N i represents the first quantity with probability i, by taking versus The minimum of the sum of the two can be obtained as the second classification probability t.
  • the original upper bound probability and the original lower bound probability of the original review probability interval may be calculated according to a preset algorithm that is the same as the calculated bad probability. Since each bad picture identification system has a missed detection rate or a false detection rate, each bad picture authentication system has its own initial probability interval that needs to be reviewed, and the initial upper limit probability and the initial lower limit probability of the initial probability interval can be extracted. The calculation is performed to obtain the original upper limit probability and the original lower limit probability. The original upper bound probability and the original lower bound probability of the original review probability interval may also be directly determined according to the previously adjusted picture review probability interval.
  • the second quantity is distributed from the original lower limit probability to the integral of the second classification probability, and the first quantity is distributed to the integral of the second classification probability to the original upper limit probability.
  • the sum of the integrals of the two takes the minimum value to calculate the two-class probability.
  • the original upper limit probability and the original lower limit probability are calculated according to presets or simultaneously, and the picture review probability interval is regenerated.
  • the method further includes: acquiring the picture transmission system corresponding to the picture set Image system weight.
  • Step S206 includes: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight, and regenerating the picture review probability interval.
  • the picture system weight is one or more weights corresponding to the picture system.
  • the picture system includes, but is not limited to, an entertainment system, such as a blind date forum, a social platform, or the like, or a system in the financial field, such as a banking system, a securities system, and the like.
  • the picture system weight can be assigned according to the attributes or parameters of the picture system.
  • the value can be assigned according to the type of the picture, wherein the picture type includes but is not limited to the invoice document class, the customer document class, the voucher file class, the work circulation class, the customer communication class, etc., according to the degree of internal to external process business from low to High assignment; can also be assigned according to the frequency of image access, wherein the access frequency can be divided into external access frequency and download frequency, etc., according to the value of the frequency from low to high assignment; can also be assigned according to the importance of the image, for example, according to The degree of relevance of the image to the transaction is assigned, and can be assigned from low to high according to the degree of relevance of the image to the transaction voucher, promotion and customer management.
  • the original upper limit probability and the original lower limit probability may be calculated according to one or more system weights and two classification probabilities of the picture system, and the picture review probability interval is regenerated.
  • the sum of multiple system weights can also be used as the overall system weight for the picture system.
  • the weight of the integrated system is in the preset weight interval, the corresponding upper limit probability and the original lower limit probability are calculated separately or simultaneously.
  • the weights may be determined q 1, q 2 weights may be determined according to the picture access frequency, may be determined according to weight q 3 importance image based on the image type.
  • q 1 can take values from 1 to 10
  • q 2 takes values from 1 to 20
  • q 3 takes values from 1 to 10.
  • a corresponding calculation formula is used to regenerate the original upper limit probability and the original lower limit probability.
  • s 1 is the original lower bound probability
  • s 2 is the original upper bound probability
  • t is the second classification probability
  • s 1 ' is the new lower bound probability
  • s 2 ' is the new upper bound probability
  • the type of picture may be determined based on keywords in the URL (resource uniform locator) of the picture. For example, if the keyword related to the URL of a picture is goods, payment, payment, etc., the picture can be tagged with a document.
  • a database of keywords can be preset to generate a mapping relationship between keywords and types.
  • the keyword included in the URL is extracted, the keyword is matched with the preset keyword in the database, and the image type having a mapping relationship with the keyword that matches the matching is obtained as the image type corresponding to the image.
  • Corresponding image review probability intervals are generated for different picture system situations, so that the generated picture review probability interval is more in line with actual needs, and the accuracy of the picture review interval is improved.
  • corresponding picture system weights may also be available for different users.
  • the image system weight can be assigned according to the user's historical map data. For example, the value may be assigned according to the degree of the badness of the image sent in the user history record; the value may be assigned according to the image access frequency of the user history map; or may be performed according to the degree of badness of other forms of content such as text sent in the user history record. Assignment. For example, a user who has uploaded a pornographic picture and whose picture access frequency is greater than a preset threshold may be given a relatively high weight.
  • the poor according to the degree of user history send images to determine the weight c 1, may determine the weight value c based on the image user historical development of FIG access frequency 2, according to the user history issued text, etc.
  • the degree of badness of other forms of content determines the weight c 3 .
  • May be c 1, c 2, 3 c, and the weighted sum is determined integrated system weight C c 1 + c 2 + c 3 according to, and may define a c 1, c 2, c 3 range.
  • c 1 may take values from 1 to 10
  • c 2 may take values from 1 to 20
  • c 3 may take values from 1 to 10.
  • a corresponding calculation formula is used to regenerate the original upper limit probability and the original lower limit probability. For example, when the integrated system weight C is less than or equal to 20, adjust When the integrated system weight C is greater than 20, adjust Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, s 1 ' is the new lower bound probability, and s 2 ' is the new upper bound probability. Corresponding image review probability intervals are generated for the history records of different users, so that the generated image review probability interval is more in line with actual needs, and the accuracy of the image review interval is improved.
  • another method for generating a picture review probability interval includes:
  • Step S402 Acquire an initial probability that each picture is a bad picture, and calculate an initial probability corresponding to each picture to generate a bad probability that each picture in the preset picture set is a bad picture.
  • the initial probability is that the probability of each picture outputted by the image authentication system is different.
  • the output of the bad picture authentication system may be a single-category probability that the picture is identified as one of normal, sexy, and erotic, or the picture is identified as normal. Three multi-category probabilities corresponding to the three categories of sexy, erotic, and erotic.
  • a mapping formula of three single-category probabilities of normal, sexy, and erotic may be preset, respectively, to calculate a probability of generating a bad.
  • the mapping formula can be a bad probability.
  • the mapping formula can be a bad probability.
  • the mapping formula can be a bad probability
  • the mapping formula can be a bad probability
  • the probability of bad y corresponding to probability B is to The probability of bad y corresponding to probability C is Up to 1, so the probability of failure y ranges from 0 to 1.
  • the normal probability, the sexy probability, and the erotic probability corresponding to each picture may be integrated, and the probability of generating the defect is calculated according to a unified mapping formula.
  • the mapping formula may be preset as a bad probability.
  • Step S404 setting an original review probability interval, and extracting a picture corresponding to the bad probability in the original review probability interval, and forming a picture set corresponding to the original review probability interval.
  • the original review probability interval can be determined according to the mapping formula for determining the probability of failure.
  • the original review probability interval is in the value interval of the bad probability corresponding to the preset picture set.
  • Step S406 obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture.
  • Step S408 for each group of pictures having the same bad probability, the first number of the normal picture is determined as the normal picture, and the second number is determined as the bad picture.
  • the picture set corresponding to the original review probability interval is used as a picture set that needs to be manually reviewed, so that the picture reviewer determines the picture in the picture set as a normal picture or a bad picture.
  • a corresponding image review interface may be provided, and the image in the image set corresponding to the original review probability interval is displayed on the interface, and the image reviewer may select a normal image or a bad image according to the judgment. . Further, the corresponding picture may be displayed according to each bad probability. When the click operation of the picture reviewing personnel is detected, the picture may be marked with a corresponding label, and the number of labels of the unified type is counted as the first quantity or the second quantity.
  • the picture reviewer may click on the selected picture as a bad picture and mark it as "1", and the picture that has not been clicked as the normal picture is marked as "0", and the picture corresponding to the bad probability is performed.
  • the number of pictures marked as “0” under the probability of failure may be counted as the first quantity, and the number of pictures marked as “0” as the second quantity.
  • Step S410 Acquire an original upper bound probability and an original lower bound probability of the original review probability interval, and calculate a second classification probability according to the original upper bound probability and the original lower bound probability.
  • the sum of the second lower quantity probability to the second quantity corresponding to the second classification probability, the sum of the first quantity corresponding to the second classification probability to the original upper limit probability, and the sum of the sums of the two can be obtained by the two classification probability.
  • the original upper limit probability s 2 and the original lower limit probability s 1 corresponding to the original review probability interval are obtained, wherein the original upper limit probability s 2 and the original lower limit probability s 1 may be determined according to the missed detection rate and the false detection rate of the bad picture authentication system.
  • the first quantity distribution P 1 (x) may be generated according to the first quantity corresponding to each bad probability.
  • the second quantity distribution P may be generated according to the second quantity corresponding to each bad probability. 2 (x).
  • the formula Calculating generates a two-class probability t, wherein the two-class probability t is such that the integral of the second quantity distribution P 2 (x) between the original lower-order probability s 1 and the second-class probability t, and the first quantity distribution P 1 (x)
  • the integral of the classification probability t to the original upper limit probability s 2 and the sum of the integrals of the two takes the minimum value.
  • step S412 it is determined whether the two classification probability is greater than a preset probability.
  • step S314 When the two classification probability is less than or equal to the preset probability, step S314 is performed; when the second classification probability is greater than the preset probability, step S316 is performed.
  • step S414 the calculation is performed according to the preset first formula, and the upper limit probability is regenerated.
  • the first formula can be preset to Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability, and s 2 ' is the new upper bound probability.
  • step S416 the calculation is performed according to the preset second formula, and the lower limit probability is regenerated.
  • the second formula can be preset to Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability, and s 1 ' is the new lower bound probability.
  • Step S418 Regenerate the picture review probability interval according to the regenerated upper limit probability or the regenerated lower limit probability.
  • the picture review probability interval is regenerated, and the picture review probability interval is s 1 ' to s 2 '.
  • the initial probability of the picture outputted by the bad picture identification system is uniformly mapped to the bad probability for comparison, and the result of the manual review after the picture corresponding to the original review probability interval is counted as the normal picture.
  • a quantity, and a second quantity determined as a bad picture by the re-trial calculates the original review probability interval according to the first quantity and the second quantity, and regenerates the picture review probability interval.
  • the combined two-class probability is calculated by combining the results of the manual review. According to the size of the second classification probability and the preset probability, the adjustment of the review probability interval is divided into two cases, which makes the newly generated picture review probability interval range more accurate, reduces the workload of the picture review, and improves the efficiency of the bad picture review.
  • a picture retrieving method comprising:
  • Step S502 obtaining a bad probability that the picture to be identified is a bad picture.
  • the picture to be identified is a picture that needs to be judged whether or not an image review is to be performed.
  • the bad probability of obtaining the picture to be identified as a bad picture is consistent with the calculation method of calculating the probability of each picture in the preset picture set.
  • Step S504 determining whether the probability of failure is within a preset picture review probability interval.
  • the method for generating the image review probability interval includes: obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; for each group of pictures having the same bad probability, the statistically retried is determined as the first picture of the normal picture. The quantity, and the second number determined to be a bad picture; the picture review probability interval is regenerated according to the first quantity and the second quantity corresponding to each group of bad probabilities. If yes, go to step S506; if no, go to step S508.
  • step S506 it is determined that the picture to be identified needs to be reviewed.
  • step S508 it is determined that the picture to be identified does not need to be reviewed.
  • the method for generating a picture review probability interval further includes: acquiring a preset picture set in each of the picture sets corresponding to the original review probability interval, each picture being a bad picture, and each picture is The initial probability of a bad picture; the initial probability corresponding to each picture is calculated, and the bad probability that each picture in the preset picture set is a bad picture is generated; The probability interval is reviewed, and the picture corresponding to the bad probability in the original review probability interval is extracted to form a picture set corresponding to the original review probability interval.
  • the image review probability interval is regenerated according to the first quantity and the second quantity corresponding to each set of the bad probability, including: obtaining the original upper limit probability and the original lower limit probability of the original review probability interval; according to the original upper limit probability and the original lower limit
  • the probability of calculating the two-category probability, the sum of the second quantity corresponding to the second-class probability, and the sum of the first quantity corresponding to the two-class probability to the original upper-level probability, and the sum of the two can be obtained by the two-class probability Minimum value; the original upper limit probability and the original lower limit probability are calculated according to the two-class probability, and the picture review probability interval is regenerated.
  • the original upper bound probability and the original lower bound probability are calculated according to the second classification probability, and the image review probability interval is regenerated, including: when the second classification probability is less than or equal to the preset probability, according to the formula Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper limit probability s 2 ' or new The lower bound probability s 1 ', regenerates the picture review probability interval.
  • the method further includes: acquiring a picture system weight of the picture transmission system corresponding to the picture set; The original upper limit probability and the original lower limit probability are calculated, and the image review probability interval is regenerated, including: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight, and regenerating the picture review probability interval.
  • the initial probability of the picture outputted by the bad picture identification system is uniformly mapped to the bad probability for comparison, and the picture corresponding to the original review probability interval is manually reviewed.
  • the statistics are determined to be the first number of normal pictures, and the second number determined to be a bad picture, and the original review probability interval is calculated according to the first quantity and the second quantity, and the picture review probability interval is regenerated. .
  • the review probability interval is adjusted, so that the newly generated image review probability interval range is more accurate, reducing the workload of image review, thereby improving the efficiency of poor image review.
  • a picture review probability interval generation device 600 includes: a bad probability acquisition module 602, configured to acquire each picture in the picture set corresponding to the original review probability interval. The bad probability of the bad picture; the picture determination quantity statistics module 604 is configured to, for each group of pictures having the same bad probability, count the first number of the normal picture to be determined as the normal picture, and the second quantity determined as the bad picture by the repeated trial; The picture review probability interval generation module 606 is configured to regenerate the picture review probability interval according to the first quantity and the second quantity corresponding to each set of bad probability.
  • another picture review probability interval generating apparatus 700 is provided.
  • the apparatus further includes an original review picture set forming module 601, configured to acquire a preset picture set, and each picture is bad.
  • the initial probability of the picture; the initial probability corresponding to each picture is calculated, and the bad probability of each picture in the preset picture set as a bad picture is generated; the original review probability interval is set, and the bad probability corresponding to the original review probability interval is extracted.
  • the picture forms a set of pictures corresponding to the original review probability interval.
  • the picture review probability interval generation module 606 is further configured to obtain an original upper limit probability and an original lower limit probability of the original review probability interval; calculate a second classification probability according to the original upper limit probability and the original lower limit probability, and obtain the second classification probability by using the second classification probability The sum of the original lower bound probability to the second quantity corresponding to the second classification probability, the sum of the first quantity corresponding to the second classification probability to the original upper limit probability, and the sum of the sums; the original upper bound probability and the original according to the second classification probability The lower bound probability is calculated and the image review probability interval is regenerated.
  • the picture review probability interval generation module 606 is further configured to: when the two classification probability is less than or equal to the preset probability, according to the formula Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper limit probability s 2 ' or new The lower bound probability s 1 ', regenerates the picture review probability interval.
  • the picture review probability interval generation module 606 is further configured to acquire a picture system weight of the picture transmission system corresponding to the picture set; and calculate the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight value. , regenerate the image review probability interval.
  • a picture re-determination device 800 is provided.
  • the device includes: a picture defect probability acquisition module 802 to be identified, which is used to obtain a bad probability that a picture to be identified is a bad picture;
  • the determining module 804 is configured to determine whether the bad probability is within a preset image review probability interval, and the method for generating the image review probability interval includes: obtaining a bad probability that each image in the image set corresponding to the original review probability interval is a bad image
  • the statistics are determined to be the first number of normal pictures, and the second number determined to be a bad picture; the first quantity and the second quantity corresponding to each group of bad probabilities
  • the image review probability interval is regenerated; if it is, then the image to be identified needs to be reviewed.
  • the method before acquiring the bad probability that each picture is a bad picture in the picture set corresponding to the original review probability interval, the method includes: acquiring a preset picture set, each picture being an initial probability of a bad picture; The initial probability corresponding to the picture is calculated, and the bad probability of each picture in the preset picture set as a bad picture is generated; the original review probability interval is set, and the picture corresponding to the bad probability in the original review probability interval is extracted, and the original review probability interval is formed. Corresponding picture set.
  • the image review probability interval is regenerated according to the first quantity and the second quantity corresponding to each set of the bad probability, including: obtaining the original upper limit probability and the original lower limit probability of the original review probability interval; according to the original upper limit probability and the original lower limit Probability calculates the probability of two classifications, passing the two points
  • the class probability can obtain the sum of the original lower bound probability to the second quantity corresponding to the second classification probability, the sum of the first quantity corresponding to the second classification probability to the original upper limit probability, and the sum of the sums; the original according to the second classification probability
  • the upper limit probability and the original lower limit probability are calculated, and the picture review probability interval is regenerated.
  • the original upper bound probability and the original lower bound probability are calculated according to the second classification probability, and the image review probability interval is regenerated, including: when the second classification probability is less than or equal to the preset probability, according to the formula Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper limit probability s 2 ' or new The lower bound probability s 1 ', regenerates the picture review probability interval.
  • the method further includes: acquiring a picture system weight of the picture transmission system corresponding to the picture set; The original upper limit probability and the original lower limit probability are calculated, and the image review probability interval is regenerated, including: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight, and regenerating the picture review probability interval.
  • the above picture review probability interval generating means and picture retrieving means may be implemented in the form of a computer readable instruction which may be run on the first computer device as in FIG.
  • Each of the modules in the picture review probability interval generation device and the picture review device may be implemented in whole or in part by software, hardware, and combinations thereof.
  • Each of the above modules may be embedded in or independent of the memory of the computer device in hardware, or may be stored in the memory of the computer device in software form, so that the processor invokes the operations corresponding to the above modules.
  • the processor can It is a central processing unit (CPU), a microprocessor, a single chip microcomputer, and the like.
  • one or more non-transitory readable storage mediums storing computer readable instructions that, when executed by one or more processors, cause one or more processors to execute The steps of the picture review probability interval generation method in each of the above embodiments.
  • one or more non-transitory readable storage media having computer readable instructions that, when executed by one or more processors, cause one or more processors to execute The steps of the picture review method in the various embodiments described above.
  • a computer apparatus comprising a memory and one or more processors having stored therein computer readable instructions that, when executed by a processor, cause one or more processors to execute The steps of the picture review probability interval generation method in each of the above embodiments.
  • a computer apparatus comprising a memory and one or more processors having stored therein computer readable instructions that, when executed by a processor, cause one or more processors to execute The steps of the picture review method in the various embodiments described above.
  • the computer device described above may include, but is not limited to, a stand-alone physical server, or a server cluster of multiple physical servers.
  • FIG. 9 is a schematic diagram showing the internal structure of a computer device in an embodiment.
  • the computer device is applicable to the first computer device 102 in the application environment of FIG.
  • the computer device includes a processor coupled through a system bus, a non-volatile storage medium, an internal memory, and a network interface.
  • the processor of the computer device is used to provide computing and control capabilities to support the operation of the entire computer device.
  • a non-volatile storage medium of a computer device stores an operating system, a database, and computer readable instructions.
  • the database stores data related to a picture review probability interval generation method and a picture re-determination method provided by the foregoing various embodiments, for example, an initial probability corresponding to each picture set in a preset picture set may be stored.
  • the computer readable instructions are executable by the processor for implementing a picture review probability interval generation method and a picture retry method provided by the above various embodiments.
  • the internal memory in the computer device provides a cached operating environment for operating systems, databases, and computer readable instructions in a non-volatile storage medium.
  • the network interface can be an Ethernet card or a wireless network card, etc., for external use.
  • the terminal or server communicates, such as communicating with the second computer device 104 in the application environment of FIG. 1, to obtain an initial probability that each picture in the preset picture set is a bad picture.
  • FIG. 9 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation of the computer device to which the solution of the present application is applied.
  • the computer device may include more or fewer components than those in the figures, or some components may be combined, or have different component arrangements.
  • the computer device in the figure may also include a display screen or the like.
  • the readable storage medium which when executed, may include the flow of an embodiment of the methods as described above.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or the like.

Abstract

A method for generating a picture review probability interval, the method comprising: acquiring a probability of defectiveness by which each picture in a picture set corresponding to an original review probability interval is a defective picture; for each group of pictures having the same probability of defectiveness, counting a first number for pictures that are determined by means of review to be normal pictures and a second number for pictures that are determined by means of review to be defective pictures; newly generating a picture review probability interval according to a first number and second number corresponding to the probability of defectiveness of each group.

Description

图片复审概率区间生成方法及图片复审判定方法Image review probability interval generation method and picture complex trial method
本申请要求于2017年9月25日提交中国专利局,申请号为2017108769171,发明名称为“图片复审概率区间生成方法及图片复审判定方法”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application is required to be submitted to the Chinese Patent Office on September 25, 2017, the application number is 2017108769171, and the invention name is the priority of the Chinese patent application of the "picture review probability interval generation method and picture re-determination method", the entire contents of which are incorporated by reference. In this application.
技术领域Technical field
本申请涉及通信技术领域,特别是涉及一种图片复审概率区间生成方法、装置、存储介质和计算机设备及一种图片复审判定方法、装置、存储介质和计算机设备。The present invention relates to the field of communications technologies, and in particular, to a method, an apparatus, a storage medium, and a computer device for generating a picture review probability interval, and a picture retry method, apparatus, storage medium, and computer device.
背景技术Background technique
随着通信技术的发展,为了便于用户的交流沟通,越来越多的网络平台,如社交平台、金融平台等,都为用户提供图片上传下载的功能。这也使得不良图片的传播越来越容易。为了防止不良图片的传播,许多网络平台通过将传输的图片接入不良图片鉴别系统进行鉴别,不良图片鉴别系统反馈的可以是将图片鉴定为正常、性感、色情三类中一种的单类别概率,或者是将图片鉴定为正常、性感、色情三类时分别对应的三个概率。With the development of communication technology, in order to facilitate user communication, more and more network platforms, such as social platforms and financial platforms, provide users with the function of uploading and downloading pictures. This also makes the spread of bad pictures easier and easier. In order to prevent the spread of bad pictures, many network platforms identify the transmitted pictures by accessing the bad picture authentication system. The bad picture authentication system can feedback the single-category probability of identifying the pictures as one of normal, sexy and pornographic. Or the three probabilities corresponding to the identification of the picture as normal, sexy, and erotic.
然而,由于目前的不良图片鉴别系统存在漏检率或误检率,经过不良图片鉴别系统鉴定后的图片中仍有一部分图片需要进行人工复审来确认。在不良图片鉴别系统识别率有限的情况下,不良图片鉴别系统生成的图片复审概率区间比较大,因此人工复审的工作量也相应较大,需要耗费大量不必要的时间,从而导致不良图片的复审效率较低。However, due to the current flaw detection rate or false detection rate of the bad picture authentication system, some pictures in the pictures identified by the bad picture authentication system still need to be manually reviewed to confirm. In the case that the recognition rate of the bad picture identification system is limited, the picture review probability interval generated by the bad picture authentication system is relatively large, so the workload of the manual review is also relatively large, and it takes a lot of unnecessary time, which leads to the review of the bad picture. Less efficient.
发明内容Summary of the invention
根据本申请的各种实施例,提供一种图片复审概率区间生成方法、装置、存储介质和计算机设备。 According to various embodiments of the present application, a picture review probability interval generation method, apparatus, storage medium, and computer device are provided.
根据本申请的各种实施例,提供一种图片复审判定方法、装置、存储介质和计算机设备。In accordance with various embodiments of the present application, a picture review method, apparatus, storage medium, and computer apparatus are provided.
一种图片复审概率区间生成方法,所述方法包括:获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。A method for generating a picture review probability interval, the method comprising: obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; and for each group of pictures having the same bad probability, the statistical review is determined as The first number of normal pictures, and the second number determined to be a bad picture; the picture review probability interval is regenerated according to the first quantity and the second quantity corresponding to each group of bad probabilities.
在其中一个实施例中,在所述获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之前,包括:获取预设图片集中,每个图片的为不良图片初始概率;对每个图片对应的初始概率进行计算,生成所述预设图片集中每个图片为不良图片的不良概率;设定原始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与所述原始复审概率区间相对应的图片集。In one embodiment, before the obtaining a bad probability of each picture in the picture set corresponding to the original review probability interval, the method includes: acquiring a preset picture set, where each picture is a bad picture initial probability; Calculating an initial probability corresponding to each picture, generating a bad probability that each picture in the preset picture set is a bad picture; setting an original review probability interval, and extracting a picture corresponding to the bad probability in the original review probability interval, forming A set of pictures corresponding to the original review probability interval.
在其中一个实施例中,所述根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间,包括:获取所述原始复审概率区间的原始上限概率和原始下限概率;根据所述原始上限概率和原始下限概率计算出二分类概率,通过所述二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In one embodiment, the regenerating the picture review probability interval according to the first quantity and the second quantity corresponding to each set of the bad probability comprises: obtaining an original upper limit probability and an original lower limit probability of the original review probability interval; Calculating a two-category probability by using the original upper bound probability and the original lower bound probability, by which the sum of the original lower bound probability to the second quantity corresponding to the second classification probability can be obtained, and the second classification probability corresponds to the original upper limit probability a sum of quantities, the minimum of the sum of the two; calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval.
在其中一个实施例中,所述根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:当二分类概率小于等于预设概率时,根据公式
Figure PCTCN2017108480-appb-000001
进行计算,重新生成上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;当二分类概率大于预设概率时,根据公式
Figure PCTCN2017108480-appb-000002
进 行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;根据所述新的上限概率s2’或所述新的下限概率s1’,重新生成图片复审概率区间。
In one embodiment, the calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval, including: when the two classification probability is less than or equal to the preset probability, according to the formula
Figure PCTCN2017108480-appb-000001
Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula
Figure PCTCN2017108480-appb-000002
Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper bound probability s 2 ' Or the new lower limit probability s 1 ', regenerating the picture review probability interval.
在其中一个实施例中,所述获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之后,还包括:获取所述图片集对应的图片传输系统的图片系统权值;所述根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:根据所述二分类概率和所述图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In one embodiment, after obtaining the bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture, the method further includes: acquiring a picture system weight of the picture transmission system corresponding to the picture set; Calculating the original upper limit probability and the original lower limit probability according to the second classification probability, and regenerating the picture review probability interval, comprising: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight value Perform calculations to regenerate the image review probability interval.
一种图片复审判定方法,所述方法包括:获取待鉴定图片为不良图片的不良概率;判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间根据上述各个实施例中所述的图片复审概率区间生成方法而生成;若是,则判定所述待鉴定图片需要进行图片复审。A picture re-determination method, the method comprising: obtaining a bad probability that a picture to be identified is a bad picture; determining whether the bad probability is within a preset picture review probability interval, and the picture review probability interval is according to the foregoing implementation The picture review probability interval generation method described in the example is generated; if yes, determining that the picture to be identified requires a picture review.
一种图片复审概率区间生成装置,所述装置包括:不良概率获取模块,用于获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;图片判定数量统计模块,用于对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;图片复审概率区间生成模块,用于根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。A picture review probability interval generating device, the device comprising: a bad probability obtaining module, configured to acquire a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; and a picture determination quantity statistics module, configured to: For each group of pictures with the same bad probability, the statistics are determined to be the first number of normal pictures, and the second number is determined as the bad picture; the picture review probability interval generation module is used to correspond to each group of bad probabilities. The first quantity and the second quantity regenerate the picture review probability interval.
一种图片复审判定装置,所述装置包括:待鉴定图片不良概率获取模块,用于获取待鉴定图片为不良图片的不良概率;图片复审判断模块,用于判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间根据上述各实施例中所述的图片复审概率区间生成方法生成;若是,则判定所述待鉴定图片需要进行图片复审。A picture re-determination device, the device comprising: a picture defect probability acquisition module to be used for obtaining a bad probability that a picture to be identified is a bad picture; a picture review judging module, configured to determine whether the bad probability is preset Within the picture review probability interval, the picture review probability interval is generated according to the picture review probability interval generation method described in the above embodiments; if yes, determining that the picture to be identified requires image review.
一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以 下步骤:获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;及根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。One or more non-transitory readable storage mediums storing computer readable instructions, when executed by one or more processors, cause the one or more processors to execute The following steps: obtaining the bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; for each picture having the same bad probability, the statistics are determined to be the first quantity of the normal picture, and the trial is determined. a second number of bad pictures; and regenerating a picture review probability interval according to the first quantity and the second quantity corresponding to each set of bad probabilities.
一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:获取待鉴定图片为不良图片的不良概率;判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间的生成步骤包括获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间;及若是,则判定所述待鉴定图片需要进行图片复审。One or more non-transitory readable storage mediums storing computer readable instructions, when executed by one or more processors, cause the one or more processors to perform the steps of: acquiring The probability that the picture to be identified is a bad picture; determining whether the bad probability is within a preset picture review probability interval, the step of generating the picture review probability interval includes acquiring each picture set corresponding to the original review probability interval The picture is a bad probability of a bad picture; for each group of pictures with the same bad probability, the statistics are determined to be the first number of normal pictures, and the second number of bad pictures is determined as the bad picture; A quantity and a second quantity regenerate the picture review probability interval; and if so, determining that the picture to be identified requires a picture review.
一种计算机设备,包括存储器和一个或多个处理器,所述存储器中存储有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;及根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。A computer device comprising a memory and one or more processors, the memory storing computer readable instructions, the computer readable instructions being executed by the processor, causing the one or more processors to execute The following steps: obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; for each picture having the same bad probability, the statistics are determined to be the first number of normal pictures, and the trial is determined a second number of bad pictures; and regenerating a picture review probability interval according to the first quantity and the second quantity corresponding to each set of bad probabilities.
一种计算机设备,包括存储器和一个或多个处理器,所述存储器中存储有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:获取待鉴定图片为不良图片的不良概率;判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间的生成步骤包括获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;根据每组不良 概率对应的第一数量和第二数量重新生成图片复审概率区间;及若是,则判定所述待鉴定图片需要进行图片复审。A computer device comprising a memory and one or more processors, the memory storing computer readable instructions, the computer readable instructions being executed by the processor, causing the one or more processors to execute The following steps: obtaining a bad probability that the picture to be identified is a bad picture; determining whether the bad probability is within a preset picture review probability interval, and the generating step of the picture review probability interval includes obtaining a corresponding interval corresponding to the original review probability interval The bad probability of each picture being a bad picture in the picture set; for each group of pictures with the same bad probability, the statistics are determined to be the first number of normal pictures, and the second number of bad pictures is determined as a bad picture; The first quantity and the second quantity corresponding to the probability regenerate the picture review probability interval; and if so, determining that the picture to be identified requires a picture review.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征、目的和优点将从说明书、附图以及权利要求书变得明显。Details of one or more embodiments of the present application are set forth in the accompanying drawings and description below. Other features, objects, and advantages of the invention will be apparent from the description and appended claims.
附图说明DRAWINGS
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings to be used in the embodiments will be briefly described below. Obviously, the drawings in the following description are only some embodiments of the present application, Those skilled in the art can also obtain other drawings based on these drawings without any creative work.
图1为一个实施例中图片复审概率区间生成方法的应用环境图;1 is an application environment diagram of a method for generating a picture review probability interval in an embodiment;
图2为一个实施例中图片复审概率区间生成方法的流程图;2 is a flowchart of a method for generating a picture review probability interval in an embodiment;
图3为另一个实施例中图片复审概率区间生成方法的流程图;3 is a flowchart of a method for generating a picture review probability interval in another embodiment;
图4为又一个实施例中图片复审概率区间生成方法的流程图;4 is a flowchart of a method for generating a picture review probability interval in still another embodiment;
图5为一个实施例中图片复审判定方法的流程图;Figure 5 is a flow chart of a method for resuming a picture in an embodiment;
图6为一个实施例中图片复审概率区间生成装置的结构框图;6 is a structural block diagram of a picture review probability interval generating apparatus in an embodiment;
图7为另一个实施例中图片复审概率区间生成装置的结构框图;FIG. 7 is a structural block diagram of a picture review probability interval generating apparatus in another embodiment; FIG.
图8为一个实施例中图片复审判定装置的结构框图;Figure 8 is a block diagram showing the structure of a picture re-determining device in an embodiment;
图9为一个实施例中第一计算机设备的内部结构图。Figure 9 is a diagram showing the internal structure of a first computer device in one embodiment.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the objects, technical solutions, and advantages of the present application more comprehensible, the present application will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting.
可以理解,本申请所使用的术语“第一”、“第二”等可在本文中用于描述各种元件,但这些元件不受这些术语的限制。这些术语仅用于将第一个元件与另一个元件区分。举例来说,在不脱离本申请的范围的情况下,可以将 第一数量称为第二数量,且类似地,可将第二数量称为第一数量。第一数量和第二数量两者都是数量,但其不是同一数量。It will be understood that the terms "first", "second" and the like, as used herein, may be used to describe various elements, but these elements are not limited by these terms. These terms are only used to distinguish one element from another. For example, without departing from the scope of the application, The first quantity is referred to as a second quantity, and similarly, the second quantity may be referred to as a first quantity. Both the first quantity and the second quantity are quantities, but they are not the same quantity.
本申请实施例所提供的图片复审概率区间生成方法,可应用于如图1的应用环境中。参照图1,该应用环境包括第一计算机设备102和第二计算机设备104。第一计算机设备102和第二计算机设备104可为终端也可为服务器。其中,终端包括但不限于手机、平板电脑或者个人数字助理或穿戴式设备等,服务器可以是独立的物理服务器,也可以是多个物理服务器构成的服务器集群。第一计算机设备102和第二计算机设备可为相同类型的计算机设备,也可为不同类型的计算机设备。第一计算机设备102可用于执行本申请实施例所提供的图片复审概率区间生成方法。第二计算机设备104可以是存储有不良图片鉴别系统的终端或服务器,可用于鉴别不良图片并输出图片的初始概率。第一计算机设备102可与第二计算机设备104网络连接,比如说,第一计算机设备102可获取第二计算机设备104对图片进行鉴别得到的初始概率等数据,网络连接包括但不限于无线网络、有线网络等。The image review probability interval generation method provided by the embodiment of the present application can be applied to the application environment as shown in FIG. 1 . Referring to FIG. 1, the application environment includes a first computer device 102 and a second computer device 104. The first computer device 102 and the second computer device 104 can be terminals or servers. The terminal includes but is not limited to a mobile phone, a tablet computer, or a personal digital assistant or a wearable device. The server may be an independent physical server or a server cluster composed of multiple physical servers. The first computer device 102 and the second computer device can be the same type of computer device, or can be different types of computer devices. The first computer device 102 can be used to execute the picture review probability interval generation method provided by the embodiment of the present application. The second computer device 104 may be a terminal or server that stores a bad picture authentication system that can be used to identify bad pictures and output an initial probability of the picture. The first computer device 102 can be connected to the second computer device 104. For example, the first computer device 102 can obtain data such as an initial probability that the second computer device 104 authenticates the picture, and the network connection includes, but is not limited to, a wireless network. Wired network, etc.
在一个实施例中,第一计算机设备102和第二计算机设备104可为同一计算机设备。In one embodiment, the first computer device 102 and the second computer device 104 can be the same computer device.
在一个实施例中,如图2所示,提供了一种图片复审概率区间生成方法,该方法可应用于如图1所示的应用环境中的第一计算机设备102,该方法包括:In one embodiment, as shown in FIG. 2, a picture review probability interval generation method is provided, which is applicable to the first computer device 102 in the application environment as shown in FIG. 1, the method comprising:
步骤S202,获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率。Step S202: Obtain a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture.
原始复审概率区间可以指预设的固定的复审概率区间,也可以是前一次调整的复审概率区间。其中,复审概率区间是指在不良图片鉴定过程中,需要进行人工复审的图片所对应的概率区间。不良概率是指图片为不良图片的可能程度,不良概率处于原始复审概率区间中。可根据不良图片鉴定系统输出的初始概率,按照预设的算法计算得到不良概率。进一步地,可根据与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率,统计形 成原始复审概率区间上的图片不良分布,其中,图片不良分布中,横坐标可为不良概率,纵坐标可为相同不良概率所对应的图片数量。The original review probability interval may refer to a preset fixed review probability interval, or may be a previous adjusted review probability interval. Among them, the review probability interval refers to the probability interval corresponding to the picture that needs to be manually reviewed in the process of bad picture identification. The bad probability refers to the possible degree of the picture being a bad picture, and the bad probability is in the original review probability interval. According to the initial probability of the bad picture identification system output, the bad probability can be calculated according to a preset algorithm. Further, according to the original image of the original review probability interval, the probability of each picture being a bad picture may be statistically shaped. The image is poorly distributed in the original review probability interval. In the bad distribution of the image, the abscissa may be a bad probability, and the ordinate may be the number of pictures corresponding to the same bad probability.
步骤S204,对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量。Step S204, for each group of pictures having the same bad probability, the first number of the normal picture is determined as the normal picture, and the second number is determined as the bad picture.
统计原始复审概率区间中的图片被复审判定为正常图片或不良图片的数量。其中,相同不良概率的图片中,被复审判定为正常图片的图片数量为第一数量,被复审判定为不良图片的图片数量为第二数量。The number of pictures in the original review probability interval is determined to be the number of normal pictures or bad pictures. Among them, in the picture with the same bad probability, the number of pictures that are determined to be normal pictures is the first number, and the number of pictures that are determined to be bad pictures is the second number.
步骤S206,根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。Step S206: Regenerate the picture review probability interval according to the first quantity and the second quantity corresponding to each group of the bad probability.
新的图片复审概率区间是指用于判定图片是否要进行人工复审的概率区间,当新接收的图片的不良概率处于上述方法计算得到的新的图片复审概率区间之中时,可将该图片判定为需要进行人工复审。进一步地,还可根据每个不良概率的图片所对应的第一数量和第二数量,分别形成第一数量分布和第二数量分布之后,根据每组不良概率对应的第一数量分布和第二数量分布重新生成图片复审概率区间。The new picture review probability interval refers to the probability interval used to determine whether the picture is to be manually reviewed. When the bad probability of the newly received picture is in the new picture review probability interval calculated by the above method, the picture can be determined. For manual review. Further, after the first quantity distribution and the second quantity distribution are respectively formed according to the first quantity and the second quantity corresponding to the picture of each bad probability, the first quantity distribution and the second quantity corresponding to each group of the bad probability are further The quantity distribution regenerates the image review probability interval.
上述实施例中,通过结合原始复审概率区间对应图片集的人工复审结果,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量,根据第一数量和第二数量对原始复审概率区间进行计算,重新生成图片复审概率区间。通过结合人工复审后的结果,对复审概率区间进行调整,使得新生成的图片复审概率区间范围更加准确,减少图片复审的工作量,从而提高不良图片复审的效率。In the above embodiment, by combining the manual review results of the original review probability interval corresponding picture set, the first number of the normal picture is determined as the normal picture, and the second number is determined as the bad picture, according to the first quantity and the second number. The quantity is calculated for the original review probability interval, and the image review probability interval is regenerated. By combining the results of the manual review, the review probability interval is adjusted, so that the newly generated image review probability interval range is more accurate, reducing the workload of image review, thereby improving the efficiency of poor image review.
在一个实施例中,如图3所示,在步骤S202之前,还包括:In an embodiment, as shown in FIG. 3, before step S202, the method further includes:
步骤S302,获取预设图片集中,每个图片为不良图片的初始概率。Step S302: Acquire a preset picture set, and each picture is an initial probability of a bad picture.
预设图片集是指图片系统在预设时间段内输入不良图片鉴别系统的图片所构成的图片集。初始概率不良图片鉴别系统输出的每个图片鉴别后的概率,不良图片鉴别系统输出的可以是将图片鉴定为正常、性感、色情三类中一种的单类别概率,或者是将图片鉴定为正常、性感、色情三类时分别对应的三 个多类别概率。其中,单一类别概率和多类别概率可为图片为相应图片类别的置信度。单一类别概率和多类别概率的取值可为0至1。The preset picture set refers to a picture set formed by the picture system inputting a picture of the bad picture authentication system within a preset time period. The initial probability is that the probability of each picture outputted by the image authentication system is different. The output of the bad picture authentication system may be a single-category probability that the picture is identified as one of normal, sexy, and erotic, or the picture is identified as normal. Three pairs of sexy, erotic, and erotic Multiple category probabilities. Among them, the single category probability and the multi-category probability may be the confidence that the picture is the corresponding picture category. The single category probability and the multi-class probability may range from 0 to 1.
步骤S304,对每个图片对应的初始概率进行计算,生成预设图片集中每个图片为不良图片的不良概率。Step S304, calculating an initial probability corresponding to each picture, and generating a bad probability that each picture in the preset picture set is a bad picture.
不良概率是按照预设的算法根据初始概率进行计算而得到的具有统一标准的概率。将单类别概率或多类别概率统一映射为在同一参考范围上的概率。The probability of badness is the probability of having a uniform standard obtained by calculating according to the initial probability according to a preset algorithm. Uniform mapping of single category probabilities or multi-category probabilities to probabilities over the same reference range.
在一个实施例中,获取预设图片集中,每个图片为不良图片的初始概率所对应的图片类别权值;根据相同图片的初始概率所对应的图片类别权值,对每个图片对应的初始概率进行计算,生成预设图片集中每个图片为不良图片的不良概率。其中,图片类别权值是指不良图片鉴别系统输出的图片类别所对应的权值。当初始概率为单一类别概率时,每个图片所对应的图片类别权值为预设正常权值、预设性感权值、预设色情权值三类中的一种;当初始概率为多类别概率时,每个图片所对应的图片类别权值可为三个,分别是预设正常权值、预设性感权值和预设色情权值。In an embodiment, the preset picture set is obtained, and each picture is a picture category weight corresponding to an initial probability of the bad picture; and an initial corresponding to each picture according to the picture category weight corresponding to the initial probability of the same picture The probability is calculated to generate a bad probability that each picture in the preset picture set is a bad picture. The picture category weight refers to the weight corresponding to the picture category output by the bad picture authentication system. When the initial probability is a single category probability, the picture category weight corresponding to each picture is one of three categories: a preset normal weight, a preset sexy weight, and a preset porn weight; when the initial probability is a multi-category In the case of probability, the picture category weight value corresponding to each picture may be three, which are a preset normal weight, a preset sexy weight, and a preset porn weight.
步骤S306,设定原始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与原始复审概率区间相对应的图片集。Step S306, setting an original review probability interval, and extracting a picture corresponding to the bad probability in the original review probability interval, and forming a picture set corresponding to the original review probability interval.
计算出每个图片为不良图片的不良概率之后,可根据原始复审概率区间提取出相应的图片,形成与原始复审概率区间相对应的图片集,该图片集为需要进行人工复审的图片集。进一步地,可将与原始复审概率区间相对应的图片集提取出来存储至预设路径下的数据库,使得图片复审人员能够直接对已提取出的图片进行复审。After calculating the bad probability that each picture is a bad picture, the corresponding picture may be extracted according to the original review probability interval to form a picture set corresponding to the original review probability interval, and the picture set is a picture set that needs manual review. Further, the picture set corresponding to the original review probability interval may be extracted and stored in a database under the preset path, so that the picture reviewer can directly review the extracted picture.
上述实施例中,通过将不良图片鉴定系统输出的图片的初始概率统一映射成不良概率进行比较,使得能够确定具有统一参考标准的原始复审概率区间,并根据原始复审概率区间相对应的图片集的人工复审结果,对原始复审概率区间进行计算,重新生成图片复审概率区间,规范了图片复审概率区间的确定方式,提高了图片复审概率区间的准确度。In the above embodiment, the initial probability of the picture outputted by the bad picture identification system is uniformly mapped to the bad probability for comparison, so that the original review probability interval having the unified reference standard can be determined, and the picture set corresponding to the original review probability interval is determined. The results of the manual review are used to calculate the original review probability interval, regenerate the image review probability interval, standardize the determination method of the image review probability interval, and improve the accuracy of the image review probability interval.
在一个实施例中,步骤S206,还包括:获取原始复审概率区间的原始上 限概率和原始下限概率;根据原始上限概率和原始下限概率计算出二分类概率,通过二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In an embodiment, step S206, further comprising: acquiring an original review probability interval on the original The probability of the limit and the probability of the original lower limit; the probability of the second class is calculated according to the original upper limit probability and the original lower limit probability, and the sum of the second lower limit probability corresponding to the second classification probability and the second classification probability to the original upper limit can be obtained by the two classification probability The sum of the first quantity corresponding to the probability, the minimum of the sum of the two; the original upper limit probability and the original lower limit probability are calculated according to the second classification probability, and the picture review probability interval is regenerated.
原始上限概率和原始下限概率是指原始复审概率区间的端点,且原始上限概率的值大于原始下限概率的值。二分类概率是指将原始复审概率区间分割成两个区间的值,二分类概率大于原始下限阈值且小于原始上限阈值。在将原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,两者的总和取最小值时,可得到二分类概率。可表示为公式
Figure PCTCN2017108480-appb-000003
其中,s1为原始下限概率,s2为原始上限概率,Mi表示概率为i的第二数量,Ni表示概率为i的第一数量,通过取
Figure PCTCN2017108480-appb-000004
Figure PCTCN2017108480-appb-000005
二者之和的最小值可求得二分类概率t。
The original upper bound probability and the original lower bound probability refer to the endpoints of the original review probability interval, and the value of the original upper bound probability is greater than the original lower bound probability. The second classification probability refers to dividing the original review probability interval into two interval values, and the second classification probability is greater than the original lower limit threshold and less than the original upper limit threshold. When the sum of the original lower bound probability to the second quantity corresponding to the second classification probability and the first quantity corresponding to the second upper limit probability to the original upper limit probability, the sum of the two takes the minimum value, the two classification probability is obtained. Can be expressed as a formula
Figure PCTCN2017108480-appb-000003
Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, M i represents the second quantity with probability i, and N i represents the first quantity with probability i, by taking
Figure PCTCN2017108480-appb-000004
versus
Figure PCTCN2017108480-appb-000005
The minimum of the sum of the two can be obtained as the second classification probability t.
在一个实施例中,原始复审概率区间的原始上限概率和原始下限概率,可按照与计算不良概率相同的预设的算法计算得到。由于每个不良图片鉴别系统都存在漏检率或误检率,每个不良图片鉴别系统有自身预设的需要进行复审的初始概率区间,可提取该初始概率区间的初始上限概率和初始下限概率进行计算,得到原始上限概率和原始下限概率。原始复审概率区间的原始上限概率和原始下限概率也可为根据前一次调整的图片复审概率区间直接确定的。In one embodiment, the original upper bound probability and the original lower bound probability of the original review probability interval may be calculated according to a preset algorithm that is the same as the calculated bad probability. Since each bad picture identification system has a missed detection rate or a false detection rate, each bad picture authentication system has its own initial probability interval that needs to be reviewed, and the initial upper limit probability and the initial lower limit probability of the initial probability interval can be extracted. The calculation is performed to obtain the original upper limit probability and the original lower limit probability. The original upper bound probability and the original lower bound probability of the original review probability interval may also be directly determined according to the previously adjusted picture review probability interval.
进一步地,还可在形成第一数量分布和第二数量分布之后,将第二数量分布在原始下限概率至二分类概率的积分,与第一数量分布在二分类概率至原始上限概率的积分,两者的积分之和取最小值计算得二分类概率。根据二分类概率对原始上限概率和原始下限概率按照预设的分别或者同时进行计算,重新生成图片复审概率区间。Further, after the first quantity distribution and the second quantity distribution are formed, the second quantity is distributed from the original lower limit probability to the integral of the second classification probability, and the first quantity is distributed to the integral of the second classification probability to the original upper limit probability. The sum of the integrals of the two takes the minimum value to calculate the two-class probability. According to the second classification probability, the original upper limit probability and the original lower limit probability are calculated according to presets or simultaneously, and the picture review probability interval is regenerated.
在一个实施例中,在获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之后,还包括:获取图片集对应的图片传输系统的 图片系统权值。步骤S206包括:根据二分类概率和图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In an embodiment, after obtaining the bad probability that each picture is a bad picture in the picture set corresponding to the original review probability interval, the method further includes: acquiring the picture transmission system corresponding to the picture set Image system weight. Step S206 includes: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight, and regenerating the picture review probability interval.
图片系统权值为与图片系统所对应的一个或多个权值。其中,图片系统包括但不限于娱乐性质的系统,比如相亲论坛、社交平台等,或者也可为金融领域的系统,比如银行系统、证券系统等。具体的,图片系统权值可根据图片系统的属性或参数进行赋值。比如,可根据图片类型进行赋值,其中,图片类型包括但不限于发票单据类、客户单据类、凭证文件类、工作流通类、客户交流沟通类等,按照内部至对外流程业务的程度由低到高赋值;还可以根据图片访问频率进行赋值,其中,访问频率可分为外部访问频率及下载频率等,按照频率的取值由低到高赋值;还可以根据图片重要程度进行赋值,比如可根据图片与交易的关联程度进行赋值,可按照图片与交易凭证、推广宣传及客户管理的相关程度由低到高赋值。进一步地,可根据图片系统的一个或多个系统权值和二分类概率,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。还可将多个系统权值的和作为该图片系统的综合系统权值。当综合系统权值在预设的权值区间时,采取相应的分别或同时对原始上限概率和原始下限概率进行计算。The picture system weight is one or more weights corresponding to the picture system. The picture system includes, but is not limited to, an entertainment system, such as a blind date forum, a social platform, or the like, or a system in the financial field, such as a banking system, a securities system, and the like. Specifically, the picture system weight can be assigned according to the attributes or parameters of the picture system. For example, the value can be assigned according to the type of the picture, wherein the picture type includes but is not limited to the invoice document class, the customer document class, the voucher file class, the work circulation class, the customer communication class, etc., according to the degree of internal to external process business from low to High assignment; can also be assigned according to the frequency of image access, wherein the access frequency can be divided into external access frequency and download frequency, etc., according to the value of the frequency from low to high assignment; can also be assigned according to the importance of the image, for example, according to The degree of relevance of the image to the transaction is assigned, and can be assigned from low to high according to the degree of relevance of the image to the transaction voucher, promotion and customer management. Further, the original upper limit probability and the original lower limit probability may be calculated according to one or more system weights and two classification probabilities of the picture system, and the picture review probability interval is regenerated. The sum of multiple system weights can also be used as the overall system weight for the picture system. When the weight of the integrated system is in the preset weight interval, the corresponding upper limit probability and the original lower limit probability are calculated separately or simultaneously.
在一个实施例中,可根据图片类型确定权值q1,可根据图片访问频率确定权值q2,可根据图片重要程度确定权值q3。可根据q1、q2、q3权值之和确定综合系统权值Q=q1+q2+q3,且可限定q1、q2、q3的取值范围。比如,q1可取值1至10,q2取值1至20,q3取值1至10。进一步地,可根据综合系统权值Q与预设阈值的大小,采取相应的计算公式重新生成原始上限概率和原始下限概率。比如,当综合系统权值Q小于等于20,调整
Figure PCTCN2017108480-appb-000006
当综合系统权值Q大于20,调整
Figure PCTCN2017108480-appb-000007
其中,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,s1’为新的下限概率,s2’为新的上限概率。
In one embodiment, the weights may be determined q 1, q 2 weights may be determined according to the picture access frequency, may be determined according to weight q 3 importance image based on the image type. According to q 1, q 2, 3 to determine the weighted sum of q and integrated system of weights Q = q 1 + q 2 + q 3, and may define q 1, q 2, q 3 range. For example, q 1 can take values from 1 to 10, q 2 takes values from 1 to 20, and q 3 takes values from 1 to 10. Further, according to the weight of the integrated system weight Q and the preset threshold, a corresponding calculation formula is used to regenerate the original upper limit probability and the original lower limit probability. For example, when the integrated system weight Q is less than or equal to 20, adjust
Figure PCTCN2017108480-appb-000006
When the integrated system weight Q is greater than 20, adjust
Figure PCTCN2017108480-appb-000007
Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, s 1 ' is the new lower bound probability, and s 2 ' is the new upper bound probability.
在一个实施例中,可根据图片的URL(资源统一定位符)中的关键字确定图片的类型。比如说,如果一张图片的URL相关的关键词为货物、货款、支付等,则可给图片打上单据的标签。可以预设关键词的数据库,生成关键词和类型的映射关系。当提取到URL中包含的关键词时,将该关键词与数据库中的预设关键词进行匹配,获取与匹配成功的关键词具有映射关系的图片类型,作为与该图片对应的图片类型。针对不同的图片系统的情况生成相应的图片复审概率区间,使得生成的图片复审概率区间更加符合实际需求,提高了图片复审区间的准确度。In one embodiment, the type of picture may be determined based on keywords in the URL (resource uniform locator) of the picture. For example, if the keyword related to the URL of a picture is goods, payment, payment, etc., the picture can be tagged with a document. A database of keywords can be preset to generate a mapping relationship between keywords and types. When the keyword included in the URL is extracted, the keyword is matched with the preset keyword in the database, and the image type having a mapping relationship with the keyword that matches the matching is obtained as the image type corresponding to the image. Corresponding image review probability intervals are generated for different picture system situations, so that the generated picture review probability interval is more in line with actual needs, and the accuracy of the picture review interval is improved.
在一个实施例中,针对不同的用户也可有相应的图片系统权值。该图片系统权值可根据用户的历史发图数据进行赋值。比如说,可以根据用户历史记录中发的图片的不良程度进行赋值;还可根据用户历史发图的图片访问频率进行赋值;还可根据用户历史记录中发的文本等其他形式内容的不良程度进行赋值。举例来说,针对一个历史习惯为上传过色情图片,且图片访问频率大于预设阈值的用户,可赋予相对较高的权值。In one embodiment, corresponding picture system weights may also be available for different users. The image system weight can be assigned according to the user's historical map data. For example, the value may be assigned according to the degree of the badness of the image sent in the user history record; the value may be assigned according to the image access frequency of the user history map; or may be performed according to the degree of badness of other forms of content such as text sent in the user history record. Assignment. For example, a user who has uploaded a pornographic picture and whose picture access frequency is greater than a preset threshold may be given a relatively high weight.
在一个实施例中,可根据用户历史记录中发的图片的不良程度确定权值c1,可根据用户历史发图的图片访问频率确定权值c2,可根据用户历史记录中发的文本等其他形式内容的不良程度确定权值c3。可根据c1、c2、c3权值之和确定综合系统权值C=c1+c2+c3,且可限定c1、c2、c3的取值范围。比如,c1可取值1至10,c2取值1至20,c3取值1至10。进一步地,可根据综合系统权值C与预设阈值的大小,采取相应的计算公式重新生成原始上限概率和原始下限概率。比如,当综合系统权值C小于等于20,调整
Figure PCTCN2017108480-appb-000008
当综合系统权值C大于20,调整
Figure PCTCN2017108480-appb-000009
其中,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,s1’为新的下限概率,s2’为新的上限概率。针对不同用户的历史记录生成相应的图片复审概率区间,使得生成的图片复审概率区间更加符合实际需求,提高了图片复审区间的准 确度。
In one embodiment, the poor according to the degree of user history send images to determine the weight c 1, may determine the weight value c based on the image user historical development of FIG access frequency 2, according to the user history issued text, etc. The degree of badness of other forms of content determines the weight c 3 . May be c 1, c 2, 3 c, and the weighted sum is determined integrated system weight C = c 1 + c 2 + c 3 according to, and may define a c 1, c 2, c 3 range. For example, c 1 may take values from 1 to 10, c 2 may take values from 1 to 20, and c 3 may take values from 1 to 10. Further, according to the weight of the integrated system weight C and the preset threshold, a corresponding calculation formula is used to regenerate the original upper limit probability and the original lower limit probability. For example, when the integrated system weight C is less than or equal to 20, adjust
Figure PCTCN2017108480-appb-000008
When the integrated system weight C is greater than 20, adjust
Figure PCTCN2017108480-appb-000009
Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, s 1 ' is the new lower bound probability, and s 2 ' is the new upper bound probability. Corresponding image review probability intervals are generated for the history records of different users, so that the generated image review probability interval is more in line with actual needs, and the accuracy of the image review interval is improved.
在一个实施例中,如图4所示,提供了另一种图片复审概率区间生成方法,该方法包括:In an embodiment, as shown in FIG. 4, another method for generating a picture review probability interval is provided, and the method includes:
步骤S402,获取预设图片集中,每个图片为不良图片的初始概率,对每个图片对应的初始概率进行计算,生成预设图片集中每个图片为不良图片的不良概率。Step S402: Acquire an initial probability that each picture is a bad picture, and calculate an initial probability corresponding to each picture to generate a bad probability that each picture in the preset picture set is a bad picture.
初始概率不良图片鉴别系统输出的每个图片鉴别后的概率,不良图片鉴别系统输出的可以是将图片鉴定为正常、性感、色情三类中一种的单类别概率,或者是将图片鉴定为正常、性感、色情三类时分别对应的三个多类别概率。The initial probability is that the probability of each picture outputted by the image authentication system is different. The output of the bad picture authentication system may be a single-category probability that the picture is identified as one of normal, sexy, and erotic, or the picture is identified as normal. Three multi-category probabilities corresponding to the three categories of sexy, erotic, and erotic.
在一个实施例中,当初始概率为单类别概率时,可分别预设正常、性感及色情三种单类别概率的映射公式,计算生成不良概率。比如说,针对鉴别为正常图片的单类别概率A,映射公式可为不良概率
Figure PCTCN2017108480-appb-000010
针对鉴别为性感图片的单类别概率B,映射公式可为不良概率
Figure PCTCN2017108480-appb-000011
针对鉴别为色情图片的单类别概率C,映射公式可为不良概率
Figure PCTCN2017108480-appb-000012
其中,若A、B、C的取值都为0至1,则与概率A相应的不良概率y取值为0至
Figure PCTCN2017108480-appb-000013
与概率B相应的不良概率y取值为
Figure PCTCN2017108480-appb-000014
Figure PCTCN2017108480-appb-000015
与概率C相应的不良概率y取值为
Figure PCTCN2017108480-appb-000016
至1,因此不良概率y的取值范围为0至1。通过将三种单类别概率映射为同一取值标准下的不良概率,降低了比较不良图片鉴别系统输出概率的难度,规范了不良概率的标准,提高了确定复审概率区间的准确度。
In one embodiment, when the initial probability is a single-category probability, a mapping formula of three single-category probabilities of normal, sexy, and erotic may be preset, respectively, to calculate a probability of generating a bad. For example, for a single-category probability A that is identified as a normal picture, the mapping formula can be a bad probability.
Figure PCTCN2017108480-appb-000010
For a single-category probability B identified as a sexy picture, the mapping formula can be a bad probability
Figure PCTCN2017108480-appb-000011
For a single-category probability C that is identified as an erotic image, the mapping formula can be a bad probability
Figure PCTCN2017108480-appb-000012
Wherein, if the values of A, B, and C are both 0 to 1, the probability y corresponding to the probability A is 0.
Figure PCTCN2017108480-appb-000013
The probability of bad y corresponding to probability B is
Figure PCTCN2017108480-appb-000014
to
Figure PCTCN2017108480-appb-000015
The probability of bad y corresponding to probability C is
Figure PCTCN2017108480-appb-000016
Up to 1, so the probability of failure y ranges from 0 to 1. By mapping the three single-category probabilities to the bad probability under the same value criterion, the difficulty of comparing the output probability of the bad picture identification system is reduced, the standard of the bad probability is standardized, and the accuracy of determining the review probability interval is improved.
在一个实施例中,当初始概率为多类别概率时,可综合每张图片所对应的正常概率、性感概率及色情概率,按照统一的映射公式,计算生成不良概率。比如说,针对相同图片所对应的正常概率D、性感概率E和色情概率F, 且三个概率之和可为1,可预设映射公式为不良概率
Figure PCTCN2017108480-appb-000017
通过综合三个多类别概率计算得到不良概率,降低了比较不良图片鉴别系统输出概率的难度,规范了不良概率的标准,提高了确定复审概率区间的准确度。
In one embodiment, when the initial probability is a multi-category probability, the normal probability, the sexy probability, and the erotic probability corresponding to each picture may be integrated, and the probability of generating the defect is calculated according to a unified mapping formula. For example, for the normal probability D, the sexy probability E, and the erotic probability F corresponding to the same picture, and the sum of the three probabilities may be 1, the mapping formula may be preset as a bad probability.
Figure PCTCN2017108480-appb-000017
By combining three multi-category probability calculations to obtain the bad probability, the difficulty of comparing the output probability of the bad picture identification system is reduced, the standard of the bad probability is standardized, and the accuracy of determining the review probability interval is improved.
步骤S404,设定原始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与原始复审概率区间相对应的图片集。Step S404, setting an original review probability interval, and extracting a picture corresponding to the bad probability in the original review probability interval, and forming a picture set corresponding to the original review probability interval.
可根据确定不良概率的映射公式,确定原始复审概率区间。其中,原始复审概率区间在预设图片集所对应的不良概率的取值区间中。The original review probability interval can be determined according to the mapping formula for determining the probability of failure. The original review probability interval is in the value interval of the bad probability corresponding to the preset picture set.
步骤S406,获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率。Step S406, obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture.
步骤S408,对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量。Step S408, for each group of pictures having the same bad probability, the first number of the normal picture is determined as the normal picture, and the second number is determined as the bad picture.
将原始复审概率区间相对应的图片集作为需要进行人工复审的图片集,使得图片复审人员将该图片集中的图片判定为正常图片或不良图片。The picture set corresponding to the original review probability interval is used as a picture set that needs to be manually reviewed, so that the picture reviewer determines the picture in the picture set as a normal picture or a bad picture.
在一个实施例中,针对图片复审,可提供相应的图片复审界面,在该界面上展示原始复审概率区间相对应的图片集中的图片,图片复审人员可根据判断选择出其中的正常图片或不良图片。进一步地,可按照每个不良概率展示对应的图片,当检测到图片复审人员的点击操作,则可以给该图片打上相应的标签,统计统一类型的标签数量作为第一数量或第二数量。In an embodiment, for the image review, a corresponding image review interface may be provided, and the image in the image set corresponding to the original review probability interval is displayed on the interface, and the image reviewer may select a normal image or a bad image according to the judgment. . Further, the corresponding picture may be displayed according to each bad probability. When the click operation of the picture reviewing personnel is detected, the picture may be marked with a corresponding label, and the number of labels of the unified type is counted as the first quantity or the second quantity.
举例来说,可将图片复审人员点击选择的图片作为不良图片并标记为“1”,将没有被点击选择的图片作为正常图片并标记为“0”,当对该不良概率对应的图片都进行了复审之后,可以统计该不良概率下标记为“0”的图片数量作为第一数量,统计标记为“0”的图片数量作为第二数量。For example, the picture reviewer may click on the selected picture as a bad picture and mark it as "1", and the picture that has not been clicked as the normal picture is marked as "0", and the picture corresponding to the bad probability is performed. After the review, the number of pictures marked as “0” under the probability of failure may be counted as the first quantity, and the number of pictures marked as “0” as the second quantity.
步骤S410,获取原始复审概率区间的原始上限概率和原始下限概率,根据原始上限概率和原始下限概率计算出二分类概率。Step S410: Acquire an original upper bound probability and an original lower bound probability of the original review probability interval, and calculate a second classification probability according to the original upper bound probability and the original lower bound probability.
通过二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最 小值。获取原始复审概率区间对应的原始上限概率s2和原始下限概率s1,其中,原始上限概率s2和原始下限概率s1可根据不良图片鉴别系统的漏检率和误检率确定。The sum of the second lower quantity probability to the second quantity corresponding to the second classification probability, the sum of the first quantity corresponding to the second classification probability to the original upper limit probability, and the sum of the sums of the two can be obtained by the two classification probability. The original upper limit probability s 2 and the original lower limit probability s 1 corresponding to the original review probability interval are obtained, wherein the original upper limit probability s 2 and the original lower limit probability s 1 may be determined according to the missed detection rate and the false detection rate of the bad picture authentication system.
在一个实施例中,可根据每个不良概率所对应的第一数量生成第一数量分布P1(x),相应的,可根据每个不良概率所对应的第二数量生成第二数量分布P2(x)。进一步地,可根据公式
Figure PCTCN2017108480-appb-000018
计算生成二分类概率t,其中二分类概率t使得,将第二数量分布P2(x)在原始下限概率s1至二分类概率t的积分,与第一数量分布P1(x)在二分类概率t至原始上限概率s2的积分,两者的积分之和取最小值。
In an embodiment, the first quantity distribution P 1 (x) may be generated according to the first quantity corresponding to each bad probability. Correspondingly, the second quantity distribution P may be generated according to the second quantity corresponding to each bad probability. 2 (x). Further, according to the formula
Figure PCTCN2017108480-appb-000018
Calculating generates a two-class probability t, wherein the two-class probability t is such that the integral of the second quantity distribution P 2 (x) between the original lower-order probability s 1 and the second-class probability t, and the first quantity distribution P 1 (x) The integral of the classification probability t to the original upper limit probability s 2 , and the sum of the integrals of the two takes the minimum value.
步骤S412,判断二分类概率是否大于预设概率。In step S412, it is determined whether the two classification probability is greater than a preset probability.
当二分类概率小于等于预设概率时,执行步骤S314;当二分类概率大于预设概率时,执行步骤S316。When the two classification probability is less than or equal to the preset probability, step S314 is performed; when the second classification probability is greater than the preset probability, step S316 is performed.
步骤S414,按照预设的第一公式进行计算,重新生成上限概率。In step S414, the calculation is performed according to the preset first formula, and the upper limit probability is regenerated.
举例来说,第一公式可预设为
Figure PCTCN2017108480-appb-000019
其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率,s2’为新的上限概率。
For example, the first formula can be preset to
Figure PCTCN2017108480-appb-000019
Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability, and s 2 ' is the new upper bound probability.
步骤S416,按照预设的第二公式进行计算,重新生成下限概率。In step S416, the calculation is performed according to the preset second formula, and the lower limit probability is regenerated.
举例来说,第二公式可预设为
Figure PCTCN2017108480-appb-000020
其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率,s1’为新的下限概率。
For example, the second formula can be preset to
Figure PCTCN2017108480-appb-000020
Where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability, and s 1 ' is the new lower bound probability.
步骤S418,根据重新生成的上限概率或重新生成的下限概率,重新生成图片复审概率区间。Step S418: Regenerate the picture review probability interval according to the regenerated upper limit probability or the regenerated lower limit probability.
根据新的上限概率s2’或新的下限概率s1’,重新生成图片复审概率区间, 该图片复审概率区间为s1’至s2’。According to the new upper limit probability s 2 ' or the new lower limit probability s 1 ', the picture review probability interval is regenerated, and the picture review probability interval is s 1 ' to s 2 '.
上述实施例中,通过将不良图片鉴定系统输出的图片的初始概率统一映射成不良概率进行比较,根据原始复审概率区间所对应的图片经过人工复审后的结果,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量,根据第一数量和第二数量对原始复审概率区间进行计算,重新生成图片复审概率区间。通过结合人工复审后的结果,计算生成二分类概率。根据二分类概率与预设概率的大小将对复审概率区间的调整分成两种情况,使得新生成的图片复审概率区间范围更加准确,减少图片复审的工作量,从而提高不良图片复审的效率。In the above embodiment, the initial probability of the picture outputted by the bad picture identification system is uniformly mapped to the bad probability for comparison, and the result of the manual review after the picture corresponding to the original review probability interval is counted as the normal picture. A quantity, and a second quantity determined as a bad picture by the re-trial, calculates the original review probability interval according to the first quantity and the second quantity, and regenerates the picture review probability interval. The combined two-class probability is calculated by combining the results of the manual review. According to the size of the second classification probability and the preset probability, the adjustment of the review probability interval is divided into two cases, which makes the newly generated picture review probability interval range more accurate, reduces the workload of the picture review, and improves the efficiency of the bad picture review.
在一个实施例中,如图5所示,提供了一种图片复审判定方法,该方法包括:In one embodiment, as shown in FIG. 5, a picture retrieving method is provided, the method comprising:
步骤S502,获取待鉴定图片为不良图片的不良概率。Step S502, obtaining a bad probability that the picture to be identified is a bad picture.
待鉴定图片为需要进行判定是否要进行图像复审的图片。获取待鉴定图片为不良图片的不良概率与计算预设图片集中每个图片不良概率的计算方式一致。The picture to be identified is a picture that needs to be judged whether or not an image review is to be performed. The bad probability of obtaining the picture to be identified as a bad picture is consistent with the calculation method of calculating the probability of each picture in the preset picture set.
步骤S504,判断不良概率是否在预设的图片复审概率区间之内。Step S504, determining whether the probability of failure is within a preset picture review probability interval.
图片复审概率区间的生成方法包括:获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。若是,则执行步骤S506;若否,则执行步骤S508。The method for generating the image review probability interval includes: obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; for each group of pictures having the same bad probability, the statistically retried is determined as the first picture of the normal picture. The quantity, and the second number determined to be a bad picture; the picture review probability interval is regenerated according to the first quantity and the second quantity corresponding to each group of bad probabilities. If yes, go to step S506; if no, go to step S508.
步骤S506,判定待鉴定图片需要进行图片复审。In step S506, it is determined that the picture to be identified needs to be reviewed.
步骤S508,判定待鉴定图片不需要进行图片复审。In step S508, it is determined that the picture to be identified does not need to be reviewed.
在一个实施例中,图片复审概率区间的生成方法还包括:在获取与原始复审概率区间相对应的图片集中,每个图片为不良图片的不良概率之前,获取预设图片集中,每个图片为不良图片的初始概率;对每个图片对应的初始概率进行计算,生成预设图片集中每个图片为不良图片的不良概率;设定原 始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与原始复审概率区间相对应的图片集。In an embodiment, the method for generating a picture review probability interval further includes: acquiring a preset picture set in each of the picture sets corresponding to the original review probability interval, each picture being a bad picture, and each picture is The initial probability of a bad picture; the initial probability corresponding to each picture is calculated, and the bad probability that each picture in the preset picture set is a bad picture is generated; The probability interval is reviewed, and the picture corresponding to the bad probability in the original review probability interval is extracted to form a picture set corresponding to the original review probability interval.
在一个实施例中,根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间,包括:获取原始复审概率区间的原始上限概率和原始下限概率;根据原始上限概率和原始下限概率计算出二分类概率,通过二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In an embodiment, the image review probability interval is regenerated according to the first quantity and the second quantity corresponding to each set of the bad probability, including: obtaining the original upper limit probability and the original lower limit probability of the original review probability interval; according to the original upper limit probability and the original lower limit The probability of calculating the two-category probability, the sum of the second quantity corresponding to the second-class probability, and the sum of the first quantity corresponding to the two-class probability to the original upper-level probability, and the sum of the two, can be obtained by the two-class probability Minimum value; the original upper limit probability and the original lower limit probability are calculated according to the two-class probability, and the picture review probability interval is regenerated.
在一个实施例中,根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:当二分类概率小于等于预设概率时,根据公式
Figure PCTCN2017108480-appb-000021
进行计算,重新生成上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;当二分类概率大于预设概率时,根据公式
Figure PCTCN2017108480-appb-000022
进行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;根据新的上限概率s2’或新的下限概率s1’,重新生成图片复审概率区间。
In an embodiment, the original upper bound probability and the original lower bound probability are calculated according to the second classification probability, and the image review probability interval is regenerated, including: when the second classification probability is less than or equal to the preset probability, according to the formula
Figure PCTCN2017108480-appb-000021
Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula
Figure PCTCN2017108480-appb-000022
Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper limit probability s 2 ' or new The lower bound probability s 1 ', regenerates the picture review probability interval.
在一个实施例中,获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之后,还包括:获取图片集对应的图片传输系统的图片系统权值;根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:根据二分类概率和图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In an embodiment, after obtaining the bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture, the method further includes: acquiring a picture system weight of the picture transmission system corresponding to the picture set; The original upper limit probability and the original lower limit probability are calculated, and the image review probability interval is regenerated, including: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight, and regenerating the picture review probability interval.
上述实施例中,通过将不良图片鉴定系统输出的图片的初始概率统一映射成不良概率进行比较,根据原始复审概率区间所对应的图片经过人工复审 后的结果,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量,根据第一数量和第二数量对原始复审概率区间进行计算,重新生成图片复审概率区间。通过结合人工复审后的结果,对复审概率区间进行调整,使得新生成的图片复审概率区间范围更加准确,减少图片复审的工作量,从而提高不良图片复审的效率。In the above embodiment, the initial probability of the picture outputted by the bad picture identification system is uniformly mapped to the bad probability for comparison, and the picture corresponding to the original review probability interval is manually reviewed. After the result, the statistics are determined to be the first number of normal pictures, and the second number determined to be a bad picture, and the original review probability interval is calculated according to the first quantity and the second quantity, and the picture review probability interval is regenerated. . By combining the results of the manual review, the review probability interval is adjusted, so that the newly generated image review probability interval range is more accurate, reducing the workload of image review, thereby improving the efficiency of poor image review.
在一个实施例中,如图6所示,提供了一种图片复审概率区间生成装置600,该装置包括:不良概率获取模块602,用于获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;图片判定数量统计模块604,用于对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;图片复审概率区间生成模块606,用于根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。In an embodiment, as shown in FIG. 6, a picture review probability interval generation device 600 is provided. The device includes: a bad probability acquisition module 602, configured to acquire each picture in the picture set corresponding to the original review probability interval. The bad probability of the bad picture; the picture determination quantity statistics module 604 is configured to, for each group of pictures having the same bad probability, count the first number of the normal picture to be determined as the normal picture, and the second quantity determined as the bad picture by the repeated trial; The picture review probability interval generation module 606 is configured to regenerate the picture review probability interval according to the first quantity and the second quantity corresponding to each set of bad probability.
在一个实施例中,如图7所示,提供了另一种图片复审概率区间生成装置700,该装置还包括原始复审图片集形成模块601,用于获取预设图片集中,每个图片为不良图片的初始概率;对每个图片对应的初始概率进行计算,生成预设图片集中每个图片为不良图片的不良概率;设定原始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与原始复审概率区间相对应的图片集。In an embodiment, as shown in FIG. 7, another picture review probability interval generating apparatus 700 is provided. The apparatus further includes an original review picture set forming module 601, configured to acquire a preset picture set, and each picture is bad. The initial probability of the picture; the initial probability corresponding to each picture is calculated, and the bad probability of each picture in the preset picture set as a bad picture is generated; the original review probability interval is set, and the bad probability corresponding to the original review probability interval is extracted. The picture forms a set of pictures corresponding to the original review probability interval.
在一个实施例中,图片复审概率区间生成模块606还用于获取原始复审概率区间的原始上限概率和原始下限概率;根据原始上限概率和原始下限概率计算出二分类概率,通过二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In one embodiment, the picture review probability interval generation module 606 is further configured to obtain an original upper limit probability and an original lower limit probability of the original review probability interval; calculate a second classification probability according to the original upper limit probability and the original lower limit probability, and obtain the second classification probability by using the second classification probability The sum of the original lower bound probability to the second quantity corresponding to the second classification probability, the sum of the first quantity corresponding to the second classification probability to the original upper limit probability, and the sum of the sums; the original upper bound probability and the original according to the second classification probability The lower bound probability is calculated and the image review probability interval is regenerated.
在一个实施例中,图片复审概率区间生成模块606还用于当二分类概率小于等于预设概率时,根据公式
Figure PCTCN2017108480-appb-000023
进行计算,重新生成 上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;当二分类概率大于预设概率时,根据公式
Figure PCTCN2017108480-appb-000024
进行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;根据新的上限概率s2’或新的下限概率s1’,重新生成图片复审概率区间。
In an embodiment, the picture review probability interval generation module 606 is further configured to: when the two classification probability is less than or equal to the preset probability, according to the formula
Figure PCTCN2017108480-appb-000023
Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula
Figure PCTCN2017108480-appb-000024
Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper limit probability s 2 ' or new The lower bound probability s 1 ', regenerates the picture review probability interval.
在一个实施例中,图片复审概率区间生成模块606还用于获取图片集对应的图片传输系统的图片系统权值;根据二分类概率和图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。在一个实施例中,如图8所示,提供了一种图片复审判定装置800,该装置包括:待鉴定图片不良概率获取模块802,用于获取待鉴定图片为不良图片的不良概率;图片复审判断模块804,用于判断不良概率是否在预设的图片复审概率区间之内,图片复审概率区间的生成方法包括:获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片中,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间;若是,则判定待鉴定图片需要进行图片复审。In an embodiment, the picture review probability interval generation module 606 is further configured to acquire a picture system weight of the picture transmission system corresponding to the picture set; and calculate the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight value. , regenerate the image review probability interval. In an embodiment, as shown in FIG. 8, a picture re-determination device 800 is provided. The device includes: a picture defect probability acquisition module 802 to be identified, which is used to obtain a bad probability that a picture to be identified is a bad picture; The determining module 804 is configured to determine whether the bad probability is within a preset image review probability interval, and the method for generating the image review probability interval includes: obtaining a bad probability that each image in the image set corresponding to the original review probability interval is a bad image For each group of pictures with the same bad probability, the statistics are determined to be the first number of normal pictures, and the second number determined to be a bad picture; the first quantity and the second quantity corresponding to each group of bad probabilities The image review probability interval is regenerated; if it is, then the image to be identified needs to be reviewed.
在一个实施例中,在获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之前,包括:获取预设图片集中,每个图片为不良图片的初始概率;对每个图片对应的初始概率进行计算,生成预设图片集中每个图片为不良图片的不良概率;设定原始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与原始复审概率区间相对应的图片集。In an embodiment, before acquiring the bad probability that each picture is a bad picture in the picture set corresponding to the original review probability interval, the method includes: acquiring a preset picture set, each picture being an initial probability of a bad picture; The initial probability corresponding to the picture is calculated, and the bad probability of each picture in the preset picture set as a bad picture is generated; the original review probability interval is set, and the picture corresponding to the bad probability in the original review probability interval is extracted, and the original review probability interval is formed. Corresponding picture set.
在一个实施例中,根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间,包括:获取原始复审概率区间的原始上限概率和原始下限概率;根据原始上限概率和原始下限概率计算出二分类概率,通过二分 类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In an embodiment, the image review probability interval is regenerated according to the first quantity and the second quantity corresponding to each set of the bad probability, including: obtaining the original upper limit probability and the original lower limit probability of the original review probability interval; according to the original upper limit probability and the original lower limit Probability calculates the probability of two classifications, passing the two points The class probability can obtain the sum of the original lower bound probability to the second quantity corresponding to the second classification probability, the sum of the first quantity corresponding to the second classification probability to the original upper limit probability, and the sum of the sums; the original according to the second classification probability The upper limit probability and the original lower limit probability are calculated, and the picture review probability interval is regenerated.
在一个实施例中,根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:当二分类概率小于等于预设概率时,根据公式
Figure PCTCN2017108480-appb-000025
进行计算,重新生成上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;当二分类概率大于预设概率时,根据公式
Figure PCTCN2017108480-appb-000026
进行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;根据新的上限概率s2’或新的下限概率s1’,重新生成图片复审概率区间。
In an embodiment, the original upper bound probability and the original lower bound probability are calculated according to the second classification probability, and the image review probability interval is regenerated, including: when the second classification probability is less than or equal to the preset probability, according to the formula
Figure PCTCN2017108480-appb-000025
Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; when the second classification probability is greater than the preset probability, According to the formula
Figure PCTCN2017108480-appb-000026
Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, x 0 is the preset probability; according to the new upper limit probability s 2 ' or new The lower bound probability s 1 ', regenerates the picture review probability interval.
在一个实施例中,获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之后,还包括:获取图片集对应的图片传输系统的图片系统权值;根据二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:根据二分类概率和图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。In an embodiment, after obtaining the bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture, the method further includes: acquiring a picture system weight of the picture transmission system corresponding to the picture set; The original upper limit probability and the original lower limit probability are calculated, and the image review probability interval is regenerated, including: calculating the original upper limit probability and the original lower limit probability according to the second classification probability and the picture system weight, and regenerating the picture review probability interval.
上述的图片复审概率区间生成装置和图片复审判定装置可以实现为一种计算机可读指令的形式,计算机可读指令可在如图9的第一计算机设备上运行。The above picture review probability interval generating means and picture retrieving means may be implemented in the form of a computer readable instruction which may be run on the first computer device as in FIG.
上述的图片复审概率区间生成装置和图片复审判定装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备的存储器中,也可以以软件形式存储于计算机设备的存储器中,以便于处理器调用执行以上各个模块对应的操作。该处理器可以 为中央处理单元(CPU)、微处理器、单片机等。Each of the modules in the picture review probability interval generation device and the picture review device may be implemented in whole or in part by software, hardware, and combinations thereof. Each of the above modules may be embedded in or independent of the memory of the computer device in hardware, or may be stored in the memory of the computer device in software form, so that the processor invokes the operations corresponding to the above modules. The processor can It is a central processing unit (CPU), a microprocessor, a single chip microcomputer, and the like.
在一个实施例中,提供了一个或多个存储有计算机可读指令的非易失性可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行上述各个实施例中图片复审概率区间生成方法的步骤。In one embodiment, there is provided one or more non-transitory readable storage mediums storing computer readable instructions that, when executed by one or more processors, cause one or more processors to execute The steps of the picture review probability interval generation method in each of the above embodiments.
在一个实施例中,提供了一个或多个存储有计算机可读指令的非易失性可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行上述各个实施例中图片复审判定方法的步骤。In one embodiment, there is provided one or more non-transitory readable storage media having computer readable instructions that, when executed by one or more processors, cause one or more processors to execute The steps of the picture review method in the various embodiments described above.
在一个实施例中,提供了一种计算机设备,包括存储器和一个或多个处理器,存储器中存储有计算机可读指令,计算机可读指令被处理器执行时,使得一个或多个处理器执行上述各个实施例中图片复审概率区间生成方法的步骤。In one embodiment, a computer apparatus is provided comprising a memory and one or more processors having stored therein computer readable instructions that, when executed by a processor, cause one or more processors to execute The steps of the picture review probability interval generation method in each of the above embodiments.
在一个实施例中,提供了一种计算机设备,包括存储器和一个或多个处理器,存储器中存储有计算机可读指令,计算机可读指令被处理器执行时,使得一个或多个处理器执行上述各个实施例中图片复审判定方法的步骤。In one embodiment, a computer apparatus is provided comprising a memory and one or more processors having stored therein computer readable instructions that, when executed by a processor, cause one or more processors to execute The steps of the picture review method in the various embodiments described above.
在一个实施例中,上述的计算机设备可包括但不限于独立的物理服务器,或者是多个物理服务器构成的服务器集群。如图9,为一个实施例中计算机设备的内部结构示意图。该计算机设备可应用于图1的应用环境中的第一计算机设备102。该计算机设备包括通过系统总线连接的处理器、非易失性存储介质、内存储器和网络接口。其中,该计算机设备的处理器用于提供计算和控制能力,支撑整个计算机设备的运行。计算机设备的非易失性存储介质存储有操作系统、数据库和计算机可读指令。该数据库中存储有用于实现以上各个实施例所提供的一种图片复审概率区间生成方法和图片复审判定方法相关的数据,比如可存储有预设的图片集中每个图片集所对应的初始概率。该计算机可读指令可被处理器所执行,以用于实现以上各个实施例所提供的一种图片复审概率区间生成方法和一种图片复审判定方法。计算机设备中的内存储器为非易失性存储介质中的操作系统、数据库和计算机可读指令提供高速缓存的运行环境。网络接口可以是以太网卡或无线网卡等,用于与外部 的终端或服务器进行通信,比如与图1的应用环境中的第二计算机设备104进行通信,获取预设图片集中每个图片为不良图片的初始概率。In one embodiment, the computer device described above may include, but is not limited to, a stand-alone physical server, or a server cluster of multiple physical servers. FIG. 9 is a schematic diagram showing the internal structure of a computer device in an embodiment. The computer device is applicable to the first computer device 102 in the application environment of FIG. The computer device includes a processor coupled through a system bus, a non-volatile storage medium, an internal memory, and a network interface. The processor of the computer device is used to provide computing and control capabilities to support the operation of the entire computer device. A non-volatile storage medium of a computer device stores an operating system, a database, and computer readable instructions. The database stores data related to a picture review probability interval generation method and a picture re-determination method provided by the foregoing various embodiments, for example, an initial probability corresponding to each picture set in a preset picture set may be stored. The computer readable instructions are executable by the processor for implementing a picture review probability interval generation method and a picture retry method provided by the above various embodiments. The internal memory in the computer device provides a cached operating environment for operating systems, databases, and computer readable instructions in a non-volatile storage medium. The network interface can be an Ethernet card or a wireless network card, etc., for external use. The terminal or server communicates, such as communicating with the second computer device 104 in the application environment of FIG. 1, to obtain an initial probability that each picture in the preset picture set is a bad picture.
本领域技术人员可以理解,图9中示出的计算机设备的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。比如,该图中的计算机设备还可包括显示屏等。It will be understood by those skilled in the art that the structure of the computer device shown in FIG. 9 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation of the computer device to which the solution of the present application is applied. The computer device may include more or fewer components than those in the figures, or some components may be combined, or have different component arrangements. For example, the computer device in the figure may also include a display screen or the like.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)等。One of ordinary skill in the art can understand that all or part of the process of implementing the above embodiments can be completed by computer readable instructions, which can be stored in a non-volatile computer. The readable storage medium, which when executed, may include the flow of an embodiment of the methods as described above. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or the like.
以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above-described embodiments may be arbitrarily combined. For the sake of brevity of description, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction between the combinations of these technical features, All should be considered as the scope of this manual.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。 The above-mentioned embodiments are merely illustrative of several embodiments of the present application, and the description thereof is more specific and detailed, but is not to be construed as limiting the scope of the invention. It should be noted that a number of variations and modifications may be made by those skilled in the art without departing from the spirit and scope of the present application. Therefore, the scope of the invention should be determined by the appended claims.

Claims (20)

  1. 一种图片复审概率区间生成方法,包括:A method for generating a picture review probability interval, comprising:
    获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;Obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture;
    对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;及For each group of pictures with the same probability of failure, the statistics are determined to be the first number of normal pictures, and the second number determined to be a bad picture; and
    根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。The picture review probability interval is regenerated according to the first quantity and the second quantity corresponding to each group of bad probabilities.
  2. 根据权利要求1所述的方法,其特征在于,在所述获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之前,还包括:The method according to claim 1, wherein before the obtaining, in the set of pictures corresponding to the original review probability interval, each picture is a bad probability of a bad picture, the method further comprises:
    获取预设图片集中,每个图片为不良图片的初始概率;Obtain the initial probability that each picture is a bad picture in the preset picture set;
    对每个图片对应的初始概率进行计算,生成所述预设图片集中每个图片为不良图片的不良概率;及Calculating an initial probability corresponding to each picture, and generating a bad probability that each picture in the preset picture set is a bad picture; and
    设定原始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与所述原始复审概率区间相对应的图片集。The original review probability interval is set, and the picture corresponding to the bad probability in the original review probability interval is extracted, and a picture set corresponding to the original review probability interval is formed.
  3. 根据权利要求1所述的方法,其特征在于,所述根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间,包括:The method according to claim 1, wherein the regenerating the picture review probability interval according to the first quantity and the second quantity corresponding to each set of the bad probability comprises:
    获取所述原始复审概率区间的原始上限概率和原始下限概率;Obtaining an original upper limit probability and an original lower limit probability of the original review probability interval;
    根据所述原始上限概率和原始下限概率计算出二分类概率,通过所述二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;及Calculating a binary classification probability according to the original upper bound probability and the original lower bound probability, by which the sum of the original lower bound probability to the second quantity corresponding to the second classification probability may be obtained, corresponding to the second classification probability to the original upper bound probability The sum of the first quantities, the sum of the sum of the two; and
    根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。Calculating the original upper bound probability and the original lower bound probability according to the binary classification probability, and regenerating the picture review probability interval.
  4. 根据权利要求3述的方法,其特征在于,所述根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:The method according to claim 3, wherein the calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval, comprising:
    当二分类概率小于等于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100001
    进 行计算,重新生成上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;
    When the two classification probability is less than or equal to the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100001
    Calculate and regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    当二分类概率大于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100002
    进行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;及
    When the two classification probability is greater than the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100002
    Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    根据所述新的上限概率s2’或所述新的下限概率s1’,重新生成图片复审概率区间。The picture review probability interval is regenerated according to the new upper limit probability s 2 ' or the new lower limit probability s 1 '.
  5. 根据权利要求3所述的方法,其特征在于,在所述获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率之后,还包括:The method according to claim 3, further comprising: after the obtaining the bad probability that each picture is a bad picture in the picture set corresponding to the original review probability interval, further comprising:
    获取所述图片集对应的图片传输系统的图片系统权值;及Obtaining a picture system weight of the picture transmission system corresponding to the picture set; and
    所述根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:The calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval, including:
    根据所述二分类概率和所述图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。Calculating the original upper bound probability and the original lower bound probability according to the two classification probability and the picture system weight, and regenerating the picture review probability interval.
  6. 一种图片复审判定方法,所述方法包括:A picture re-determination method, the method comprising:
    获取待鉴定图片为不良图片的不良概率;Obtaining a bad probability that the picture to be identified is a bad picture;
    判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间根据权利要求1至5中任一项所述图片复审概率区间生成方法生成;及Determining whether the bad probability is within a preset picture review probability interval, and the picture review probability interval is generated according to the picture review probability interval generation method according to any one of claims 1 to 5;
    若是,则判定所述待鉴定图片需要进行图片复审。If yes, determining that the picture to be identified requires a picture review.
  7. 一种图片复审概率区间生成装置,其特征在于,所述装置包括:A picture review probability interval generating device, wherein the device comprises:
    不良概率获取模块,用于获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;a bad probability acquisition module, configured to obtain a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture;
    图片判定数量统计模块,用于对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量; 及a picture determination quantity statistics module, configured to determine, for each group of pictures having the same bad probability, a first number of normal pictures to be determined as a normal picture, and a second quantity to be determined as a bad picture by the repeated trial; and
    图片复审概率区间生成模块,用于根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。The picture review probability interval generating module is configured to regenerate the picture review probability interval according to the first quantity and the second quantity corresponding to each set of bad probability.
  8. 一种图片复审判定装置,其特征在于,所述装置包括:A picture re-determination device, characterized in that the device comprises:
    待鉴定图片不良概率获取模块,用于获取待鉴定图片为不良图片的不良概率;A bad probability acquisition module for identifying a picture, which is used to obtain a bad probability that the picture to be identified is a bad picture;
    图片复审判断模块,用于判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间根据权利要求1至5中任一项所述图片复审概率区间生成方法生成;若是,则判定所述待鉴定图片需要进行图片复审。a picture review judging module, configured to determine whether the bad probability is within a preset picture review probability interval, and the picture review probability interval is generated according to the picture review probability interval generation method according to any one of claims 1 to 5; If yes, determining that the picture to be identified requires a picture review.
  9. 一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:One or more non-transitory readable storage mediums storing computer readable instructions, when executed by one or more processors, cause the one or more processors to perform the following steps:
    获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;Obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture;
    对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;及For each group of pictures with the same probability of failure, the statistics are determined to be the first number of normal pictures, and the second number determined to be a bad picture; and
    根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间。The picture review probability interval is regenerated according to the first quantity and the second quantity corresponding to each group of bad probabilities.
  10. 根据权利要求9所述的存储介质,其特征在于,在所述处理器所执行的获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率的步骤之前,还包括:The storage medium according to claim 9, wherein before the step of acquiring, by the processor, the probability that each picture in the picture set corresponding to the original review probability interval is a bad picture, the method further comprises:
    获取预设图片集中,每个图片为不良图片的初始概率;Obtain the initial probability that each picture is a bad picture in the preset picture set;
    对每个图片对应的初始概率进行计算,生成所述预设图片集中每个图片为不良图片的不良概率;及Calculating an initial probability corresponding to each picture, and generating a bad probability that each picture in the preset picture set is a bad picture; and
    设定原始复审概率区间,并提取位于原始复审概率区间的不良概率对应的图片,形成与所述原始复审概率区间相对应的图片集。The original review probability interval is set, and the picture corresponding to the bad probability in the original review probability interval is extracted, and a picture set corresponding to the original review probability interval is formed.
  11. 根据权利要求9所述的存储介质,其特征在于,所述处理器所执行 的根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间的步骤,包括:A storage medium according to claim 9, wherein said processor executes The steps of regenerating the image review probability interval according to the first quantity and the second quantity corresponding to each group of bad probabilities, including:
    获取所述原始复审概率区间的原始上限概率和原始下限概率;Obtaining an original upper limit probability and an original lower limit probability of the original review probability interval;
    根据所述原始上限概率和原始下限概率计算出二分类概率,通过所述二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;及Calculating a binary classification probability according to the original upper bound probability and the original lower bound probability, by which the sum of the original lower bound probability to the second quantity corresponding to the second classification probability may be obtained, corresponding to the second classification probability to the original upper bound probability The sum of the first quantities, the sum of the sum of the two; and
    根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。Calculating the original upper bound probability and the original lower bound probability according to the binary classification probability, and regenerating the picture review probability interval.
  12. 根据权利要求11所述的存储介质,其特征在于,所述处理器所执行的根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间的步骤,包括:The storage medium according to claim 11, wherein the step of calculating, by the processor, the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval comprises:
    当二分类概率小于等于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100003
    进行计算,重新生成上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;
    When the two classification probability is less than or equal to the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100003
    Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    当二分类概率大于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100004
    进行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;及
    When the two classification probability is greater than the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100004
    Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    根据所述新的上限概率s2’或所述新的下限概率s1’,重新生成图片复审概率区间。The picture review probability interval is regenerated according to the new upper limit probability s 2 ' or the new lower limit probability s 1 '.
  13. 根据权利要求11所述的存储介质,其特征在于,在所述处理器所执行的获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率的步骤之后,还包括:The storage medium according to claim 11, wherein after the step of obtaining, by the processor, the probability that each picture in the picture set corresponding to the original review probability interval is a bad picture, the method further includes:
    获取所述图片集对应的图片传输系统的图片系统权值;及Obtaining a picture system weight of the picture transmission system corresponding to the picture set; and
    所述根据所述二分类概率对原始上限概率和原始下限概率进行计算,重 新生成图片复审概率区间,包括:Calculating the original upper bound probability and the original lower bound probability according to the two classification probability, The newly generated image review probability interval includes:
    根据所述二分类概率和所述图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。Calculating the original upper bound probability and the original lower bound probability according to the two classification probability and the picture system weight, and regenerating the picture review probability interval.
  14. 一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:One or more non-transitory readable storage mediums storing computer readable instructions, when executed by one or more processors, cause the one or more processors to perform the following steps:
    获取待鉴定图片为不良图片的不良概率;Obtaining a bad probability that the picture to be identified is a bad picture;
    判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间的生成步骤包括获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间;及Determining whether the bad probability is within a preset picture review probability interval, and the step of generating the picture review probability interval includes obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; The group has the same bad probability picture, the statistics are determined to be the first number of normal pictures, and the second number is determined as the bad picture; the first quantity and the second quantity corresponding to each group of bad probability are regenerated. Probability interval; and
    若是,则判定所述待鉴定图片需要进行图片复审。If yes, determining that the picture to be identified requires a picture review.
  15. 一种计算机设备,包括存储器和一个或多个处理器,所述存储器中存储有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device comprising a memory and one or more processors, the memory storing computer readable instructions, the computer readable instructions being executed by the processor, causing the one or more processors to execute The following steps:
    获取所述原始复审概率区间的原始上限概率和原始下限概率;Obtaining an original upper limit probability and an original lower limit probability of the original review probability interval;
    根据所述原始上限概率和原始下限概率计算出二分类概率,通过所述二分类概率能够获得原始下限概率至二分类概率所对应的第二数量之和,与二分类概率至原始上限概率所对应的第一数量之和,二者总和的最小值;及Calculating a binary classification probability according to the original upper bound probability and the original lower bound probability, by which the sum of the original lower bound probability to the second quantity corresponding to the second classification probability may be obtained, corresponding to the second classification probability to the original upper bound probability The sum of the first quantities, the sum of the sum of the two; and
    根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。Calculating the original upper bound probability and the original lower bound probability according to the binary classification probability, and regenerating the picture review probability interval.
  16. 根据权利要求15所述的计算机设备,其特征在于,所述处理器所执行的根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间的步骤,包括:The computer device according to claim 15, wherein the step of calculating, by the processor, the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval comprises:
    当二分类概率小于等于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100005
    进 行计算,重新生成上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;
    When the two classification probability is less than or equal to the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100005
    Calculate and regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    当二分类概率大于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100006
    进行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;及
    When the two classification probability is greater than the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100006
    Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    根据所述新的上限概率s2’或所述新的下限概率s1’,重新生成图片复审概率区间。The picture review probability interval is regenerated according to the new upper limit probability s 2 ' or the new lower limit probability s 1 '.
  17. 根据权利要求15所述的计算机设备,其特征在于,在所述处理器所执行的获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率的步骤之后,还包括:The computer device according to claim 15, wherein after the step of acquiring, by the processor, the probability that each picture in the picture set corresponding to the original review probability interval is a bad picture, the method further comprises:
    获取所述图片集对应的图片传输系统的图片系统权值;及Obtaining a picture system weight of the picture transmission system corresponding to the picture set; and
    所述根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:The calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval, including:
    根据所述二分类概率和所述图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。Calculating the original upper bound probability and the original lower bound probability according to the two classification probability and the picture system weight, and regenerating the picture review probability interval.
  18. 根据权利要求17所述的计算机设备,其特征在于,所述处理器所执行的根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间的步骤,包括:The computer device according to claim 17, wherein the step of calculating, by the processor, the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval comprises:
    当二分类概率小于等于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100007
    进行计算,重新生成上限概率s2’,其中,s1为原始下限概率,s2为原始上限概率,t为二分类概率,x0为预设概率;
    When the two classification probability is less than or equal to the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100007
    Perform calculation to regenerate the upper limit probability s 2 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    当二分类概率大于预设概率时,根据公式
    Figure PCTCN2017108480-appb-100008
    进行计算,重新生成下限概率s1’,其中,s1为原始下限概率,s2为原始上限概率,t为二 分类概率,x0为预设概率;及
    When the two classification probability is greater than the preset probability, according to the formula
    Figure PCTCN2017108480-appb-100008
    Calculate and regenerate the lower bound probability s 1 ', where s 1 is the original lower bound probability, s 2 is the original upper bound probability, t is the second classification probability, and x 0 is the preset probability;
    根据所述新的上限概率s2’或所述新的下限概率s1’,重新生成图片复审概率区间。The picture review probability interval is regenerated according to the new upper limit probability s 2 ' or the new lower limit probability s 1 '.
  19. 根据权利要求17所述的计算机设备,其特征在于,在所述处理器所执行的获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率的步骤之后,还包括:The computer device according to claim 17, wherein after the step of acquiring, by the processor, the probability that each picture in the picture set corresponding to the original review probability interval is a bad picture, the method further comprises:
    获取所述图片集对应的图片传输系统的图片系统权值;及Obtaining a picture system weight of the picture transmission system corresponding to the picture set; and
    所述根据所述二分类概率对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间,包括:The calculating the original upper limit probability and the original lower limit probability according to the two classification probability, and regenerating the picture review probability interval, including:
    根据所述二分类概率和所述图片系统权值,对原始上限概率和原始下限概率进行计算,重新生成图片复审概率区间。Calculating the original upper bound probability and the original lower bound probability according to the two classification probability and the picture system weight, and regenerating the picture review probability interval.
  20. 一种计算机设备,包括存储器和一个或多个处理器,所述存储器中存储有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device comprising a memory and one or more processors, the memory storing computer readable instructions, the computer readable instructions being executed by the processor, causing the one or more processors to execute The following steps:
    获取待鉴定图片为不良图片的不良概率;Obtaining a bad probability that the picture to be identified is a bad picture;
    判断所述不良概率是否在预设的图片复审概率区间之内,所述图片复审概率区间的生成步骤包括获取与原始复审概率区间相对应的图片集中每个图片为不良图片的不良概率;对于每组具有相同不良概率的图片,统计被复审判定为正常图片的第一数量,和被复审判定为不良图片的第二数量;根据每组不良概率对应的第一数量和第二数量重新生成图片复审概率区间;及Determining whether the bad probability is within a preset picture review probability interval, and the step of generating the picture review probability interval includes obtaining a bad probability that each picture in the picture set corresponding to the original review probability interval is a bad picture; The group has the same bad probability picture, the statistics are determined to be the first number of normal pictures, and the second number is determined as the bad picture; the first quantity and the second quantity corresponding to each group of bad probability are regenerated. Probability interval; and
    若是,则判定所述待鉴定图片需要进行图片复审。 If yes, determining that the picture to be identified requires a picture review.
PCT/CN2017/108480 2017-09-25 2017-10-31 Method for generating picture review probability interval and method for picture review determination WO2019056496A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710876917.1 2017-09-25
CN201710876917.1A CN107729924B (en) 2017-09-25 2017-09-25 Picture review probability interval generation method and picture review determination method

Publications (1)

Publication Number Publication Date
WO2019056496A1 true WO2019056496A1 (en) 2019-03-28

Family

ID=61206890

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/108480 WO2019056496A1 (en) 2017-09-25 2017-10-31 Method for generating picture review probability interval and method for picture review determination

Country Status (2)

Country Link
CN (1) CN107729924B (en)
WO (1) WO2019056496A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108491866B (en) * 2018-03-06 2022-09-13 平安科技(深圳)有限公司 Pornographic picture identification method, electronic device and readable storage medium
CN109491970A (en) * 2018-10-11 2019-03-19 平安科技(深圳)有限公司 Imperfect picture detection method, device and storage medium towards cloud storage
CN109886865B (en) * 2019-01-07 2024-01-23 平安科技(深圳)有限公司 Method, device, computer equipment and storage medium for automatically shielding bad information
CN112445410B (en) * 2020-12-07 2023-04-18 北京小米移动软件有限公司 Touch event identification method and device and computer readable storage medium
CN113095563A (en) * 2021-04-07 2021-07-09 全球能源互联网研究院有限公司 Method and device for reviewing prediction result of artificial intelligence model

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101359372A (en) * 2008-09-26 2009-02-04 腾讯科技(深圳)有限公司 Training method and device of classifier, and method apparatus for recognising sensitization picture
CN106485528A (en) * 2015-09-01 2017-03-08 阿里巴巴集团控股有限公司 The method and apparatus of detection data
CN106776842A (en) * 2016-11-28 2017-05-31 腾讯科技(上海)有限公司 Multi-medium data detection method and device

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101996180A (en) * 2009-08-12 2011-03-30 升东网络科技发展(上海)有限公司 Picture examination and filter system and method
CN101692639A (en) * 2009-09-15 2010-04-07 西安交通大学 Bad webpage recognition method based on URL
CN102340424B (en) * 2010-07-21 2013-12-04 中国移动通信集团山东有限公司 Bad message detection method and bad message detection device
JP5707291B2 (en) * 2011-09-29 2015-04-30 株式会社日立ハイテクノロジーズ Charged particle beam system that supports image classification
CN103064858B (en) * 2011-10-19 2016-03-30 北京千橡网景科技发展有限公司 Method and apparatus for objectionable image detection in social networking websites
CN102663093B (en) * 2012-04-10 2014-07-09 中国科学院计算机网络信息中心 Method and device for detecting bad website
CN105654057A (en) * 2015-12-31 2016-06-08 中国建设银行股份有限公司 Picture auditing system and picture auditing method based on picture contents
CN107145904A (en) * 2017-04-28 2017-09-08 北京小米移动软件有限公司 Determination method, device and the storage medium of image category

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101359372A (en) * 2008-09-26 2009-02-04 腾讯科技(深圳)有限公司 Training method and device of classifier, and method apparatus for recognising sensitization picture
CN106485528A (en) * 2015-09-01 2017-03-08 阿里巴巴集团控股有限公司 The method and apparatus of detection data
CN106776842A (en) * 2016-11-28 2017-05-31 腾讯科技(上海)有限公司 Multi-medium data detection method and device

Also Published As

Publication number Publication date
CN107729924A (en) 2018-02-23
CN107729924B (en) 2019-02-19

Similar Documents

Publication Publication Date Title
WO2019056496A1 (en) Method for generating picture review probability interval and method for picture review determination
EP3971798A1 (en) Data processing method and apparatus, and computer readable storage medium
WO2022179138A1 (en) Image processing method and apparatus, and computer device and storage medium
CN111178410A (en) Illegal picture identification method and device
WO2021189855A1 (en) Image recognition method and apparatus based on ct sequence, and electronic device and medium
CN109214501B (en) Method and apparatus for identifying information
WO2022105119A1 (en) Training corpus generation method for intention recognition model, and related device thereof
JP6664585B2 (en) Information processing apparatus, information processing method, and information processing program
CN107291774B (en) Error sample identification method and device
WO2021196935A1 (en) Data checking method and apparatus, electronic device, and storage medium
CN112214770B (en) Malicious sample identification method, device, computing equipment and medium
WO2021174869A1 (en) User image data processing method, apparatus, computer device, and storage medium
WO2022105120A1 (en) Text detection method and apparatus from image, computer device and storage medium
CN113641823B (en) Text classification model training, text classification method, device, equipment and medium
JP2022153339A (en) Record matching in database system (computer-implemented method, computer program and computer system for record matching in database system)
CN112182520B (en) Identification method and device of illegal account number, readable medium and electronic equipment
CN113609018A (en) Test method, training method, device, apparatus, medium, and program product
CN109885710B (en) User image depicting method based on differential evolution algorithm and server
CN113052509A (en) Model evaluation method, model evaluation apparatus, electronic device, and storage medium
CN113869904A (en) Suspicious data identification method, device, electronic equipment, medium and computer program
CN112784990A (en) Training method of member inference model
CN112149391A (en) Information processing method, information processing apparatus, terminal device, and storage medium
CN115250200B (en) Service authorization authentication method and related equipment thereof
CN116383391B (en) Text classification method and device
CN111818114B (en) Information sharing method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17926146

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 25/09/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 17926146

Country of ref document: EP

Kind code of ref document: A1