CN112231076A - Data annotation task scheduling method based on intelligent optimization - Google Patents

Data annotation task scheduling method based on intelligent optimization Download PDF

Info

Publication number
CN112231076A
CN112231076A CN202010985546.2A CN202010985546A CN112231076A CN 112231076 A CN112231076 A CN 112231076A CN 202010985546 A CN202010985546 A CN 202010985546A CN 112231076 A CN112231076 A CN 112231076A
Authority
CN
China
Prior art keywords
individual
population
wei
subscript
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010985546.2A
Other languages
Chinese (zh)
Other versions
CN112231076B (en
Inventor
陈远存
郭肇禄
谭力江
蔡岳城
李凯辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oking Information Industry Co ltd
Original Assignee
Guangdong Oking Information Industry Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oking Information Industry Co ltd filed Critical Guangdong Oking Information Industry Co ltd
Priority to CN202010985546.2A priority Critical patent/CN112231076B/en
Publication of CN112231076A publication Critical patent/CN112231076A/en
Application granted granted Critical
Publication of CN112231076B publication Critical patent/CN112231076B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • G06F9/5016Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals the resource being the memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a data annotation task scheduling method based on intelligent optimization. Aiming at the defect that the traditional whale optimization algorithm is easy to fall into local optimization when solving the data annotation task scheduling problem, the invention provides an improved whale optimization algorithm for solving the data annotation task scheduling problem. In the improved whale optimization algorithm, an optimization rate is calculated based on an adaptive value of an individual, a proximity is calculated based on an Euclidean distance of the individual, the optimization rate and the proximity are subjected to information fusion by utilizing an orientation coefficient to construct an orientation rate of the individual, the individual is selected according to the orientation rate to guide searching, the convergence speed of the algorithm is improved, meanwhile, the diversity of a population is kept, the probability of being trapped into local optimization can be reduced, and therefore the execution efficiency of a data labeling task is improved.

Description

Data annotation task scheduling method based on intelligent optimization
Technical Field
The invention relates to the field of cloud computing task scheduling, in particular to a data annotation task scheduling method based on intelligent optimization.
Background
In recent years, artificial intelligence techniques are being pursued to various aspects of human life. Artificial intelligence technology has become an indispensable part of human life, for example, intelligent navigation, parking lot intelligent barrier, fingerprint attendance, face-brushing payment, and the like. Annotation data is the basis for artificial intelligence techniques. The accuracy and the usability of the artificial intelligence technology can be improved better only if the artificial intelligence technology has massive marking data. People have been able to collect vast amounts of data through internet technology. However, annotating the mass data collected is a very time consuming task.
In order to realize the labeling of mass data, researchers propose an intelligent data labeling method based on weak supervision and unsupervised. However, the intelligent data annotation method is a very computing resource consuming task. The intelligent data annotation method is realized by adopting a data annotation system based on cloud computing. In a data annotation system based on cloud computing, technicians often need to solve the scheduling problem of data annotation tasks [ zuo, d., Liu, h., Gao, l.,&Li,S.(2011).An improved differential evolution algorithm for the task assignment problem.Engineering Applications of Artificial Intelligence,24(4),616-624.]: the data annotation system based on cloud computing is provided with ND station annotation servers, wherein each annotation server has a memory size value MZkiAnd throughput value PZkiWherein ki 1,2, ND; now given MT data labeling tasks, wherein the memory size value required by each data labeling task is TRti(ii) a The processing capability value required for each data labeling task is PRtiAnd the time value required by the ti-th data annotation task to be executed and completed on the ki-th annotation server is given as ETki,tiWherein ti is 1, 2.. said, MT; the MT data annotation tasks are required to be distributed to ND annotation servers for execution, and one data annotation task can be distributed to only one server for execution; scheduling method for data annotation task required to be determinedThe method minimizes the total execution time of MT data labeling tasks, and also requires that the total amount of memory size required by all tasks distributed to the ki server cannot exceed MZkiThe total amount of processing power required for all tasks allocated to the ki server cannot exceed the PZki. The data labeling task scheduling problem is an NP completeness problem, and a better scheduling scheme is difficult to find in an acceptable time by using a traditional method. To this end, researchers use intelligent optimization methods to solve.
Whale optimization algorithm is an intelligent optimization method (Mirjalli, S., & Lewis, A. (2016.). The while optimization algorithm in engineering software,95, 51-67.) proposed in recent years, and obtains satisfactory results in solving many practical engineering problems. However, the traditional whale optimization algorithm is easy to fall into local optimization when solving the scheduling problem of the data annotation task, so that the execution efficiency of the scheduled data annotation task is not high.
Disclosure of Invention
The invention provides a data annotation task scheduling method based on intelligent optimization. The method overcomes the defect that the traditional whale optimization algorithm is easy to fall into local optimization when applied to the scheduling problem of the data annotation task to a certain extent, and can improve the execution efficiency of the data annotation task.
The technical scheme of the invention is as follows: a data annotation task scheduling method based on intelligent optimization comprises the following steps:
step 1, inputting the number ND of the labeling servers, and then inputting the number MT of the data labeling tasks;
step 2, inputting the memory size value MZ of the ND station mark serverkiAnd throughput value PZkiWherein subscript ki ═ 1, 2., ND;
step 3, inputting the memory size TR required by the MT data labeling taskstiAnd a processing capability value PR required for the MT data tagging tasktiWherein subscript ti 1, 2. ·, MT;
step 4, inputting the time required for the ti-th data annotation task to be executed and completed on the ki station annotation serverValue between ETki,tiWherein subscript ki 1, 2.., ND, and subscript ti 1, 2.., MT;
step 5, inputting a population scale WN, and then inputting a maximum iteration number MaxT;
step 6, setting the current iteration time t to be 0;
step 7, randomly generating WN individual composition population WP ═ { WA1,WA2,...,WAri,...,WAWNWhere WAri=[WAri,1,WAri,2,...,WAri,wk,...,WAri,MT]Represents the ri th individual in the population and individual WAsriThe distribution weight of MT data labeling tasks is stored; WAri,wkRepresenting an individual WAriThe distribution weight of the wk data marking task stored in the database; wherein the individual subscript ri is 1, 2. Task subscript wk — 1, 2.., MT;
step 8, calculating the adaptive value of each individual in the population; WA for individuals in a populationri Individual subscript ri 1, 2.., WN, with the fitness value calculation process: first, the individual WAriDecoding the stored distribution weight values of the MT data annotation tasks into an annotation server distribution list SEL, and converting the annotation server distribution list SEL into a distribution state matrix STA; then calculating the individual WA according to equation (1)riAdapted value of (WFit)ri
Figure BDA0002689063310000021
Wherein the STAki,tiA value representing the element of the kth row and the tth column of the assignment state matrix STA; pw1 represents a memory penalty factor; pw2 represents a processing power penalty factor; MFb represents a memory over-limit (i.e., a quantity value by which the memory demand exceeds the limit); PFb denotes a processing capacity over limit; max represents a maximum function;
step 9, storing the individual with the minimum fitness value in the population to the optimal individual WBestA;
step 10, then calculate the quality PWFit of each individual in the population according to equation (2)wei
Figure BDA0002689063310000031
Wherein, subscript wei ═ 1, 2.., WN; WFitweiRepresenting the fitness value of the wei-th individual in the population;
step 11, calculating the preferred rate ESP of each individual in the population according to the formula (3)wei
Figure BDA0002689063310000032
Wherein, subscript wei ═ 1, 2.., WN; wti are cumulative subscripts;
step 12, setting the counting variable wi to 1, and then calculating the orientation coefficient dcw according to the formula (4):
Figure BDA0002689063310000033
step 13, calculating individual WA according to formula (5)wiThe Euclidean distance WID between each body in the populationwei
WIDwei=dist(WAwi,WAwei) (5);
Wherein, subscript wei ═ 1, 2.., WN; dist represents the calculation of the Euclidean distance function; WAwiRepresenting the wi th individual in the population; WAweiRepresents the wei-th individual in the population; WIDweiRepresenting an individual WAwiAnd individual WAweiThe euclidean distance between;
step 14, calculating the individual WA according to equation (6)wiProximity WAP to each individual in a populationwei
Figure BDA0002689063310000034
Wherein, subscript wei ═ 1, 2.., WN; wti are cumulative subscripts;
step 15, according to the formula (7)Calculating the orientation ratio GP of each individual in the populationwei
Figure BDA0002689063310000035
Wherein, subscript wei ═ 1, 2.., WN;
step 16, according to the orientation rate GP of each individual in the populationweiSelecting directional individual DIE from the population by adopting a roulette selection method;
step 17, randomly generating a real number wrp; if wrp is less than 0.5, go to step 20, otherwise go to step 18;
step 18, executing the spiral hunting operator to generate new individual NV according to the formula (8)wi
NVwi=DIE+|2×wr1×DIE-WAwi|×exp(wr2)×cos(2×π×wr2) (8);
Wherein wr1 is a random real number between [0,1 ]; wr2 is a random real number between [ -1,1 ]; pi is the circumference ratio; exp represents an exponential function with a natural constant e as the base; cos represents a cosine function;
step 19, go to step 25;
step 20, setting the step factor AC to be 2 × ad × rp-ad; wherein rp is [0,1]]Random real number in between, convergence factor
Figure BDA0002689063310000041
Step 21, if the absolute value of the step factor AC is less than 1, go to step 22, otherwise go to step 24;
step 22, perform the bracketing operator to generate new individual NV according to equation (9)wi
NVwi=DIE-AC×|2×wr1×DIE-WAwi| (9);
Step 23, go to step 25;
step 24, execute the prey search operator to generate new individual NV according to the formula (10)wi
NVwi=WArsi-AC×|2×wr1×WAwi-WArsi| (10);
Wherein, WArsiIs an individual randomly selected from a population;
step 25, calculating New Individual NVwiAn adaptation value of;
step 26, if new individual NVwiIs smaller than the individual WAwiUsing the new individual NV in the populationwiReplacement of individual WAswiOtherwise, keeping the individual WAwiThe change is not changed;
step 27, setting a counting variable wi-wi + 1; if wi is greater than WN, go to step 28, otherwise go to step 13;
step 28, finding out the individual with the minimum adaptive value from the population and storing the individual with the minimum adaptive value into the optimal individual WBestA;
step 29, setting the current iteration time t as t + 1;
step 30, if the current iteration time t is less than the maximum iteration time MaxT, the step 10 is carried out, otherwise, the step 31 is carried out;
and step 31, decoding an annotation server distribution list of MT data annotation tasks from the optimal individual WBestA, and obtaining a scheduling scheme of the data annotation tasks.
The invention has the beneficial effects that: when the data annotation task scheduling problem is solved, the spiral hunting operator and the surrounding hunting operator of the traditional whale optimization algorithm directly utilize the optimal individuals in the population to guide searching, so that the individuals in the population greedily point to the optimal individuals to search, the diversity of the population is easily lost, and the defect of local optimization is caused. In order to improve the defects of the traditional whale optimization algorithm, the invention provides an improved whale optimization algorithm for solving the scheduling problem of the data annotation task. In the improved whale optimization algorithm, the adaptive value of an individual and the Euclidean distance close to the current individual are comprehensively utilized to select a directional individual, then the directional individual is utilized to guide searching, the diversity of a population is enhanced, and the convergence speed of the algorithm is kept, so that the probability of trapping in local optimum is reduced, and the execution efficiency of a data labeling task is improved.
Drawings
FIG. 1 is a process of decoding an allocation weight into an allocation list of a tag server, and a process of converting the allocation list of the tag server into an allocation state matrix;
FIG. 2 is a flowchart of a data annotation task scheduling method based on intelligent optimization.
Detailed Description
By way of example, and with reference to the accompanying drawings: FIG. 1 is a process of decoding an allocation weight into an allocation list of a tag server, and a process of converting the allocation list of the tag server into an allocation state matrix; fig. 2 is a flowchart of a data annotation task scheduling method based on intelligent optimization, and further details the technical scheme of the present invention.
Example (b):
in this embodiment, referring to fig. 1 and fig. 2, the specific implementation steps of the present invention are as follows:
step 1, inputting the number ND of the annotation servers as 100, and then inputting the number MT of the data annotation tasks as 300; the annotation server refers to a server used for executing a data tagging task; the data annotation task refers to a given batch of data and requires to label the given data;
step 2, inputting the memory size value MZ of 100 marking serverskiAnd throughput value PZkiWherein the subscript ki 1, 2.., 100; the processing capacity value refers to the number of instructions which can be executed by the annotation server in each second;
step 3, inputting memory size values TR required by 300 data labeling taskstiAnd a processing capability value PR required for 300 data tagging taskstiWherein subscript ti ═ 1, 2.., 300;
step 4, inputting the time value ET required by the ti-th data annotation task to be executed and completed on the ki station annotation serverki,tiWherein subscript ki 1, 2.., ND, and subscript ti 1, 2.., MT;
step 5, inputting a population scale WN of 300, and then inputting a maximum iteration number MaxT of 1000;
step 6, setting the current iteration time t to be 0;
step 7, randomly generating WN individual composition population WP ═ { WA1,WA2,...,WAri,...,WAWNWhere WAri=[WAri,1,WAri,2,...,WAri,wk,...,WAri,MT]Represents the ri th individual in the population and individual WAsriThe distribution weight of MT data labeling tasks is stored; WAri,wkRepresenting an individual WAriThe distribution weight of the wk data marking task stored in the database; wherein the individual subscript ri is 1, 2. Task subscript wk — 1, 2.., MT; the distribution weight value of the data labeling task is [1, ND]Real numbers in between;
step 8, calculating the adaptive value of each individual in the population; WA for individuals in a populationri Individual subscript ri 1, 2.., WN, with the fitness value calculation process: first, the individual WAriDecoding the stored distribution weight values of the MT data annotation tasks into an annotation server distribution list SEL, and converting the annotation server distribution list SEL into a distribution state matrix STA; then calculating the individual WA according to equation (1)riAdapted value of (WFit)ri
Figure BDA0002689063310000061
Wherein the STAki,tiA value representing the element of the kth row and the tth column of the assignment state matrix STA; pw1 represents a memory penalty factor; pw2 represents a processing power penalty factor; MFb represents a memory over-limit; PFb denotes a processing capacity over limit; max represents a maximum function;
said subjecting individual WAriDecoding the stored distribution weight values of the MT data annotation tasks into an annotation server distribution list SEL, wherein the specific process comprises the following steps: to individual WAriRounding the distribution weight of the stored MT data annotation tasks to obtain MT integers, and sequentially storing the MT integers into an annotation server distribution list SEL, wherein the 1 st integer in the annotation server distribution list SEL represents the number of an annotation server distributed by the 1 st data annotation task; annotation Server Allocation List SEL 2 ndThe integer represents the number of the annotation server allocated by the 2 nd data annotation task; the 3 rd integer in the allocation list SEL of the annotation server represents the number of the annotation server allocated by the 3 rd data annotation task; … …, respectively; the MT integer in the allocation list SEL of the annotation server represents the serial number of the annotation server allocated by the MT data annotation task;
the specific process of converting the allocation list SEL of the annotation server into the allocation state matrix STA is as follows: first, the size of the assignment state matrix STA is set to ND rows MT columns, and each element of the assignment state matrix STA is set to 0; then, sequentially judging MT data annotation tasks according to an annotation server distribution list SEL, if the et-th data annotation task is distributed in a kd-th annotation server, setting the value of the et-th row and the et-th column element of a kd-th row of a distribution state matrix STA to be 1, and otherwise, setting the value of the et-th row and the et-th column element of the kd-th row of the distribution state matrix STA to be 0; wherein the integer et ═ 1, 2.., MT; an integer kd ═ 1,2,. and ND;
step 9, storing the individual with the minimum fitness value in the population to the optimal individual WBestA;
step 10, then calculate the quality PWFit of each individual in the population according to equation (2)wei
Figure BDA0002689063310000071
Wherein, subscript wei ═ 1, 2.., WN; WFitweiThe fitness value of the wei-th individual in the population is shown, wherein the other meaning is when WFit iswei<At 0;
step 11, calculating the preferred rate ESP of each individual in the population according to the formula (3)wei
Figure BDA0002689063310000072
Wherein, subscript wei ═ 1, 2.., WN; wti are cumulative subscripts;
step 12, setting the counting variable wi to 1, and then calculating the orientation coefficient dcw according to the formula (4):
Figure BDA0002689063310000073
step 13, calculating individual WA according to formula (5)wiThe Euclidean distance WID between each body in the populationwei
WIDwei=dist(WAwi,WAwei) (5);
Wherein, subscript wei ═ 1, 2.., WN; dist represents the calculation of the Euclidean distance function; WAwiRepresenting the wi th individual in the population; WAweiRepresents the wei-th individual in the population; WIDweiRepresenting an individual WAwiAnd individual WAweiThe euclidean distance between;
step 14, calculating the individual WA according to equation (6)wiProximity WAP to each individual in a populationwei
Figure BDA0002689063310000074
Wherein, subscript wei ═ 1, 2.., WN; wti are cumulative subscripts;
step 15, calculating the orientation rate GP of each individual in the population according to the formula (7)wei
Figure BDA0002689063310000075
Wherein the subscript wei ═ 1, 2., WN, wherein the other meaning is when wei ═ wi;
step 16, according to the orientation rate GP of each individual in the populationweiSelecting directional individual DIE from the population by adopting a roulette selection method;
step 17, randomly generating a real number wrp; if wrp is less than 0.5, go to step 20, otherwise go to step 18;
step 18, executing the spiral hunting operator to generate new individual NV according to the formula (8)wi
NVwi=DIE+|2×wr1×DIE-WAwi|×exp(wr2)×cos(2×π×wr2) (8);
Wherein wr1 is a random real number between [0,1 ]; wr2 is a random real number between [ -1,1 ]; pi is the circumference ratio; exp represents an exponential function with a natural constant e as the base; cos represents a cosine function;
step 19, go to step 25;
step 20, setting the step factor AC to be 2 × ad × rp-ad; wherein rp is [0,1]]Random real number in between, convergence factor
Figure BDA0002689063310000081
Step 21, if the absolute value of the step factor AC is less than 1, go to step 22, otherwise go to step 24;
step 22, perform the bracketing operator to generate new individual NV according to equation (9)wi
NVwi=DIE-AC×|2×wr1×DIE-WAwi| (9);
Step 23, go to step 25;
step 24, execute the prey search operator to generate new individual NV according to the formula (10)wi
NVwi=WArsi-AC×|2×wr1×WAwi-WArsi| (10);
Wherein, WArsiIs an individual randomly selected from a population;
step 25, calculating New Individual NVwiAn adaptation value of;
step 26, if new individual NVwiIs smaller than the individual WAwiUsing the new individual NV in the populationwiReplacement of individual WAswiOtherwise, keeping the individual WAwiThe change is not changed;
step 27, setting a counting variable wi-wi + 1; if wi is greater than WN, go to step 28, otherwise go to step 13;
step 28, finding out the individual with the minimum adaptive value from the population and storing the individual with the minimum adaptive value into the optimal individual WBestA;
step 29, setting the current iteration time t as t + 1;
step 30, if the current iteration time t is less than the maximum iteration time MaxT, the step 10 is carried out, otherwise, the step 31 is carried out;
and step 31, decoding an annotation server distribution list of MT data annotation tasks from the optimal individual WBestA, and obtaining a scheduling scheme of the data annotation tasks.

Claims (1)

1. The data annotation task scheduling method based on intelligent optimization is characterized by comprising the following steps:
step 1, inputting the number ND of the labeling servers, and then inputting the number MT of the data labeling tasks;
step 2, inputting the memory size value MZ of the ND station mark serverkiAnd throughput value PZkiWherein subscript ki ═ 1, 2., ND;
step 3, inputting the memory size TR required by the MT data labeling taskstiAnd a processing capability value PR required for the MT data tagging tasktiWherein subscript ti 1, 2. ·, MT;
step 4, inputting the time value ET required by the ti-th data annotation task to be executed and completed on the ki station annotation serverki,tiWherein subscript ki 1, 2.., ND, and subscript ti 1, 2.., MT;
step 5, inputting a population scale WN, and then inputting a maximum iteration number MaxT;
step 6, setting the current iteration time t to be 0;
step 7, randomly generating WN individual composition population WP ═ { WA1,WA2,...,WAri,...,WAWNWhere WAri=[WAri,1,WAri,2,...,WAri,wk,...,WAri,MT]Represents the ri th individual in the population and individual WAsriThe distribution weight of MT data labeling tasks is stored; WAri,wkRepresenting an individual WAriThe distribution weight of the wk data marking task stored in the database; wherein the individual subscript ri is 1, 2. Task subscript wk — 1, 2.., MT;
step (ii) of8, calculating the adaptive value of each individual in the population; WA for individuals in a populationriIndividual subscript ri 1, 2.., WN, with the fitness value calculation process: first, the individual WAriDecoding the stored distribution weight values of the MT data annotation tasks into an annotation server distribution list SEL, and converting the annotation server distribution list SEL into a distribution state matrix STA; then calculating the individual WA according to equation (1)riAdapted value of (WFit)ri
Figure FDA0002689063300000011
Wherein the STAki,tiA value representing the element of the kth row and the tth column of the assignment state matrix STA; pw1 represents a memory penalty factor; pw2 represents a processing power penalty factor; MFb represents a memory over-limit; PFb denotes a processing capacity over limit; max represents a maximum function;
step 9, storing the individual with the minimum fitness value in the population to the optimal individual WBestA;
step 10, then calculate the quality PWFit of each individual in the population according to equation (2)wei
Figure FDA0002689063300000021
Wherein, subscript wei ═ 1, 2.., WN; WFitweiRepresenting the fitness value of the wei-th individual in the population;
step 11, calculating the preferred rate ESP of each individual in the population according to the formula (3)wei
Figure FDA0002689063300000022
Wherein, subscript wei ═ 1, 2.., WN; wti are cumulative subscripts;
step 12, setting the counting variable wi to 1, and then calculating the orientation coefficient dcw according to the formula (4):
Figure FDA0002689063300000023
step 13, calculating individual WA according to formula (5)wiThe Euclidean distance WID between each body in the populationwei
WIDwei=dist(WAwi,WAwei) (5);
Wherein, subscript wei ═ 1, 2.., WN; dist represents the calculation of the Euclidean distance function; WAwiRepresenting the wi th individual in the population; WAweiRepresents the wei-th individual in the population; WIDweiRepresenting an individual WAwiAnd individual WAweiThe euclidean distance between;
step 14, calculating the individual WA according to equation (6)wiProximity WAP to each individual in a populationwei
Figure FDA0002689063300000024
Wherein, subscript wei ═ 1, 2.., WN; wti are cumulative subscripts;
step 15, calculating the orientation rate GP of each individual in the population according to the formula (7)wei
Figure FDA0002689063300000025
Wherein, subscript wei ═ 1, 2.., WN;
step 16, according to the orientation rate GP of each individual in the populationweiSelecting directional individual DIE from the population by adopting a roulette selection method;
step 17, randomly generating a real number wrp; if wrp is less than 0.5, go to step 20, otherwise go to step 18;
step 18, executing the spiral hunting operator to generate new individual NV according to the formula (8)wi
NVwi=DIE+|2×wr1×DIE-WAwi|×exp(wr2)×cos(2×π×wr2) (8);
Wherein wr1 is a random real number between [0,1 ]; wr2 is a random real number between [ -1,1 ]; pi is the circumference ratio; exp represents an exponential function with a natural constant e as the base; cos represents a cosine function;
step 19, go to step 25;
step 20, setting the step factor AC to be 2 × ad × rp-ad; wherein rp is [0,1]]Random real number in between, convergence factor
Figure FDA0002689063300000031
Step 21, if the absolute value of the step factor AC is less than 1, go to step 22, otherwise go to step 24;
step 22, perform the bracketing operator to generate new individual NV according to equation (9)wi
NVwi=DIE-AC×|2×wr1×DIE-WAwi| (9);
Step 23, go to step 25;
step 24, execute the prey search operator to generate new individual NV according to the formula (10)wi
NVwi=WArsi-AC×|2×wr1×WAwi-WArsi| (10);
Wherein, WArsiIs an individual randomly selected from a population;
step 25, calculating New Individual NVwiAn adaptation value of;
step 26, if new individual NVwiIs smaller than the individual WAwiUsing the new individual NV in the populationwiReplacement of individual WAswiOtherwise, keeping the individual WAwiThe change is not changed;
step 27, setting a counting variable wi-wi + 1; if wi is greater than WN, go to step 28, otherwise go to step 13;
step 28, finding out the individual with the minimum adaptive value from the population and storing the individual with the minimum adaptive value into the optimal individual WBestA;
step 29, setting the current iteration time t as t + 1;
step 30, if the current iteration time t is less than the maximum iteration time MaxT, the step 10 is carried out, otherwise, the step 31 is carried out;
and step 31, decoding an annotation server distribution list of MT data annotation tasks from the optimal individual WBestA, and obtaining a scheduling scheme of the data annotation tasks.
CN202010985546.2A 2020-09-18 2020-09-18 Data annotation task scheduling method based on intelligent optimization Active CN112231076B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010985546.2A CN112231076B (en) 2020-09-18 2020-09-18 Data annotation task scheduling method based on intelligent optimization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010985546.2A CN112231076B (en) 2020-09-18 2020-09-18 Data annotation task scheduling method based on intelligent optimization

Publications (2)

Publication Number Publication Date
CN112231076A true CN112231076A (en) 2021-01-15
CN112231076B CN112231076B (en) 2022-04-08

Family

ID=74107226

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010985546.2A Active CN112231076B (en) 2020-09-18 2020-09-18 Data annotation task scheduling method based on intelligent optimization

Country Status (1)

Country Link
CN (1) CN112231076B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112801380A (en) * 2021-02-02 2021-05-14 广东奥博信息产业股份有限公司 Load prediction method for data annotation cloud server
CN112836759A (en) * 2021-02-09 2021-05-25 重庆紫光华山智安科技有限公司 Method and device for evaluating machine-selected picture, storage medium and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9275347B1 (en) * 2015-10-09 2016-03-01 AlpacaDB, Inc. Online content classifier which updates a classification score based on a count of labeled data classified by machine deep learning
CN108509335A (en) * 2018-01-31 2018-09-07 浙江理工大学 Software Test Data Generation Method based on genetic algorithm optimization
CN109886589A (en) * 2019-02-28 2019-06-14 长安大学 A method of low-carbon Job-Shop is solved based on whale optimization algorithm is improved
CN109902873A (en) * 2019-02-28 2019-06-18 长安大学 A method of the cloud manufacturing resource allocation based on modified whale algorithm

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9275347B1 (en) * 2015-10-09 2016-03-01 AlpacaDB, Inc. Online content classifier which updates a classification score based on a count of labeled data classified by machine deep learning
CN108509335A (en) * 2018-01-31 2018-09-07 浙江理工大学 Software Test Data Generation Method based on genetic algorithm optimization
CN109886589A (en) * 2019-02-28 2019-06-14 长安大学 A method of low-carbon Job-Shop is solved based on whale optimization algorithm is improved
CN109902873A (en) * 2019-02-28 2019-06-18 长安大学 A method of the cloud manufacturing resource allocation based on modified whale algorithm

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张斯琪等: ""能耗和噪声约束下的柔性车间调度决策优化"", 《小型微型计算机系统》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112801380A (en) * 2021-02-02 2021-05-14 广东奥博信息产业股份有限公司 Load prediction method for data annotation cloud server
CN112836759A (en) * 2021-02-09 2021-05-25 重庆紫光华山智安科技有限公司 Method and device for evaluating machine-selected picture, storage medium and electronic equipment
CN112836759B (en) * 2021-02-09 2023-05-30 重庆紫光华山智安科技有限公司 Machine-selected picture evaluation method and device, storage medium and electronic equipment

Also Published As

Publication number Publication date
CN112231076B (en) 2022-04-08

Similar Documents

Publication Publication Date Title
CN112231076B (en) Data annotation task scheduling method based on intelligent optimization
CN106228185B (en) A kind of general image classifying and identifying system neural network based and method
CN105118090B (en) A kind of point cloud filtering method of adaptive complicated landform structure
CN116188584B (en) Method and system for identifying object polishing position based on image
CN115048539B (en) Social media data online retrieval method and system based on dynamic memory
CN111596622B (en) Flexible job shop scheduling method of ECM (electronic model control) rule distribution estimation algorithm
CN109345684A (en) A kind of multinational paper money number recognition methods based on GMDH-SVM
CN112148213B (en) Annotated data storage server allocation method based on evolution optimization
CN110796268A (en) Method and device for determining hyper-parameters of business processing model
CN111191941A (en) Method for solving SEVM model problem based on artificial bee colony algorithm
CN116452373A (en) Multi-target genetic algorithm-based intelligent generation method and system for block building body quantity
CN111310121A (en) New energy output probability prediction method and system
CN114089699B (en) Method for solving mixed flow shop scheduling based on cell type membrane calculation model
CN112991402B (en) Wen Wudian cloud registration method and system based on improved differential evolution algorithm
CN108629356A (en) A kind of date storage method and device towards power load classification application
CN113552881A (en) Multi-path planning data set generation method for neural network training
CN114399152B (en) Method and device for optimizing comprehensive energy scheduling of industrial park
CN111275078B (en) Optimization method of support vector machine for part image recognition
CN116977652B (en) Workpiece surface morphology generation method and device based on multi-mode image generation
CN112711256B (en) Automatic generation method for target search observation points of mobile robot
CN115907430A (en) Multi-unmanned aerial vehicle sequence multi-task cooperative allocation method and system
CN106855918A (en) Process the principal component analytical method of extensive matrix data
CN116229103A (en) Target detection method based on multi-scale positive and negative sample selection
CN118053515A (en) DNN-TL-GA-based metal cluster structure optimization method and optimization system
CN111144460A (en) Algorithm for improving classification performance of convolutional neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Data annotation task scheduling method based on intelligent optimization

Effective date of registration: 20220930

Granted publication date: 20220408

Pledgee: Guangdong Shunde Rural Commercial Bank Co.,Ltd. science and technology innovation sub branch

Pledgor: GUANGDONG OKING INFORMATION INDUSTRY CO.,LTD.

Registration number: Y2022980017199