CN111985545B - Target data detection method, device, equipment and medium based on artificial intelligence - Google Patents
Target data detection method, device, equipment and medium based on artificial intelligence Download PDFInfo
- Publication number
- CN111985545B CN111985545B CN202010797375.0A CN202010797375A CN111985545B CN 111985545 B CN111985545 B CN 111985545B CN 202010797375 A CN202010797375 A CN 202010797375A CN 111985545 B CN111985545 B CN 111985545B
- Authority
- CN
- China
- Prior art keywords
- data
- target
- sub
- detected
- target data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 188
- 238000013473 artificial intelligence Methods 0.000 title claims abstract description 44
- 238000012706 support-vector machine Methods 0.000 claims abstract description 58
- 238000010845 search algorithm Methods 0.000 claims abstract description 33
- 238000012549 training Methods 0.000 claims description 46
- 238000012795 verification Methods 0.000 claims description 19
- 230000006870 function Effects 0.000 claims description 18
- 238000000034 method Methods 0.000 claims description 17
- 239000003016 pheromone Substances 0.000 claims description 15
- 230000004044 response Effects 0.000 claims description 10
- 238000010276 construction Methods 0.000 claims description 6
- 238000005457 optimization Methods 0.000 claims description 6
- 238000012216 screening Methods 0.000 abstract description 14
- 238000005516 engineering process Methods 0.000 abstract description 3
- 238000004590 computer program Methods 0.000 description 12
- 230000000694 effects Effects 0.000 description 8
- 238000012545 processing Methods 0.000 description 8
- 238000007726 management method Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 5
- 230000006399 behavior Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000001680 brushing effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 238000000802 evaporation-induced self-assembly Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/018—Certifying business or products
- G06Q30/0185—Product, service or business identity fraud
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/08—Insurance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Biology (AREA)
- Development Economics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Business, Economics & Management (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Economics (AREA)
- Evolutionary Computation (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Technology Law (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to artificial intelligence, and provides a target data detection method, device, equipment and medium based on artificial intelligence, which can select data from data to be detected to construct a target data set according to a target label, select data corresponding to a sub label from the data to be detected to construct a sub data set, calculate the intersection of the target data set and the sub data set to obtain a sub intersection, train a single support vector machine model by the sub intersection in combination with a global search algorithm to obtain a detection model, input the data to be detected into the detection model, output a detection result, integrate the data with the detection result as target data, and further automatically detect and screen the target data in an artificial intelligence mode, thereby solving the problems of low screening efficiency, high error rate and poor reliability. The invention also relates to a blockchain technology, and a detection model and target data can be stored in the blockchain.
Description
Technical Field
The present invention relates to the field of artificial intelligence technologies, and in particular, to a target data detection method, apparatus, device, and medium based on artificial intelligence.
Background
At present, with the rapid development of insurance industry, teams of agents are also growing, and partial agents have situations of replacing or brushing data (such as using equipment of other people to do questions) so as to generate a large amount of false data, and the false data can cause larger data noise, reduce data quality and influence the usability of the data due to lower authenticity, so that how to quickly and accurately detect target data becomes a problem to be solved.
For the above-mentioned cases, a solution generally adopted in the industry is to screen data with rules set in advance, for example: and manually setting a screening principle, reserving data conforming to the principle, and deleting data not conforming to the principle. The method only relies on artificially defined rules to carry out data statistics, so that the screening efficiency is low, mistakes are easy to occur, and the reliability of the final detection result is poor.
When the machine is adopted to automatically screen target data, the training samples are difficult to acquire, and the parameters of the traditional support vector machine model are difficult to determine, so that the performance of the model is poor, and the global property and the robustness of the model are also required to be improved.
Disclosure of Invention
In view of the foregoing, it is necessary to provide a method, apparatus, device and medium for detecting target data based on artificial intelligence, which can automatically detect and screen target data in an artificial intelligence manner, and solve the problems of low screening efficiency, high error rate and poor reliability.
An artificial intelligence based target data detection method, the artificial intelligence based target data detection method comprising:
determining a target tag in response to the received target data detection instruction;
acquiring data to be detected corresponding to a user to be detected;
selecting data from the data to be detected according to the target tag to construct a target data set;
Acquiring at least one pre-defined sub-label, selecting data corresponding to each sub-label from the data to be detected, and constructing at least one sub-data set corresponding to each sub-label;
calculating the intersection of the target data set and the at least one sub data set to obtain at least one sub intersection;
Training a single-class support vector machine model by the at least one sub-intersection by combining a global search algorithm to obtain a detection model;
Inputting the data to be detected into the detection model, and outputting a detection result, wherein the detection result comprises a target and a non-target;
and integrating the data with the detection result as a target as the target data.
According to a preferred embodiment of the present invention, the determining the target tag includes one or more of the following combinations:
Analyzing the method body of the target data detection instruction to obtain data carried by the target data detection instruction, acquiring a preset tag, matching the preset tag with the data carried by the target data detection instruction, and determining the matched data as the target tag; or alternatively
Acquiring historical target data, identifying keywords of the historical target data as first keywords, calculating the occurrence frequency of the first keywords, and acquiring the first keywords with highest occurrence frequency as the target labels.
According to a preferred embodiment of the present invention, the selecting data from the data to be detected according to the target tag to construct a target data set includes:
Identifying a keyword of each datum in the data to be detected as a second keyword;
dividing the data with the same second keywords in the data to be detected into one category, and naming each category by the same second keywords;
configuring the weight of each category in the corresponding data;
determining the category with the weight greater than or equal to the configuration weight as the target category of the corresponding data;
Defining a label of the corresponding data according to the target category;
And matching the target label in the defined label, and acquiring data corresponding to the label matched with the target label to construct the target data set.
According to a preferred embodiment of the present invention, the training a single-class support vector machine model with the at least one sub-intersection in combination with a global search algorithm, to obtain a detection model includes:
Integrating the data in the at least one sub-intersection to obtain a training sample;
Determining a kernel function of a single-class support vector machine model, and determining initial parameters of the kernel function;
Optimizing the initial parameters by adopting the global search algorithm to obtain an optimized single-class support vector machine model;
and training the optimized single-class support vector machine model by using the training sample to obtain the detection model.
According to a preferred embodiment of the present invention, the optimizing the initial parameters by using a global search algorithm, to obtain an optimized single-class support vector machine model includes:
Determining the highest iteration times and search step length of the global search algorithm;
Taking the initial parameter as an initial position, and carrying out iterative search of the pheromone by the search step length until the highest iterative times are reached, stopping searching, and obtaining the current pheromone;
And replacing the initial parameters by taking the position corresponding to the current pheromone as the optimized parameters to obtain the optimized single-class support vector machine model.
According to a preferred embodiment of the present invention, the artificial intelligence based target data detection method further includes:
randomly acquiring data from the target data set to construct a verification set;
inputting the data in the verification set into the detection model, and outputting a first detection result;
acquiring data of which the first detection result is a target, and calculating the proportion of the acquired data in the verification set;
When the proportion is greater than or equal to the configuration proportion, determining that the detection model passes verification, and storing the detection model into a blockchain; or alternatively
And when the proportion is smaller than the configuration proportion, determining that the detection model fails to pass verification, updating the at least one sub-label, updating the at least one sub-data set according to the updated at least one sub-label, and carrying out optimization training on the detection model by the updated at least one sub-data set.
According to a preferred embodiment of the present invention, after integrating the data whose detection result is the target as the target data, the artificial intelligence-based target data detection method further includes:
Calculating the duty ratio of the target data in the data to be detected;
when the duty ratio is lower than the configuration duty ratio, generating warning information according to the duty ratio;
Determining the associated user of the user to be detected;
And sending the warning information to the terminal equipment of the user to be detected and the terminal equipment of the associated user.
An artificial intelligence based target data detection apparatus, the artificial intelligence based target data detection apparatus comprising:
a determining unit for determining a target tag in response to the received target data detection instruction;
The acquisition unit is used for acquiring to-be-detected data corresponding to the to-be-detected user;
the construction unit is used for selecting data from the data to be detected according to the target label to construct a target data set;
The construction unit is further used for acquiring at least one pre-defined sub-label, selecting data corresponding to each sub-label from the data to be detected, and constructing at least one sub-data set corresponding to each sub-label;
A calculating unit, configured to calculate an intersection of the target data set and the at least one sub data set, to obtain at least one sub intersection;
the training unit is used for combining a global search algorithm, training a single-class support vector machine model by the at least one sub-intersection to obtain a detection model;
the input unit is used for inputting the data to be detected into the detection model and outputting a detection result, wherein the detection result comprises a target and a non-target;
and the integration unit is used for integrating the data with the detection result as the target data.
An electronic device, the electronic device comprising:
a memory storing at least one instruction; and
And the processor executes the instructions stored in the memory to realize the target data detection method based on the artificial intelligence.
A computer-readable storage medium having stored therein at least one instruction for execution by a processor in an electronic device to implement the artificial intelligence based target data detection method.
According to the technical scheme, the target label can be determined in response to the received target data detection instruction, the data to be detected corresponding to the user to be detected is obtained, the target data set is constructed according to the target label, the target data set is selected from the data to be detected, at least one pre-defined sub-label is obtained, the data corresponding to each sub-label is selected from the data to be detected, at least one sub-data set corresponding to each sub-label is constructed, the intersection of the target data set and the at least one sub-data set is calculated, at least one sub-intersection is obtained, the global search algorithm is combined, the single support vector machine model is trained by the at least one sub-intersection, the detection model is obtained, the data to be detected is input into the detection model, the detection result is output, the detection result comprises the target data and the non-target data, the integrated detection result is the target data as the target data, further the detection and screening of the target data can be automatically carried out in an artificial intelligent mode, and the problems of low screening efficiency, high error rate and poor reliability are solved.
Drawings
FIG. 1 is a flow chart of a preferred embodiment of the artificial intelligence based target data detection method of the present invention.
FIG. 2 is a functional block diagram of a preferred embodiment of the artificial intelligence based target data detection apparatus of the present invention.
FIG. 3 is a schematic diagram of an electronic device implementing a preferred embodiment of an artificial intelligence based target data detection method according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in detail with reference to the accompanying drawings and specific embodiments.
FIG. 1 is a flow chart of a preferred embodiment of the artificial intelligence based target data detection method of the present invention. The order of the steps in the flowchart may be changed and some steps may be omitted according to various needs.
The target data detection method based on artificial intelligence is applied to one or more electronic devices, wherein the electronic devices are devices capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions, and the hardware of the electronic devices comprises, but is not limited to, microprocessors, application SPECIFIC INTEGRATED Circuits (ASICs), programmable gate arrays (Field-Programmable GATE ARRAY, FPGA), digital processors (DIGITAL SIGNAL processors, DSPs), embedded devices and the like.
The electronic device may be any electronic product that can interact with a user in a human-computer manner, such as a Personal computer, a tablet computer, a smart phone, a Personal digital assistant (Personal DIGITAL ASSISTANT, PDA), a game console, an interactive internet protocol television (Internet Protocol Television, IPTV), a smart wearable device, etc.
The electronic device may also include a network device and/or a user device. Wherein the network device includes, but is not limited to, a single network server, a server group composed of a plurality of network servers, or a Cloud based Cloud Computing (Cloud Computing) composed of a large number of hosts or network servers.
The network in which the electronic device is located includes, but is not limited to, the internet, a wide area network, a metropolitan area network, a local area network, a virtual private network (Virtual Private Network, VPN), and the like.
S10, determining a target label in response to the received target data detection instruction.
In at least one embodiment of the present invention, the target data detection instructions may be triggered by a designated person to execute according to actual needs.
Of course, the target data detection instruction may also be configured to trigger periodically, so as to screen target data periodically, and avoid that other related service scenarios influence the execution effect of the task due to noise generated by non-target data when the task is executed by using the data.
In this embodiment, when a piece of data carries the target tag, the piece of data can be determined as target data. For example: the target tag may be: an operation of importing an address book, etc. is performed.
In at least one embodiment of the present invention, the determining the target tag includes, but is not limited to, one or more of the following:
Analyzing the method body of the target data detection instruction to obtain data carried by the target data detection instruction, acquiring a preset tag, matching the preset tag with the data carried by the target data detection instruction, and determining the matched data as the target tag; or alternatively
Through the implementation mode, when the target label is carried in the target data detection instruction, the target label can be directly extracted by the preset label, so that the time for data processing is saved, and the accuracy of the target label is ensured.
Acquiring historical target data, identifying keywords of the historical target data as first keywords, calculating the occurrence frequency of the first keywords, and acquiring the first keywords with highest occurrence frequency as the target labels.
By the embodiment, when the target tag is not carried in the target data detection instruction, the target tag can be determined through historical target data.
Of course, in other embodiments, the target tag may be determined in other ways, and the invention is not limited.
For example: and receiving the label uploaded by the user as the target label.
S11, obtaining to-be-detected data corresponding to the to-be-detected user.
In this embodiment, the account number of the user to be detected is obtained, all data corresponding to the account number is obtained, and the obtained data is determined to be the data to be detected.
It can be understood that the data corresponding to the account number is not necessarily the data generated by the user to be detected, if other people log in the account number of the user to be detected to operate, the generated data is not the actual data of the user to be detected, and can be regarded as data to be brushed or substituted, namely, non-target data, and the actual data of the user to be detected is the target data.
S12, selecting data from the data to be detected according to the target label to construct a target data set.
In this embodiment, since the data in the target data set all carry the target tag, the data in the target data set are all target data.
In at least one embodiment of the present invention, the selecting data from the data to be detected according to the target tag to construct a target data set includes:
Identifying a keyword of each datum in the data to be detected as a second keyword;
dividing the data with the same second keywords in the data to be detected into one category, and naming each category by the same second keywords;
configuring the weight of each category in the corresponding data;
determining the category with the weight greater than or equal to the configuration weight as the target category of the corresponding data;
Defining a label of the corresponding data according to the target category;
And matching the target label in the defined label, and acquiring data corresponding to the label matched with the target label to construct the target data set.
The configuration weights can be configured in a self-defined manner according to actual requirements, and the invention is not limited.
After naming each category by the same keyword, a certain weight needs to be assigned to each category to distinguish importance degrees of different categories, and in particular, multiple data indexes can be comprehensively considered when determining the weight, such as: number of interviews, number of attractive interviews, etc.
S13, at least one pre-defined sub-label is obtained, and data corresponding to each sub-label is selected from the data to be detected to construct at least one sub-data set corresponding to each sub-label.
Wherein the at least one sub-tag belongs to an extension to the target tag.
For example: the at least one sub-tag may include, but is not limited to: AI interviews were performed with attendance records.
It should be noted that, the manner of constructing at least one sub-data set corresponding to each sub-label is similar to the manner of constructing the target data set, and is not described herein.
S14, calculating the intersection of the target data set and the at least one sub data set to obtain at least one sub intersection.
It should be noted that, the data with the at least one sub-tag and the target tag may be determined as target data.
Further, the data in the at least one sub-intersection obtained after the intersection processing is performed belongs to target data, and other data can be marked and extracted by the data in the at least one sub-intersection so as to detect the target data.
By the aid of the method and the device, data expansion can be further carried out on the basis of the target data set, sufficient data are obtained to train the model, and the model training effect is improved.
S15, combining a global search algorithm, and training a single-Class support vector machine (One-Class Support Vector Machine, one-Class SVM) model by using the at least One sub-intersection to obtain a detection model.
It should be noted that, in a practical application scenario, only positive samples can be defined in many cases, but not negative samples. For example: when predicting whether the user has a child or not through the search record of the user, it is not absolute that the user does not search for words such as "baby", "early education", and it is determined that the user has no child. A single class support vector machine model may be adapted to the scene.
In at least one embodiment of the present invention, the training a single-class support vector machine model with the at least one sub-intersection in combination with a global search algorithm, to obtain a detection model includes:
Integrating the data in the at least one sub-intersection to obtain a training sample;
Determining a kernel function of a single-class support vector machine model, and determining initial parameters of the kernel function;
Optimizing the initial parameters by adopting the global search algorithm to obtain an optimized single-class support vector machine model;
and training the optimized single-class support vector machine model by using the training sample to obtain the detection model.
Through the embodiment, the training sample can be constructed based on the pre-marked sub-intersection, and the training sample is further used for training the single-class support vector machine model, so that the target of the data to be detected can be intelligently detected by using the detection model obtained through training.
In addition, under normal conditions, parameters of the single-class support vector machine model are difficult to determine, and the global search algorithm is adopted to optimize and determine the parameters of the single-class support vector machine model, so that the global property and the robustness of the single-class support vector machine model are improved.
Specifically, the optimizing the initial parameters by using a global search algorithm to obtain the optimized single-class support vector machine model comprises the following steps:
Determining the highest iteration times and search step length of the global search algorithm;
Taking the initial parameter as an initial position, and carrying out iterative search of the pheromone by the search step length until the highest iterative times are reached, stopping searching, and obtaining the current pheromone;
And replacing the initial parameters by taking the position corresponding to the current pheromone as the optimized parameters to obtain the optimized single-class support vector machine model.
Through the implementation mode, the parameters of the single-class support vector machine model can be automatically determined through the global search algorithm, so that the performance of the single-class support vector machine model is improved, and the problem that the parameters of the single-class support vector machine model are difficult to determine is solved.
In at least one embodiment of the present invention, the artificial intelligence based target data detection method further includes:
randomly acquiring data from the target data set to construct a verification set;
inputting the data in the verification set into the detection model, and outputting a first detection result;
acquiring data of which the first detection result is a target, and calculating the proportion of the acquired data in the verification set;
When the proportion is greater than or equal to the configuration proportion, determining that the detection model passes verification, and storing the detection model into a blockchain; or alternatively
The configuration proportion can be configured in a self-defined manner according to actual requirements, and the invention is not limited.
By the above embodiment, the detection model can be verified with a small amount of target data to ensure usability of the detection model.
Meanwhile, the detection model is stored in a blockchain so as to further ensure the safety of the detection model.
And when the proportion is smaller than the configuration proportion, determining that the detection model fails to pass verification, updating the at least one sub-label, updating the at least one sub-data set according to the updated at least one sub-label, and carrying out optimization training on the detection model by the updated at least one sub-data set.
Through the embodiment, when the detection model fails to pass verification, the corresponding label optimization training can be timely adjusted to train the detection model, so that the adaptability of the detection model is enhanced, and meanwhile, the flexibility of the detection model is improved.
S16, inputting the data to be detected into the detection model, and outputting a detection result, wherein the detection result comprises a target and a non-target.
Specifically, when the detection result is a first identifier (such as 1 or Y), determining that the detection result is a target; or when the detection result is a second identifier (such as 0 or N), determining that the detection result is a non-target.
Through the implementation mode, the detection and screening of the target data can be automatically performed in an artificial intelligence mode, and the problems of low screening efficiency, high error rate and poor reliability caused by data statistics according to the artificially defined rules in the prior art are solved.
And S17, integrating the data with the detection result as a target as the target data.
Through the implementation mode, all target data can be automatically screened from the data to be detected by combining an artificial intelligence means, and compared with a traditional mode, the method is more efficient, is not easy to make mistakes, and has stronger practicability.
In order to prevent the data from being tampered, the target data may also be saved to the blockchain.
In at least one embodiment of the present invention, after integrating the data with the detection result as the target data, the target data detection method based on artificial intelligence further includes:
Calculating the duty ratio of the target data in the data to be detected;
when the duty ratio is lower than the configuration duty ratio, generating warning information according to the duty ratio;
Determining the associated user of the user to be detected;
And sending the warning information to the terminal equipment of the user to be detected and the terminal equipment of the associated user.
The warning information is used for warning that non-target data generated by the user to be detected are excessive currently.
The configuration duty ratio can be configured in a self-defined way according to actual requirements, and the invention is not limited.
Wherein the associated user may include, but is not limited to: and the superior leadership and attendance management personnel of the user to be detected.
Through the embodiment, the warning information is sent to the terminal equipment of the user to be detected, so that the warning effect on the user to be detected can be achieved, the illegal behavior is avoided, meanwhile, the warning information is sent to the terminal equipment of the associated user, the attention of related personnel can be drawn, the problem is timely processed when the problem occurs, the problem is avoided from being enlarged, and the effect of timely stopping the damage is achieved.
According to the technical scheme, the target label can be determined in response to the received target data detection instruction, the data to be detected corresponding to the user to be detected is obtained, the target data set is constructed according to the target label, the target data set is selected from the data to be detected, at least one pre-defined sub-label is obtained, the data corresponding to each sub-label is selected from the data to be detected, at least one sub-data set corresponding to each sub-label is constructed, the intersection of the target data set and the at least one sub-data set is calculated, at least one sub-intersection is obtained, the global search algorithm is combined, the single support vector machine model is trained by the at least one sub-intersection, the detection model is obtained, the data to be detected is input into the detection model, the detection result is output, the detection result comprises the target data and the non-target data, the integrated detection result is the target data as the target data, further the detection and screening of the target data can be automatically carried out in an artificial intelligent mode, and the problems of low screening efficiency, high error rate and poor reliability are solved.
FIG. 2 is a functional block diagram of a preferred embodiment of the artificial intelligence based object data detection device of the present invention. The artificial intelligence based target data detecting apparatus 11 includes a determining unit 110, an acquiring unit 111, a constructing unit 112, a calculating unit 113, a training unit 114, an input unit 115, and an integrating unit 116. The module/unit referred to in the present invention refers to a series of computer program segments capable of being executed by the processor 13 and of performing a fixed function, which are stored in the memory 12. In the present embodiment, the functions of the respective modules/units will be described in detail in the following embodiments.
The determining unit 110 determines the target tag in response to the received target data detection instruction.
In at least one embodiment of the present invention, the target data detection instructions may be triggered by a designated person to execute according to actual needs.
Of course, the target data detection instruction may also be configured to trigger periodically, so as to screen target data periodically, and avoid that other related service scenarios influence the execution effect of the task due to noise generated by non-target data when the task is executed by using the data.
In this embodiment, when a piece of data carries the target tag, the piece of data can be determined as target data. For example: the target tag may be: an operation of importing an address book, etc. is performed.
In at least one embodiment of the present invention, the determining unit 110 determines the target tag includes, but is not limited to, one or more of the following:
Analyzing the method body of the target data detection instruction to obtain the data carried by the target data detection instruction, acquiring a preset tag, matching the preset tag with the data carried by the target data detection instruction, and determining the matched data as the target tag.
Through the implementation mode, when the target label is carried in the target data detection instruction, the target label can be directly extracted by the preset label, so that the time for data processing is saved, and the accuracy of the target label is ensured.
Or acquiring historical target data, identifying keywords of the historical target data as first keywords, calculating the occurrence frequency of the first keywords, and acquiring the first keywords with highest occurrence frequency as the target labels.
By the embodiment, when the target tag is not carried in the target data detection instruction, the target tag can be determined through historical target data.
Of course, in other embodiments, the target tag may be determined in other ways, and the invention is not limited.
For example: and receiving the label uploaded by the user as the target label.
The acquisition unit 111 acquires data to be detected corresponding to a user to be detected.
In this embodiment, the account number of the user to be detected is obtained, all data corresponding to the account number is obtained, and the obtained data is determined to be the data to be detected.
It can be understood that the data corresponding to the account number is not necessarily the data generated by the user to be detected, if other people log in the account number of the user to be detected to operate, the generated data is not the actual data of the user to be detected, and can be regarded as data to be brushed or substituted, namely, non-target data, and the actual data of the user to be detected is the target data.
The construction unit 112 selects data from the data to be detected to construct a target data set according to the target tag.
In this embodiment, since the data in the target data set all carry the target tag, the data in the target data set are all target data.
In at least one embodiment of the present invention, the constructing unit 112 selects data from the data to be detected according to the target tag to construct a target data set including:
Identifying a keyword of each datum in the data to be detected as a second keyword;
dividing the data with the same second keywords in the data to be detected into one category, and naming each category by the same second keywords;
configuring the weight of each category in the corresponding data;
determining the category with the weight greater than or equal to the configuration weight as the target category of the corresponding data;
Defining a label of the corresponding data according to the target category;
And matching the target label in the defined label, and acquiring data corresponding to the label matched with the target label to construct the target data set.
The configuration weights can be configured in a self-defined manner according to actual requirements, and the invention is not limited.
After naming each category by the same keyword, a certain weight needs to be assigned to each category to distinguish importance degrees of different categories, and in particular, multiple data indexes can be comprehensively considered when determining the weight, such as: number of interviews, number of attractive interviews, etc.
The construction unit 112 acquires at least one sub-tag defined in advance, and selects data corresponding to each sub-tag from the data to be detected to construct at least one sub-data set corresponding to each sub-tag.
Wherein the at least one sub-tag belongs to an extension to the target tag.
For example: the at least one sub-tag may include, but is not limited to: AI interviews were performed with attendance records.
It should be noted that, the manner of constructing at least one sub-data set corresponding to each sub-label is similar to the manner of constructing the target data set, and is not described herein.
The calculation unit 113 calculates an intersection of the target data set and the at least one sub data set, resulting in at least one sub intersection.
It should be noted that, the data with the at least one sub-tag and the target tag may be determined as target data.
Further, the data in the at least one sub-intersection obtained after the intersection processing is performed belongs to target data, and other data can be marked and extracted by the data in the at least one sub-intersection so as to detect the target data.
By the aid of the method and the device, data expansion can be further carried out on the basis of the target data set, sufficient data are obtained to train the model, and the model training effect is improved.
The training unit 114 combines the global search algorithm to train a single Class support vector machine (One-Class Support Vector Machine, one-Class SVM) model with the at least One sub-intersection to obtain a detection model.
It should be noted that, in a practical application scenario, only positive samples can be defined in many cases, but not negative samples. For example: when predicting whether the user has a child or not through the search record of the user, it is not absolute that the user does not search for words such as "baby", "early education", and it is determined that the user has no child. A single class support vector machine model may be adapted to the scene.
In at least one embodiment of the present invention, the training unit 114 in combination with the global search algorithm trains a single-class support vector machine model with the at least one sub-intersection, and obtaining the detection model includes:
Integrating the data in the at least one sub-intersection to obtain a training sample;
Determining a kernel function of a single-class support vector machine model, and determining initial parameters of the kernel function;
Optimizing the initial parameters by adopting the global search algorithm to obtain an optimized single-class support vector machine model;
and training the optimized single-class support vector machine model by using the training sample to obtain the detection model.
Through the embodiment, the training sample can be constructed based on the pre-marked sub-intersection, and the training sample is further used for training the single-class support vector machine model, so that the target of the data to be detected can be intelligently detected by using the detection model obtained through training.
In addition, under normal conditions, parameters of the single-class support vector machine model are difficult to determine, and the global search algorithm is adopted to optimize and determine the parameters of the single-class support vector machine model, so that the global property and the robustness of the single-class support vector machine model are improved.
Specifically, the training unit 114 optimizes the initial parameters by using a global search algorithm, and the obtaining the optimized single-class support vector machine model includes:
Determining the highest iteration times and search step length of the global search algorithm;
Taking the initial parameter as an initial position, and carrying out iterative search of the pheromone by the search step length until the highest iterative times are reached, stopping searching, and obtaining the current pheromone;
And replacing the initial parameters by taking the position corresponding to the current pheromone as the optimized parameters to obtain the optimized single-class support vector machine model.
Through the implementation mode, the parameters of the single-class support vector machine model can be automatically determined through the global search algorithm, so that the performance of the single-class support vector machine model is improved, and the problem that the parameters of the single-class support vector machine model are difficult to determine is solved.
In at least one embodiment of the invention, randomly acquiring data from the target dataset to construct a validation set;
inputting the data in the verification set into the detection model, and outputting a first detection result;
acquiring data of which the first detection result is a target, and calculating the proportion of the acquired data in the verification set;
Further, when the ratio is greater than or equal to a configuration ratio, determining that the detection model is validated, and saving the detection model to a blockchain.
The configuration proportion can be configured in a self-defined manner according to actual requirements, and the invention is not limited.
By the above embodiment, the detection model can be verified with a small amount of target data to ensure usability of the detection model.
Meanwhile, the detection model is stored in a blockchain so as to further ensure the safety of the detection model.
Or when the proportion is smaller than the configuration proportion, determining that the detection model is not verified, updating the at least one sub-label, updating the at least one sub-data set according to the updated at least one sub-label, and carrying out optimization training on the detection model by the updated at least one sub-data set.
Through the embodiment, when the detection model fails to pass verification, the corresponding label optimization training can be timely adjusted to train the detection model, so that the adaptability of the detection model is enhanced, and meanwhile, the flexibility of the detection model is improved.
The input unit 115 inputs the data to be detected to the detection model, and outputs a detection result, wherein the detection result includes a target and a non-target.
Specifically, when the detection result is a first identifier (such as 1 or Y), determining that the detection result is a target; or when the detection result is a second identifier (such as 0 or N), determining that the detection result is a non-target.
Through the implementation mode, the detection and screening of the target data can be automatically performed in an artificial intelligence mode, and the problems of low screening efficiency, high error rate and poor reliability caused by data statistics according to the artificially defined rules in the prior art are solved.
The integration unit 116 integrates the data whose detection result is the target as the target data.
Through the implementation mode, all target data can be automatically screened from the data to be detected by combining an artificial intelligence means, and compared with a traditional mode, the method is more efficient, is not easy to make mistakes, and has stronger practicability.
In order to prevent the data from being tampered, the target data may also be saved to the blockchain.
In at least one embodiment of the present invention, after integrating the data with the detection result as the target data, the duty ratio of the target data in the data to be detected is calculated;
when the duty ratio is lower than the configuration duty ratio, generating warning information according to the duty ratio;
Determining the associated user of the user to be detected;
And sending the warning information to the terminal equipment of the user to be detected and the terminal equipment of the associated user.
The warning information is used for warning that non-target data generated by the user to be detected are excessive currently.
The configuration duty ratio can be configured in a self-defined way according to actual requirements, and the invention is not limited.
Wherein the associated user may include, but is not limited to: and the superior leadership and attendance management personnel of the user to be detected.
Through the embodiment, the warning information is sent to the terminal equipment of the user to be detected, so that the warning effect on the user to be detected can be achieved, the illegal behavior is avoided, meanwhile, the warning information is sent to the terminal equipment of the associated user, the attention of related personnel can be drawn, the problem is timely processed when the problem occurs, the problem is avoided from being enlarged, and the effect of timely stopping the damage is achieved.
According to the technical scheme, the target label can be determined in response to the received target data detection instruction, the data to be detected corresponding to the user to be detected is obtained, the target data set is constructed according to the target label, the target data set is selected from the data to be detected, at least one pre-defined sub-label is obtained, the data corresponding to each sub-label is selected from the data to be detected, at least one sub-data set corresponding to each sub-label is constructed, the intersection of the target data set and the at least one sub-data set is calculated, at least one sub-intersection is obtained, the global search algorithm is combined, the single support vector machine model is trained by the at least one sub-intersection, the detection model is obtained, the data to be detected is input into the detection model, the detection result is output, the detection result comprises the target data and the non-target data, the integrated detection result is the target data as the target data, further the detection and screening of the target data can be automatically carried out in an artificial intelligent mode, and the problems of low screening efficiency, high error rate and poor reliability are solved.
Fig. 3 is a schematic structural diagram of an electronic device according to a preferred embodiment of the present invention for implementing an artificial intelligence-based target data detection method.
The electronic device 1 may comprise a memory 12, a processor 13 and a bus, and may further comprise a computer program stored in the memory 12 and executable on the processor 13, such as an artificial intelligence based target data detection program.
It will be appreciated by those skilled in the art that the schematic diagram is merely an example of the electronic device 1 and does not constitute a limitation of the electronic device 1, the electronic device 1 may be a bus type structure, a star type structure, the electronic device 1 may further comprise more or less other hardware or software than illustrated, or a different arrangement of components, for example, the electronic device 1 may further comprise an input-output device, a network access device, etc.
It should be noted that the electronic device 1 is only used as an example, and other electronic products that may be present in the present invention or may be present in the future are also included in the scope of the present invention by way of reference.
The memory 12 includes at least one type of readable storage medium including flash memory, a removable hard disk, a multimedia card, a card memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, etc. The memory 12 may in some embodiments be an internal storage unit of the electronic device 1, such as a mobile hard disk of the electronic device 1. The memory 12 may also be an external storage device of the electronic device 1 in other embodiments, such as a plug-in mobile hard disk, a smart memory card (SMART MEDIA CARD, SMC), a Secure Digital (SD) card, a flash memory card (FLASH CARD) or the like, which are provided on the electronic device 1. Further, the memory 12 may also include both an internal storage unit and an external storage device of the electronic device 1. The memory 12 may be used not only for storing application software installed in the electronic device 1 and various types of data, such as code of an artificial intelligence-based target data detection program, but also for temporarily storing data that has been output or is to be output.
The processor 13 may be comprised of integrated circuits in some embodiments, for example, a single packaged integrated circuit, or may be comprised of multiple integrated circuits packaged with the same or different functions, including one or more central processing units (Central Processing unit, CPU), microprocessors, digital processing chips, graphics processors, various control chips, and the like. The processor 13 is a Control Unit (Control Unit) of the electronic device 1, connects the respective components of the entire electronic device 1 using various interfaces and lines, executes or executes programs or modules stored in the memory 12 (for example, executes an artificial intelligence-based target data detection program or the like), and invokes data stored in the memory 12 to perform various functions of the electronic device 1 and process the data.
The processor 13 executes the operating system of the electronic device 1 and various types of applications installed. The processor 13 executes the application program to implement the steps of the various embodiments of the artificial intelligence based target data detection method described above, such as the steps shown in fig. 1.
Illustratively, the computer program may be partitioned into one or more modules/units that are stored in the memory 12 and executed by the processor 13 to complete the present invention. The one or more modules/units may be a series of instruction segments of a computer program capable of performing a specific function for describing the execution of the computer program in the electronic device 1. For example, the computer program may be divided into a determining unit 110, an acquiring unit 111, a constructing unit 112, a calculating unit 113, a training unit 114, an input unit 115, an integrating unit 116.
The integrated units implemented in the form of software functional modules described above may be stored in a computer readable storage medium. The software functional modules are stored in a storage medium and include instructions for causing a computer device (which may be a personal computer, a computer device, or a network device, etc.) or a processor (processor) to perform portions of the artificial intelligence-based target data detection method according to various embodiments of the invention.
The integrated modules/units of the electronic device 1 may be stored in a computer readable storage medium if implemented in the form of software functional units and sold or used as separate products. Based on this understanding, the present invention may also be implemented by a computer program for instructing a relevant hardware device to implement all or part of the procedures of the above-mentioned embodiment method, where the computer program may be stored in a computer readable storage medium and the computer program may be executed by a processor to implement the steps of each of the above-mentioned method embodiments.
Wherein the computer program comprises computer program code which may be in source code form, object code form, executable file or some intermediate form etc. The computer readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM).
Further, the computer-usable storage medium may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function, and the like; the storage data area may store data created from the use of blockchain nodes, and the like.
The blockchain is a novel application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, encryption algorithm and the like. The blockchain (Blockchain), essentially a de-centralized database, is a string of data blocks that are generated in association using cryptographic methods, each of which contains information from a batch of network transactions for verifying the targeting (anti-counterfeiting) of the information and generating the next block. The blockchain may include a blockchain underlying platform, a platform product services layer, an application services layer, and the like.
The bus may be a peripheral component interconnect standard (PERIPHERAL COMPONENT INTERCONNECT, PCI) bus, or an extended industry standard architecture (extended industry standard architecture, EISA) bus, among others. The bus may be classified as an address bus, a data bus, a control bus, etc. For ease of illustration, only one arrow is shown in FIG. 3, but only one bus or one type of bus is not shown. The bus is arranged to enable a connection communication between the memory 12 and at least one processor 13 or the like.
Although not shown, the electronic device 1 may further comprise a power source (such as a battery) for powering the various components, which may preferably be logically connected to the at least one processor 13 via a power management means, so as to perform functions such as charge management, discharge management, and power consumption management via the power management means. The power supply may also include one or more of any of a direct current or alternating current power supply, recharging device, power failure detection circuit, power converter or inverter, power status indicator, etc. The electronic device 1 may further include various sensors, bluetooth modules, wi-Fi modules, etc., which will not be described herein.
Further, the electronic device 1 may also comprise a network interface, optionally the network interface may comprise a wired interface and/or a wireless interface (e.g. WI-FI interface, bluetooth interface, etc.), typically used for establishing a communication connection between the electronic device 1 and other electronic devices.
The electronic device 1 may optionally further comprise a user interface, which may be a Display, an input unit, such as a Keyboard (Keyboard), or a standard wired interface, a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch, or the like. The display may also be referred to as a display screen or display unit, as appropriate, for displaying information processed in the electronic device 1 and for displaying a visual user interface.
It should be understood that the embodiments described are for illustrative purposes only and are not limited to this configuration in the scope of the patent application.
Fig. 3 shows only an electronic device 1 with components 12-13, it being understood by a person skilled in the art that the structure shown in fig. 3 does not constitute a limitation of the electronic device 1, and may comprise fewer or more components than shown, or may combine certain components, or a different arrangement of components.
In connection with fig. 1, the memory 12 in the electronic device 1 stores a plurality of instructions to implement an artificial intelligence based target data detection method, the processor 13 being executable to implement:
determining a target tag in response to the received target data detection instruction;
acquiring data to be detected corresponding to a user to be detected;
selecting data from the data to be detected according to the target tag to construct a target data set;
Acquiring at least one pre-defined sub-label, selecting data corresponding to each sub-label from the data to be detected, and constructing at least one sub-data set corresponding to each sub-label;
calculating the intersection of the target data set and the at least one sub data set to obtain at least one sub intersection;
Training a single-class support vector machine model by the at least one sub-intersection by combining a global search algorithm to obtain a detection model;
Inputting the data to be detected into the detection model, and outputting a detection result, wherein the detection result comprises a target and a non-target;
and integrating the data with the detection result as a target as the target data.
Specifically, the specific implementation method of the above instructions by the processor 13 may refer to the description of the relevant steps in the corresponding embodiment of fig. 1, which is not repeated herein.
In the several embodiments provided in the present invention, it should be understood that the disclosed systems, devices, and methods may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the modules is merely a logical function division, and there may be other manners of division when actually implemented.
The modules described as separate components may or may not be physically separate, and components shown as modules may or may not be physical units, may be located in one place, or may be distributed over multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional module in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units can be realized in a form of hardware or a form of hardware and a form of software functional modules.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof.
The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference signs in the claims shall not be construed as limiting the claim concerned.
Furthermore, it is evident that the word "comprising" does not exclude other elements or steps, and that the singular does not exclude a plurality. A plurality of units or means recited in the system claims can also be implemented by means of software or hardware by means of one unit or means. The terms second, etc. are used to denote a name, but not any particular order.
Finally, it should be noted that the above-mentioned embodiments are merely for illustrating the technical solution of the present invention and not for limiting the same, and although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications and equivalents may be made to the technical solution of the present invention without departing from the spirit and scope of the technical solution of the present invention.
Claims (7)
1. The target data detection method based on the artificial intelligence is characterized by comprising the following steps of:
determining a target tag in response to the received target data detection instruction;
acquiring data to be detected corresponding to a user to be detected;
selecting data from the data to be detected according to the target tag to construct a target data set;
Acquiring at least one pre-defined sub-label, selecting data corresponding to each sub-label from the data to be detected, and constructing at least one sub-data set corresponding to each sub-label;
calculating the intersection of the target data set and the at least one sub data set to obtain at least one sub intersection;
Training a single-class support vector machine model with the at least one sub-intersection in combination with a global search algorithm to obtain a detection model, comprising: integrating the data in the at least one sub-intersection to obtain a training sample; determining a kernel function of a single-class support vector machine model, and determining initial parameters of the kernel function; optimizing the initial parameters by adopting the global search algorithm to obtain an optimized single-class support vector machine model; training the optimized single-class support vector machine model by using the training sample to obtain the detection model; the optimizing the initial parameters by adopting a global search algorithm, and the obtaining the optimized single-class support vector machine model comprises the following steps: determining the highest iteration times and search step length of the global search algorithm; taking the initial parameter as an initial position, and carrying out iterative search of the pheromone by the search step length until the highest iterative times are reached, stopping searching, and obtaining the current pheromone; taking the position corresponding to the current pheromone as an optimized parameter to replace the initial parameter to obtain the optimized single-class support vector machine model;
Inputting the data to be detected into the detection model, and outputting a detection result, wherein the detection result comprises a target and a non-target;
Integrating the data with the detection result as a target as the target data;
Calculating the duty ratio of the target data in the data to be detected; when the duty ratio is lower than the configuration duty ratio, generating warning information according to the duty ratio; determining the associated user of the user to be detected; and sending the warning information to the terminal equipment of the user to be detected and the terminal equipment of the associated user.
2. The artificial intelligence based target data detection method of claim 1, wherein the determining target tags comprises a combination of one or more of:
Analyzing the method body of the target data detection instruction to obtain data carried by the target data detection instruction, acquiring a preset tag, matching the preset tag with the data carried by the target data detection instruction, and determining the matched data as the target tag; or alternatively
Acquiring historical target data, identifying keywords of the historical target data as first keywords, calculating the occurrence frequency of the first keywords, and acquiring the first keywords with highest occurrence frequency as the target labels.
3. The artificial intelligence based target data detection method of claim 1, wherein the selecting data from the data to be detected to construct a target data set according to the target tag comprises:
Identifying a keyword of each datum in the data to be detected as a second keyword;
dividing the data with the same second keywords in the data to be detected into one category, and naming each category by the same second keywords;
configuring the weight of each category in the corresponding data;
determining the category with the weight greater than or equal to the configuration weight as the target category of the corresponding data;
Defining a label of the corresponding data according to the target category;
And matching the target label in the defined label, and acquiring data corresponding to the label matched with the target label to construct the target data set.
4. The artificial intelligence based target data detection method of claim 1, further comprising:
randomly acquiring data from the target data set to construct a verification set;
inputting the data in the verification set into the detection model, and outputting a first detection result;
acquiring data of which the first detection result is a target, and calculating the proportion of the acquired data in the verification set;
When the proportion is greater than or equal to the configuration proportion, determining that the detection model passes verification, and storing the detection model into a blockchain; or alternatively
And when the proportion is smaller than the configuration proportion, determining that the detection model fails to pass verification, updating the at least one sub-label, updating the at least one sub-data set according to the updated at least one sub-label, and carrying out optimization training on the detection model by the updated at least one sub-data set.
5. An artificial intelligence based target data detection apparatus, comprising:
a determining unit for determining a target tag in response to the received target data detection instruction;
The acquisition unit is used for acquiring to-be-detected data corresponding to the to-be-detected user;
the construction unit is used for selecting data from the data to be detected according to the target label to construct a target data set;
The construction unit is further used for acquiring at least one pre-defined sub-label, selecting data corresponding to each sub-label from the data to be detected, and constructing at least one sub-data set corresponding to each sub-label;
A calculating unit, configured to calculate an intersection of the target data set and the at least one sub data set, to obtain at least one sub intersection;
The training unit is configured to train a single-class support vector machine model with the at least one sub-intersection in combination with a global search algorithm to obtain a detection model, and includes: integrating the data in the at least one sub-intersection to obtain a training sample; determining a kernel function of a single-class support vector machine model, and determining initial parameters of the kernel function; optimizing the initial parameters by adopting the global search algorithm to obtain an optimized single-class support vector machine model; training the optimized single-class support vector machine model by using the training sample to obtain the detection model; the optimizing the initial parameters by adopting a global search algorithm, and the obtaining the optimized single-class support vector machine model comprises the following steps: determining the highest iteration times and search step length of the global search algorithm; taking the initial parameter as an initial position, and carrying out iterative search of the pheromone by the search step length until the highest iterative times are reached, stopping searching, and obtaining the current pheromone; taking the position corresponding to the current pheromone as an optimized parameter to replace the initial parameter to obtain the optimized single-class support vector machine model;
the input unit is used for inputting the data to be detected into the detection model and outputting a detection result, wherein the detection result comprises a target and a non-target;
an integrating unit for integrating the data with the detection result as a target as the target data;
Calculating the duty ratio of the target data in the data to be detected; when the duty ratio is lower than the configuration duty ratio, generating warning information according to the duty ratio; determining the associated user of the user to be detected; and sending the warning information to the terminal equipment of the user to be detected and the terminal equipment of the associated user.
6. An electronic device, the electronic device comprising:
a memory storing at least one instruction; and
A processor executing instructions stored in the memory to implement the artificial intelligence based target data detection method of any one of claims 1 to 4.
7. A computer-readable storage medium, characterized by: the computer-readable storage medium has stored therein at least one instruction that is executed by a processor in an electronic device to implement the artificial intelligence based target data detection method of any one of claims 1 to 4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010797375.0A CN111985545B (en) | 2020-08-10 | 2020-08-10 | Target data detection method, device, equipment and medium based on artificial intelligence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010797375.0A CN111985545B (en) | 2020-08-10 | 2020-08-10 | Target data detection method, device, equipment and medium based on artificial intelligence |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111985545A CN111985545A (en) | 2020-11-24 |
CN111985545B true CN111985545B (en) | 2024-05-17 |
Family
ID=73445448
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010797375.0A Active CN111985545B (en) | 2020-08-10 | 2020-08-10 | Target data detection method, device, equipment and medium based on artificial intelligence |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111985545B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112925958A (en) * | 2021-02-05 | 2021-06-08 | 深圳力维智联技术有限公司 | Multi-source heterogeneous data adaptation method, device, equipment and readable storage medium |
CN114004443A (en) * | 2021-09-13 | 2022-02-01 | 广东省国土资源测绘院 | Detection method and device for illegal building data |
CN114580562A (en) * | 2022-03-16 | 2022-06-03 | 北京珞安科技有限责任公司 | Abnormal data detection method and device based on process flow |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106709506A (en) * | 2016-11-28 | 2017-05-24 | 广东工业大学 | Method for identifying and classifying species and different origins of Chinese herbal medicine |
WO2019119515A1 (en) * | 2017-12-22 | 2019-06-27 | 深圳云天励飞技术有限公司 | Face analysis and filtering method, device, embedded apparatus, dielectric and integrated circuit |
CN110059734A (en) * | 2019-04-02 | 2019-07-26 | 唯思科技(北京)有限公司 | A kind of training method, object identification method, device, robot and the medium of target identification disaggregated model |
CN110348490A (en) * | 2019-06-20 | 2019-10-18 | 宜通世纪科技股份有限公司 | A kind of soil quality prediction technique and device based on algorithm of support vector machine |
CN110516251A (en) * | 2019-08-29 | 2019-11-29 | 秒针信息技术有限公司 | A kind of construction method, construction device, equipment and the medium of electric business entity recognition model |
CN111476324A (en) * | 2020-06-28 | 2020-07-31 | 平安国际智慧城市科技股份有限公司 | Traffic data labeling method, device, equipment and medium based on artificial intelligence |
-
2020
- 2020-08-10 CN CN202010797375.0A patent/CN111985545B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106709506A (en) * | 2016-11-28 | 2017-05-24 | 广东工业大学 | Method for identifying and classifying species and different origins of Chinese herbal medicine |
WO2019119515A1 (en) * | 2017-12-22 | 2019-06-27 | 深圳云天励飞技术有限公司 | Face analysis and filtering method, device, embedded apparatus, dielectric and integrated circuit |
CN110059734A (en) * | 2019-04-02 | 2019-07-26 | 唯思科技(北京)有限公司 | A kind of training method, object identification method, device, robot and the medium of target identification disaggregated model |
CN110348490A (en) * | 2019-06-20 | 2019-10-18 | 宜通世纪科技股份有限公司 | A kind of soil quality prediction technique and device based on algorithm of support vector machine |
CN110516251A (en) * | 2019-08-29 | 2019-11-29 | 秒针信息技术有限公司 | A kind of construction method, construction device, equipment and the medium of electric business entity recognition model |
CN111476324A (en) * | 2020-06-28 | 2020-07-31 | 平安国际智慧城市科技股份有限公司 | Traffic data labeling method, device, equipment and medium based on artificial intelligence |
Also Published As
Publication number | Publication date |
---|---|
CN111985545A (en) | 2020-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111950621B (en) | Target data detection method, device, equipment and medium based on artificial intelligence | |
CN111985545B (en) | Target data detection method, device, equipment and medium based on artificial intelligence | |
CN112446025A (en) | Federal learning defense method and device, electronic equipment and storage medium | |
CN112801718B (en) | User behavior prediction method, device, equipment and medium | |
CN111949708B (en) | Multi-task prediction method, device, equipment and medium based on time sequence feature extraction | |
CN111639153B (en) | Query method and device based on legal knowledge graph, electronic equipment and medium | |
CN113806434B (en) | Big data processing method, device, equipment and medium | |
CN112396547B (en) | Course recommendation method, device, equipment and medium based on unsupervised learning | |
CN114997263B (en) | Method, device, equipment and storage medium for analyzing training rate based on machine learning | |
CN111340240A (en) | Method and device for realizing automatic machine learning | |
CN115081538A (en) | Customer relationship identification method, device, equipment and medium based on machine learning | |
US10762089B2 (en) | Open ended question identification for investigations | |
CN114612194A (en) | Product recommendation method and device, electronic equipment and storage medium | |
CN113256181A (en) | Risk factor prediction method, device, equipment and medium | |
CN114201482A (en) | Dynamic population distribution statistical method and device, electronic equipment and readable storage medium | |
CN112651782B (en) | Behavior prediction method, device, equipment and medium based on dot product attention scaling | |
CN111950707B (en) | Behavior prediction method, device, equipment and medium based on behavior co-occurrence network | |
CN112700261B (en) | Method, device, equipment and medium for detecting single file of brushing on basis of suspicious communities | |
CN112115890B (en) | Drunk driving identification method, device, equipment and medium based on artificial intelligence | |
CN116823437A (en) | Access method, device, equipment and medium based on configured wind control strategy | |
CN112330080B (en) | Factor screening method, device, equipment and medium based on connectivity graph | |
CN113657546B (en) | Information classification method, device, electronic equipment and readable storage medium | |
CN111859985B (en) | AI customer service model test method and device, electronic equipment and storage medium | |
CN113204962B (en) | Word sense disambiguation method, device, equipment and medium based on graph expansion structure | |
CN114722146A (en) | Supply chain asset checking method, device, equipment and medium based on artificial intelligence |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |