CN116739512A - Data analysis management system and method based on artificial intelligent cloud platform - Google Patents

Data analysis management system and method based on artificial intelligent cloud platform Download PDF

Info

Publication number
CN116739512A
CN116739512A CN202310669585.5A CN202310669585A CN116739512A CN 116739512 A CN116739512 A CN 116739512A CN 202310669585 A CN202310669585 A CN 202310669585A CN 116739512 A CN116739512 A CN 116739512A
Authority
CN
China
Prior art keywords
evaluation
state
application
file
account
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310669585.5A
Other languages
Chinese (zh)
Other versions
CN116739512B (en
Inventor
黄宗威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Zhongwu Big Data Development Group Co ltd
Original Assignee
Harbin Rongmei Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Rongmei Technology Co ltd filed Critical Harbin Rongmei Technology Co ltd
Priority to CN202310669585.5A priority Critical patent/CN116739512B/en
Publication of CN116739512A publication Critical patent/CN116739512A/en
Application granted granted Critical
Publication of CN116739512B publication Critical patent/CN116739512B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06311Scheduling, planning or task assignment for a person or group
    • G06Q10/063114Status monitoring or status determination for a person or group

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Quality & Reliability (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Game Theory and Decision Science (AREA)
  • Educational Administration (AREA)
  • Development Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a data analysis management system and method based on an artificial intelligent cloud platform, and belongs to the technical field of flow management. A set of business process management method is established in a cloud platform mode and is applied to the field of content assessment scenes; firstly, classifying all received application files according to application types, and calculating the text similarity of each application file; secondly, acquiring the type and the similarity of the file which is evaluated by each evaluation account for the last time, and distributing application files with different text similarity under different types or the same type to the corresponding evaluation account to ensure that the file received by each evaluation account has the difference with the application file which is evaluated by the evaluation account for the last time; and finally, evaluating the account, calculating the reliability of the corresponding application file through an evaluation result, adjusting the state of the application file according to the reliability, and determining whether re-evaluation is needed according to the state information of the application file, so that the reliability of the whole evaluation link is improved, and the false evaluation probability is reduced.

Description

Data analysis management system and method based on artificial intelligent cloud platform
Technical Field
The invention relates to the technical field of flow management, in particular to a data analysis management system and method based on an artificial intelligent cloud platform.
Background
With the continuous development of information technology, various industries are accelerating the digital transformation, and office automation systems have become an indispensable tool in daily work. More and more businesses are transferred to online processing, great convenience is brought, and meanwhile, the office efficiency is also greatly improved. Meanwhile, in order to ensure the accuracy of data and the standard perfection of the flow, a plurality of evaluation links are generally inserted into the business flow to ensure the accuracy of data and the flow, improve the working efficiency and reduce the invalid operation probability.
At present, the assessment link is generally full-automatic treatment or semi-automatic treatment. The full-automatic processing is usually used for simple assessment operation, and assessment work is automatically completed by a computer in a data analysis or cloud computing mode, so that the business is high in repeatability, low in complexity, simple and easy to process. However, in some cases, the evaluation link needs complex judgment and review, and no very clear evaluation standard exists, so that the task cannot be simply performed by a computer, and often needs participation of people to be cooperatively completed, so that a semi-automatic processing mode is generally adopted. The processing mode has the advantages that more flexible and reasonable decisions can be made according to human experience and judgment, and the stiffness and mechanization caused by flow automation are avoided. However, at the same time, a series of problems are caused by the participation of the person. For example, the pressure and fatigue caused by repeated assessment of the same type of file over a long period of time causes erroneous assessment; or, the evaluation result caused by rough browsing and simple scrolling is lack of accuracy. These problems not only affect the quality and accuracy of the assessment of the file, but also waste a lot of resources. Therefore, how to optimize the evaluation flow and improve the evaluation efficiency and accuracy becomes a problem that needs to be solved currently.
Disclosure of Invention
The invention aims to provide a data analysis management system and method based on an artificial intelligence cloud platform, which are used for solving the problems in the background technology.
In order to solve the technical problems, the invention provides the following technical scheme: the system comprises a data acquisition module, a data processing module, an operation management module and a data storage module.
The data acquisition module is used for acquiring the received application file information, the evaluation account information and the evaluation process information; the data processing module is used for processing the application file information, classifying according to application types, calculating the text similarity of each application file and distributing the text similarity to different evaluation accounts; comprehensively analyzing and evaluating account information and evaluation process information, and calculating the reliability of the application file; the running management module adjusts the state of the corresponding application file according to the reliability, and determines the file flow direction through the file state; the data storage module is used for carrying out backup storage on all the information.
The data acquisition module comprises a file information acquisition unit, an account information acquisition unit and a rating information acquisition unit.
The file information acquisition unit is used for acquiring type information, content information and state information of all application files. The type information refers to the application type of the application file, the content information refers to the main text in the application file, and the state information refers to the current state of the application file, including the state to be evaluated, the state to be evaluated and the state to be re-evaluated.
The system receives in real time a large number of different types of application files, which may have not yet been rated, or which may have been rated to be re-rated. Each application file contains a large amount of text information, and different types of application file formats are different.
The account information acquisition unit is used for acquiring the historical misjudgment rate of the estimated account, the estimated account state, the type of the application file of the last evaluation and the similarity. The historical misjudgment rate refers to the ratio of the number of evaluation errors in the historical evaluation application files of the corresponding evaluation account to the total number of the evaluation application files, and the evaluation account state comprises an idle state and a busy state.
The evaluation information acquisition unit is used for acquiring the evaluation time and the evaluation result of the evaluation application file. The evaluation time refers to the time it takes for the evaluation account to evaluate from the beginning of the evaluation to the end of the evaluation, and the evaluation result includes pass and reject.
The evaluation object uses the evaluation account to perform evaluation work, and the evaluation result is passed or rejected. Because the evaluation link is one link in the whole business process, the evaluation result can influence other links whether passing or refusing. When the evaluation result is passing, the application file flows into the next link; and when the assessment result is refusal, the application file flows back to the last link. When the next link or the last link feeds back the erroneous judgment of the link, the system automatically distributes the corresponding application files in a classified mode to carry out re-assessment. If the subsequent final evaluation result is not consistent with the previous evaluation result, the previous evaluation error is described, the ratio of the number of evaluation errors in the historical evaluation application files of each evaluation account to the total number of the evaluation application files is counted as the historical misjudgment rate, and the higher the historical misjudgment rate is, the greater the misjudgment possibility of the evaluation account is represented, and the lower the reliability degree of the evaluation result is.
The data processing module comprises a file classification dispatch unit and an assessment behavior analysis unit.
The file classification and dispatch unit is used for classifying the application files with all states of to-be-evaluated states or review states and dispatching the application files to the evaluation account in the idle state for evaluation. Firstly, classifying all application files according to type information, wherein the application files with the same type are one type; secondly, performing text similarity calculation on all application files in each class and reference files corresponding to the type, and binding the obtained similarity with the application files; and finally, judging the application files which should be dispatched at this time according to the type and the similarity of the application files which are evaluated by the evaluation account for the last time.
The type or similarity of the application file being dispatched should be different from the type or similarity of the file last assessed by the corresponding assessment account. When all the idle state evaluation accounts can be distributed to application files with different types, the distribution is directly carried out. When the evaluation account in the partial idle state cannot be distributed to the application files with different types, the partial evaluation account preferentially selects the application file with the largest similarity difference with the last evaluation file of the corresponding evaluation account in the corresponding class for distribution, and the rest evaluation accounts normally distribute the application files with different types.
The evaluation behavior analysis unit is used for calculating the reliability of the evaluation process. Substituting the historical misjudgment rate, the evaluation time and the evaluation result of the evaluation account into a formula, and calculating to obtain the reliability, wherein the reliability is bound with the corresponding application file.
The reliability determines the reliability of the assessment behavior. The higher the reliability, the higher the reliability of the assessment behavior, and the lower the reliability.
The operation management module comprises a state management unit and a flow direction management unit.
The state management unit is used for adjusting the states of the evaluation account and the application file. Evaluating account states includes a free state and a busy state. The idle state refers to the assessment account not being subject to assessment work, and the busy state refers to the assessment account being subject to assessment work. The states of the application file include a state to be evaluated, an evaluated state and a re-evaluation state. The state to be evaluated refers to that the application file is not evaluated, the evaluated state refers to that the application file is evaluated and the reliability is not in a reliability abnormal section, and the re-evaluation state refers to that the application file is evaluated and the reliability is in a reliability abnormal section.
When the evaluation account is evaluating the application file, the system automatically adjusts the state of the evaluation account to a busy state, and the system does not send any task any more in the state. When the assessment of the assessment account is finished, the system automatically adjusts the assessment account state to a idle state in which the system can dispatch tasks.
When the application file is dispatched for the first time and the evaluation is not completed yet, the system automatically sets the state of the corresponding application file as the state to be evaluated. And when the evaluation is finished, judging whether the reliability of the application file is in a reliability abnormal section, setting the corresponding application file state as a re-evaluation state if the reliability of the application file is in the reliability abnormal section, and setting the corresponding application file state as an evaluated state if the reliability of the application file is not in the reliability abnormal section.
The flow direction management unit is used for judging the flow direction according to the state of the application file. And when the state of the application file is the rated state, ending the evaluation, and sending the evaluation result to the next link. When the application file state is the re-evaluation state, the evaluation is not finished, the re-evaluation is needed, the corresponding application file is sent to the data processing module, and the re-classification distribution is carried out for evaluation.
The data storage module is used for storing the application file information, the evaluation account information, the assessment process information and the state adjustment information into a database for subsequent tracing operation.
A data analysis management method based on an artificial intelligence cloud platform comprises the following steps:
s1, classifying all application files and then distributing the application files to each evaluation account;
s2, evaluating the application file by the evaluation account, and collecting evaluation information;
s3, calculating reliability through the assessment information, and adjusting the state;
s4, defining a flow direction according to the state of the application file.
In S1, the classification and dispatch steps for the application file are as follows:
s101, collecting type information and content information of all application files; the type information refers to the application type of the application file, and the content information refers to the main text in the application file.
S102, acquiring type information of all application files, and dividing the application files with the same type into the same class, wherein one or more application files are arranged in each class.
S103, acquiring main body texts of all application files through content information. And simultaneously, acquiring the main text of the reference file corresponding to each class. And calculating the similarity between the main text of each application file and the main text of the reference file corresponding to the class of the application file, and binding the obtained similarity with the application file. The method comprises the following steps:
s103-1, word segmentation processing is carried out on the text content, and each text is converted into a word list.
S103-2, removing repeated words in each word list to obtain two different word sets.
S103-3, combining two different word sets to obtain a total word set.
S103-4, constructing a vector for each word in the total word set, wherein each dimension of the vector represents the number of times the word appears in the text. The similarity between the two vectors is calculated using the formula:
where cos (θ) is the similarity of the application documents, |a| represents the modulus of vector a, |b| represents the modulus of vector b, and a×b represents the inner product of vector a and vector b.
S104, obtaining the file types and the similarity of the last evaluation of all the evaluation accounts, finding out application files of different types from the application files, and judging whether all the evaluation accounts can be distributed to the application files of different types. If yes, directly distributing; if the result is no, the type of all the application files cannot meet the type requirement of all the evaluation accounts, and part of the evaluation accounts can be distributed with the application files with the same type as the last evaluation file. In this case, the part of the evaluation accounts preferentially select the application files with the largest similarity difference with the last evaluation file of the corresponding evaluation account in the corresponding class to be distributed, and the rest of the evaluation accounts normally distribute the application files with different types. The formula is as follows:
x=argmax{|n-s|:s∈S}
wherein argmax represents the similarity with the maximum difference between the last evaluation file similarity of the evaluation account, x represents the application file corresponding to argmax, n represents the last evaluation file similarity of the evaluation account, S represents the element in the corresponding class, and S represents the set of all the similarities in the corresponding class.
In S2, the evaluation information refers to information of all the evaluation accounts in the evaluation process, including a historical misjudgment rate, an evaluation time and an evaluation result of each evaluation account. The historical misjudgment rate refers to the ratio of the number of rating errors in the historical rating application files of the corresponding rating account to the total number of rating application files. The evaluation time refers to the time it takes for an evaluation account to evaluate from the beginning of the evaluation to the end of the evaluation. The assessment results include pass and reject.
In S2 and S3, the reliability calculation and file status setting steps are as follows:
s301, acquiring the historical misjudgment rates of all the evaluation accounts which are executing the assessment work, modifying the states of the evaluation accounts into busy states, and marking.
S302, when the marked evaluation account is detected to finish the evaluation work, the state of the corresponding evaluation account is modified to be in an idle state, marking is canceled, and meanwhile, the evaluation time and the evaluation result are collected.
S303, substituting the historical misjudgment rate, the evaluation time and the evaluation result of the evaluation account into a formula, and calculating to obtain the reliability, wherein the reliability is bound with the corresponding application file. The reliability calculation formula is as follows:
wherein KKD is the reliability of the application file; p is the type of the evaluation result, the result is the refuting value-1, and the result is the passing value 1; e is the influence weight of the historical misjudgment rate; h is the historical misjudgment rate of the estimated account; f is an evaluation time influence coefficient, and the value is greater than 1; t is the evaluation time; k is the reliability of the last assessment.
S304, judging whether the reliability of the application file is in a reliability abnormal section, if so, adjusting the state of the corresponding application file from the state to be evaluated to the re-evaluation state, and if not, adjusting the state of the corresponding application file from the state to be evaluated to the evaluated state. The judgment formula is as follows:
wherein result is the application file state adjustment result, K min K is the minimum value of the reliability abnormal interval max The reliability abnormal interval maximum value KKD is the application file reliability.
In S4, the states of the application file include an evaluated state and a review state. The rated state refers to the rated and reliable application files meeting the requirements, and the re-rated state refers to the rated and reliable application files not meeting the requirements.
And when the state of the application file is the rated state, ending the evaluation, and sending the evaluation result to the next link. When the state of the application file is the re-evaluation state, the re-evaluation is needed, the step is skipped to the step S1, and the re-classification dispatch is carried out for evaluation.
Compared with the prior art, the invention has the following beneficial effects:
1. according to the method, each application file is classified and the text similarity is calculated in the dispatching stage, the application files with different file types or text similarity which are evaluated by the account last time are dispatched to each evaluation account, the situation that the evaluation account repeatedly evaluates the application files with the same type or text high similarity is avoided, and the probability of reducing misjudgment in repeated work is reduced.
2. The invention judges the reliability degree of the evaluation action by analyzing the historical misjudgment rate of the evaluation account and the evaluation time of the time, and defines the evaluation accuracy by judging whether the reliability is in the reliability abnormal section. Under the condition of abnormal reliability, the method and the system are automatically distributed to other evaluation accounts for re-evaluation, and a plurality of evaluation accounts evaluate the same application file, so that the accuracy of evaluation can be effectively improved.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate the invention and together with the embodiments of the invention, serve to explain the invention. In the drawings:
FIG. 1 is a schematic diagram of a data analysis management system and method based on an artificial intelligence cloud platform;
FIG. 2 is a flow chart of a system and method for data analysis management based on an artificial intelligence cloud platform.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Referring to fig. 1, the present invention provides the following technical solutions: the system comprises a data acquisition module, a data processing module, an operation management module and a data storage module.
The data acquisition module is used for acquiring the received application file information, the evaluation account information and the evaluation process information; the data processing module is used for processing the application file information, classifying according to application types, calculating the text similarity of each application file and distributing the text similarity to different evaluation accounts; comprehensively analyzing and evaluating account information and evaluation process information, and calculating the reliability of the application file; the running management module adjusts the state of the corresponding application file according to the reliability, and determines the file flow direction through the file state; the data storage module is used for carrying out backup storage on all the information.
The data acquisition module comprises a file information acquisition unit, an account information acquisition unit and an assessment information acquisition unit.
The file information acquisition unit is used for acquiring type information, content information and state information of all application files. The type information refers to the application type of the application file, the content information refers to the main text in the application file, and the state information refers to the current state of the application file, including the state to be evaluated, the state to be evaluated and the state to be re-evaluated.
The system receives in real time a large number of different types of application files, which may have not yet been rated, or which may have been rated to be re-rated. Each application file contains a large amount of text information, and different types of application file formats are different.
The account information acquisition unit is used for acquiring the historical misjudgment rate of the estimated account, the estimated account state, the type of the application file of the last evaluation and the similarity. The historical misjudgment rate refers to the ratio of the number of evaluation errors in the historical evaluation application files of the corresponding evaluation account to the total number of the evaluation application files, and the evaluation account state comprises an idle state and a busy state.
The evaluation information acquisition unit is used for acquiring the evaluation time and the evaluation result of the evaluation application file. The evaluation time refers to the time it takes for the evaluation account to evaluate from the beginning of the evaluation to the end of the evaluation, and the evaluation result includes pass and reject.
The evaluation object uses the evaluation account to perform evaluation work, and the evaluation result is passed or rejected. Because the evaluation link is one link in the whole business process, the evaluation result can influence other links whether passing or refusing. When the evaluation result is passing, the application file flows into the next link; and when the assessment result is refusal, the application file flows back to the last link. When the next link or the last link feeds back the erroneous judgment of the link, the system automatically distributes the corresponding application files in a classified mode to carry out re-assessment. If the subsequent final evaluation result is not consistent with the previous evaluation result, the previous evaluation error is described, the ratio of the number of evaluation errors in the historical evaluation application files of each evaluation account to the total number of the evaluation application files is counted as the historical misjudgment rate, and the higher the historical misjudgment rate is, the greater the misjudgment possibility of the evaluation account is represented, and the lower the reliability degree of the evaluation result is.
The data processing module comprises a file classification dispatch unit and an assessment behavior analysis unit.
The file classification and dispatch unit is used for classifying all application files with states to be evaluated or reevaluated and dispatching the application files to the evaluation account in the idle state for evaluation. Firstly, classifying all application files according to type information, wherein the application files with the same type are one type; secondly, performing text similarity calculation on all application files in each class and reference files corresponding to the type, and binding the obtained similarity with the application files; and finally, judging the application files which should be dispatched at this time according to the type and the similarity of the application files which are evaluated by the evaluation account for the last time.
The type or similarity of the application file being dispatched should be different from the type or similarity of the file last assessed by the corresponding assessment account. When all the idle state evaluation accounts can be distributed to application files with different types, the distribution is directly carried out. When the evaluation account in the partial idle state cannot be distributed to the application files with different types, the partial evaluation account preferentially selects the application file with the largest similarity difference with the last evaluation file of the corresponding evaluation account in the corresponding class for distribution, and the rest evaluation accounts normally distribute the application files with different types.
The assessment behavior analysis unit is used for calculating the reliability of the assessment process. Substituting the historical misjudgment rate, the evaluation time and the evaluation result of the evaluation account into a formula, and calculating to obtain the reliability, wherein the reliability is bound with the corresponding application file.
The reliability determines the reliability of the assessment behavior. The higher the reliability, the higher the reliability of the assessment behavior, and the lower the reliability.
The operation management module comprises a state management unit and a flow direction management unit.
The state management unit is used for adjusting the states of the evaluation account and the application file. Evaluating account states includes a free state and a busy state. The idle state refers to the assessment account not being subject to assessment work, and the busy state refers to the assessment account being subject to assessment work. The states of the application file include a state to be evaluated, an evaluated state and a re-evaluation state. The state to be evaluated refers to that the application file is not evaluated, the evaluated state refers to that the application file is evaluated and the reliability is not in a reliability abnormal section, and the re-evaluation state refers to that the application file is evaluated and the reliability is in a reliability abnormal section.
When the evaluation account is evaluating the application file, the system automatically adjusts the state of the evaluation account to a busy state, and the system does not send any task any more in the state. When the assessment of the assessment account is finished, the system automatically adjusts the assessment account state to a idle state in which the system can dispatch tasks.
When the application file is dispatched for the first time and the evaluation is not completed yet, the system automatically sets the state of the corresponding application file as the state to be evaluated. And when the evaluation is finished, judging whether the reliability of the application file is in a reliability abnormal section, setting the corresponding application file state as a re-evaluation state if the reliability of the application file is in the reliability abnormal section, and setting the corresponding application file state as an evaluated state if the reliability of the application file is not in the reliability abnormal section.
The flow direction management unit is used for judging the flow direction according to the state of the application file. And when the state of the application file is the rated state, ending the evaluation, and sending the evaluation result to the next link. When the application file state is the re-evaluation state, the evaluation is not finished, the re-evaluation is needed, the corresponding application file is sent to the data processing module, and the re-classification distribution is carried out for evaluation.
The data storage module is used for storing the application file information, the evaluation account information, the assessment process information and the state adjustment information into the database for subsequent tracing operation.
Referring to fig. 2, the present invention provides a technical solution, a data analysis management method based on an artificial intelligence cloud platform, the method includes the following steps:
s1, classifying all application files and then distributing the application files to each evaluation account;
s2, evaluating the application file by the evaluation account, and collecting evaluation information;
s3, calculating reliability through the assessment information, and adjusting the state;
s4, defining a flow direction according to the state of the application file.
In S1, the classification and dispatch steps for the application file are as follows:
s101, collecting type information and content information of all application files; the type information refers to the application type of the application file, and the content information refers to the main text in the application file.
S102, acquiring type information of all application files, and dividing the application files with the same type into the same class, wherein one or more application files are arranged in each class.
S103, acquiring main body texts of all application files through content information. And simultaneously, acquiring the main text of the reference file corresponding to each class. And calculating the similarity between the main text of each application file and the main text of the reference file corresponding to the class of the application file, and binding the obtained similarity with the application file. The method comprises the following steps:
s103-1, word segmentation processing is carried out on the text content, and each text is converted into a word list.
S103-2, removing repeated words in each word list to obtain two different word sets.
S103-3, combining two different word sets to obtain a total word set.
S103-4, constructing a vector for each word in the total word set, wherein each dimension of the vector represents the number of times the word appears in the text. The similarity between the two vectors is calculated using the formula:
where cos (θ) is the similarity of the application documents, |a| represents the modulus of vector a, |b| represents the modulus of vector b, and a×b represents the inner product of vector a and vector b.
S104, obtaining the file types and the similarity of the last evaluation of all the evaluation accounts, finding out application files of different types from the application files, and judging whether all the evaluation accounts can be distributed to the application files of different types. If yes, directly distributing; if the result is no, the type of all the application files cannot meet the type requirement of all the evaluation accounts, and part of the evaluation accounts can be distributed with the application files with the same type as the last evaluation file. In this case, the part of the evaluation accounts preferentially select the application files with the largest similarity difference with the last evaluation file of the corresponding evaluation account in the corresponding class to be distributed, and the rest of the evaluation accounts normally distribute the application files with different types. The formula is as follows:
x=argmax{|n-s|:s∈S}
wherein argmax represents the similarity with the maximum difference between the last evaluation file similarity of the evaluation account, x represents the application file corresponding to argmax, n represents the last evaluation file similarity of the evaluation account, S represents the element in the corresponding class, and S represents the set of all the similarities in the corresponding class.
In S2, the evaluation information refers to information of all the evaluation accounts in the evaluation process, including a historical misjudgment rate, an evaluation time and an evaluation result of each evaluation account. The historical misjudgment rate refers to the ratio of the number of rating errors in the historical rating application files of the corresponding rating account to the total number of rating application files. The evaluation time refers to the time it takes for an evaluation account to evaluate from the beginning of the evaluation to the end of the evaluation. The assessment results include pass and reject.
In S2 and S3, the reliability calculation and file status setting steps are as follows:
s301, acquiring the historical misjudgment rates of all the evaluation accounts which are executing the assessment work, modifying the states of the evaluation accounts into busy states, and marking.
S302, when the marked evaluation account is detected to finish the evaluation work, the state of the corresponding evaluation account is modified to be in an idle state, marking is canceled, and meanwhile, the evaluation time and the evaluation result are collected.
S303, substituting the historical misjudgment rate, the evaluation time and the evaluation result of the evaluation account into a formula, and calculating to obtain the reliability, wherein the reliability is bound with the corresponding application file. The reliability calculation formula is as follows:
wherein KKD is the reliability of the application file; p is the type of the evaluation result, the result is the refuting value-1, and the result is the passing value 1; e is the influence weight of the historical misjudgment rate; h is the historical misjudgment rate of the estimated account; f is an evaluation time influence coefficient, and the value is greater than 1; t is the evaluation time; k is the reliability of the last assessment.
S304, judging whether the reliability of the application file is in a reliability abnormal section, if so, adjusting the state of the corresponding application file from the state to be evaluated to the re-evaluation state, and if not, adjusting the state of the corresponding application file from the state to be evaluated to the evaluated state. The judgment formula is as follows:
wherein result is the application file state adjustment result, K min K is the minimum value of the reliability abnormal interval max The reliability abnormal interval maximum value KKD is the application file reliability.
In S4, the states of the application file include an evaluated state and a review state. The rated state refers to the rated and reliable application files meeting the requirements, and the re-rated state refers to the rated and reliable application files not meeting the requirements.
And when the state of the application file is the rated state, ending the evaluation, and sending the evaluation result to the next link. When the state of the application file is the re-evaluation state, the re-evaluation is needed, the step is skipped to the step S1, and the re-classification dispatch is carried out for evaluation.
Embodiment one:
assuming that the historical misjudgment rates of the A, B and C three evaluation accounts are 5%, 10% and 15% respectively, the three evaluation accounts are respectively assigned with X, Y and Z application files of different types, and after the evaluation is completed, the following results are obtained:
a: evaluation time: evaluation results were carried out for 120 seconds: through;
b: evaluation time: 150 seconds, evaluation result: performing revetment;
c: evaluation time: evaluation results 80 seconds: through;
assuming that the influence weight of the historical misjudgment rate is 10, the influence weight of the evaluation time is 1.2, the reliability abnormal interval is [ -30, +30], and the reliability of the application files corresponding to A, B and C is:
X:
Y:
Z:
the X state is evaluated, the evaluation result is passed, and the evaluation is finished; the Y and Z states are reevaluated and the evaluation is continued;
and respectively distributing the Y and the Z to the C and the A evaluation accounts for evaluation, and obtaining the following products after the evaluation is completed:
c: evaluation time: evaluation results were carried out for 120 seconds: performing revetment;
a: evaluation time: evaluation results were carried out for 60 seconds: performing revetment;
the reliability of the application files corresponding to C and A is:
Y:
Z:
the Y state is evaluated, the evaluation result is refused, and the evaluation is finished; z state is reevaluation, and evaluation is continued;
and (3) distributing the Z to the evaluation account B for evaluation, and obtaining:
b: evaluation time: evaluation results were carried out for 120 seconds: performing revetment;
the reliability of the application file corresponding to B is:
B:
and the Z state is evaluated, the evaluation result is refused, and the evaluation is finished.
It is noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Finally, it should be noted that: the foregoing description is only a preferred embodiment of the present invention, and the present invention is not limited thereto, but it is to be understood that modifications and equivalents of some of the technical features described in the foregoing embodiments may be made by those skilled in the art, although the present invention has been described in detail with reference to the foregoing embodiments. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. A data analysis management system based on an artificial intelligence cloud platform is characterized in that: the system comprises a data acquisition module, a data processing module, an operation management module and a data storage module;
the data acquisition module is used for acquiring the received application file information, the evaluation account information and the evaluation process information; the data processing module is used for processing the application file information, classifying according to application types, calculating the text similarity of each application file and distributing the text similarity to different evaluation accounts; comprehensively analyzing and evaluating account information and evaluation process information, and calculating the reliability of the application file; the running management module adjusts the state of the corresponding application file according to the reliability, and determines the file flow direction through the file state; the data storage module is used for carrying out backup storage on all the information.
2. The data analysis management system based on the artificial intelligence cloud platform according to claim 1, wherein: the data acquisition module comprises a file information acquisition unit, an account information acquisition unit and an assessment information acquisition unit;
the file information acquisition unit is used for acquiring type information, content information and state information of all application files; the type information refers to the application type of the application file, the content information refers to the main text in the application file, and the state information refers to the current state of the application file, including a state to be evaluated, an evaluated state and a re-evaluation state;
the account information acquisition unit is used for acquiring the historical misjudgment rate of the estimated account, the estimated account state, the type of the application file which is evaluated for the last time and the similarity; the historical misjudgment rate refers to the ratio of the number of evaluation errors in the historical evaluation application files of the corresponding evaluation account to the total number of the evaluation application files, and the evaluation account state comprises an idle state and a busy state;
the evaluation information acquisition unit is used for acquiring the evaluation time and the evaluation result of the evaluation application file; the evaluation time refers to the time it takes for the evaluation account to evaluate from the beginning of the evaluation to the end of the evaluation, and the evaluation result includes pass and reject.
3. The data analysis management system based on the artificial intelligence cloud platform according to claim 1, wherein: the data processing module comprises a file classification dispatch unit and an assessment behavior analysis unit;
the file classification and dispatch unit is used for classifying the application files with all states of to-be-evaluated states or review states and dispatching the application files to an evaluation account in an idle state for evaluation; firstly, classifying all application files according to type information, wherein the application files with the same type are one type; secondly, performing text similarity calculation on all application files in each class and reference files corresponding to the type, and binding the obtained similarity with the application files; finally, judging the application files to be dispatched according to the type and the similarity of the application files which are evaluated by the evaluation account for the last time;
the evaluation behavior analysis unit is used for calculating the reliability of the evaluation process; substituting the historical misjudgment rate, the evaluation time and the evaluation result of the evaluation account into a formula, and calculating to obtain the reliability, wherein the reliability is bound with the corresponding application file.
4. The data analysis management system based on the artificial intelligence cloud platform according to claim 1, wherein: the operation management module comprises a state management unit and a flow direction management unit;
the state management unit is used for adjusting the states of the evaluation account and the application file; evaluating account states including a free state and a busy state; the idle state refers to that the evaluation account is not doing evaluation work, and the busy state refers to that the evaluation account is doing evaluation work; the states of the application file comprise a state to be evaluated, an evaluated state and a re-evaluation state; the state to be evaluated refers to that the application file is not evaluated, the evaluated state refers to that the application file is evaluated and the reliability is not in a reliability abnormal interval, and the re-evaluation state refers to that the application file is evaluated and the reliability is in a reliability abnormal interval;
the flow direction management unit is used for judging the flow direction according to the state of the application file; when the state of the application file is the rated state, ending the evaluation, and sending the evaluation result to the next link; when the application file state is the re-evaluation state, the evaluation is not finished, the re-evaluation is needed, the corresponding application file is sent to the data processing module, and the re-classification distribution is carried out for evaluation.
5. The data analysis management system based on the artificial intelligence cloud platform according to claim 1, wherein: the data storage module is used for storing the application file information, the evaluation account information, the assessment process information and the state adjustment information into a database for subsequent tracing operation.
6. The data analysis management method based on the artificial intelligence cloud platform is characterized by comprising the following steps of:
s1, classifying all application files and then distributing the application files to each evaluation account;
s2, evaluating the application file by the evaluation account, and collecting evaluation information;
s3, calculating reliability through the assessment information, and adjusting the state;
s4, defining a flow direction according to the state of the application file.
7. The data analysis and management method based on the artificial intelligence cloud platform according to claim 6, wherein in S1, the classification and dispatch steps of the application files are as follows:
s101, collecting type information and content information of all application files; the type information refers to the application type of the application file, and the content information refers to the main text in the application file;
s102, acquiring type information of all application files, and dividing the application files with the same type into the same class;
s103, acquiring main text of all application files through content information; simultaneously, acquiring a main text of a reference file corresponding to each class; and calculating the similarity between the main text of each application file and the main text of the reference file corresponding to the class of the application file, and binding the obtained similarity with the application file, wherein the steps are as follows:
s103-1, performing word segmentation processing on text content, and converting each text into a word list;
s103-2, removing repeated words in each word list to obtain two different word sets;
s103-3, combining two different word sets to obtain a total word set;
s103-4, constructing a vector for each word in the total word set, wherein each dimension of the vector represents the number of times the word appears in the text; the similarity between the two vectors is calculated using the formula:
where cos (θ) is the similarity of the application documents, |a| represents the modulus of vector a, |b| represents the modulus of vector b, and a×b represents the inner product of vector a and vector b;
s104, obtaining the file types and the similarity of the last evaluation of all the evaluation accounts, finding out application files of different types from the evaluation accounts in all the application files, and judging whether all the evaluation accounts can be distributed to the application files of different types; if yes, directly distributing; if the result is negative, the type of all the application files cannot meet the type requirement of all the evaluation accounts, and part of the evaluation accounts can be distributed with the application files with the same type as the last evaluation file; in this case, the part of the evaluation account preferentially selects the application files with the largest similarity difference with the last evaluation file of the corresponding evaluation account in the corresponding class for distribution, and the rest of the evaluation accounts normally distribute the application files with different types; the formula is as follows:
x=argmax{|n-s|:s∈S}
wherein argmax represents the similarity with the maximum difference between the last evaluation file similarity of the evaluation account, x represents the application file corresponding to argmax, n represents the last evaluation file similarity of the evaluation account, S represents the element in the corresponding class, and S represents the set of all the similarities in the corresponding class.
8. The data analysis management method based on the artificial intelligence cloud platform as claimed in claim 6, wherein the method comprises the following steps: in S2, the evaluation information refers to information of all the evaluation accounts in the evaluation process, wherein the information comprises a historical misjudgment rate, evaluation time and an evaluation result of each evaluation account; the historical misjudgment rate refers to the ratio of the number of evaluation errors in the historical evaluation application files of the corresponding evaluation account to the total number of the evaluation application files; the evaluation time refers to the time taken by the evaluation account from the beginning of the evaluation to the end of the evaluation; the assessment results include pass and reject.
9. The data analysis management method based on the artificial intelligence cloud platform as claimed in claim 6, wherein the method comprises the following steps: in S2 and S3, the reliability calculation and file status setting steps are as follows:
s301, acquiring the historical misjudgment rates of all the evaluation accounts which are executing the assessment work, modifying the states of the evaluation accounts into busy states and marking;
s302, when the marked evaluation account is detected to finish the evaluation work, modifying the state of the corresponding evaluation account into an idle state, canceling marking, and collecting evaluation time and evaluation results;
s303, substituting the historical misjudgment rate, the evaluation time and the evaluation result of the evaluation account into a formula, and calculating to obtain the reliability, wherein the reliability is bound with the corresponding application file; the reliability calculation formula is as follows:
wherein KKD is the reliability of the application file, p is the type of the evaluation result, e is the influence weight of the historical misjudgment rate, h is the historical misjudgment rate of the evaluation account, f is the influence coefficient of the evaluation time, t is the evaluation time, and k is the reliability of the last evaluation;
s304, judging whether the reliability of the application file is in a reliability abnormal section, if so, adjusting the state of the corresponding application file from the state to be evaluated to the re-evaluation state, and if not, adjusting the state of the corresponding application file from the state to be evaluated to the evaluated state; the judgment formula is as follows:
wherein result is the application file state adjustment result, K min K is the minimum value of the reliability abnormal interval max The reliability abnormal interval maximum value KKD is the application file reliability.
10. The data analysis management method based on the artificial intelligence cloud platform as claimed in claim 6, wherein the method comprises the following steps: in S4, the state of the application file includes an evaluated state and a review state; the rated state refers to the rated application files with the reliability meeting the requirements, and the re-rated state refers to the rated application files with the reliability not meeting the requirements;
when the state of the application file is the rated state, ending the evaluation, and sending the evaluation result to the next link; when the state of the application file is the re-evaluation state, the re-evaluation is needed, the step is skipped to the step S1, and the re-classification dispatch is carried out for evaluation.
CN202310669585.5A 2023-06-07 2023-06-07 Data analysis management system and method based on artificial intelligent cloud platform Active CN116739512B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310669585.5A CN116739512B (en) 2023-06-07 2023-06-07 Data analysis management system and method based on artificial intelligent cloud platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310669585.5A CN116739512B (en) 2023-06-07 2023-06-07 Data analysis management system and method based on artificial intelligent cloud platform

Publications (2)

Publication Number Publication Date
CN116739512A true CN116739512A (en) 2023-09-12
CN116739512B CN116739512B (en) 2024-07-26

Family

ID=87905584

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310669585.5A Active CN116739512B (en) 2023-06-07 2023-06-07 Data analysis management system and method based on artificial intelligent cloud platform

Country Status (1)

Country Link
CN (1) CN116739512B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101446970A (en) * 2008-12-15 2009-06-03 腾讯科技(深圳)有限公司 Method for censoring and process text contents issued by user and device thereof
CN112016830A (en) * 2020-08-27 2020-12-01 广东电网有限责任公司 Patent file evaluation task allocation method and device
CN113780438A (en) * 2021-09-15 2021-12-10 山东恒信科技服务有限公司 Science and technology project application tutoring system based on big data
CN114202195A (en) * 2021-12-09 2022-03-18 国网四川省电力公司天府新区供电公司 Design file workload evaluation system and evaluation method based on private chain
CN115759945A (en) * 2022-09-07 2023-03-07 南方电网数字平台科技(广东)有限公司 Project intelligent evaluation method and device based on bid document
CN115908062A (en) * 2022-12-26 2023-04-04 广西壮族自治区农业科学院 Intellectual property full-period management system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101446970A (en) * 2008-12-15 2009-06-03 腾讯科技(深圳)有限公司 Method for censoring and process text contents issued by user and device thereof
CN112016830A (en) * 2020-08-27 2020-12-01 广东电网有限责任公司 Patent file evaluation task allocation method and device
CN113780438A (en) * 2021-09-15 2021-12-10 山东恒信科技服务有限公司 Science and technology project application tutoring system based on big data
CN114202195A (en) * 2021-12-09 2022-03-18 国网四川省电力公司天府新区供电公司 Design file workload evaluation system and evaluation method based on private chain
CN115759945A (en) * 2022-09-07 2023-03-07 南方电网数字平台科技(广东)有限公司 Project intelligent evaluation method and device based on bid document
CN115908062A (en) * 2022-12-26 2023-04-04 广西壮族自治区农业科学院 Intellectual property full-period management system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张文俊;: "企业专利管理系统中专利分析评估模块功能研究", 江苏科技信息, no. 09, 30 March 2017 (2017-03-30) *
肖刘明镜;周志;邹小军;胡俊峰;: "一种基于领域本体的稿件―审阅人相关度度量方法", 中文信息学报, no. 02, 15 March 2017 (2017-03-15) *

Also Published As

Publication number Publication date
CN116739512B (en) 2024-07-26

Similar Documents

Publication Publication Date Title
CN109949152A (en) A kind of personal credit's violation correction method
CN110991474A (en) Machine learning modeling platform
CN111061709A (en) Data cleaning automation method and system for similar repeated records
CN110689437A (en) Communication construction project financial risk prediction method based on random forest
CN113033909A (en) Portable user analysis method, device, equipment and computer storage medium
CN116739512B (en) Data analysis management system and method based on artificial intelligent cloud platform
CN116739332B (en) Industrial chain intelligent management method, system and medium based on big data
CN111292099A (en) Intelligent station anti-ticket-swiping method and anti-ticket-swiping system
CN111752541A (en) Pay routing method based on Rete algorithm
CN109145207B (en) Information personalized recommendation method and device based on classification index prediction
CN111797922B (en) Text image classification method and device
CN115544250A (en) Data processing method and system
CN115438190A (en) Power distribution network fault decision-making assisting knowledge extraction method and system
CN110263156B (en) Intelligent order dispatching method for big data of government-enterprise service
CN114372810A (en) Funding account identification and funding transaction relation network analysis method for funding person
CN109308565B (en) Crowd performance grade identification method and device, storage medium and computer equipment
CN113191569A (en) Enterprise management method and system based on big data
CN112991131A (en) Government affair data processing method suitable for electronic government affair platform
CN114637849B (en) Legal relation cognition method and system based on artificial intelligence
CN110443305A (en) Self-adaptive features processing method and processing device
CN117807559B (en) Multi-mode data fusion method and system
CN107577644A (en) Landslide Remedial Measures on Some method for optimizing based on AOWEA operators
WO2022227213A1 (en) Industry recommendation method and apparatus, computer device and storage medium
CN114120091A (en) Group pedigree identification method, device, equipment and medium
CN118152087A (en) Data distribution method and system for distributed computing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20240703

Address after: 223800, 2nd Floor, Building 1, Huating, Lijingwan, Sucheng District, Suqian City, Jiangsu Province

Applicant after: Jiangsu Zhongwu Big Data Development Group Co.,Ltd.

Country or region after: China

Address before: No. 5-1, 1st floor, and No. 13-2, 3rd floor, CD Building, Permanent Commercial Building, Nangang Concentration District, Economic and Technological Development Zone, Harbin City, Heilongjiang Province, 150000 yuan

Applicant before: HARBIN RONGMEI TECHNOLOGY Co.,Ltd.

Country or region before: China

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant