CN117312411A - Personnel document file rapid searching system and method based on data analysis - Google Patents
Personnel document file rapid searching system and method based on data analysis Download PDFInfo
- Publication number
- CN117312411A CN117312411A CN202311272805.7A CN202311272805A CN117312411A CN 117312411 A CN117312411 A CN 117312411A CN 202311272805 A CN202311272805 A CN 202311272805A CN 117312411 A CN117312411 A CN 117312411A
- Authority
- CN
- China
- Prior art keywords
- data
- module
- analysis
- personnel
- file
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000007405 data analysis Methods 0.000 title claims abstract description 39
- 238000000034 method Methods 0.000 title claims description 18
- 238000004458 analytical method Methods 0.000 claims abstract description 26
- 238000007418 data mining Methods 0.000 claims abstract description 21
- 238000007781 pre-processing Methods 0.000 claims abstract description 20
- 238000013079 data visualisation Methods 0.000 claims abstract description 19
- 238000013480 data collection Methods 0.000 claims description 16
- 238000012545 processing Methods 0.000 claims description 8
- 238000004140 cleaning Methods 0.000 claims description 7
- 230000002159 abnormal effect Effects 0.000 claims description 6
- 238000005065 mining Methods 0.000 claims description 6
- 238000012300 Sequence Analysis Methods 0.000 claims description 4
- 238000000611 regression analysis Methods 0.000 claims description 4
- 238000010586 diagram Methods 0.000 claims description 3
- 238000011161 development Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/248—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/26—Visual data mining; Browsing structured data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/105—Human resources
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Entrepreneurship & Innovation (AREA)
- Computational Linguistics (AREA)
- Strategic Management (AREA)
- Quality & Reliability (AREA)
- Probability & Statistics with Applications (AREA)
- Mathematical Physics (AREA)
- Fuzzy Systems (AREA)
- Software Systems (AREA)
- Economics (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to the technical field of personnel file searching, and discloses a personnel file rapid searching system based on data analysis, which comprises a data collector, a data preprocessing module, a data mining module, a data analysis module, a data visualization module and a data output module; according to the invention, the data collector, the data preprocessing module, the data mining module, the data analysis module, the data visualization module and the data output module are arranged, so that personnel file data can be collected, cleaned, mined and analyzed, finally, the personnel file data is presented and output in a chart mode, and the personnel file meeting the conditions can be quickly and accurately searched through the analysis of the data, so that the efficiency and the accuracy of personnel management are improved.
Description
Technical Field
The invention relates to the technical field of personnel file searching, in particular to a system and a method for quickly searching personnel file based on data analysis.
Background
The personnel file is file material for recording personal experience, moral style, business ability, work performance and other contents formed in personnel cultivation, selection, optional work and other works, and is important personnel information data of personnel in post.
Today, the technological informatization development is high, and the traditional personnel file management system has been replaced by personnel file informatization management, wherein the personnel file informatization management refers to the improvement of the current personnel file management work by adopting the modern advanced information technology, and is the necessary trend and result of the personnel file management work development.
The existing personal document file quick search system can only adopt a search mode to search and find the personal document file, however, the document search efficiency can be influenced by a plurality of factors such as the number of search keywords, the search accuracy and the data volume of a database, so that the search efficiency of the single search mode under different search environments can not be ensured, and therefore, the personal document file quick search system and the method based on data analysis are provided.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a personnel file quick searching system and method based on data analysis, which improves the efficiency and accuracy of personnel file searching.
In order to achieve the above purpose, the present invention provides the following technical solutions: the system comprises a data collector, a data preprocessing module, a data mining module, a data analysis module, a data visualization module and a data output module, and is characterized in that:
and a data collection module: the personal file data collection device is used for collecting personal file data;
and a data preprocessing module: for cleaning data;
the data mining module is used for mining potential rules in the data;
the data analysis module is used for analyzing the data;
the data visualization module is used for presenting the data in a chart form;
and the data output module is used for outputting the result.
Preferably, the data collection module obtains personnel profile information of the staff based on the personnel management system of the enterprise.
Preferably, the data preprocessing module removes duplicate information, processes missing data, and detects and processes outliers based on the personnel document files collected by the data collection module.
Preferably, the data mining module is used for mining potential data distribution, association and classification based on the personnel document files processed by the data preprocessing module.
Preferably, the data analysis module performs higher-level data processing and analysis based on potential rules in the personnel file data mined by the data mining module, and the analysis modes of the data analysis module comprise data clustering, regression analysis and time sequence analysis.
Preferably, the data visualization module outputs the analysis data analyzed by the data analysis module into a graph through a word port.
Preferably, the data output module outputs the chart output by the data visualization module to form a report and a document.
A personal document file quick searching method based on data analysis comprises the following steps:
s1, data collection: collecting personnel file data, including employee names, sexes, ages, academies, working experiences, salary levels, job position changes, salary adjustment and performance assessment;
s2, data preprocessing: cleaning the collected data, including deleting repeated data, processing missing values and eliminating abnormal values;
s3, data mining: potential rules in the data are mined, including distribution conditions of staff salary levels and salary differences of staff in different schools;
s4, data analysis: analyzing the data, including data classification, data clustering and association rule analysis;
s5, data visualization: presenting the analysis result in a chart form, wherein the chart form comprises a distribution diagram of staff salary levels and salary difference histograms of staff of different students;
s6, data output: and outputting analysis results, wherein the analysis results comprise average value, median and mode information of staff salary levels and salary difference conditions of different students staff.
Compared with the prior art, the invention has the following beneficial effects:
according to the invention, the data collector, the data preprocessing module, the data mining module, the data analysis module, the data visualization module and the data output module are arranged, so that personnel file data can be collected, cleaned, mined and analyzed, finally, the personnel file data is presented and output in a chart mode, and the personnel file meeting the conditions can be quickly and accurately searched through the analysis of the data, so that the efficiency and the accuracy of personnel management are improved.
According to the method, the repeated information is removed, the missing data is processed, abnormal values are detected and processed, potential data distribution, association and classification in the personnel file are excavated, and higher-level data processing and analysis such as data clustering, regression analysis and time sequence analysis are performed on the personnel file, so that different data of the personnel file are differentiated, and all information of personnel can be searched through inquiring part of information of the personnel, and matching of the information of the personnel with corresponding posts is facilitated, so that the personnel file is conveniently called.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention may be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
FIG. 1 is a flow chart of the personal document file system of the present invention;
FIG. 2 is a flowchart of the method for recording the personal document of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art without the inventive effort, are intended to be within the scope of the present invention, based on the embodiments herein.
Embodiment one:
referring to fig. 1, the system for quickly searching a personal document file based on data analysis in this embodiment includes a data collector, a data preprocessing module, a data mining module, a data analysis module, a data visualization module and a data output module, and specifically includes the following steps:
and a data collection module: the personal file data collection device is used for collecting personal file data;
and a data preprocessing module: for cleaning data;
the data mining module is used for mining potential rules in the data;
the data analysis module is used for analyzing the data;
the data visualization module is used for presenting the data in a chart form;
and the data output module is used for outputting the result.
Specifically, the structure of the invention is similar to the existing structure, and the main improvement point of the invention is to improve the efficiency and accuracy of searching the personnel file; according to the invention, the data collector, the data preprocessing module, the data mining module, the data analysis module, the data visualization module and the data output module are arranged, so that personnel file data can be collected, cleaned, mined and analyzed, finally, the personnel file data is presented and output in a chart mode, and the personnel file meeting the conditions can be quickly and accurately searched through the analysis of the data, so that the efficiency and the accuracy of personnel management are improved.
Specifically, the data collection module acquires personnel file information of staff based on a personnel management system of an enterprise; the personnel management system of the enterprise is a system for managing personnel, and comprises personnel files, personnel transfer, performance assessment and the like, and the data collection module is used for collecting personnel files and new personnel files of the enterprise by adding the personnel file, so that the personnel file deletion is reduced, and the follow-up accuracy of personnel file searching is ensured.
Specifically, the data preprocessing module removes repeated information, processes missing data and detects and processes abnormal values based on the personnel file collected by the data collecting module; and cleaning, normalizing, converting and the like are performed on the collected data so as to ensure the accuracy and consistency of the data. For example, the date format is uniformly converted into a standard format, and text information (such as job position and salary) is converted into numerical data.
Specifically, the data mining module is used for mining potential data distribution, association and classification based on the personnel document files processed by the data preprocessing module; the processed personnel file is mined, the law of the personnel file is found, the data of the corresponding personnel file can be conveniently queried through key information during query, and the query efficiency and accuracy are improved.
Specifically, the data analysis module performs higher-level data processing and analysis based on potential rules in personnel file data mined by the data mining module, and the analysis modes of the data analysis module comprise data clustering, regression analysis and time sequence analysis; preprocessing and excavating the personnel file, and then carrying out higher-level processing and analysis on personnel file data to form accurate classification so as to carry out quick and accurate query; and a quick search model, such as a classification model based on a machine learning algorithm or an association rule model based on a data mining algorithm, can be constructed according to the rule mined by the data mining module so as to facilitate quick analysis of personnel document archival data.
Specifically, the data visualization module outputs analysis data analyzed by the data analysis module into a chart through a word port; the personnel file data of the classified personnel file are output into a chart through word, so that personnel can conveniently review the chart, the data can be conveniently compared and analyzed, and whether the data has poor deviation with the existing data or not is observed so as to correct the data in time.
Specifically, the data output module outputs the chart output by the data visualization module to form a report and a document; when a user makes a query request, the system performs quick matching and screening according to the query condition, returns a candidate list meeting the condition, and outputs and displays the candidate document file in a word chart mode, so that the user can quickly select a proper candidate.
Embodiment two:
referring to fig. 2, the method for quickly searching a personal document file based on data analysis in this embodiment is as follows:
s1, data collection: collecting personnel file data, including employee names, sexes, ages, academies, working experiences, salary levels, job position changes, salary adjustment and performance assessment; by collecting information which can influence posts in various aspects of the personnel file, the accuracy of personnel file information is ensured when the personnel file is searched, so that the optimal personnel file is searched.
S2, data preprocessing: cleaning the collected data, including deleting repeated data, processing missing values and eliminating abnormal values; in the data collection process, abnormal information such as repetition, errors and the like can exist, and the efficiency and the accuracy of personnel document file data in query are ensured by deleting, correcting, adjusting and the like the information.
S3, data mining: potential rules in the data are mined, including distribution conditions of staff salary levels and salary differences of staff in different schools; by excavating and comparing the pretreated personnel file, rules can be excavated through different conditions of staff, so that people meeting the conditions can be obtained rapidly when the personnel file is inquired based on the rules.
S4, data analysis: analyzing the data, including data classification, data clustering and association rule analysis; by means of higher-level analysis of the data, the personnel file records are classified, and related inquiry is carried out through inquired information, so that the inquiry accuracy of the personnel file records is improved, and the working efficiency is improved.
S5, data visualization: presenting the analysis result in a chart form, wherein the chart form comprises a distribution diagram of staff salary levels and salary difference histograms of staff of different students; the screened candidate information is displayed to the user in a simple and clear mode, and different personnel document files can be compared in a chart form to select an optimal personnel document file, so that the method is convenient to use
S6, data output: outputting analysis results, including average value, median and mode information of staff salary levels and salary difference conditions of different students; staff is classified based on salary, query results presented by charts are output through input of query information, and optimal personnel document files can be selected from the query results.
It is to be understood that the above examples of the present invention are provided for clarity of illustration only and are not limiting of the embodiments of the present invention. Other variations or modifications of the above teachings will be apparent to those of ordinary skill in the art. It is not necessary here nor is it exhaustive of all embodiments. Any modification, equivalent replacement, improvement, etc. which come within the spirit and principles of the invention are desired to be protected by the following claims.
Claims (8)
1. The personnel file quick searching system based on data analysis is characterized by comprising a data collector, a data preprocessing module, a data mining module, a data analysis module, a data visualization module and a data output module, and specifically comprises the following steps:
and a data collection module: the personal file data collection device is used for collecting personal file data;
and a data preprocessing module: for cleaning data;
the data mining module is used for mining potential rules in the data;
the data analysis module is used for analyzing the data;
the data visualization module is used for presenting the data in a chart form;
and the data output module is used for outputting the result.
2. The rapid personnel file searching system based on data analysis of claim 1, wherein the data collection module obtains personnel file information of the personnel in the personnel management system of the enterprise.
3. The rapid personal document file searching system based on data analysis of claim 2 wherein the data preprocessing module removes duplicate information, processes missing data and detects and processes outliers based on the personal document file collected by the data collection module.
4. A system for quickly searching personal document files based on data analysis according to claim 3, wherein the data mining module is used for mining potential data distribution, association and classification based on the personal document files processed by the data preprocessing module.
5. The system for quickly searching the personnel document file based on the data analysis according to claim 4, wherein the data analysis module performs higher-level data processing and analysis based on potential rules in the personnel document file data mined by the data mining module, and the analysis modes of the data analysis module comprise data clustering, regression analysis and time sequence analysis.
6. The rapid personal document archive lookup system based on data analysis of claim 5 wherein the data visualization module outputs the analyzed data based on the data analysis module as a graph through a word port.
7. The rapid personal document file searching system based on data analysis of claim 6, wherein the data output module outputs the chart outputted by the data visualization module to form a report and a document.
8. A personnel file quick searching method based on data analysis is characterized by comprising the following steps:
s1, data collection: collecting personnel file data, including employee names, sexes, ages, academies, working experiences, salary levels, job position changes, salary adjustment and performance assessment;
s2, data preprocessing: cleaning the collected data, including deleting repeated data, processing missing values and eliminating abnormal values;
s3, data mining: potential rules in the data are mined, including distribution conditions of staff salary levels and salary differences of staff in different schools;
s4, data analysis: analyzing the data, including data classification, data clustering and association rule analysis;
s5, data visualization: presenting the analysis result in a chart form, wherein the chart form comprises a distribution diagram of staff salary levels and salary difference histograms of staff of different students;
s6, data output: and outputting analysis results, wherein the analysis results comprise average value, median and mode information of staff salary levels and salary difference conditions of different students staff.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311272805.7A CN117312411A (en) | 2023-09-28 | 2023-09-28 | Personnel document file rapid searching system and method based on data analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311272805.7A CN117312411A (en) | 2023-09-28 | 2023-09-28 | Personnel document file rapid searching system and method based on data analysis |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117312411A true CN117312411A (en) | 2023-12-29 |
Family
ID=89245770
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311272805.7A Pending CN117312411A (en) | 2023-09-28 | 2023-09-28 | Personnel document file rapid searching system and method based on data analysis |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117312411A (en) |
-
2023
- 2023-09-28 CN CN202311272805.7A patent/CN117312411A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Thor et al. | Introducing CitedReferencesExplorer (CRExplorer): A program for reference publication year spectroscopy with cited references standardization | |
Zhou et al. | A map of threats to validity of systematic literature reviews in software engineering | |
US10970315B2 (en) | Method and system for disambiguating informational objects | |
CN112800113B (en) | Bidding auditing method and system based on data mining analysis technology | |
US7953724B2 (en) | Method and system for disambiguating informational objects | |
KR102213627B1 (en) | Analysis software management system and analysis software management method | |
KR20190039758A (en) | Data analysis support device and data analysis support system | |
DE102012221251A1 (en) | Semantic and contextual search of knowledge stores | |
CN112581189A (en) | Intelligent supplier recommendation system and method | |
CN113408890A (en) | Artificial intelligence-based method and system for generating evaluation report after industrial investment project | |
CN116384889A (en) | Intelligent analysis method for information big data based on natural language processing technology | |
CN111143370B (en) | Method, apparatus and computer-readable storage medium for analyzing relationships between a plurality of data tables | |
CN111104483A (en) | ICT system fault analysis and auxiliary discrimination method based on machine learning | |
CN112598142B (en) | Wind turbine maintenance working quality inspection auxiliary method and system | |
CN117312411A (en) | Personnel document file rapid searching system and method based on data analysis | |
Al-Zubidy et al. | Review of systematic literature review tools | |
CN113239145A (en) | Resume retrieval method based on job description | |
KR20060114569A (en) | An operating methods for patent information system | |
CN116401212B (en) | Personnel file quick searching system based on data analysis | |
CN112699005A (en) | Server hardware fault monitoring method, electronic equipment and storage medium | |
Nedelkoska et al. | Eight decades of changes in occupational tasks, computerization and the gender pay gap | |
CN115858738B (en) | Enterprise public opinion information similarity identification method | |
US11816112B1 (en) | Systems and methods for automated process discovery | |
CN117435777B (en) | Automatic construction method and system for industrial chain map | |
CN117877039A (en) | Data identification and data management method for periodic inspection report of oil refining chemical equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |