CN112506986A - Specific professional talent skill requirement mining system based on web recruitment information - Google Patents

Specific professional talent skill requirement mining system based on web recruitment information Download PDF

Info

Publication number
CN112506986A
CN112506986A CN202011307168.9A CN202011307168A CN112506986A CN 112506986 A CN112506986 A CN 112506986A CN 202011307168 A CN202011307168 A CN 202011307168A CN 112506986 A CN112506986 A CN 112506986A
Authority
CN
China
Prior art keywords
data
module
information
recruitment
web
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011307168.9A
Other languages
Chinese (zh)
Inventor
罗南超
钟静
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ABA Teachers University
Original Assignee
ABA Teachers University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ABA Teachers University filed Critical ABA Teachers University
Priority to CN202011307168.9A priority Critical patent/CN112506986A/en
Publication of CN112506986A publication Critical patent/CN112506986A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/24569Query processing with adaptation to specific hardware, e.g. adapted for using GPUs or SSDs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/105Human resources

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Quality & Reliability (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Software Systems (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a system for mining the skill requirements of specific professional talents based on web recruitment information, and relates to the technical field of recruitment information processing. The system for mining the skill requirements of the specific professional talents based on the web recruitment information comprises a web access module, an information storage cloud library, an information retrieval platform and a data analysis platform; the Web access module is used for acquiring the recruitment information released on the Web; the information storage cloud library is used for collecting and storing various recruitment information released on the web and updating the recruitment information in real time; the information retrieval platform is used for screening the specific recruitment information required to be retrieved by the user from the information storage cloud library; and the data analysis platform is used for analyzing and classifying the information screened by the information retrieval platform and then displaying the information in a data mode. Through the use of the system, the requirement condition of an enterprise on the skills of a certain special professional can be quickly, conveniently and accurately inquired, the efficiency is high, and a large amount of labor force can be saved.

Description

Specific professional talent skill requirement mining system based on web recruitment information
Technical Field
The invention relates to the technical field of recruitment information processing, in particular to a system for mining the skill requirement of a specific professional talent based on web recruitment information.
Background
With the popularization and progress of the internet, carriers of recruitment information are gradually transferred to various recruitment websites on the internet from paper newspapers, and the recruitment websites become main ways for enterprises and applicants to release and acquire the recruitment information. In order to recruit high-precision talents, enterprises can issue corresponding recruitment information on different recruitment websites. And the recruitment requirement conditions of different occupations are counted according to the recruitment information, so that the culture and the development of talents of some special occupations can be facilitated.
At present, if the skill requirement condition of some specific professionals needs to be counted, a large number of searches and screens must be conducted on the Internet, then the obtained data are subjected to statistical analysis, and finally a conclusion can be obtained.
Disclosure of Invention
Technical problem to be solved
Aiming at the defects of the prior art, the invention provides a system for mining the skill requirements of the specific professional talents based on web recruitment information, and solves the problems of large task amount, long time consumption and low efficiency caused by manually completing statistics on the requirements of enterprises on the skills of the specific professional talents at present.
(II) technical scheme
In order to achieve the purpose, the invention is realized by the following technical scheme: the system for mining the skill requirements of the specific professional talents based on the web recruitment information comprises a web access module, an information storage cloud library, an information retrieval platform and a data analysis platform;
the Web access module is used for acquiring the recruitment information released on the Web;
the information storage cloud library is used for collecting and storing various recruitment information released on the web and updating the recruitment information in real time;
the information retrieval platform is used for screening the specific recruitment information required to be retrieved by the user from the information storage cloud library;
and the data analysis platform is used for analyzing and classifying the information screened by the information retrieval platform and then displaying the information in a data mode.
Preferably, the information storage cloud library comprises a data acquisition module, a data cleaning module, a data deleting module and a data storage module, wherein the data acquisition module is connected with the data cleaning module, the data cleaning module is connected with the data storage module, and the data deleting module is connected with the data storage module;
the data acquisition module is used for acquiring talent recruitment information of all aspects from the web in real time;
the data cleaning module is used for receiving the webpage recruitment information acquired by the data acquisition module and removing useless information in the webpage recruitment information;
the data storage module is used for storing the processed useful recruitment information;
and the data deleting module is used for deleting some outdated data and repeated data in the recruitment information, and the data time limit can be set.
Preferably, the input end of the data acquisition module is connected with the output end of the web access module.
Preferably, the information retrieval platform comprises a retrieval target determination module, a data extraction module, a primary data screening module, a secondary data screening module, a final data screening module and a data transmission module, wherein the data extraction module is connected with the primary data screening module, the retrieval target determination module is connected with the primary data screening module, the primary data screening module is connected with the secondary data screening module, the secondary data screening module is connected with the final data screening module, and the remainder of the data transmission module is connected with the final data screening module;
the retrieval target determining module is used for inputting information of a target to be retrieved and a screening grade by a user;
the data extraction module is used for extracting recruitment information from the information storage cloud library;
the data primary screening module, the data secondary screening module and the data final screening module are used for receiving the recruitment information transmitted by the data extraction module and performing grading screening retrieval on the recruitment information according to retrieval conditions set by the retrieval recruitment module;
and the data transmission module is used for transmitting the data obtained after the multistage screening to the data analysis platform.
Preferably, the search items included in the search target determination module include specific professional names and keywords, search time ranges, search areas, and the like.
Preferably, the data analysis platform comprises a data statistics module, a data classification module and a data output module;
the data statistics module is used for performing statistics on all the retrieved data;
the data classification module is used for classifying the recruitment data after the retrieval is finished;
and the data output module is used for outputting and displaying the recruitment data after the analysis is finished.
Preferably, the classification criteria of the data classification module include recruitment unit size, recruitment wage, recruitment age segment, and the like.
Preferably, the output form of the data output module is an electronic chart, and specifically includes a sector statistical chart, a data trend graph, an Excel table and the like.
(III) advantageous effects
The invention provides a system for mining the skill requirements of specific professionals based on web recruitment information. The method has the following beneficial effects:
1. the system designed by the invention is provided with the information storage cloud library, the information storage cloud library can collect and store the recruitment information published on the web in real time, and remove useless information such as advertisements, links and the like included in the webpage information, so that when a user needs to retrieve the recruitment information, the recruitment information can be directly retrieved from the recruitment information library without retrieving from the webpage, and a large amount of time can be reduced.
2. The system designed by the invention is provided with the information retrieval platform, the information retrieval platform can set retrieval conditions according to the condition of specific occupation to be retrieved, and then the information is classified and accurately screened from the cloud library according to the conditions, so that all related recruitment information can be rapidly and accurately retrieved.
3. The system designed by the invention is provided with a data analysis platform, the data analysis platform can carry out classification statistics on the retrieved data and then accurately and visually display the data to the user in a chart form according to the classification standard, and the requirement condition of an enterprise on the skills of a certain professional talent can be quickly, conveniently and accurately inquired by using the system.
Drawings
Fig. 1 is a schematic structural diagram of a system for mining the skill requirement of a specific professional based on web recruitment information according to the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example (b):
as shown in fig. 1, an embodiment of the present invention provides a system for mining skills of a specific professional based on web recruitment information, which includes a web access module, an information storage cloud library, an information retrieval platform, and a data analysis platform;
the Web access module is used for acquiring the recruitment information released on the Web;
the information storage cloud library is used for collecting and storing various recruitment information published on the web and updating the recruitment information in real time, and comprises a data acquisition module used for acquiring talent recruitment information in all aspects from the web in real time; the data cleaning module is used for receiving the webpage recruitment information acquired by the data acquisition module and removing useless information in the webpage recruitment information; the data deleting module is used for deleting some outdated data and repeated data in the recruitment information, and data time limit can be set; the data storage module is used for storing the processed useful recruitment information; the data acquisition module is connected with the data cleaning module, the data cleaning module is connected with the data storage module, and the data deletion module is connected with the data storage module; the input end of the data acquisition module is connected with the output end of the web access module;
the information retrieval platform is used for screening specific recruitment information required to be retrieved by a user from the information storage cloud library and comprises a retrieval target determining module, a data extracting module, a primary data screening module, a secondary data screening module, a final data screening module and a data transmission module, wherein the data extracting module is connected with the primary data screening module, the retrieval target determining module is connected with the primary data screening module, the primary data screening module is connected with the secondary data screening module, the secondary data screening module is connected with the final data screening module, and the rest of the primary data screening module is connected with the data transmission module; the retrieval target determining module is used for inputting information of a target to be retrieved and a screening grade by a user; the data extraction module is used for extracting recruitment information from the information storage cloud library; the data primary screening module, the data secondary screening module and the data final screening module are used for receiving the recruitment information transmitted by the data extraction module and performing grading screening retrieval on the recruitment information according to retrieval conditions set by the retrieval recruitment module; the data transmission module is used for transmitting the data obtained after the multilevel screening to the data analysis platform;
the data analysis platform is used for analyzing and classifying the information screened by the information retrieval platform and then displaying the information in a data mode, and comprises a data statistics module, a data classification module and a data output module; the data statistics module is used for performing statistics on all the retrieved data; the data classification module is used for classifying the retrieved recruitment data, and the classification standard of the data classification module comprises recruitment unit scale, recruitment wages, recruitment age bracket and the like; and the data output module is used for outputting and displaying the analyzed recruitment data, and the output form of the data output module is an electronic chart, and specifically comprises a sector statistical chart, a data trend curve graph, an Excel table and the like.
The specific operation flow of the system is as follows:
s1, a user inputs specific occupation and keywords to be searched in a search target determining module, and sets a search range such as a region and a time range;
s2, the data extraction module extracts recruitment information from the data storage module and then transmits the recruitment information to the primary data screening module;
s3, the primary data screening module carries out primary retrieval screening according to the specific occupation in the retrieval requirements set in the step S1, and screened data are transmitted to the secondary data screening module;
s4, the data secondary screening module carries out secondary screening according to the secondary retrieval requirement region set in the step S1, and then the screened data are transmitted to the data final screening module;
s5, performing final screening in the data final-stage screening module according to the time range of the final retrieval requirement set in the step S1, and then transmitting the screened data to the data transmission module;
s6, the data transmission module transmits the retrieved data to a data analysis platform;
s7, carrying out classification statistics on the data by a data statistics module and a data classification module of the data analysis platform;
and S8, outputting the analyzed result in the formats of a sector statistical chart, a data trend curve graph, an Excel table and the like through a data output module.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (8)

1. The system for mining the skill requirements of specific professionals based on web recruitment information is characterized in that: the system comprises a web access module, an information storage cloud library, an information retrieval platform and a data analysis platform;
the Web access module is used for acquiring the recruitment information released on the Web;
the information storage cloud library is used for collecting and storing various recruitment information released on the web and updating the recruitment information in real time;
the information retrieval platform is used for screening the specific recruitment information required to be retrieved by the user from the information storage cloud library;
and the data analysis platform is used for analyzing and classifying the information screened by the information retrieval platform and then displaying the information in a data mode.
2. The web recruitment information based specific professional talent skill requirement mining system of claim 1, wherein: the information storage cloud library comprises a data acquisition module, a data cleaning module, a data deleting module and a data storage module, wherein the data acquisition module is connected with the data cleaning module, the data cleaning module is connected with the data storage module, and the data deleting module is connected with the data storage module;
the data acquisition module is used for acquiring talent recruitment information of all aspects from the web in real time;
the data cleaning module is used for receiving the webpage recruitment information acquired by the data acquisition module and removing useless information in the webpage recruitment information;
the data storage module is used for storing the processed useful recruitment information;
and the data deleting module is used for deleting some outdated data and repeated data in the recruitment information, and the data time limit can be set.
3. The web recruitment information based specific professional talent skill requirement mining system according to claim 1 or 2, wherein: and the input end of the data acquisition module is connected with the output end of the web access module.
4. The web recruitment information based specific professional talent skill requirement mining system of claim 1, wherein: the information retrieval platform comprises a retrieval target determining module, a data extracting module, a primary data screening module, a secondary data screening module, a final data screening module and a data transmission module, wherein the data extracting module is connected with the primary data screening module;
the retrieval target determining module is used for inputting information of a target to be retrieved and a screening grade by a user;
the data extraction module is used for extracting recruitment information from the information storage cloud library;
the data primary screening module, the data secondary screening module and the data final screening module are used for receiving the recruitment information transmitted by the data extraction module and performing grading screening retrieval on the recruitment information according to retrieval conditions set by the retrieval recruitment module;
and the data transmission module is used for transmitting the data obtained after the multistage screening to the data analysis platform.
5. The web recruitment information based specific professional talent skill requirement mining system of claim 4, wherein: the retrieval items included by the retrieval target determining module comprise specific professional names, keywords, retrieval time ranges, retrieval areas and the like.
6. The web recruitment information based specific professional talent skill requirement mining system of claim 1, wherein: the data analysis platform comprises a data statistics module, a data classification module and a data output module;
the data statistics module is used for performing statistics on all the retrieved data;
the data classification module is used for classifying the recruitment data after the retrieval is finished;
and the data output module is used for outputting and displaying the recruitment data after the analysis is finished.
7. The web recruitment information based specific professional talent skill requirement mining system of claim 6, wherein: the classification standard of the data classification module comprises recruitment unit scale, recruitment wage, recruitment age segment and the like.
8. The web recruitment information based specific professional talent skill requirement mining system of claim 6, wherein: the output form of the data output module is an electronic chart, and specifically comprises a sector statistical chart, a data trend curve graph, an Excel table and the like.
CN202011307168.9A 2020-11-19 2020-11-19 Specific professional talent skill requirement mining system based on web recruitment information Pending CN112506986A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011307168.9A CN112506986A (en) 2020-11-19 2020-11-19 Specific professional talent skill requirement mining system based on web recruitment information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011307168.9A CN112506986A (en) 2020-11-19 2020-11-19 Specific professional talent skill requirement mining system based on web recruitment information

Publications (1)

Publication Number Publication Date
CN112506986A true CN112506986A (en) 2021-03-16

Family

ID=74958964

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011307168.9A Pending CN112506986A (en) 2020-11-19 2020-11-19 Specific professional talent skill requirement mining system based on web recruitment information

Country Status (1)

Country Link
CN (1) CN112506986A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462431A (en) * 2014-12-16 2015-03-25 浪潮软件集团有限公司 Method for crawling web page recruitment information
CN107203872A (en) * 2017-05-26 2017-09-26 山东省科学院情报研究所 Region demand for talent based on big data quantifies analysis method
CN110443582A (en) * 2019-08-06 2019-11-12 安徽赛福贝特信息技术有限公司 A kind of human resource data processing system based on cloud computing platform
CN111414522A (en) * 2020-02-18 2020-07-14 北京网聘咨询有限公司 Recruitment information visualization analysis system based on web crawler

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462431A (en) * 2014-12-16 2015-03-25 浪潮软件集团有限公司 Method for crawling web page recruitment information
CN107203872A (en) * 2017-05-26 2017-09-26 山东省科学院情报研究所 Region demand for talent based on big data quantifies analysis method
CN110443582A (en) * 2019-08-06 2019-11-12 安徽赛福贝特信息技术有限公司 A kind of human resource data processing system based on cloud computing platform
CN111414522A (en) * 2020-02-18 2020-07-14 北京网聘咨询有限公司 Recruitment information visualization analysis system based on web crawler

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
钟静 等: "基于Web 招聘信息的专业技能需求文本挖掘" *

Similar Documents

Publication Publication Date Title
CN110874530B (en) Keyword extraction method, keyword extraction device, terminal equipment and storage medium
CN106446071B (en) Information processing apparatus and method
CN110892398A (en) Multi-factor document analysis
CN107463616B (en) Enterprise information analysis method and system
CN111177332B (en) Method and device for automatically extracting judge document case-related label and judge result
Patra Google Scholar-based citation analysis of Indian library and information science journals
CN110928903B (en) Data extraction method and device, equipment and storage medium
CN112181490B (en) Method, device, equipment and medium for identifying function category in function point evaluation method
CN102402717A (en) Data analysis facility and method
CN111538903B (en) Method and device for determining search recommended word, electronic equipment and computer readable medium
CN112052396A (en) Course matching method, system, computer equipment and storage medium
TWI556128B (en) Forensic system, forensic method and evidence collection program
de Lutio et al. The herbarium 2021 half–earth challenge dataset and machine learning competition
CN112214557B (en) Data matching classification method and device
CN113220875B (en) Internet information classification method and system based on industry labels and electronic equipment
CN108734021B (en) Financial loan big data risk assessment method and system based on privacy-removing data
CN115408499B (en) Automatic analysis and interpretation method and system for government affair data analysis report chart
CN112506986A (en) Specific professional talent skill requirement mining system based on web recruitment information
CN114528448B (en) Accurate analytic system of drawing of portrait of global foreign trade customer
CN110737749B (en) Entrepreneurship plan evaluation method, entrepreneurship plan evaluation device, computer equipment and storage medium
CN115471042A (en) Enterprise legal affair risk assessment method and system
CN107818177B (en) Business intelligent model building method and building device
CN114780601A (en) Data query method and device, electronic equipment and storage medium
CN113642867A (en) Method and system for assessing risk
US20110179022A1 (en) Method of analyzing claims of a patent document and patent analysis system thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210316