CN106096913A - A kind of resume mail analysis system and method based on cloud service - Google Patents
A kind of resume mail analysis system and method based on cloud service Download PDFInfo
- Publication number
- CN106096913A CN106096913A CN201610412262.8A CN201610412262A CN106096913A CN 106096913 A CN106096913 A CN 106096913A CN 201610412262 A CN201610412262 A CN 201610412262A CN 106096913 A CN106096913 A CN 106096913A
- Authority
- CN
- China
- Prior art keywords
- resume
- class
- experience
- cloud service
- method based
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/105—Human resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/137—Hierarchical processing, e.g. outlines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/107—Computer-aided management of electronic mailing [e-mailing]
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Entrepreneurship & Innovation (AREA)
- Data Mining & Analysis (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Business, Economics & Management (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Economics (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- Computer Hardware Design (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The present invention devises a kind of resume mail analysis system and method based on cloud service, can mate the resume of free-format, and mate work experience and the education experience of segmentation while ensureing coupling accuracy.The present invention can be the resume all fields of extraction of various different file format, different language, free typesetting, is formatted as the resume of the consolidation form of standard.The file format supported has doc, docx, pdf, txt and html etc., and the language of support has Chinese and English.The parsing of Chinese Resume is extracted accuracy more than 95%, per minute can resolve 200 300 parts of resumes, 15 30 ten thousand parts of resumes can be processed every day, save artificial resume and process the time and reach 99.9%, save human cost more than 85%.The present invention disposes on the internet, can be the analysis service of the HR department offer resume of each recruitment website, software company, headhunter/talent agency and enterprise.
Description
Technical field
The present invention relates to field of computer technology, particularly relate to a kind of resume mail resolution system based on cloud service and side
Method.
Background technology
Along with the development of Internet technology, obtain by the personnel recruitment mode of Email reception application resume is continuous
Favor to each recruitment person.
At present, recruitment person typically can announce the recruitment mailbox for receiving application resume in the job notice issued.This
Sample, the recruitment mailbox that applicant can announce according to advertising unit, it is sent directly to resume mail recruit in mailbox.Meanwhile,
The resume mail received in recruitment mailbox is resolved by recruitment person by resume resolution system, extracts in keyword and key
Hold.Thus alleviate operating pressure, improve efficiency.
Existing a lot of technical schemes simple keyword match method of employing, the name in resume to be extracted, one
Surely before the content of name, to find the keyword of " name ", before sex, to find " sex " keyword, use this simply
The method of keyword match, can only mate the resume of specific format, low to the resume matching accuracy rate of free-format.
And existing a lot of scheme can not mate the segmentation experience in work experience and education experience, causes knowing
What stage this talent is within the specific time.
Summary of the invention
For defect and the technical problem of existence of above-mentioned prior art, the present invention devises a kind of letter based on cloud service
Go through mail analysis system and method, the resume of free-format, and coupling point can be mated while ensureing coupling accuracy
The work experience of section and education experience.
The technical solution adopted for the present invention to solve the technical problems is: a kind of resume mail based on cloud service resolves system
System and method, system refers to a kind of application system based on cloud service exploitation, and its implementing procedure is as follows:
All kinds of resume mails received are uploaded in system by 1, enterprise HR login system;
2, system judges whether resume form is supported, is resolved by the resume file that format character assembly system requires;
3, derive after the resume consolidation form after system will resolve;
Further, the file format of described resume mail resolution system support has doc, docx, pdf, txt and html,
The language supported includes Chinese and English;
Further, resume is resolved by described resume mail resolution system according to following method:
First extract resume language, initialize resolver according to different language, then use the method that branch extracts, elder generation according to
Class in the feature extraction resumes such as keyword, then carries out extracting to work experience class, education experience class and project experiences class and divides
Section, then to class and stage extraction field, if not extracting key message class again resume text is once recalled coupling.
Further, described class is: have the text of a certain common trait, such as the essential information class in resume, work
Make experience class, education experience class, project experiences class etc..
Further, described in be segmented into: refer to the text that apoplexy due to endogenous wind time phase associates, such as work experience, education experience,
The text of certain time period in project experiences.
Further, described field: refer to express in resume text the least unit of physical meaning, such as " male ", " female "
This is only content, does not has headed field, and also one is the most substantial word of " place of working: Shanghai " this existing title
Section.
Further, for there is no the resume content of keyword, use the algorithm of the characteristic matching in high in the clouds, the surname to resume
The contents such as name, company, industry, position, function, school, specialty, certificate and the feature database being deployed on cloud mate, greatly
The accuracy that improve coupling and integrity degree.
Further, using backtracking algorithm for comprising the content of time period in resume content, the time period is one and compares
Special text message, is generally present in work experience, education experience and project experiences, if according to keyword match
Time do not match work experience, education experience and the category information of project experiences, but text occurs in that again the time period, at this moment
It is accomplished by near the time period finding the characteristic information of class, such as have found so-and-so company similar, then can substantially determine this
It is work experience, the most just can start backtracking algorithm, again go to extract work experience, perform one time the most again and extract segmentation,
Extract the process of field.
Resume mail analysis system and method based on foregoing invention, compared with prior art, the present invention can be various
Different file formats, different language, the resume of free typesetting extract all fields, are formatted as the letter of the consolidation form of standard
Go through.The file format supported has doc, docx, pdf, txt and html etc., and the language of support has Chinese and English.To Chinese Resume
Parsing extract accuracy more than 95%, per minute can resolve 200 300 parts of resumes, 15 30 ten thousand can be processed every day
Part resume, saves artificial resume and processes the time and reach 99.9%, save human cost more than 85%.The present invention is deployed in the Internet
On, can be the solution of the HR department offer resume of each recruitment website, software company, headhunter/talent agency and enterprise
Analysis service.
Accompanying drawing explanation
Fig. 1 is the general thought schematic flow sheet of the present invention.
Fig. 2 is the resume process of analysis figure of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawings the detailed description of the invention of the present invention is elaborated, be further elucidated with advantages of the present invention and
Outstanding contributions relative to prior art, it is possible to understand that, following embodiment is only detailed to preferred embodiment of the present invention
Describe in detail bright, should not be construed as any restriction to technical solution of the present invention.
As shown in Figure 1-2, the present invention provides a kind of resume mail analysis system and method based on cloud service, and system refers to
A kind of application system based on cloud service exploitation, its implementing procedure is as follows:
All kinds of resume mails received are uploaded in system by 1, enterprise HR login system;
2, system judges whether resume form is supported, is resolved by the resume file that format character assembly system requires;
3, derive after the resume consolidation form after system will resolve;
Further, the file format of described resume mail resolution system support has doc, docx, pdf, txt and html,
The language supported includes Chinese and English;
Further, resume is resolved by described resume mail resolution system according to following method:
First extract resume language, initialize resolver according to different language, then use the method that branch extracts, elder generation according to
Class in the feature extraction resumes such as keyword, then carries out extracting to work experience class, education experience class and project experiences class and divides
Section, then to class and stage extraction field, if not extracting key message class again resume text is once recalled coupling.
Further, described class is: have the text of a certain common trait, such as the essential information class in resume, work
Make experience class, education experience class, project experiences class etc..
Further, described in be segmented into: refer to the text that apoplexy due to endogenous wind time phase associates, such as work experience, education experience,
The text of certain time period in project experiences.
Further, described field: refer to express in resume text the least unit of physical meaning, such as " male ", " female "
This is only content, does not has headed field, and also one is the most substantial word of " place of working: Shanghai " this existing title
Section.
Further, for there is no the resume content of keyword, use the algorithm of the characteristic matching in high in the clouds, the surname to resume
The contents such as name, company, industry, position, function, school, specialty, certificate and the feature database being deployed on cloud mate, greatly
The accuracy that improve coupling and integrity degree.
Further, using backtracking algorithm for comprising the content of time period in resume content, the time period is one and compares
Special text message, is generally present in work experience, education experience and project experiences, if according to keyword match
Time do not match work experience, education experience and the category information of project experiences, but text occurs in that again the time period, at this moment
It is accomplished by near the time period finding the characteristic information of class, such as have found so-and-so company similar, then can substantially determine this
It is work experience, the most just can start backtracking algorithm, again go to extract work experience, perform one time the most again and extract segmentation,
Extract the process of field.
Resume mail analysis system and method based on foregoing invention, compared with prior art, the present invention can be various
Different file formats, different language, the resume of free typesetting extract all fields, are formatted as the letter of the consolidation form of standard
Go through.The file format supported has doc, docx, pdf, txt and html etc., and the language of support has Chinese and English.To Chinese Resume
Parsing extract accuracy more than 95%, per minute can resolve 200 300 parts of resumes, 15 30 ten thousand can be processed every day
Part resume, saves artificial resume and processes the time and reach 99.9%, save human cost more than 85%.The present invention is deployed in the Internet
On, can be the solution of the HR department offer resume of each recruitment website, software company, headhunter/talent agency and enterprise
Analysis service.
Claims (5)
1. a resume mail analysis system and method based on cloud service, it is characterised in that: described system refers to take based on cloud
A kind of application system of business exploitation, makes whole system realize deriving with consolidation form after resume resolves.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: institute
The file format stating resume mail resolution system support has doc, docx, pdf, txt and html, the language of support include Chinese and
English.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: institute
State resume mail resolution system according to following method, resume to be resolved: first extract resume language, at the beginning of different language
Begin to dissolve parser, then use the method that branch extracts, first according to the class in the feature extraction resumes such as keyword, then to the warp that works
Go through class, education experience class and project experiences class to carry out extracting segmentation, then to class and stage extraction field, if the key of not extracting
Resume text is once recalled coupling by info class again.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: right
In there is no the resume content of keyword, use the algorithm of the characteristic matching in high in the clouds, to the name of resume, company, industry, position,
The contents such as function, school, specialty, certificate and the feature database being deployed on cloud mate.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: right
The content comprising the time period in resume content uses backtracking algorithm, and the time period is the text message that a comparison is special, typically
Occur in work experience, education experience and project experiences, if not matching work warp according to keyword match when
Go through, educate experience and the category information of project experiences, but text occurs in that again the time period, be at this moment accomplished by near the time period
Find the characteristic information of class, such as have found so-and-so company similar, then can substantially determine that this is work experience, the most just may be used
To start backtracking algorithm, again go to extract work experience, perform one time the most again and extract segmentation, extract the process of field.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610412262.8A CN106096913A (en) | 2016-06-14 | 2016-06-14 | A kind of resume mail analysis system and method based on cloud service |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610412262.8A CN106096913A (en) | 2016-06-14 | 2016-06-14 | A kind of resume mail analysis system and method based on cloud service |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106096913A true CN106096913A (en) | 2016-11-09 |
Family
ID=57846506
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610412262.8A Pending CN106096913A (en) | 2016-06-14 | 2016-06-14 | A kind of resume mail analysis system and method based on cloud service |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106096913A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108874928A (en) * | 2018-05-31 | 2018-11-23 | 平安科技(深圳)有限公司 | Resume data information analyzing and processing method, device, equipment and storage medium |
CN110020327A (en) * | 2019-04-16 | 2019-07-16 | 上海大易云计算股份有限公司 | A kind of resume resolution system based on vertical search engine |
CN111241270A (en) * | 2018-11-12 | 2020-06-05 | 马上消费金融股份有限公司 | Resume processing method and device |
CN111339776A (en) * | 2020-02-17 | 2020-06-26 | 北京字节跳动网络技术有限公司 | Resume parsing method and device, electronic equipment and computer-readable storage medium |
-
2016
- 2016-06-14 CN CN201610412262.8A patent/CN106096913A/en active Pending
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108874928A (en) * | 2018-05-31 | 2018-11-23 | 平安科技(深圳)有限公司 | Resume data information analyzing and processing method, device, equipment and storage medium |
CN108874928B (en) * | 2018-05-31 | 2024-02-02 | 平安科技(深圳)有限公司 | Resume data information analysis processing method, device, equipment and storage medium |
CN111241270A (en) * | 2018-11-12 | 2020-06-05 | 马上消费金融股份有限公司 | Resume processing method and device |
CN110020327A (en) * | 2019-04-16 | 2019-07-16 | 上海大易云计算股份有限公司 | A kind of resume resolution system based on vertical search engine |
CN111339776A (en) * | 2020-02-17 | 2020-06-26 | 北京字节跳动网络技术有限公司 | Resume parsing method and device, electronic equipment and computer-readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106096913A (en) | A kind of resume mail analysis system and method based on cloud service | |
CN107145584B (en) | Resume parsing method based on n-gram model | |
US7664323B2 (en) | Scalable hash-based character recognition | |
CN102722479A (en) | A method and device for realizing language translation | |
CN102541948A (en) | Method and device for extracting document structure | |
CN112580308A (en) | Document comparison method and device, electronic equipment and readable storage medium | |
CN105787047A (en) | Extraction, analysis and conversion method of resume information | |
CN106227808A (en) | A kind of method removing mail interference information and method for judging rubbish mail | |
CN104820962B (en) | A kind of printing watermark generation method instead of artificial signature | |
CN109271616B (en) | Intelligent extraction method based on bibliographic characteristic value of standard literature | |
CN103678280A (en) | Translation task fragmentization method | |
RU2580424C1 (en) | Method of detecting insignificant lexical items in text messages and computer | |
Clausner et al. | Efficient ocr training data generation with aletheia | |
CN112182141A (en) | Key information extraction method, device, equipment and readable storage medium | |
US20210182677A1 (en) | Identifying Portions of Electronic Communication Documents Using Machine Vision | |
Baker et al. | Comparing approaches to mathematical document analysis from PDF | |
CN105573981A (en) | Method and device for extracting Chinese names of people and places | |
CN103020037A (en) | Official document standardized calibration system | |
CN110852359B (en) | Family tree identification method and system based on deep learning | |
EP4167106A1 (en) | Method and apparatus for data structuring of text | |
CN111858886B (en) | Object and viewpoint extraction system for airport comments | |
KR101686114B1 (en) | Method of automatic conversion to hanja by the koreansentence unit using an add-in program | |
CN114120304A (en) | Entity identification method, device and computer program product | |
CN113806368A (en) | System and method for identifying document and automatically establishing database | |
CN110096574B (en) | Scheme for establishing and subsequently optimizing and expanding data set in E-commerce comment classification task |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20161109 |
|
WD01 | Invention patent application deemed withdrawn after publication |