CN106096913A - A kind of resume mail analysis system and method based on cloud service - Google Patents

A kind of resume mail analysis system and method based on cloud service Download PDF

Info

Publication number
CN106096913A
CN106096913A CN201610412262.8A CN201610412262A CN106096913A CN 106096913 A CN106096913 A CN 106096913A CN 201610412262 A CN201610412262 A CN 201610412262A CN 106096913 A CN106096913 A CN 106096913A
Authority
CN
China
Prior art keywords
resume
class
experience
cloud service
method based
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610412262.8A
Other languages
Chinese (zh)
Inventor
包谞斌
胡健
钱宏立
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiaxing Flying Knife Software Technology Co Ltd
Original Assignee
Jiaxing Flying Knife Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiaxing Flying Knife Software Technology Co Ltd filed Critical Jiaxing Flying Knife Software Technology Co Ltd
Priority to CN201610412262.8A priority Critical patent/CN106096913A/en
Publication of CN106096913A publication Critical patent/CN106096913A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/105Human resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/137Hierarchical processing, e.g. outlines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/107Computer-aided management of electronic mailing [e-mailing]

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Computer Hardware Design (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The present invention devises a kind of resume mail analysis system and method based on cloud service, can mate the resume of free-format, and mate work experience and the education experience of segmentation while ensureing coupling accuracy.The present invention can be the resume all fields of extraction of various different file format, different language, free typesetting, is formatted as the resume of the consolidation form of standard.The file format supported has doc, docx, pdf, txt and html etc., and the language of support has Chinese and English.The parsing of Chinese Resume is extracted accuracy more than 95%, per minute can resolve 200 300 parts of resumes, 15 30 ten thousand parts of resumes can be processed every day, save artificial resume and process the time and reach 99.9%, save human cost more than 85%.The present invention disposes on the internet, can be the analysis service of the HR department offer resume of each recruitment website, software company, headhunter/talent agency and enterprise.

Description

A kind of resume mail analysis system and method based on cloud service
Technical field
The present invention relates to field of computer technology, particularly relate to a kind of resume mail resolution system based on cloud service and side Method.
Background technology
Along with the development of Internet technology, obtain by the personnel recruitment mode of Email reception application resume is continuous Favor to each recruitment person.
At present, recruitment person typically can announce the recruitment mailbox for receiving application resume in the job notice issued.This Sample, the recruitment mailbox that applicant can announce according to advertising unit, it is sent directly to resume mail recruit in mailbox.Meanwhile, The resume mail received in recruitment mailbox is resolved by recruitment person by resume resolution system, extracts in keyword and key Hold.Thus alleviate operating pressure, improve efficiency.
Existing a lot of technical schemes simple keyword match method of employing, the name in resume to be extracted, one Surely before the content of name, to find the keyword of " name ", before sex, to find " sex " keyword, use this simply The method of keyword match, can only mate the resume of specific format, low to the resume matching accuracy rate of free-format.
And existing a lot of scheme can not mate the segmentation experience in work experience and education experience, causes knowing What stage this talent is within the specific time.
Summary of the invention
For defect and the technical problem of existence of above-mentioned prior art, the present invention devises a kind of letter based on cloud service Go through mail analysis system and method, the resume of free-format, and coupling point can be mated while ensureing coupling accuracy The work experience of section and education experience.
The technical solution adopted for the present invention to solve the technical problems is: a kind of resume mail based on cloud service resolves system System and method, system refers to a kind of application system based on cloud service exploitation, and its implementing procedure is as follows:
All kinds of resume mails received are uploaded in system by 1, enterprise HR login system;
2, system judges whether resume form is supported, is resolved by the resume file that format character assembly system requires;
3, derive after the resume consolidation form after system will resolve;
Further, the file format of described resume mail resolution system support has doc, docx, pdf, txt and html, The language supported includes Chinese and English;
Further, resume is resolved by described resume mail resolution system according to following method:
First extract resume language, initialize resolver according to different language, then use the method that branch extracts, elder generation according to Class in the feature extraction resumes such as keyword, then carries out extracting to work experience class, education experience class and project experiences class and divides Section, then to class and stage extraction field, if not extracting key message class again resume text is once recalled coupling.
Further, described class is: have the text of a certain common trait, such as the essential information class in resume, work Make experience class, education experience class, project experiences class etc..
Further, described in be segmented into: refer to the text that apoplexy due to endogenous wind time phase associates, such as work experience, education experience, The text of certain time period in project experiences.
Further, described field: refer to express in resume text the least unit of physical meaning, such as " male ", " female " This is only content, does not has headed field, and also one is the most substantial word of " place of working: Shanghai " this existing title Section.
Further, for there is no the resume content of keyword, use the algorithm of the characteristic matching in high in the clouds, the surname to resume The contents such as name, company, industry, position, function, school, specialty, certificate and the feature database being deployed on cloud mate, greatly The accuracy that improve coupling and integrity degree.
Further, using backtracking algorithm for comprising the content of time period in resume content, the time period is one and compares Special text message, is generally present in work experience, education experience and project experiences, if according to keyword match Time do not match work experience, education experience and the category information of project experiences, but text occurs in that again the time period, at this moment It is accomplished by near the time period finding the characteristic information of class, such as have found so-and-so company similar, then can substantially determine this It is work experience, the most just can start backtracking algorithm, again go to extract work experience, perform one time the most again and extract segmentation, Extract the process of field.
Resume mail analysis system and method based on foregoing invention, compared with prior art, the present invention can be various Different file formats, different language, the resume of free typesetting extract all fields, are formatted as the letter of the consolidation form of standard Go through.The file format supported has doc, docx, pdf, txt and html etc., and the language of support has Chinese and English.To Chinese Resume Parsing extract accuracy more than 95%, per minute can resolve 200 300 parts of resumes, 15 30 ten thousand can be processed every day Part resume, saves artificial resume and processes the time and reach 99.9%, save human cost more than 85%.The present invention is deployed in the Internet On, can be the solution of the HR department offer resume of each recruitment website, software company, headhunter/talent agency and enterprise Analysis service.
Accompanying drawing explanation
Fig. 1 is the general thought schematic flow sheet of the present invention.
Fig. 2 is the resume process of analysis figure of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawings the detailed description of the invention of the present invention is elaborated, be further elucidated with advantages of the present invention and Outstanding contributions relative to prior art, it is possible to understand that, following embodiment is only detailed to preferred embodiment of the present invention Describe in detail bright, should not be construed as any restriction to technical solution of the present invention.
As shown in Figure 1-2, the present invention provides a kind of resume mail analysis system and method based on cloud service, and system refers to A kind of application system based on cloud service exploitation, its implementing procedure is as follows:
All kinds of resume mails received are uploaded in system by 1, enterprise HR login system;
2, system judges whether resume form is supported, is resolved by the resume file that format character assembly system requires;
3, derive after the resume consolidation form after system will resolve;
Further, the file format of described resume mail resolution system support has doc, docx, pdf, txt and html, The language supported includes Chinese and English;
Further, resume is resolved by described resume mail resolution system according to following method:
First extract resume language, initialize resolver according to different language, then use the method that branch extracts, elder generation according to Class in the feature extraction resumes such as keyword, then carries out extracting to work experience class, education experience class and project experiences class and divides Section, then to class and stage extraction field, if not extracting key message class again resume text is once recalled coupling.
Further, described class is: have the text of a certain common trait, such as the essential information class in resume, work Make experience class, education experience class, project experiences class etc..
Further, described in be segmented into: refer to the text that apoplexy due to endogenous wind time phase associates, such as work experience, education experience, The text of certain time period in project experiences.
Further, described field: refer to express in resume text the least unit of physical meaning, such as " male ", " female " This is only content, does not has headed field, and also one is the most substantial word of " place of working: Shanghai " this existing title Section.
Further, for there is no the resume content of keyword, use the algorithm of the characteristic matching in high in the clouds, the surname to resume The contents such as name, company, industry, position, function, school, specialty, certificate and the feature database being deployed on cloud mate, greatly The accuracy that improve coupling and integrity degree.
Further, using backtracking algorithm for comprising the content of time period in resume content, the time period is one and compares Special text message, is generally present in work experience, education experience and project experiences, if according to keyword match Time do not match work experience, education experience and the category information of project experiences, but text occurs in that again the time period, at this moment It is accomplished by near the time period finding the characteristic information of class, such as have found so-and-so company similar, then can substantially determine this It is work experience, the most just can start backtracking algorithm, again go to extract work experience, perform one time the most again and extract segmentation, Extract the process of field.
Resume mail analysis system and method based on foregoing invention, compared with prior art, the present invention can be various Different file formats, different language, the resume of free typesetting extract all fields, are formatted as the letter of the consolidation form of standard Go through.The file format supported has doc, docx, pdf, txt and html etc., and the language of support has Chinese and English.To Chinese Resume Parsing extract accuracy more than 95%, per minute can resolve 200 300 parts of resumes, 15 30 ten thousand can be processed every day Part resume, saves artificial resume and processes the time and reach 99.9%, save human cost more than 85%.The present invention is deployed in the Internet On, can be the solution of the HR department offer resume of each recruitment website, software company, headhunter/talent agency and enterprise Analysis service.

Claims (5)

1. a resume mail analysis system and method based on cloud service, it is characterised in that: described system refers to take based on cloud A kind of application system of business exploitation, makes whole system realize deriving with consolidation form after resume resolves.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: institute The file format stating resume mail resolution system support has doc, docx, pdf, txt and html, the language of support include Chinese and English.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: institute State resume mail resolution system according to following method, resume to be resolved: first extract resume language, at the beginning of different language Begin to dissolve parser, then use the method that branch extracts, first according to the class in the feature extraction resumes such as keyword, then to the warp that works Go through class, education experience class and project experiences class to carry out extracting segmentation, then to class and stage extraction field, if the key of not extracting Resume text is once recalled coupling by info class again.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: right In there is no the resume content of keyword, use the algorithm of the characteristic matching in high in the clouds, to the name of resume, company, industry, position, The contents such as function, school, specialty, certificate and the feature database being deployed on cloud mate.
A kind of resume mail analysis system and method based on cloud service the most according to claim 1, it is characterised in that: right The content comprising the time period in resume content uses backtracking algorithm, and the time period is the text message that a comparison is special, typically Occur in work experience, education experience and project experiences, if not matching work warp according to keyword match when Go through, educate experience and the category information of project experiences, but text occurs in that again the time period, be at this moment accomplished by near the time period Find the characteristic information of class, such as have found so-and-so company similar, then can substantially determine that this is work experience, the most just may be used To start backtracking algorithm, again go to extract work experience, perform one time the most again and extract segmentation, extract the process of field.
CN201610412262.8A 2016-06-14 2016-06-14 A kind of resume mail analysis system and method based on cloud service Pending CN106096913A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610412262.8A CN106096913A (en) 2016-06-14 2016-06-14 A kind of resume mail analysis system and method based on cloud service

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610412262.8A CN106096913A (en) 2016-06-14 2016-06-14 A kind of resume mail analysis system and method based on cloud service

Publications (1)

Publication Number Publication Date
CN106096913A true CN106096913A (en) 2016-11-09

Family

ID=57846506

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610412262.8A Pending CN106096913A (en) 2016-06-14 2016-06-14 A kind of resume mail analysis system and method based on cloud service

Country Status (1)

Country Link
CN (1) CN106096913A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108874928A (en) * 2018-05-31 2018-11-23 平安科技(深圳)有限公司 Resume data information analyzing and processing method, device, equipment and storage medium
CN110020327A (en) * 2019-04-16 2019-07-16 上海大易云计算股份有限公司 A kind of resume resolution system based on vertical search engine
CN111241270A (en) * 2018-11-12 2020-06-05 马上消费金融股份有限公司 Resume processing method and device
CN111339776A (en) * 2020-02-17 2020-06-26 北京字节跳动网络技术有限公司 Resume parsing method and device, electronic equipment and computer-readable storage medium

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108874928A (en) * 2018-05-31 2018-11-23 平安科技(深圳)有限公司 Resume data information analyzing and processing method, device, equipment and storage medium
CN108874928B (en) * 2018-05-31 2024-02-02 平安科技(深圳)有限公司 Resume data information analysis processing method, device, equipment and storage medium
CN111241270A (en) * 2018-11-12 2020-06-05 马上消费金融股份有限公司 Resume processing method and device
CN110020327A (en) * 2019-04-16 2019-07-16 上海大易云计算股份有限公司 A kind of resume resolution system based on vertical search engine
CN111339776A (en) * 2020-02-17 2020-06-26 北京字节跳动网络技术有限公司 Resume parsing method and device, electronic equipment and computer-readable storage medium

Similar Documents

Publication Publication Date Title
CN106096913A (en) A kind of resume mail analysis system and method based on cloud service
CN107145584B (en) Resume parsing method based on n-gram model
US7664323B2 (en) Scalable hash-based character recognition
CN102722479A (en) A method and device for realizing language translation
CN102541948A (en) Method and device for extracting document structure
CN112580308A (en) Document comparison method and device, electronic equipment and readable storage medium
CN105787047A (en) Extraction, analysis and conversion method of resume information
CN106227808A (en) A kind of method removing mail interference information and method for judging rubbish mail
CN104820962B (en) A kind of printing watermark generation method instead of artificial signature
CN109271616B (en) Intelligent extraction method based on bibliographic characteristic value of standard literature
CN103678280A (en) Translation task fragmentization method
RU2580424C1 (en) Method of detecting insignificant lexical items in text messages and computer
Clausner et al. Efficient ocr training data generation with aletheia
CN112182141A (en) Key information extraction method, device, equipment and readable storage medium
US20210182677A1 (en) Identifying Portions of Electronic Communication Documents Using Machine Vision
Baker et al. Comparing approaches to mathematical document analysis from PDF
CN105573981A (en) Method and device for extracting Chinese names of people and places
CN103020037A (en) Official document standardized calibration system
CN110852359B (en) Family tree identification method and system based on deep learning
EP4167106A1 (en) Method and apparatus for data structuring of text
CN111858886B (en) Object and viewpoint extraction system for airport comments
KR101686114B1 (en) Method of automatic conversion to hanja by the koreansentence unit using an add-in program
CN114120304A (en) Entity identification method, device and computer program product
CN113806368A (en) System and method for identifying document and automatically establishing database
CN110096574B (en) Scheme for establishing and subsequently optimizing and expanding data set in E-commerce comment classification task

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20161109

WD01 Invention patent application deemed withdrawn after publication