CN107004167A - Recruit through open public examination standardization and data de-duplication - Google Patents

Recruit through open public examination standardization and data de-duplication Download PDF

Info

Publication number
CN107004167A
CN107004167A CN201580064463.7A CN201580064463A CN107004167A CN 107004167 A CN107004167 A CN 107004167A CN 201580064463 A CN201580064463 A CN 201580064463A CN 107004167 A CN107004167 A CN 107004167A
Authority
CN
China
Prior art keywords
open public
public examination
standardization
title
recruited
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201580064463.7A
Other languages
Chinese (zh)
Other versions
CN107004167B (en
Inventor
D.哈德克
G.B.马丁
J.博林格尔
L.M.沃尔
J.维姆布纳拉延
S.卡马特
P.戈文达拉简
A.D.迪尔
O.索尔森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
邻客音公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US14/502,261 external-priority patent/US10043157B2/en
Priority claimed from US14/502,224 external-priority patent/US20160092838A1/en
Application filed by 邻客音公司 filed Critical 邻客音公司
Publication of CN107004167A publication Critical patent/CN107004167A/en
Application granted granted Critical
Publication of CN107004167B publication Critical patent/CN107004167B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/105Human resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Abstract

Describe for from third party system obtain it is non-paid recruit through open public examination be standardized and data de-duplication technology.Non-paid recruit through open public examination is obtained from third party system by social networking service.The non-paid position recruited through open public examination and description, which are standardized and be combined into, standardizes non-paid recruit through open public examination.Data de-duplication process is performed to prevent non-paid recruit through open public examination of the standardization from substituting the paying in the social networking service and recruit through open public examination, and prevents the standardization is non-paid from recruiting through open public examination substitute in the social networking service more authoritative and non-paid recruit through open public examination.

Description

Recruit through open public examination standardization and data de-duplication
Priority request
This PCT application case require September in 2014 submit within 30th it is entitled " recruit through open public examination standardization and data de-duplication (JOB POSTING STANDARDIZATION AND DEDUPLICATION)" No. 14/502,224 U.S. Patent application The benefit of priority of case, and require that September in 2014 submits on the 30th entitled " recruit through open public examination standardization and repeated data Delete(JOB POSTING STANDARDIZATION AND DEDUPLICATION)" No. 14/502,261 United States Patent (USP) The benefit of priority of application case, two application cases are all incorporated herein by reference.
Technical field
The present invention relates generally to the data handling system recruited through open public examination for trustship, and in certain embodiments, relates to And for be present in recruiting through open public examination on different third party systems be standardized and data de-duplication technology.
Background technology
In typically work trusteeship service, company representative, which will recruit through open public examination, is published to work trusteeship service so that work The user of trusteeship service may search for, browse and in some cases, apply for the work associated with specifically recruiting through open public examination.Make For that the exchange recruited through open public examination can be presented to the user of work trusteeship service, some expenses will generally be paid by issuing the company recruited through open public examination With.
Brief description of the drawings
Illustrated some embodiments without limitation by means of example in each figure of accompanying drawing.
Fig. 1 is the network for illustrating the network environment suitable for social networking service according to some example embodiments.
Fig. 2 is the block diagram for the component for illustrating the social networking system according to some example embodiments.
Fig. 3 A are to illustrate performed for standardizing the disclosure obtained from third party system according to some example embodiments The flow chart of the operation for trapping module and the woprk standardization module of being worked during the method for recruitment.
Fig. 3 B are to illustrate performed for standardizing the disclosure obtained from third party system according to some example embodiments The flow chart of the optional operation of woprk standardization module during the method for recruitment.
Fig. 4 A are to illustrate performed for recruiting through open public examination for obtaining from third party system according to some example embodiments Carry out the data de-duplication module that worked during the method for data de-duplication, and optionally work trapping module and/or work The flow chart of the operation of standardized module.
Fig. 4 B are to illustrate performed for recruiting through open public examination for obtaining from third party system according to some example embodiments Carry out the flow chart of the optional operation of work data de-duplication module during the method for data de-duplication.
Fig. 5 is to illustrate that the block diagram of the example of the machine of one or more embodiments can be implemented thereon.
Embodiment
The method, system and computer program product of work trusteeship service, the work is provided separately in present invention description Trusteeship service is recruited through open public examination to paying with non-paid(Sometimes referred to as job posting)The service of varying level is provided.In detailed below In description, for illustrative purposes, illustrate numerous specific details to provide to disclosed theme various aspects It is thorough to understand.It will be apparent, however, to one skilled in the art that the present invention can be put into practice in the case of these no specific details Disclosed theme.In other cases, well-known method, program and component is not yet described in detail, in order to avoid obscure this hair Bright disclosed theme.
According to some embodiments, work trusteeship service(For example, associated with social networking system)Trustship is paid and unpaid Expense recruits through open public examination both.For example, by the module of recruiting through open public examination for the trusteeship service that works, the user of work trusteeship service can provide On specific job vacancy information and generate paying and recruit through open public examination.Recruit through open public examination generally by the company of job vacancy can be obtained Or the title of tissue, the position title of job vacancy, the describing of job function, required or suggestion technical ability, education degree and card Book and/or speciality etc. are constituted.As the exchange for paying some expenses, paying, which is recruited through open public examination, will be eligible to be presented to user(For example, The personnel for the social networking system that work trusteeship service is integrated).
In certain embodiments, work trusteeship service can pay to recruit through open public examination and be recruited through open public examination with non-paid with trustship.One In the case of a little, paying is recruited through open public examination and can be directly listed in work trusteeship service, and non-paid recruit through open public examination can be from third party System is received.However, the data format recruited through open public examination received from third party system may be public for it with work trusteeship service Open the data format mismatch that recruitment is used.In addition, can represent to be held in the palm by work from recruiting through open public examination for third party system reception What pipe service was listed recruits through open public examination.
In addition to paying and recruiting through open public examination, work trusteeship service can be from third party's recruitment website of different hosted outsides Intake is recruited through open public examination.In certain embodiments, automatic computing engine program(For example, " bot " or " spider ")Automatically phase " is captured " Close internet site and find recruiting through open public examination for intake.In certain embodiments, cooperate from by one or more third parties Recruited through open public examination in the data feeding that partner keeps.Work trusteeship service storage paying is recruited through open public examination recruits through open public examination with non-paid Both make another entity on behalf it store paying and recruits through open public examination and recruit through open public examination both with non-paid, i.e. by recruiting through open public examination Module generates and has paid recruiting through open public examination and being obtained from third party website and not yet to society for expense to social networking system Network system is handed over to pay recruiting through open public examination for expense.
In certain embodiments, it is non-paid recruit through open public examination it is only qualified by job search interface to social networking service Personnel are presented.Therefore, it is non-paid or it is free recruit through open public examination will generally be only presented to be properly termed as " positive job hunting candidate " or The social networking service personnel of " positive job hunter ".These positive job hunters are typically actively to participate in finding new employment machine The personnel of meeting.Paying recruit through open public examination it is also qualified the personnel of social networking service are presented to by searching interface, but also pass through Various different other channels are presented to these personnel.For example, work recommended engine can match person profiles with recruiting through open public examination, Target is to be recruited through open public examination correlation based on the profile data of personnel(I.e., it may be possible to which personnel are of interest to be recruited through open public examination)It is presented to The personnel of social networking service.
In certain embodiments, the data format recruited through open public examination received from third party system may be with social networking system Work trusteeship service the data format that uses recruited through open public examination for it mismatch.In such embodiment, work trusteeship service Standardize from recruiting through open public examination that third party system is received so that recruit through open public examination and be desirably integrated into work trusteeship service.
In certain embodiments, the expression of recruiting through open public examination received from third party system has been integrated into work trusteeship service Recruit through open public examination.In such embodiment, work trusteeship service, which is performed, recruits through open public examination data de-duplication, and if it is determined that new Recruit through open public examination and be better than(For example, more authoritative)Integrated recruits through open public examination, then recruited through open public examination with new instead of integrated Recruit through open public examination.
The different operating of case method described herein can be at least partly performed by one or more processors, it is described Processor is temporarily configured to(For example, passing through software instruction)Or be permanently configured to perform associative operation.It is no matter temporary Configure to when property and still permanently configure, such processor may be constructed the place for performing one or more operations or function Manage module or object that device is implemented.In some example embodiments, it is real that the module and object being mentioned above can include processor The module and/or object applied.
Similarly, approach described herein can at least partly be that processor is implemented.For example, in the operation of method At least some modules that can be implemented by one or more processors or processor are performed.The execution of some operations can be distributed in Among one or more processors, do not only reside in individual machine or computer, and cross over multiple machines or computer portion Administration.In some example embodiments, one or more processors can be located in single position(For example, in home environment, office In room environmental, at server zone etc.), and in other embodiments, processor can cross over multiple position distributions.
One or more processors can be also used for supporting " in cloud computing environment or software is serviced(“SaaS”)It is upper The hereafter performance of interior associative operation.For example, at least some operations in operation can be by computer(For example, including processing The machine of device)Group performs, and these operations can pass through network(For example, internet)And pass through one or more interface suitables (For example, application programming interfaces(API))Access.
Fig. 1 is the network for illustrating the network environment 100 suitable for social networking service according to some example embodiments. Network environment 100 includes server machine 110, database 115 and the device 150 for user 152, all to pass through network 190 are communicably coupled to each other.Server machine 110 can form all or part of of network system 105 (For example, the server system based on cloud, which is configured to provide one or more services, arrives device 130 and 150).Database 115 Recruiting through open public examination for social networking service can be stored.Server machine 110, first device 130 and second device 150 can be with Each it is implemented on completely or partially in computer system, following article is relative to described by Fig. 5.
User 152 is also illustrated in Fig. 1.User 152 can be human user(For example, the mankind), machine customer(For example, logical Cross computer of the software program configuration to be interacted with device 150)Or its any suitable combination(For example, the people aided in by machine Class or the machine by human supervision).User 152 is not a part for network environment 100, but associated with device 150.One In a little embodiments, device 150 is desktop computer, vehicle computer, tablet personal computer, guider, attachment device for displaying audio, intelligence Energy phone or the wearable device operated by user 152(For example, intelligent watch or intelligent glasses).
Any one in machine, database or device shown in Fig. 1 can be implemented on by software(For example, one or many Individual software module)Modification(For example, configuration or programming)In the all-purpose computer of special-purpose computer, to be used for the machine to perform One or more of functionality described herein of device, database or device.Can be real for example, being discussed below with respect to Fig. 5 Apply the computer system of any one or more in method described herein.As used herein, " database " is data Storage resource and can by text of storage construct, form, electrical form, relevant database(For example, Object-Relationship Database), triple warehouse, individual-layer data storage device or its any data suitably combined.In addition, illustrated in fig. 1 Machine, database or device in any two or more can be combined into individual machine, and for any individual machine, The function described herein of database or device can be segmented in multiple machines, database or device.
Network 190 can realize machine, database and device(For example, server machine 110 and device 130)Between or Among communication any network.Therefore, network 190 can be cable network, wireless network(For example, mobile or cellular network) Or its any suitable combination.Network 190 can include composition dedicated network, common network(For example, internet)Or its is any One or more parts of appropriate combination.Therefore, network 190 can include and incorporate LAN(LAN), wide area network(WAN), because Special net, mobile telephone network(For example, cellular network), wired telephone network(For example, plain old telephone system(POTS)Net Network), radio data network(For example, WiFi or WiMax networks)Or one or more parts of its any appropriate combination.Net Any one or more parts of network 190 can transmit information by transmission media.As used herein, " transmission media " refers to It can transmit(For example, transmission)Instruction is for machine(For example, for the one or more processors of this machine)Any nothing performed Shape(For example, temporary)Media, and comprising numeral or analog communication signal or other invisible media to promote the logical of this software Letter.
Fig. 2 is the block diagram for the component for illustrating the social networking system 210 according to some example embodiments.Social networking system 210 be Fig. 1 example based on network system 105.In certain embodiments, social networking system 210 captures mould comprising work Block 202, application server module 204, woprk standardization module 206 and work data de-duplication module 208, Suo Youmo Block is all arranged to communicate with one another(For example, passing through cross tie part, bus, shared memory, switch etc.).
Illustrate although Fig. 2 will recruit through open public examination database 220 for single database, recruiting through open public examination database 220 can be with Comprising multiple databases, the database can be located in a position or multiple positions.Similarly, although Fig. 2 is recruited open It is, different from social networking system 210, but in certain embodiments, to recruit through open public examination database 220 and be incorporated to engage the explanation of database 220 In social networking system 210.
In certain embodiments, work trapping module 202 is captured from third party system 170, receives or otherwise obtained Take and recruit through open public examination.As described in Fig. 3 A and 3B, in certain embodiments, database is recruited through open public examination that will recruit through open public examination to be integrated into Before in 220, the standardization of woprk standardization module 206 is recruited through open public examination.As described by figures 4 a and 4b, if this it is integrated will not Produce substitute it is excellent recruit through open public examination poor recruit through open public examination, then work data de-duplication module 208 will recruit through open public examination integrated To recruiting through open public examination in database 220.
In some cases, work trapping module 202, woprk standardization module 206 and/or work data de-duplication mould Block 208 is configured to off line and/or periodically processing data.For example, work trapping module 202 can include server, institute Server is stated periodically to recruit through open public examination from the acquisition of interested third party internet website.Third party is recruited through open public examination and is standardized Can be computation-intensive with data de-duplication;It therefore, it can off line completion woprk standardization and/or repeated data deleted Remove.
It will such as be further described relative to Fig. 3 A to 3B, work trapping module 202 combines woprk standardization module 206 can be with Non-paid recruit through open public examination is obtained and standardizes to recruit through open public examination in database 220 to be integrated into.
Hardware can be used(For example, the one or more processors of machine)Or the combination of hardware and software come implement herein Described in module in any one or more.For example, any module described herein can be with configuration processor(For example, Among the one or more processors of machine), to perform the operation described herein for the module.In addition, these moulds Any two in block or more can be combined into individual module, and can for the functions described herein of individual module To be segmented among multiple modules.In addition, according to different instances embodiment, here depicted as individual machine, database or The module of embodiment can be across multiple machines, database or device distribution in device.
In certain embodiments, recruit through open public examination database 220 and contain the one group of predefined duty recognized by work trusteeship service Position title.For example, described group of predefined position title can include such as " customer manager ", " system engineer ", " sale warp The position title of reason " etc..In certain embodiments, recruit through open public examination database 220 and contain one group recognized by work trusteeship service Predefined seniority level.For example, described group of predefined seniority level can comprising such as " trainee ", " primary ", The qualifications and record of service level of " middle rank ", " senior ", " management ", " manager " etc..
Fig. 3 A are to illustrate performed for standardizing the disclosure obtained from third party system according to some example embodiments The flow chart of the operation for trapping module 202 and the woprk standardization module 206 of being worked during the method 300 of recruitment.It can use above The operation in method 300 is performed by network system 105 relative to Fig. 2 modules described.As shown in fig. 3, method 300 include operation 302,304,306,308 and 310.
By obtaining and standardizing recruiting through open public examination from third party system, except social networking system is paid to its user Present recruit through open public examination outside, the work trusteeship service of social networking system 210 can also present to its user and come from other works What work was originated recruits through open public examination.
At operation 302, first instance(For example, the work trusteeship service of social networking system 210)Obtain(For example, making With work trapping module 202)Represent the data recruited through open public examination on third party system 170.Recruit through open public examination comprising position title and Job description.In certain embodiments, recruit through open public examination also comprising at least one in following item:Employing unit's title, recruitment row Industry, the geographical position of work and required technical ability.
At operation 304, the position title recruited through open public examination is standardized(For example, using woprk standardization module 206)With With the predefined position title recognized by first instance.In certain embodiments, one of method 350 illustrated in Fig. 3 B or Multiple operations 352 to 362 perform the part for position title standardisation process.
At operation 306, standardize job description to meet the data format recognized by first instance.In some embodiments In, standardization job description is included performs spell check/correction and/or syntax check/correction to job description.
At operation 308, standardization position title and standardization job description are combined to during standardization recruits through open public examination.One In a little embodiments, the extraneous information of such as metadata is also in standardization is recruited through open public examination.
At operation 310, standardization, which is recruited through open public examination, is integrated into first instance(For example, social networking system 210)Recruitment System(For example, work trusteeship service)In.In certain embodiments, standardization recruit through open public examination it is integrated before, to standardization Recruit through open public examination execution work data de-duplication process(For example, Fig. 4 A method 400).
Fig. 3 B are to illustrate performed for standardizing the disclosure obtained from third party system according to some example embodiments The flow chart of the optional operation of woprk standardization module 206 during the method 350 of recruitment.It can use above in relation to Fig. 2 descriptions Module operation in method 350 is performed by network system 105.As shown in Figure 3 B, method 350 comprising operation 352, 354th, 356,358,360,362,364 and 366.
At operation 352, the undesirable character occurred in position title is removed.For example, in certain embodiments, Fullstop is undesirable in position title.If the position title in recruiting through open public examination be " S.E.in San Francisco, C.A. ", then " position title after SE in San Francisco, CA " modification will be produced by removing fullstop.In some embodiments In, undesirable character is removed by the regular expression applied to position title.
At operation 354, geographical position determines to remove in position title and from position title.If for example, input The position title of this operation is " SE in San Francisco, CA ", then the position title of output will be " SE ".
At operation 356, when representing abbreviation, the word or phrase recognized with first instance substitutes the contracting in position title Write.If for example, the position title for inputting this operation is " SE ", then position title will be " Systems Engineer ".
In certain embodiments, eliminated and abridged using the context in the context and/or job description in position title Ambiguity.In certain embodiments, the ambiguity of abbreviation is eliminated by reference to the word repeatedly occurred in job description.For example, Abbreviation " SE " can represent such as " system engineer ", " sales engineer ", " sports editor ", " cleaner ", " structure work Cheng Shi ", " senior engineer " etc. predefined position title.In certain embodiments, if occurred in job description and pre- Define the potential matching of position title, then this can increase the probability correctly matched during this potential matching.
At operation 358, the word of position title is divided into the list of word.If for example, inputting the position name of this operation Title is " system engineer ", then the output of this operation will be word " system " and the list of " engineer ".
At operation 360, produce the word in word list is possible to arrangement.If for example, the list of word is " system " and " engineer ", then it will be " system engineer " and " engineer's system " that may arrange.
At operation 362, the arrangement of word is selected as at least one predefined position name with being recognized by first instance Claim the standardization position title of most tight fit.For example, if possible arrangement is " system engineer " and " engineer's system ", that " system engineer " will be selected as standardization position title.
At operation 364, it is determined that being numbered corresponding to the position title of standardization position title.If for example, standardization duty Position title is " system engineer ", then the corresponding position title numbering in specific works trusteeship service can be 525.
In addition, at operation 364, it is determined that the seniority level numbered corresponding to position title.For example, corresponding to " being The seniority level of the position title numbering 525 of system engineer " can be " middle rank ".
At operation 366, position title numbering and seniority level are in standardization is recruited through open public examination.In some realities Apply in example, before standardization is recruited through open public examination is integrated into and recruits through open public examination in database 220, position title numbering and seniority Level is in standardization is recruited through open public examination.
Fig. 4 A are to illustrate performed for recruiting through open public examination for obtaining from third party system according to some example embodiments Carry out work data de-duplication module 208 during the method 400 of data de-duplication, and the trapping module 202 that optionally works And/or the flow chart of the operation of woprk standardization module 206.The module above in relation to Fig. 2 descriptions can be used by based on net The system 105 of network performs the operation in method 400.As shown in Figure 4 A, method 400 includes operation 402,404,406,408,410 With 412.
By to recruiting through open public examination carry out data de-duplication, the work of social networking system 210 from third party system Trusteeship service can be organized to recruit through open public examination to the repetition that identical work is presented in its user.
At operation 402, optionally, first instance(For example, the work trusteeship service of social networking system 210)Obtain (For example, using work trapping module 202)Represent the data recruited through open public examination on third party system 170.In certain embodiments, Recruit through open public examination comprising at least one in following item:Position title, job description, employing unit's title, recruitment industry, work Geographical position and required technical ability.In certain embodiments, the operation 402 of method 400 is substantially similar to the operation of method 300 302。
At operation 404, optionally, the position title recruited through open public examination is standardized(For example, using woprk standardization module 206)To match the predefined position title recognized by first instance.In certain embodiments, method 350 illustrated in Fig. 3 B One or more operations 352 to 362 perform for position title standardisation process a part.
At operation 406, the first source value is assigned to standardization and recruited through open public examination.In certain embodiments, at least partly by The Source Type of third party system determines the first source value.For example, in certain embodiments, recognizing three third party's Source Types:Work The website of employing unit, electronics applicant tracking system(ATS)With electronics recruitment website.ATS example includes Taleo, ADP Deng.The example of electronics recruitment website includes Monster.com, Indeed, Craigslist etc..
In certain embodiments, there is the level of Source Type.For example, the website of work employing unit is in Source Type level It is considered as highest, electronics ATS is considered as second high in Source Type level, and electronics recruitment website is considered as most in Source Type level It is low.Therefore, recruiting through open public examination of obtaining of website is had by oneself with recruiting through open public examination high source than what is obtained from electronics ATS from employing unit Value, from recruiting through open public examination of obtaining of electronics ATS and then with recruiting through open public examination high source value than what is obtained from electronics recruitment website.
In addition, for recruiting through open public examination for being obtained from the source in identical sources type, source value can be different.For example, from Dice.com obtain recruit through open public examination can have recruit through open public examination high source value than what is obtained from Craigslist.In some realities Apply in example, source value can be assigned to different types of source by the keeper of work trusteeship service(For example, passing through user interface).
At operation 408, produce and standardize the hashed value recruited through open public examination and the hashed value is assigned to standardization public affairs Open recruitment.In certain embodiments, hashed value is produced based on standardization position title, geographical position and employing unit's title.
In certain embodiments, using the method for the comparison data in addition to hash, for example, verification and, statistical analysis Method and machine learning method, such as neutral net or other supervised learning methods.
At operation 410, it is determined that recruiting through open public examination substantially similar recruit through open public examination with the presence or absence of in social network with standardization In the work trusteeship service of network system 210.In some embodiments using hash, by by hashed value and social networking system The multiple hashed values recruited through open public examination in 210 work trusteeship service make this determination compared to relatively, for mark at operation 408 Standardization recruits through open public examination the generation hashed value.
In some embodiments using hash, if the hashed value that standardization is recruited through open public examination is with being integrated into work trustship The hashed value recruited through open public examination in service is matched enough, then standardization is recruited through open public examination and integrated recruiting through open public examination is considered as base It is similar in sheet.In certain embodiments, if standardizing the hashed value recruited through open public examination and being integrated into work trusteeship service The hashed value recruited through open public examination is matched enough, then perform the comparison of the text of two job descriptions recruited through open public examination.In some realities Apply in example, compare and be related to calculating or compare two similarity measurements having calculated that recruited through open public examination.For example, the similar systems of Jie Kade Number can be used for comparing two recruit through open public examination between similitude.
In some embodiments using the comparative approach in addition to hash, different comparison techniques are determined for public affairs Open a large amount of similitudes between recruitment.For example, the like attribute recruited through open public examination can be performed and/or interior keyword is recruited through open public examination Comparison to determine a large amount of similitudes.
In certain embodiments, if standardization recruit through open public examination and it is integrated recruit through open public examination it is essentially similar, then tool There is recruiting through open public examination for highest source value to be stored in work trusteeship service, and be rejected with recruiting through open public examination for relatively low source value.Two It is individual recruit through open public examination with identical source value in the case of, earliest recruit through open public examination will be preserved.
In certain embodiments, if standardization recruit through open public examination and it is integrated recruit through open public examination it is essentially similar, then protect Two are deposited to recruit through open public examination and the true disclosure that shows of directional user when display is recruited through open public examination or only before display is recruited through open public examination Recruitment.If for example, display when recruiting through open public examination of specific works, the paying of work recruit through open public examination have expired and previously not yet Show that paying is recruited through open public examination to user, then actually recruit through open public examination display standardization.If overdue paying is recruited through open public examination Previously shown to user, then overdue paying is recruited through open public examination to be shown as recruiting through open public examination for specific works to user.
In certain embodiments, if it is determined that substantially similar recruit through open public examination is not present in work trusteeship service, then Standardization, which is recruited through open public examination, to be integrated into work trusteeship service.
At operation 412, in work trusteeship service, recruit through open public examination the substantially similar disclosure of replacement with standardization and recruit Engage.In certain embodiments, have been identified as not being paid for recruiting through open public examination and standard in response to substantially similar recruiting through open public examination Change the source value recruited through open public examination to be more than the substantially similar source value recruited through open public examination in work trusteeship service and perform replacement.In symbol Replacement when closing these conditions prevents the non-paid paying recruited through open public examination in replacement work trusteeship service from recruiting through open public examination, and prevents It is relatively low authoritative non-paid to recruit through open public examination substitute in work trusteeship service more authoritative and non-paid recruit through open public examination.
Fig. 4 B are to illustrate performed for recruiting through open public examination for obtaining from third party system according to some example embodiments Carry out the flow chart of the optional operation of work data de-duplication module 208 during the method 450 of data de-duplication.Method 450 In operation can use above in relation to Fig. 2 describe module performed by network system 105.As shown in Figure 4 B, side Method 450 includes operation 452 and 454.
At operation 452, determine that substantially similar recruiting through open public examination is paid for recruiting through open public examination.In certain embodiments, extremely It is at least partly based on whether social networking system 210 collects remuneration and make this by least one client of social networking system 210 It is determined that, substantially similar open trick is presented with least one user 152 to the work trusteeship service of social networking system 210 Engage.In certain embodiments, this determination is made to prevent the non-paid work trustship for recruiting through open public examination replacement social networking system 210 Paying in service is recruited through open public examination.
At operation 454, after the related work search that user 152 submits is received, to the use of social networking system 210 Family 152 is presented standardization and recruited through open public examination.In certain embodiments, the user 152 of social networking system 210 is in social networking system Job search is submitted in 210.In such systems, social networking system 210 is presented and submitted job search phase to user 152 One group closed is recruited through open public examination.In certain embodiments, presentation recruit through open public examination can be recruited through open public examination comprising paying, non-paid disclosure Recruitment, or its a certain combination.
Fig. 5 illustrates that technology discussed herein can be performed thereon(For example, method)In the example of any one or more The block diagram of machine 500.In alternative embodiments, machine 500 can serve as self-contained unit or can connect(For example, networking)Arrive it Its machine.In networked deployment, machine 500 can be in server machine, client machine or server-client network environment Operated in both abilities.In instances, machine 500 can be served as between peer(P2P)(Or other distributions)In network environment Peer machine.Machine 500 can be personal computer(PC), tablet PC, set top box(STB), personal digital assistant(PDA)、 Mobile phone, network appliance, network router, switch or bridger, or be able to carry out(Sequentially or otherwise)Specifying will Any machine of the instruction for the action taken by the machine., although the only machine of instruction sheet one, but term " machine " also will in addition It is considered as comprising individually or collectively execute instruction collection(Or multiple set)To perform appointing in method discussed herein What one or many and(For example, cloud computing, software are service(SaaS), the configuration of other computer clusters)Any collection of machines.
As described in this article, example can include logic or multiple components or mechanism, or can be by logic or multiple groups Part or mechanism operation.Circuit group is comprising hardware(For example, ball bearing made, door, logic etc.)Tangible entity in the electricity implemented Gather on road.Circuit group membership can be flexible with time and underlying hardware changeability.Circuit group is included in can be single when operating Solely or in combination perform the member of assigned operation.In instances, the hardware of circuit group can be immutably designed to carry out specific Operation(For example, hardwire).In instances, the hardware of circuit group can include the physical assemblies changeably connected(For example, performing Unit, transistor, ball bearing made etc.), the physical assemblies comprising changing for physically(For example, constant centralized particle Magnetic, electrically removable placement etc.)To encode the instruction of specific operation.When connecting physical assemblies, the basis electricity that hardware is constituted It is as the same that characteristic for example changes over conductor or vice versa from insulator.Instruction makes embedded hardware(For example, execution unit or load machine Structure)The member of the circuit group in hardware can be produced via variable connection, to perform the portion of specific operation when in operation Point.Therefore, computer-readable media is communicably coupled to other components of circuit group membership when device is operated. In example, any one in physical assemblies can be used in the more than one member in more than one circuit group.For example, in operation In, execution unit can be used at a time point in the first circuit of the first circuit group, and by the first circuit group Two circuits are reused by the tertiary circuit in second circuit group in different time.
Machine(For example, computer system)500 can include hardware processor 502(For example, CPU(CPU)、 Graphics processing unit(GPU), hardware processor core, or its any combinations), main storage 504 and static memory 506, institute Stating some or all of element element can be via cross tie part(For example, bus)508 communicate with one another.Machine 500 can enter one Step includes display unit 510, alphanumeric input device 512(For example, keyboard)And user interface(UI)Guider 514(Example Such as, mouse).In instances, display unit 510, input unit 512 and UI guiders 514 can be touch-screen displays.Machine Device 500 can additionally comprise storage device(For example, driver element)516th, signal generation device 518(For example, loudspeaker), network Interface arrangement 520 and one or more sensors 521, for example, global positioning system(GPS)Sensor, compass, accelerometer or Other sensors.Machine 500 can include o controller 528, for example, serially(For example, USB(USB)), simultaneously It is capable or other wired or wireless(For example, infrared(IR), near-field communication(NFC)Deng)Connection with one or more peripheral units (For example, printer, card reader etc.)Communicate or control one or more of peripheral units.
Storage device 516 can be embodied comprising storage thereon in technology described herein or function any one or One or more groups of data structures that are multiple or being utilized by any one or more in technology described herein or function or Instruction 524(For example, software)Machine-readable medium 522.Instruction 524 can also during it is performed by machine 500 completely or Reside at least in part in main storage 504, in static memory 506 or in hardware processor 502.In instances, hardware One in processor 502, main storage 504, static memory 506 or storage device 516 or any combinations may be constructed machine Device readable media.
Although machine-readable medium 522 is illustrated for single medium, term " machine-readable medium " can include by with It is set to the single medium or multiple media for storing one or more instructions 524(For example, centralized or distributed database, and/or Associated Cache and server).
Term " machine-readable medium " can be used to be performed by machine 500 and make machine comprising that can store, encode or carry Device 500 performs the instruction of any one or more in the technology of the present invention, or can store, encodes or carry and instructed by this class Using or with the associated data structure of such instruction any media.Non-limiting machine-readable medium example can be comprising solid State memory and optics and magnetic medium.In instances, centralized machine-readable medium includes the machine with multiple particles Readable media, the particle has constant(For example, static)Quality.Therefore, centralized machine-readable medium is that non-transitory is passed Broadcast signal.The instantiation of centralized machine-readable medium can be included:Nonvolatile memory, such as semiconductor memory are filled Put(For example, EPROM(EPROM)Or Electrically Erasable Read Only Memory(EEPROM))Deposited with flash memory Reservoir device;Disk, such as internal hard drive and removable disk;Magnetic optical disc;And CD-ROM and DVD-ROM disks.
Instruction 524 can further utilize any one in multiple host-host protocols(For example, frame relay, Internet Protocol (IP), transmission control protocol(TCP), UDP(UDP), HTTP(HTTP)Deng)Connect via network Mouth device 520 is transmitted or received on communication network 526 using transmission media.Instance communications network can include LAN (LAN), wide area network(WAN), packet data network(For example, internet), mobile telephone network(For example, cellular network), it is simple Fogey phone(POTS)Network and radio data network(For example, referred to as Wi-Fi IEEE(IEEE) 802.11 series standards, the series standards of IEEE 802.16 for being referred to as WiMax), IEEE 802.15.4 series standards, it is at the same level between (P2P)Network, and other networks.In instances, Network Interface Unit 520 can include one or more physical jacks(Example Such as, Ethernet, coaxial or telephone jack)Or one or more antennas are to be connected to communication network 526.In instances, network connects Mouth device 520 can use single input and multi-output comprising multiple antennas(SIMO), multiple-input and multiple-output(MIMO)Or multi input Single output(MISO)At least one in technology wirelessly communicates.Term " transmission media ", which should be considered as including, can store, compile Code carries any invisible media of the instruction to be performed by machine 500, and comprising numeral or analog communication signal or to promote Other invisible media of the communication of this software.
Additional annotations and example embodiment:
Example 1 includes the theme of following item(For example, method, the component acted for execution, or the machine comprising instruction can Media are read, the instruction makes machine-executed actions when being performed by machine):Obtained by first instance and represent third party's recruitment system The data recruited through open public examination on system, the packet contains position title and job description;Standardization position title is to match by the At least one in multiple predefined position titles of one Entity recognition;Standardization job description is recognized with meeting by first instance Data format;Standardization position title and standardization job description are combined into standardization and recruited through open public examination;And will standardization Recruit through open public examination and be integrated into the recruitment system of first instance.
Example 2 can include the theme of example 1, or optionally can be combined with the theme with comprising wherein standardizing Position title includes the undesirable character for removing appearance, and the removing is performed using at least one regular expression.
Example 3 can include example 1 to 2 in one or any combination of theme, or can optionally with the theme Combination is with comprising wherein standardization position title includes at least one in following item:Determine the geographical position in position title And geographical position determined by being removed from position title;Or determine employing unit's title in position title and from position Employing unit's title determined by being removed in title.
Example 4 can include example 1 to 3 in one or any combination of theme, or can optionally with the theme Combination is with comprising wherein standardization position title, which is included in, represents that the word or phrase that are recognized during abbreviation with first instance substitute duty Abbreviation in the title of position.
Example 5 can include example 1 to 4 in one or any combination of theme, or can optionally with the theme Combination is with comprising wherein substitute comprising using at least one in the context in the context and job description in position title Eliminate the ambiguity of abbreviation.
Example 6 can include example 1 to 5 in one or any combination of theme, or can optionally with the theme Combination is with comprising wherein standardization position title is included:Position title including orderly multiple words is divided into the row of word Table;Multiple arrangements of word are produced according to the list of word;And most tight fit is selected from multiple arrangements of word by The arrangement of the word of at least one in multiple predefined position titles of one Entity recognition.
Example 7 can include example 1 to 6 in one or any combination of theme, or can optionally with the theme Combination is with comprising wherein standardization position title is further comprising the position title numbering determined corresponding to standardization position title With seniority level, and wherein position title numbering and seniority level in standardization is recruited through open public examination.
Example 8 can include example 1 to 7 in one or any combination of theme, or can optionally with the theme Combination is with comprising wherein the knowledge for including geographical position, employing unit's title, recruitment industry and workmanship is recruited through open public examination in standardization Not at least one.
Example 9 can include example 1 to 8 in one or any combination of theme, or can optionally with the theme Combine to include theme(For example, unit or system)Including:Machine comprising memory He at least one processor;Can The work trapping module performed by machine, it is configured to obtain the disclosure represented in third party's recruitment system by first instance The data of recruitment, the packet contains position title and job description;And the woprk standardization module that can be performed by machine, its It is configured to:Position title is standardized to match at least one in the multiple predefined position titles recognized by first instance; Job description is standardized to meet the data format recognized by first instance;Will standardization position title and standardization job description Standardization is combined into recruit through open public examination;And be integrated into standardizing to recruit through open public examination in the recruitment system of first instance.
Example 10 can include the theme of example 9, or optionally can be combined with the theme with comprising wherein standardizing Position title includes the undesirable character for removing appearance, and the removing is performed using at least one regular expression.
Example 11 can include example 9 to 10 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein standardization position title includes at least one in following item:Determine the geographical position in position title Put and identified geographical position is removed from position title;Or determine employing unit's title in position title and from duty Employing unit's title determined by being removed in the title of position.
Example 12 can include example 9 to 11 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein standardization position title, which is included in, represents that the word or phrase that are recognized during abbreviation with first instance are substituted Abbreviation in position title.
Example 13 can include example 9 to 12 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein substitute comprising using at least one in the context in the context and job description in position title The individual ambiguity for eliminating abbreviation.
Example 14 can include example 9 to 13 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein standardization position title is included:Position title including orderly multiple words is divided into word List;Multiple arrangements of word are produced according to the list of word;And selected from multiple arrangements of word most tight fit by The arrangement of the word of at least one in multiple predefined position titles of first instance identification.
Example 15 can include example 9 to 14 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein standardization position title is further compiled comprising the position title for determining to correspond to standardization position title Number and seniority level, and wherein position title numbering and seniority level in standardization is recruited through open public examination.
Example 16 can include example 9 to 15 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein standardization is recruited through open public examination comprising geographical position, employing unit's title, recruitment industry and workmanship At least one in identification.
Example 17 can include example 1 to 16 in one or any combination of theme, or can optionally with the master Topic combines to include theme(For example, method, the component acted for execution, or the machine-readable medium comprising instruction, it is described Instruction makes machine-executed actions when being performed by machine)Including:The disclosure represented on third party system is obtained by first instance The data of recruitment;Normalized number produces standardization and recruited through open public examination according to this;First source value is assigned into standardization to recruit through open public examination, it is described First source value is at least partly determined by the Source Type of third party system;Produce and standardize the first hashed value recruited through open public examination and incite somebody to action First hashed value is distributed to standardization and recruited through open public examination;It is determined that substantially similar with the second source value and the second hashed value Recruit through open public examination and be present in the recruitment system of first instance;And recruited through open public examination in the recruitment system of first instance with standardization Replacement is substantially similar to recruit through open public examination, and described substitute performs in response to following item:Substantially similar recruiting through open public examination has been recognized For be not paid for recruiting through open public examination and the first source value be more than the second source value.
Example 18 can include the theme of example 17, or optionally can be combined with the theme with comprising wherein representing The packet recruited through open public examination on third party system contains position title, geographical position and employing unit's title, wherein standardization is public Open recruitment and include standardization position title, and wherein based on standardization position title, geographical position and employing unit's title Produce the first hashed value that standardization is recruited through open public examination.
Example 19 can include example 17 to 18 in one or any combination of theme, or can optionally with the master Topic combination is with comprising the Source Type of wherein third party system is the website of employing unit, electronics applicant tracking system and electronics At least one in recruitment website.
Example 20 can include example 17 to 19 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein the source value of the website of employing unit is more than the source value of electronics applicant tracking system, and wherein electric The source value of sub- applicant tracking system is more than the source value of electronics recruitment website.
Example 21 can include example 17 to 20 in one or any combination of theme, or can optionally with the master Topic combination wherein determining that substantially similar recruiting through open public examination is present in the recruitment system of first instance with comprising including first Hashed value is compared with the multiple hashed values recruited through open public examination in the recruitment system of first instance, and the multiple hashed value includes the Two hashed values.
Example 22 can include example 17 to 21 in one or any combination of theme, or can optionally with the master Topic combines to be determined by least one client of first instance substantially similar comprising whether remuneration is collected based on first instance Recruit through open public examination and be paid for recruiting through open public examination, to be presented substantially similar at least one user of the recruitment system of first instance Recruit through open public examination.
Example 23 can include example 17 to 22 in one or any combination of theme, or can optionally with the master After topic combination is searched for the related work submitted included in the user received by the recruitment system of first instance, presented to user Standardization is recruited through open public examination.
Example 24 can include example 1 to 23 in one or any combination of theme, or can optionally with the master Topic combines to include theme(For example, unit or system)Including:Machine comprising memory He at least one processor; The work trapping module that can be performed by machine, it is configured to obtain the public affairs represented in third party's recruitment system by first instance Open the data of recruitment;The woprk standardization module that can be performed by machine, it is configured to standardization and recruited through open public examination;And can be by machine The work data de-duplication module that device is performed, it is configured to:First source value is assigned into standardization to recruit through open public examination, described One source value is at least partly determined by the Source Type of third party system;Produce the first hashed value for recruiting through open public examination of standardization and by institute State the first hashed value and be assigned to standardization and recruit through open public examination;It is determined that the substantially similar public affairs with the second source value and the second hashed value Recruitment is opened to be present in the recruitment system of first instance;And recruited through open public examination and replace with standardization in the recruitment system of first instance Recruited through open public examination for substantially similar, described substitute performs in response to following item:Substantially similar recruiting through open public examination has been identified as It is not paid for recruiting through open public examination and the first source value is more than the second source value.
Example 25 can include the theme of example 24, or optionally can be combined with the theme with comprising wherein representing The packet recruited through open public examination on third party system contains position title, geographical position and employing unit's title, wherein standardization is public Open recruitment and include standardization position title, and wherein based on standardization position title, geographical position and employing unit's title Produce the first hashed value that standardization is recruited through open public examination.
Example 26 can include example 24 to 25 in one or any combination of theme, or can optionally with the master Topic combination is with comprising the Source Type of wherein third party system is the website of employing unit, electronics applicant tracking system and electronics At least one in recruitment website.
Example 27 can include example 24 to 26 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein the source value of the website of employing unit is more than the source value of electronics applicant tracking system, and wherein electric The source value of sub- applicant tracking system is more than the source value of electronics recruitment website.
Example 28 can include example 24 to 27 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein the data de-duplication module that works is configured to by by the recruitment of the first hashed value and first instance The multiple hashed values recruited through open public examination in system compare and at least partly determine that substantially similar recruiting through open public examination is present in the In the recruitment system of one entity, the multiple hashed value includes the second hashed value.
Example 29 can include example 24 to 28 in one or any combination of theme, or can optionally with the master Topic combination is with comprising wherein the data de-duplication module that works is configured to be at least partially based on whether first instance collects remuneration And determine that substantially similar recruiting through open public examination is paid for recruiting through open public examination by least one client of first instance, with to first instance At least one user of recruitment system present and substantially similar recruit through open public examination.
Example 30 can include example 24 to 29 in one or combination theme, or can optionally with the theme group Close with comprising presentation module is configured to receiving the rear recruitment system to first instance for the related work search that user submits User present standardization recruit through open public examination.
Each in these non-limiting examples can be individually present, or can be with various arrangements or combination and other examples One or more of combination.
General term used herein in the field of computer network and computer system.The term is in this area In it is known and for convenience, be provided only as non-limiting examples.Therefore, unless otherwise indicated, otherwise in claims The explanation of corresponding term be not limited to any specific definitions.Therefore, the term used in claims should be given most widely Reasonable dismissal.
Although being described herein and describing specific embodiment, those of ordinary skill in the art, which will be appreciated that, calculates reality Any arrangement of existing identical purpose can replace shown specific embodiment.Those of ordinary skill in the art are readily apparent that many is repaiied Change.Therefore, present application is expected to cover any modification or change.
Reference discussed in detail above comprising to accompanying drawing, the part that the accompanying drawing formation is described in detail.Accompanying drawing by means of Illustrate the specific embodiment for showing to put into practice.These embodiments are also referred to as " example ".Such example can be included Element in addition to those shown or described elements.However, the present inventor it is also contemplated that wherein only provide it is shown or The example of those described elements.In addition, the present inventor is it is also contemplated that using relative to instantiation(Or one or more than one Individual aspect)Or relative to other examples shown herein or described(Or its one or more than one aspect)And show or retouch Any combinations for those elements stated or the example of arrangement(Or its one or more than one aspect).
All publication, patent and the patent document referred in this document is incorporated herein in entirety by reference, It is general just as being individually incorporated to by reference.Used between this document and those documents being incorporated by reference In the case that method is inconsistent, the usage in the bibliography being incorporated to should be considered as supplementing the usage of this document;For non-adjustable Sum it is inconsistent, the usage in this document plays a leading role.
In this document, as in patent document institute it is common and using term " one " with comprising one or more than one, this with Any other example of " at least one " or " one or more " is used unrelated.In this document, term "or" is used to refer to Non-exclusionism or, make it that unless otherwise directed, otherwise " A or B " include " A rather than B ", " B rather than A " and " A and B ".It is literary herein In offering, term "comprising" and " wherein(in which)" it is used as corresponding term " comprising " and " wherein(wherein)" it is popular etc. Imitate term.In addition, in the dependent claims, term "comprising" and " comprising " are open, that is to say, that comprising except power System, device, article or the process of element outside those elements listed in sharp claim after this term are regarded as In the range of claims.In addition, in the dependent claims, term " first ", " second " and " the 3rd " etc. is only used Mark, and be not intended to apply numerical requirements to its object.
Method example described herein can be at least in part by machine or computer-implemented.Some examples can be wrapped Have available for configuration electronic installation to perform the computer-readable matchmaker of the instruction of the method as described in the above example containing coding Body or machine-readable medium.The embodiment of such method can include code, such as microcode, assembler language code, senior language Say code etc..This code can include the computer-readable instruction for being used for performing various methods.The code can form meter The part of calculation machine program product.In addition, in instances, code can visibly be stored for example during performing or at other times On one or more volatibility, non-transitory or non-volatile tangible computer readable media.These tangible computers are readable The example of media can include, but are not limited to, hard disk, moveable magnetic disc, removable CD(For example, CD and digital video magnetic Disk), cassette tape, storage card or rod, random access memory(RAM), read-only storage(ROM)Deng.
Above description is contemplated to illustrative and not restrictive.For example, examples detailed above(Or in terms of one or more) Can be with combination with one another.Such as those skilled in the art can use other embodiments after above description is consulted.Offer meets 37C.F.R.§1.72(b)Summary with allow reader quickly determine technology disclosure essence and by summary be not used in Explain or the summary is submitted in the understanding of scope or meaning of limitation claims.In addition, in above embodiment In, various features can be grouped together to simplify the present invention.This situation should not be construed to it is expected that the announcement do not advocated is special It is required to levy for any claim.On the contrary, present subject matter can be the institute than specific disclosed embodiment There is feature to lack.Therefore, appended claims are incorporated into embodiment hereby, each of which claim conduct Separate embodiments and be individually present, and expected such embodiment can be combined with each other with various combinations or arrangement.The scope of embodiment The full breadth of the equivalent that should be authorized by reference to appended claims and this claims is determined.

Claims (45)

1. a kind of method, it includes:
The data recruited through open public examination represented in third party's recruitment system are obtained by first instance, the packet contains position title And job description;
The position title is standardized to match at least one in the multiple predefined position titles recognized by the first instance It is individual;
The job description is standardized to meet the data format recognized by the first instance;
The standardization position title and the standardization job description are combined into standardization and recruited through open public examination;And
The standardization is recruited through open public examination and is integrated into the recruitment system of the first instance.
2. according to the method described in claim 1, wherein standardizing the position title includes the undesirable of removing appearance Character, the removing is performed using at least one regular expression.
3. according to the method described in claim 1, wherein standardizing the position title includes at least one in the following It is individual:
Determine the geographical position in the position title and remove the identified geographical position from the position title;Or
Determine employing unit's title in the position title and remove the identified employment list from the position title Position title.
4. according to the method described in claim 1, it is included in wherein standardizing the position title when representing abbreviation with described the The word or phrase of one Entity recognition substitute the abbreviation in the position title.
5. method according to claim 4, includes using the context in the position title and is reported wherein substituting The ambiguity of at least one elimination abbreviation in context in the description of position.
6. according to the method described in claim 1, included wherein standardizing the position title:
The position title including orderly multiple words is divided into the list of word;
Multiple arrangements of word are produced according to the list of word;And
The multiple predefined duty for selecting most tight fit to be recognized by the first instance from the multiple arrangement of word The arrangement of the word of at least one in the title of position.
7. method according to claim 6, wherein standardizing the position title further comprising determination corresponding to described Standardize the position title numbering and seniority level of position title, and wherein described position title numbering and the work Qualifications and record of service level is in the standardization is recruited through open public examination.
8. according to the method described in claim 1, wherein the standardization is recruited through open public examination comprising geographical position, employing unit's name Claim, recruit at least one in the identification of industry and workmanship.
9. a kind of system, it includes:
Machine, it includes memory and at least one processor;
The work trapping module that can be performed by the machine, it, which is configured to obtain by first instance, represents third party's recruitment system The data recruited through open public examination on system, the packet contains position title and job description;And
The woprk standardization module that can be performed by the machine, it is configured to:
The position title is standardized to match at least one in the multiple predefined position titles recognized by the first instance It is individual;
The job description is standardized to meet the data format recognized by the first instance;
The standardization position title and the standardization job description are combined into standardization and recruited through open public examination;And
The standardization is recruited through open public examination and is integrated into the recruitment system of the first instance.
10. system according to claim 9, wherein standardizing the position title includes the undesirable of removing appearance Character, the removing is performed using at least one regular expression.
11. system according to claim 9, wherein standardizing the position title includes at least one in the following It is individual:
Determine the geographical position in the position title and remove the identified geographical position from the position title;Or
Determine employing unit's title in the position title and remove the identified employment list from the position title Position title.
12. system according to claim 9, it is included in wherein standardizing the position title when representing abbreviation with described the The word or phrase of one Entity recognition substitute the abbreviation in the position title.
13. system according to claim 12, wherein substituting comprising using the context in the position title and described The ambiguity of at least one elimination abbreviation in context in job description.
14. system according to claim 9, is included wherein standardizing the position title:
The position title including orderly multiple words is divided into the list of word;
Multiple arrangements of word are produced according to the list of word;And
The multiple predefined duty for selecting most tight fit to be recognized by the first instance from the multiple arrangement of word The arrangement of the word of at least one in the title of position.
15. system according to claim 14, wherein standardize the position title further corresponds to institute comprising determination State the position title numbering and seniority level of standardization position title, and wherein described position title numbering and the work Make qualifications and record of service level in the standardization is recruited through open public examination.
16. system according to claim 9, wherein the standardization is recruited through open public examination comprising geographical position, employing unit's name Claim, recruit at least one in the identification of industry and workmanship.
17. a kind of non-transitory machine-readable storage media including instructing, the instruction is by the one or more of machine Reason device performs the machine when performing includes following operation:
The data recruited through open public examination represented in third party's recruitment system are obtained by first instance, the packet contains position title And job description;
The position title is standardized to match at least one in the multiple predefined position titles recognized by the first instance It is individual;
The job description is standardized to meet the data format recognized by the first instance;
The standardization position title and the standardization job description are combined into standardization and recruited through open public examination;And
The standardization is recruited through open public examination and is integrated into the recruitment system of the first instance.
18. non-transitory machine-readable storage media according to claim 17, wherein standardizing the position title bag Containing the undesirable character for removing appearance, the removing is performed using at least one regular expression.
19. non-transitory machine-readable storage media according to claim 17, wherein standardizing the position title bag Containing at least one in the following:
Determine the geographical position in the position title and remove the identified geographical position from the position title;Or
Determine employing unit's title in the position title and remove the identified employment list from the position title Position title.
20. non-transitory machine-readable storage media according to claim 17, wherein standardizing the position title bag It is contained in the abbreviation in the word recognized when representing abbreviation with the first instance or the phrase replacement position title.
21. non-transitory machine-readable storage media according to claim 20, the position is used wherein substituting and including The ambiguity of at least one elimination abbreviation in the context in context and the job description in title.
22. non-transitory machine-readable storage media according to claim 17, wherein standardizing the position title bag Contain:
The position title including orderly multiple words is divided into the list of word;
Multiple arrangements of word are produced according to the list of word;And
The multiple predefined duty for selecting most tight fit to be recognized by the first instance from the multiple arrangement of word The arrangement of the word of at least one in the title of position.
23. non-transitory machine-readable storage media according to claim 22, enters wherein standardizing the position title One step is numbered and seniority level comprising the position title for determining to correspond to the standardization position title, and wherein described Position title is numbered and the seniority level is in the standardization is recruited through open public examination.
24. non-transitory machine-readable storage media according to claim 17, wherein bag is recruited through open public examination in the standardization Containing geographical position, employing unit's title, recruitment industry and workmanship identification at least one.
25. a kind of method, it includes:
The data recruited through open public examination represented on third party system are obtained by first instance;
The data are standardized to recruit through open public examination to produce standardization;
First source value is assigned into the standardization to recruit through open public examination, first source value is at least partly by the third party system Source Type is determined;
Generation is described to be standardized the first hashed value recruited through open public examination and first hashed value is assigned into the standardization public affairs Open recruitment;
It is determined that substantially similar the recruiting through open public examination with the second source value and the second hashed value is present in the trick of the first instance Engage in system;And
In the recruitment system of the first instance, the replacement substantially similar public affairs are recruited through open public examination with the standardization Recruitment is opened, described substitute performs in response to following item:
Substantially similar the recruiting through open public examination has been identified as not being paid for recruiting through open public examination, and
First source value is more than second source value.
26. method according to claim 25, wherein represent on the third party system it is described recruit through open public examination it is described Packet contains position title, geographical position and employing unit's title, wherein the standardization is recruited through open public examination comprising standardization position Title, and wherein produced described based on the standardization position title, the geographical position and employing unit's title Standardize first hashed value recruited through open public examination.
27. method according to claim 25, wherein the Source Type of the third party system is the net of employing unit Stand, at least one in electronics applicant tracking system and electronics recruitment website.
28. the source value of the website of method according to claim 27, wherein employing unit, which is more than electronics job hunter, tracks system The source value of system, and wherein the source value of electronics applicant tracking system is more than the source value of electronics recruitment website.
29. method according to claim 25, wherein described determine that substantially similar the recruiting through open public examination is present in institute State and included in first hashed value and the recruitment system of the first instance in the recruitment system of first instance The multiple hashed values recruited through open public examination compare, the multiple hashed value include second hashed value.
30. method according to claim 25, it includes:
Based on the first instance whether collect remuneration and by least one client of the first instance determine it is described substantially Similar recruit through open public examination is that the paying is recruited through open public examination, with least one user of the recruitment system of the first instance Described substantially similar recruit through open public examination is presented.
31. method according to claim 25, it further comprises:
After the related work search that the user received by the recruitment system of the first instance submits, to the user The standardization is presented to recruit through open public examination.
32. a kind of system, it includes:
Machine, it includes memory and at least one processor;
The work trapping module that can be performed by the machine, it, which is configured to obtain by first instance, represents third party's recruitment system The data recruited through open public examination on system;
The woprk standardization module that can be performed by the machine, it is configured to recruit through open public examination described in standardization;And
The work data de-duplication module that can be performed by the machine, it is configured to:
First source value is assigned into the standardization to recruit through open public examination, first source value is at least partly by the third party system Source Type is determined;
Generation is described to be standardized the first hashed value recruited through open public examination and first hashed value is assigned into the standardization public affairs Open recruitment;
It is determined that substantially similar the recruiting through open public examination with the second source value and the second hashed value is present in the trick of the first instance Engage in system;And
In the recruitment system of the first instance, the replacement substantially similar public affairs are recruited through open public examination with the standardization Recruitment is opened, described substitute performs in response to following item:
Substantially similar the recruiting through open public examination has been identified as not being paid for recruiting through open public examination, and
First source value is more than second source value.
33. system according to claim 32, wherein represent on the third party system it is described recruit through open public examination it is described Packet contains position title, geographical position and employing unit's title, wherein the standardization is recruited through open public examination comprising standardization position Title, and wherein produced described based on the standardization position title, the geographical position and employing unit's title Standardize first hashed value recruited through open public examination.
34. system according to claim 32, wherein the Source Type of the third party system is the net of employing unit Stand, at least one in electronics applicant tracking system and electronics recruitment website.
35. the source value of the website of system according to claim 34, wherein employing unit, which is more than electronics job hunter, tracks system The source value of system, and wherein the source value of electronics applicant tracking system is more than the source value of electronics recruitment website.
36. system according to claim 32, wherein the work data de-duplication module is configured to by by institute State the first hashed value compared with the multiple hashed values recruited through open public examination in the recruitment system of the first instance and at least Part determines that substantially similar the recruiting through open public examination is present in the recruitment system of the first instance, the multiple to dissipate Train value includes second hashed value.
37. system according to claim 32, wherein the work data de-duplication module is configured at least partly Whether remuneration is collected based on the first instance and is determined by least one client of the first instance described substantially similar Recruit through open public examination be it is described paying recruit through open public examination, with least one user of the recruitment system of the first instance present Described substantially similar recruits through open public examination.
38. system according to claim 32, it further comprises module is presented, and the presentation module is configured to connecing The mark is presented in the user for receiving the recruitment system of the backward first instance for the related work search that the user submits Standardization is recruited through open public examination.
39. a kind of non-transitory machine-readable storage media including instructing, the instruction is by the one or more of machine Reason device performs the machine when performing includes following operation:
The data recruited through open public examination represented on third party system are obtained by first instance;
The data are standardized to recruit through open public examination to produce standardization;
First source value is assigned into the standardization to recruit through open public examination, first source value is at least partly by the third party system Source Type is determined;
Generation is described to be standardized the first hashed value recruited through open public examination and first hashed value is assigned into the standardization public affairs Open recruitment;
It is determined that substantially similar the recruiting through open public examination with the second source value and the second hashed value is present in the trick of the first instance Engage in system;And
In the recruitment system of the first instance, the replacement substantially similar public affairs are recruited through open public examination with the standardization Recruitment is opened, described substitute performs in response to following item:
Substantially similar the recruiting through open public examination has been identified as not being paid for recruiting through open public examination, and
First source value is more than second source value.
40. the non-transitory machine-readable storage media according to claim 39, wherein representing on the third party system The packet recruited through open public examination contain position title, geographical position and employing unit's title, wherein the standardization is public Open recruitment and include standardization position title, and wherein based on the standardization position title, the geographical position and the use People's organization and produce and described standardize first hashed value recruited through open public examination.
41. the non-transitory machine-readable storage media according to claim 39, wherein the third party system is described Source Type is at least one in the website of employing unit, electronics applicant tracking system and electronics recruitment website.
42. non-transitory machine-readable storage media according to claim 41, the wherein source value of the website of employing unit More than the source value of electronics applicant tracking system, and wherein, the source value of electronics applicant tracking system is more than electronics The source value of recruitment website.
43. the non-transitory machine-readable storage media according to claim 39, wherein substantially class described in the determination As recruit through open public examination in the recruitment system for be present in the first instance and include first hashed value and described first The multiple hashed values recruited through open public examination in the recruitment system of entity compare, and the multiple hashed value dissipates comprising described second Train value.
44. the non-transitory machine-readable storage media according to claim 39, it includes being used for based on described first in fact Whether body collects remuneration and determines that substantially similar the recruiting through open public examination is institute by least one client of the first instance State paying to recruit through open public examination, to be presented described substantially similar at least one user of the recruitment system of the first instance The instruction recruited through open public examination.
45. the non-transitory machine-readable storage media according to claim 39, it further comprises being used to receive institute The standardization is presented in the user for stating the recruitment system of the backward first instance of the related work search of user's submission The instruction recruited through open public examination.
CN201580064463.7A 2014-09-30 2015-03-25 Publication recruitment normalization and deduplication Active CN107004167B (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US14/502,261 US10043157B2 (en) 2014-09-30 2014-09-30 Job posting standardization and deduplication
US14/502261 2014-09-30
US14/502,224 US20160092838A1 (en) 2014-09-30 2014-09-30 Job posting standardization and deduplication
US14/502224 2014-09-30
PCT/US2015/022480 WO2016053382A1 (en) 2014-09-30 2015-03-25 Job posting standardization and deduplication

Publications (2)

Publication Number Publication Date
CN107004167A true CN107004167A (en) 2017-08-01
CN107004167B CN107004167B (en) 2022-04-19

Family

ID=55631209

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580064463.7A Active CN107004167B (en) 2014-09-30 2015-03-25 Publication recruitment normalization and deduplication

Country Status (2)

Country Link
CN (1) CN107004167B (en)
WO (1) WO2016053382A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10043157B2 (en) 2014-09-30 2018-08-07 Microsoft Technology Licensing, Llc Job posting standardization and deduplication

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10997560B2 (en) 2016-12-23 2021-05-04 Google Llc Systems and methods to improve job posting structure and presentation
US9996523B1 (en) 2016-12-28 2018-06-12 Google Llc System for real-time autosuggestion of related objects
US10607273B2 (en) 2016-12-28 2020-03-31 Google Llc System for determining and displaying relevant explanations for recommended content

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060229899A1 (en) * 2005-03-11 2006-10-12 Adam Hyder Job seeking system and method for managing job listings
US20080065633A1 (en) * 2006-09-11 2008-03-13 Simply Hired, Inc. Job Search Engine and Methods of Use
US20080065630A1 (en) * 2006-09-08 2008-03-13 Tong Luo Method and Apparatus for Assessing Similarity Between Online Job Listings
CN101512594A (en) * 2006-05-16 2009-08-19 鲍恩蒂乔布斯有限公司 Method to facilitate engagement and communication between a company and a recruiter
CN101520867A (en) * 2009-04-03 2009-09-02 汤溪蔚 Method and system for convenient network job hunting and recruitment
US7720791B2 (en) * 2005-05-23 2010-05-18 Yahoo! Inc. Intelligent job matching system and method including preference ranking
CN102378973A (en) * 2009-03-30 2012-03-14 爱萨有限公司 System and method for data deduplication
US20130290205A1 (en) * 2012-04-30 2013-10-31 Gild, Inc. Recruiting service graphical user interface

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8375026B1 (en) * 2006-01-13 2013-02-12 CareerBuilder, LLC Method and system for matching data sets of non-standard formats
US8271473B2 (en) * 2007-06-25 2012-09-18 Jobs2Web, Inc. System and method for career website optimization
US8473503B2 (en) * 2011-07-13 2013-06-25 Linkedin Corporation Method and system for semantic search against a document collection
US20140149206A1 (en) * 2012-11-29 2014-05-29 Linkedin Corporation Combined sponsored and unsponsored content group

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060229899A1 (en) * 2005-03-11 2006-10-12 Adam Hyder Job seeking system and method for managing job listings
US7720791B2 (en) * 2005-05-23 2010-05-18 Yahoo! Inc. Intelligent job matching system and method including preference ranking
CN101512594A (en) * 2006-05-16 2009-08-19 鲍恩蒂乔布斯有限公司 Method to facilitate engagement and communication between a company and a recruiter
US20080065630A1 (en) * 2006-09-08 2008-03-13 Tong Luo Method and Apparatus for Assessing Similarity Between Online Job Listings
US8099415B2 (en) * 2006-09-08 2012-01-17 Simply Hired, Inc. Method and apparatus for assessing similarity between online job listings
US20080065633A1 (en) * 2006-09-11 2008-03-13 Simply Hired, Inc. Job Search Engine and Methods of Use
CN102378973A (en) * 2009-03-30 2012-03-14 爱萨有限公司 System and method for data deduplication
CN101520867A (en) * 2009-04-03 2009-09-02 汤溪蔚 Method and system for convenient network job hunting and recruitment
US20130290205A1 (en) * 2012-04-30 2013-10-31 Gild, Inc. Recruiting service graphical user interface

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10043157B2 (en) 2014-09-30 2018-08-07 Microsoft Technology Licensing, Llc Job posting standardization and deduplication

Also Published As

Publication number Publication date
CN107004167B (en) 2022-04-19
WO2016053382A1 (en) 2016-04-07

Similar Documents

Publication Publication Date Title
JP6784308B2 (en) Programs that update facility characteristics, programs that profile facilities, computer systems, and how to update facility characteristics
CN105532030B (en) For analyzing the devices, systems, and methods of the movement of target entity
US20190340538A1 (en) Identifying entities using a deep-learning model
US9183497B2 (en) Performance-efficient system for predicting user activities based on time-related features
Morabito Big data and analytics
US10003926B2 (en) Predicting human movement behaviors using location services model
JP6693502B2 (en) Information processing apparatus, information processing method, and program
CN104123398B (en) A kind of information-pushing method and device
CN104508739B (en) Dynamic language model
CN106796550B (en) Information delivery device and method
US10699320B2 (en) Marketplace feed ranking on online social networks
US20150161529A1 (en) Identifying Related Events for Event Ticket Network Systems
US20120076367A1 (en) Auto tagging in geo-social networking system
KR20180055876A (en) Mobile service terminal, system and data processing method used for airport service
US20160066041A1 (en) Mobility enhanced advertising on internet protocol television
GB2547395A (en) User maintenance system and method
US10445386B2 (en) Search result refinement
Cui et al. Travel behavior classification: an approach with social network and deep learning
CN110096645A (en) Information recommendation method, device, equipment and medium
US20180336529A1 (en) Job posting standardization and deduplication
Serra-Cantallops et al. Host community resignation to nightclub tourism
CN111179031A (en) Training method, device and system for commodity recommendation model
CN107004167A (en) Recruit through open public examination standardization and data de-duplication
KR20200102500A (en) Method, apparatus and selection engine for classification matching of videos
EP3188086A1 (en) Identifying entities using a deep-learning model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20180503

Address after: Washington State

Applicant after: Micro soft technique license Co., Ltd

Address before: American California

Applicant before: LINKEDIN CORPORATION

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant