WO2017215245A1 - Method and device for searching resumes - Google Patents

Method and device for searching resumes Download PDF

Info

Publication number
WO2017215245A1
WO2017215245A1 PCT/CN2016/113202 CN2016113202W WO2017215245A1 WO 2017215245 A1 WO2017215245 A1 WO 2017215245A1 CN 2016113202 W CN2016113202 W CN 2016113202W WO 2017215245 A1 WO2017215245 A1 WO 2017215245A1
Authority
WO
WIPO (PCT)
Prior art keywords
weight
resume
related word
keyword
search
Prior art date
Application number
PCT/CN2016/113202
Other languages
French (fr)
Chinese (zh)
Inventor
李贤�
Original Assignee
广州视源电子科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州视源电子科技股份有限公司 filed Critical 广州视源电子科技股份有限公司
Publication of WO2017215245A1 publication Critical patent/WO2017215245A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying

Definitions

  • the present invention relates to the field of computer information retrieval, and in particular, to a resume search method and apparatus.
  • the search engine will search for keywords and related when searching for resumes. Words use the same weight value to search, then the frequency of hits determines the ranking of the searched resumes, the search results are likely to be biased towards a higher frequency related words, on the other hand, when the number of related words is much more than In the case of keywords, the keywords may be overwhelmed by related words during the retrieval process, and it is difficult to highlight the role of keywords as the main search keywords in the process of searching for resumes.
  • the embodiment of the invention provides a resume search method, which balances the weights of keywords and related words in the resume search process, and improves the efficiency of resume search.
  • the embodiment of the invention provides a resume search method, including:
  • the related word set receives a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword;
  • the keyword, the weight value of the keyword, each related word, and the weight value of each of the related words are submitted to a search engine to search for a resume from the resume database.
  • the first mode is selected as the weight calculation mode, and based on the selected weight calculation mode, according to each of the related The degree of relevance of the word to the keyword, and the weight value of each related word and the weight value of the keyword when calculating the resume search are calculated, specifically:
  • the first weight formula Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
  • the second mode is selected as the weight calculation mode, based on the selected weight calculation manner, according to Calculating the weight value of each related word and the weight value of the keyword when performing the resume search, the specificity of each related word and the keyword, specifically:
  • Base score formula Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
  • the method further includes:
  • search engine to be used from the resume database.
  • Search for a resume specifically:
  • the keyword and each of the related words included in the resume are displayed in a highlighted form.
  • an embodiment of the present invention further provides a resume search device, including:
  • a receiving module configured to receive a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword ;
  • a weight calculation module configured to select a weight calculation manner according to the number of related words in the related word set, and calculate, according to the selected weight calculation manner, based on the correlation degree between each of the related words and the keyword, The weight value of each related word in the resume search and the weight value of the keyword;
  • a search module configured to submit the keyword, the weight value occupied by the keyword, each related word, and the weight value of each of the related words to a search engine to search for a resume from the resume database .
  • the weight calculation module includes:
  • a first calculating unit configured to: when the number of related words in the related word set is between the upper limit word quantity and the lower limit word quantity, select the first mode as the weight calculation mode, according to the first weight formula Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
  • the weight calculation module further includes:
  • a reference score calculation unit configured to: when the number of related words in the related word set is not between the upper limit word amount and the lower limit word quantity, select the second mode as the weight calculation mode, according to the reference score formula Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the correlation of all related words recorded in the related word set;
  • a sum calculation unit for summing the reference scores of all the related words to obtain a total score of the reference scores sum
  • a fourth calculating unit configured to determine whether a reference score of the i-th related word is greater than W A /n; if yes, according to a fourth weight formula Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula Calculating a weight value W i of the i-th related word when performing a resume search; wherein n is a second weight coefficient; k is a number of related words in the related word set.
  • the weight calculation module further includes:
  • a mean value calculation unit configured to: according to the mean value formula, before determining whether the reference score of the i-th related word is greater than W A /n Calculate the mean of the benchmark score among them, The average of the correlations of all related words;
  • a determining unit configured to determine, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
  • An updating unit configured to determine that a reference score S i of the i-th related word is greater than the average score By updating the formula Updating the reference score S i of the i-th related word.
  • searching module specifically includes:
  • a weight associating unit configured to associate, according to a weight association format of the search engine, a weight value occupied by the keyword and the keyword as a first combination, and a weight value occupied by each of the related words Corresponding related words are associated with the second combination;
  • Searching a display unit configured to submit the first combination and the second combination to a search engine, for the search engine to search for a resume from the resume database, and display according to a sorting algorithm built in the search engine The searched resume; wherein the keyword and each of the related words included in the resume are displayed in a highlighted form.
  • the resume search method and apparatus can select different weight calculation algorithms according to the number of related words to calculate the weight of the keywords and related words in the search process, and fully consider the number of related words to the keywords.
  • the weight of the keyword is too large or too small.
  • the weight of the keyword is too large or too small, which makes the process of resume search more reasonable, and reduces the resume search again. The occurrence of the resume search improves the efficiency of the search.
  • FIG. 1 is a schematic flow chart of an embodiment of a resume search method provided by the present invention
  • FIG. 2 is a schematic structural diagram of an embodiment of a resume search device provided by the present invention.
  • FIG. 3 is a schematic structural diagram of an embodiment of a weight calculation module of a resume search device provided by the present invention.
  • FIG. 4 is a schematic structural diagram of an embodiment of a search module of a resume search device provided by the present invention.
  • FIG. 1 is a schematic flowchart diagram of an embodiment of a resume search method provided by the present invention, where the method includes the following steps:
  • S1 Receive a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword;
  • the first mode is selected as the weight calculation mode, based on the selected weight calculation mode, according to Calculating the weight value of each related word and the weight value of the keyword when performing the resume search, and calculating the weight of each related word and the keyword, the specific implementation process is:
  • the first weight formula Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
  • the weight calculation algorithm is used for calculation, which is specifically the implementation process of step S4.
  • the second mode is selected as the weight calculation mode, based on the selected
  • the weight calculation method calculates the weight value of each related word and the weight value of the keyword when performing the resume search according to the correlation degree of each of the related words and the keyword, and the specific implementation The process is:
  • Base score formula Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
  • the purpose of calculating the benchmark score of each related word in the related word set is to base the correlation degree of the related words on the same benchmark, and represent the degree of correlation between the related words and the keywords in the form of a score.
  • the reference value is preferably r max -r min +1; the first weight formula adopts a logarithmic formula, and when the total value of the benchmark score grows too fast, the weight value of the keyword still maintains a moderate growth, and thus, the relevant words can be In a relatively large number of cases, avoid the occurrence of excessive weight of keywords; when calculating the weight of related words, the second weight formula and the third weight formula are used respectively to calculate, considering the relevant words.
  • the weight value of the related word is greater than the weight value of the keyword, or is much smaller than the weight value of the keyword, that is, during the process of searching for the resume, It can avoid the situation that the relevant words are overwhelmed by keywords and the keywords are too prominent; the setting of the second weight coefficient can be determined by the actual situation, and the general value is 2 or 4.
  • step S2 further includes the following steps:
  • the correlation degree of the related words in the related word set is based on the same benchmark, and the benchmark is the same as the above benchmark, and the purpose is to represent the average correlation between the related word set and the keyword in the form of a score.
  • Degree when the degree of correlation between a related word and a keyword (ie, the benchmark score) is greater than the average degree of correlation (average score) between the related word set and the keyword, the degree of correlation between the related word and the keyword may be The difference between the correlation degree of the related words and the average degree of relevance of the keywords is magnified 10 times as a reference score of the degree of relevance of the related words to the keywords; otherwise, the related words are related to the keywords.
  • the original benchmark score of the degree is unchanged. This method can more accurately express the relevance of the related words and keywords, making the relationship between related words and keywords more reasonable.
  • step S3 is:
  • the keyword and each of the related words included in the resume are displayed in a highlighted form.
  • the commonly used search engine is the solr search engine, and the above weight association format can be associated according to the following format: keyword ⁇ keyword weight, related word 1 ⁇ related word 1 weight, related word 2 ⁇ related words 2 weights...
  • the resume search method of the embodiment of the present invention can select different weight calculation algorithms according to the number of related words to calculate the weight of the keywords and related words in the search process, and fully consider the number of related words to search for the keywords.
  • the influence of the process balancing the weights of keywords and related words in the resume search process, avoiding the occurrence of too many related words when the related words are too large, causing the keyword weights to be too large or too small, when the related words are too small, the keywords Weight too large or too small
  • the imbalance situation makes the process of resume search more reasonable, reduces the occurrence of resume search again, and improves the efficiency of resume search.
  • the resume search device can implement the entire flow of the resume search method, and the specific structure thereof is as follows:
  • the receiving module 10 is configured to receive a keyword and a related word set for performing a resume search from a resume database, wherein the related word set records a plurality of related words, and each related word is related to the keyword degree;
  • the weight calculation module 20 is configured to select a weight calculation manner according to the number of related words in the related word set, and calculate, according to the selected weight calculation manner, according to the correlation degree between each of the related words and the keyword The weight value of each related word and the weight value of the keyword when performing a resume search;
  • the search module 30 is configured to submit the keyword, the weight value occupied by the keyword, each related word, and the weight value of each of the related words to a search engine to search from the resume database. resume.
  • the weight calculation module 20 includes:
  • the first calculating unit 21 is configured to: when the number of related words in the related word set is between the upper limit word quantity and the lower limit word quantity, select the first mode as the weight calculation mode, according to the first weight formula Calculating a weight value W i of the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words in the related word set;
  • the weight calculation module 20 further includes:
  • the reference score calculation unit 23 is configured to: when the number of related words in the related word set is not between the upper limit word amount and the lower limit word quantity, select the second mode as the weight calculation mode, according to the reference score formula Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
  • the sum calculation unit 24 is configured to sum the reference scores of all the related words to obtain a reference score total value sum of the related word set;
  • a fourth calculating unit 26 configured to determine whether a reference score of the i-th related word is greater than W A /n; if yes, according to a fourth weight formula Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula Calculating a weight value W i occupied by the i-th related word in the resume search; wherein n is a second weight coefficient; k is the number of related words in the related word set.
  • the weight calculation module 20 further includes:
  • the mean value calculating unit 27 is configured to: according to the average score formula, before determining whether the reference score of the i-th related word is greater than W A /n Calculate the mean of the benchmark score among them, The average of the correlations of all related words in the related word set;
  • the determining unit 28 is configured to determine, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
  • the updating unit 29 is configured to: when determining that the reference score S i of the ith related word is greater than the reference score mean By updating the formula Updating the reference score S i of the i-th related word.
  • FIG. 4 it is a schematic structural diagram of an embodiment of a search module of the resume search device provided by the present invention.
  • the search module 30 specifically includes:
  • the weight associating unit 31 is configured to associate, according to a weight association format of the search engine, a weight value occupied by the keyword and the keyword as a first combination, and a weight of each of the related words The value is associated with its corresponding related word as a second combination;
  • a search display unit 32 configured to submit the first combination and the second combination to a search engine, for the search engine to search for a resume from the resume database, and according to a sorting algorithm built in the search engine
  • the searched resume is displayed; wherein the keyword and each of the related words included in the resume are displayed in a highlighted form.
  • the resume search device provided by the embodiment of the present invention can select different weight calculation algorithms according to the number of related words to calculate the weight of the keywords and related words in the search process, and fully consider the number of related words to search for the keywords.
  • the influence of the process balancing the weights of keywords and related words in the resume search process, avoiding the occurrence of too many related words when the related words are too large, causing the keyword weights to be too large or too small.
  • the keywords are caused.
  • the weight imbalance is too large or too small, making the process of resume search more reasonable, reducing the occurrence of resume search again, improving resume search The efficiency of the cable.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Abstract

A method and device for searching resumes, the method comprising: receiving keywords for searching resumes in a resume database and a word set of words relevant to said keywords (S1); on the basis of the number of relevant words in said relevant word set, selecting a weighting calculation mode, then, on the basis of said selected weighting calculation mode and the degree of relevance of each relevant word to the keywords, calculating the weight of each relevant word and the weight of the keywords when performing a resume search (S2); forwarding to a search engine the keywords and the weights thereof, and the relevant words and the weights thereof, so as to retrieve resumes from a resume database (S3). Balancing the weights of keywords and relevant words in the process of searching resumes enhances the efficiency of resume searches.

Description

简历搜索方法和装置Resume search method and device 技术领域Technical field
本发明涉及计算机信息检索领域,尤其涉及一种简历搜索方法和装置。The present invention relates to the field of computer information retrieval, and in particular, to a resume search method and apparatus.
背景技术Background technique
通常在对简历进行搜索匹配过程中,一般是通过关键词+相关词检进行检索的,但是发明人在实施过程中发现该方案存在以下问题:在进行简历搜索时搜索引擎会将关键词和相关词取相同的权重值进行搜索,那么命中的频次决定了搜索出来的简历的排序,则搜索结果极有可能偏向某个频次较高的相关词,另一方面,当相关词的数量大大多于关键词时,则在检索过程中关键词可能会被相关词淹没,难以突出关键词相对于相关词在搜索出简历过程中作为主要作检索关键词的作用。Usually in the process of searching and matching resumes, it is generally searched by keyword + related word test, but the inventor found in the implementation process that the program has the following problems: the search engine will search for keywords and related when searching for resumes. Words use the same weight value to search, then the frequency of hits determines the ranking of the searched resumes, the search results are likely to be biased towards a higher frequency related words, on the other hand, when the number of related words is much more than In the case of keywords, the keywords may be overwhelmed by related words during the retrieval process, and it is difficult to highlight the role of keywords as the main search keywords in the process of searching for resumes.
发明内容Summary of the invention
本发明实施例提出一种简历搜索方法,平衡关键词和相关词在简历搜索过程中所占的权重,提高简历搜索的效率。The embodiment of the invention provides a resume search method, which balances the weights of keywords and related words in the resume search process, and improves the efficiency of resume search.
本发明实施例提出一种简历搜索方法,包括:The embodiment of the invention provides a resume search method, including:
接收用于从简历数据库中进行简历搜索的关键词和相关词集;其中,所述相关词集记载有多个相关词,以及每一个相关词与所述关键词的相关度;Receiving a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword;
根据所述相关词集中的相关词的数量选择权重计算方式,并基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值;Selecting a weight calculation manner according to the number of related words in the related word set, and calculating, according to the selected weight calculation manner, each of the related words and the keyword correlation degree, each of the resume search The weight value of the related words and the weight value of the keyword;
将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历。The keyword, the weight value of the keyword, each related word, and the weight value of each of the related words are submitted to a search engine to search for a resume from the resume database.
进一步地,当所述相关词集中的相关词的数量处于上限词量和下限词量之间时,选择第一方式作为权重计算方式,则基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值,具体为: Further, when the number of related words in the related word set is between the upper limit word quantity and the lower limit word quantity, the first mode is selected as the weight calculation mode, and based on the selected weight calculation mode, according to each of the related The degree of relevance of the word to the keyword, and the weight value of each related word and the weight value of the keyword when calculating the resume search are calculated, specifically:
根据第一权重公式
Figure PCTCN2016113202-appb-000001
计算出所述相关词集中第i个相关词在进行简历搜索时所占的权重值Wi;其中,rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
According to the first weight formula
Figure PCTCN2016113202-appb-000001
Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
根据第二权重公式WA=m×sumW,计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词,m为第一权重系数,sumW为所述相关词集中所有相关词的权重值的总和。Calculating, according to the second weight formula W A =m×sumW, the weight value W A of the keyword when performing the resume search; wherein A is the keyword, m is the first weight coefficient, and sumW is The sum of the weight values of all related words in the relevant word set.
进一步地,当所述相关词集中的相关词的数量不处于所述上限词量和所述下限词量之间时,选择第二方式作为权重计算方式,则基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值,具体为:Further, when the number of related words in the related word set is not between the upper limit word amount and the lower limit word quantity, the second mode is selected as the weight calculation mode, based on the selected weight calculation manner, according to Calculating the weight value of each related word and the weight value of the keyword when performing the resume search, the specificity of each related word and the keyword, specifically:
根据基准分值公式
Figure PCTCN2016113202-appb-000002
计算出所述相关词集中第i个相关词的基准分值Si;其中,ri为所述相关词集中的第i个相关词的相关度;rmin为在所述相关词集中所记载的所有相关词的相关度的最小值;rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
Base score formula
Figure PCTCN2016113202-appb-000002
Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
对所述所有相关词的基准分值进行求和,获得基准分值总值sum;Calculating the benchmark scores of all the related words to obtain a total score of the benchmark scores sum;
根据第三权重公式WA=5+log1.5(sum+1),计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词;Calculating, according to the third weight formula W A =5+log 1.5 (sum+1), a weight value W A of the keyword when performing a resume search; wherein A is the keyword;
判断所述第i个相关词的基准分值是否大于WA/n;若是,则根据第四权重公式
Figure PCTCN2016113202-appb-000003
计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;若否,则根据第五权重公式
Figure PCTCN2016113202-appb-000004
计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;其中,n为第二权重系数;k为所述相关词集中的相关词的数量。
Determining whether the reference score of the i-th related word is greater than W A /n; if yes, according to the fourth weight formula
Figure PCTCN2016113202-appb-000003
Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula
Figure PCTCN2016113202-appb-000004
Calculating a weight value W i of the i-th related word when performing a resume search; wherein n is a second weight coefficient; k is a number of related words in the related word set.
再进一步地,在所述判断所述第i个相关词的基准分值是否大于WA/n之前,还包括:Further, before the determining whether the reference score of the i-th related word is greater than W A /n, the method further includes:
根据均值公式
Figure PCTCN2016113202-appb-000005
计算基准分值均值
Figure PCTCN2016113202-appb-000006
其中,
Figure PCTCN2016113202-appb-000007
为所述所有相关词的相关度的平均值;
According to the mean formula
Figure PCTCN2016113202-appb-000005
Calculate the mean of the benchmark score
Figure PCTCN2016113202-appb-000006
among them,
Figure PCTCN2016113202-appb-000007
The average of the correlations of all related words;
对于所述相关词集中的第i个相关词,判断所述第i个相关词的基准分值Si是否大于所述基准分值均值
Figure PCTCN2016113202-appb-000008
Determining, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
Figure PCTCN2016113202-appb-000008
若是,通过更新公式
Figure PCTCN2016113202-appb-000009
更新所述第i个相关词的基准分值Si
If yes, by updating the formula
Figure PCTCN2016113202-appb-000009
Updating the reference score S i of the i-th related word.
更进一步地,所述将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词所占的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历,具体为:Further, the keyword, the weight value occupied by the keyword, each related word, and the weight value of each of the related words are submitted to a search engine to be used from the resume database. Search for a resume, specifically:
根据搜索引擎的权重关联格式,将所述关键词所占的权重值和所述关键词相关联作为第一组合,以及将所述每一个相关词所占的权重值与其对应的相关词相关联作为第二组合;Correlating a weight value occupied by the keyword with the keyword as a first combination according to a weight association format of a search engine, and associating a weight value of each of the related words with a corresponding related word thereof As a second combination;
将所述第一组合和所述第二组合提交给搜索引擎,以供所述搜索引擎从所述简历数据库中搜索出简历,并根据所述搜索引擎内置的排序算法显示搜索出的简历;其中,所述简历中包含的所述关键词和所述每一个相关词以高亮形式显示。Submitting the first combination and the second combination to a search engine for the search engine to search for a resume from the resume database, and displaying the searched resume according to a sorting algorithm built into the search engine; The keyword and each of the related words included in the resume are displayed in a highlighted form.
相应地,本发明实施例还提供一种简历搜索装置,包括:Correspondingly, an embodiment of the present invention further provides a resume search device, including:
接收模块,用于接收用于从简历数据库中进行简历搜索的关键词和相关词集;其中,所述相关词集记载有多个相关词,以及每一个相关词与所述关键词的相关度;a receiving module, configured to receive a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword ;
权重计算模块,用于根据所述相关词集中的相关词的数量选择权重计算方式,并基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值;a weight calculation module, configured to select a weight calculation manner according to the number of related words in the related word set, and calculate, according to the selected weight calculation manner, based on the correlation degree between each of the related words and the keyword, The weight value of each related word in the resume search and the weight value of the keyword;
搜索模块,用于将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历。a search module, configured to submit the keyword, the weight value occupied by the keyword, each related word, and the weight value of each of the related words to a search engine to search for a resume from the resume database .
进一步地,所述权重计算模块包括:Further, the weight calculation module includes:
第一计算单元,用于当所述相关词集中的相关词的数量处于上限词量和下限词量之间时,选择第一方式作为权重计算方式,根据第一权重公式
Figure PCTCN2016113202-appb-000010
计算出所述相关词集中第i个相关词在进行简历搜索时所占的权重值Wi;其中,rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
a first calculating unit, configured to: when the number of related words in the related word set is between the upper limit word quantity and the lower limit word quantity, select the first mode as the weight calculation mode, according to the first weight formula
Figure PCTCN2016113202-appb-000010
Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
第二计算单元,用于根据第二权重公式WA=m×sumW,计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词,m为第一权重系数,sumW为所述相关词集中所有相关词的权重值的总和。a second calculating unit, configured to calculate, according to the second weight formula W A =m×sumW, a weight value W A of the keyword when performing a resume search; wherein A is the keyword, m is the first A weighting coefficient, sumW is the sum of the weighting values of all related words in the related word set.
进一步地,所述权重计算模块还包括:Further, the weight calculation module further includes:
基准分值计算单元,用于当所述相关词集中的相关词的数量不处于所述上限词量和所述下限词量之间时,选择第二方式作为权重计算方式,根据基准分值公式
Figure PCTCN2016113202-appb-000011
计算出所述相关词集中第i个相关词的基准分值Si;其中,ri为所述相关词集中的第i个相关词的相关度;rmin为在所述相关词集中所记载的所有相关词的相关度的最小值;rmax为在所述相 关词集中所记载的所有相关词的相关度的最大值;
a reference score calculation unit, configured to: when the number of related words in the related word set is not between the upper limit word amount and the lower limit word quantity, select the second mode as the weight calculation mode, according to the reference score formula
Figure PCTCN2016113202-appb-000011
Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the correlation of all related words recorded in the related word set;
求和计算单元,用于对所述所有相关词的基准分值进行求和,获得基准分值总值sum;a sum calculation unit for summing the reference scores of all the related words to obtain a total score of the reference scores sum;
第三计算单元,用于根据第三权重公式WA=5+log1.5(sum+1),计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词;a third calculating unit, configured to calculate, according to the third weight formula W A =5+log 1.5 (sum+1), a weight value W A of the keyword when performing a resume search; wherein A is the Key words;
第四计算单元,用于判断所述第i个相关词的基准分值是否大于WA/n;若是,则根据第四权重公式
Figure PCTCN2016113202-appb-000012
计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;若否,则根据第五权重公式
Figure PCTCN2016113202-appb-000013
计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;其中,n为第二权重系数;k为所述相关词集中的相关词的数量。
a fourth calculating unit, configured to determine whether a reference score of the i-th related word is greater than W A /n; if yes, according to a fourth weight formula
Figure PCTCN2016113202-appb-000012
Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula
Figure PCTCN2016113202-appb-000013
Calculating a weight value W i of the i-th related word when performing a resume search; wherein n is a second weight coefficient; k is a number of related words in the related word set.
再进一步地,所述权重计算模块还包括:Further, the weight calculation module further includes:
均值计算单元,用于在所述判断所述第i个相关词的基准分值是否大于WA/n之前,根据均值公式
Figure PCTCN2016113202-appb-000014
计算基准分值均值
Figure PCTCN2016113202-appb-000015
其中,
Figure PCTCN2016113202-appb-000016
为所述所有相关词的相关度的平均值;
a mean value calculation unit, configured to: according to the mean value formula, before determining whether the reference score of the i-th related word is greater than W A /n
Figure PCTCN2016113202-appb-000014
Calculate the mean of the benchmark score
Figure PCTCN2016113202-appb-000015
among them,
Figure PCTCN2016113202-appb-000016
The average of the correlations of all related words;
判断单元,用于对于所述相关词集中的第i个相关词,判断所述第i个相关词的基准分值Si是否大于所述基准分值均值
Figure PCTCN2016113202-appb-000017
a determining unit, configured to determine, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
Figure PCTCN2016113202-appb-000017
更新单元,用于当判断所述第i个相关词的基准分值Si是大于所述平均分值
Figure PCTCN2016113202-appb-000018
时,通过更新公式
Figure PCTCN2016113202-appb-000019
更新所述第i个相关词的基准分值Si
An updating unit, configured to determine that a reference score S i of the i-th related word is greater than the average score
Figure PCTCN2016113202-appb-000018
By updating the formula
Figure PCTCN2016113202-appb-000019
Updating the reference score S i of the i-th related word.
更进一步地,所述搜索模块具体包括:Further, the searching module specifically includes:
权重相联单元,用于根据搜索引擎的权重关联格式,将所述关键词所占的权重值和所述关键词相关联作为第一组合,以及将所述每一个相关词所占的权重值与其对应的相关词相关联作为第二组合;a weight associating unit, configured to associate, according to a weight association format of the search engine, a weight value occupied by the keyword and the keyword as a first combination, and a weight value occupied by each of the related words Corresponding related words are associated with the second combination;
搜索显示单元,用于将所述第一组合和所述第二组合提交给搜索引擎,以供所述搜索引擎从所述简历数据库中搜索出简历,并根据所述搜索引擎内置的排序算法显示搜索出的简历;其中,所述简历中包含的所述关键词和所述每一个相关词以高亮形式显示。Searching a display unit, configured to submit the first combination and the second combination to a search engine, for the search engine to search for a resume from the resume database, and display according to a sorting algorithm built in the search engine The searched resume; wherein the keyword and each of the related words included in the resume are displayed in a highlighted form.
实施本发明实施例,具有如下有益效果:Embodiments of the present invention have the following beneficial effects:
本发明实施例提供的简历搜索方法和装置,能根据相关词的数量选取不同的权重计算算法,来进行计算关键词和相关词在搜索过程所占的权重,充分考虑相关词的数量对关键词在检索过程的影响,平衡关键词和相关词在简历搜索过程中所占的权重,避免在权重计算过程 中出现当相关词过多时导致关键词权重过大或过小,当相关词过少时关键词的权重过大或过小的权重失衡情况,使得简历检索的过程更为合理,减少再次进行简历搜索的发生,提高简历搜索的效率。The resume search method and apparatus provided by the embodiments of the present invention can select different weight calculation algorithms according to the number of related words to calculate the weight of the keywords and related words in the search process, and fully consider the number of related words to the keywords. In the impact of the retrieval process, balance the weights of keywords and related words in the resume search process, avoiding the weight calculation process When there are too many related words, the weight of the keyword is too large or too small. When the number of related words is too small, the weight of the keyword is too large or too small, which makes the process of resume search more reasonable, and reduces the resume search again. The occurrence of the resume search improves the efficiency of the search.
附图说明DRAWINGS
图1是本发明提供的简历搜索方法的一个实施例的流程示意图;1 is a schematic flow chart of an embodiment of a resume search method provided by the present invention;
图2是本发明提供的简历搜索装置的一个实施例的结构示意图;2 is a schematic structural diagram of an embodiment of a resume search device provided by the present invention;
图3是本发明提供的简历搜索装置的权重计算模块的一个实施例的结构示意图;3 is a schematic structural diagram of an embodiment of a weight calculation module of a resume search device provided by the present invention;
图4是本发明提供的简历搜索装置的搜索模块的一个实施例的结构示意图。4 is a schematic structural diagram of an embodiment of a search module of a resume search device provided by the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
参见图1,是本发明提供的简历搜索方法的一个实施例的流程示意图,该方法包括以下步骤:FIG. 1 is a schematic flowchart diagram of an embodiment of a resume search method provided by the present invention, where the method includes the following steps:
S1,接收用于从简历数据库中进行简历搜索的关键词和相关词集;其中,所述相关词集记载有多个相关词,以及每一个相关词与所述关键词的相关度;S1. Receive a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword;
S2,根据所述相关词集中的相关词的数量选择权重计算方式,并基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值;S2, selecting a weight calculation manner according to the number of related words in the related word set, and calculating, according to the selected weight calculation manner, based on the correlation degree between each of the related words and the keyword, calculating a resume search The weight value of each related word and the weight value of the keyword;
S3,将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历。S3. Submit the keyword, the weight value occupied by the keyword, each related word, and the weight value of each of the related words to a search engine to search for a resume from the resume database.
进一步地,上述步骤S2中,当所述相关词集中的相关词的数量处于上限词量和下限词量之间时,选择第一方式作为权重计算方式,则基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值,具体的实施过程为:Further, in the above step S2, when the number of related words in the related word set is between the upper limit word quantity and the lower limit word quantity, the first mode is selected as the weight calculation mode, based on the selected weight calculation mode, according to Calculating the weight value of each related word and the weight value of the keyword when performing the resume search, and calculating the weight of each related word and the keyword, the specific implementation process is:
根据第一权重公式
Figure PCTCN2016113202-appb-000020
计算出所述相关词集中第i个相关词在进行简历搜索时所占的权重值Wi;其中,rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
According to the first weight formula
Figure PCTCN2016113202-appb-000020
Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
根据第二权重公式WA=m×sumW,计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词,m为第一权重系数,sumW为所述相关词集中所有相关词的权重值的总和。Calculating, according to the second weight formula W A =m×sumW, the weight value W A of the keyword when performing the resume search; wherein A is the keyword, m is the first weight coefficient, and sumW is The sum of the weight values of all related words in the relevant word set.
需要说明的是,采用上述权重计算算法,进行计算关键词和相关词的权重值,计算过程简单,能快速计算出关键词和相关词的权重值,但本权重计算算法仅适合相关词的数量适当时,才能够即能快速计算出权重值,并能确保关键词所占的权重值处于恰当的区间内。而当相关词的数量过多或过小时,即不在上述上限词量和下限词量的范围之内,采用第二权重计算算法进行计算,具体为步骤S4的实施过程。It should be noted that, by using the above weight calculation algorithm, the weight values of the keywords and related words are calculated, the calculation process is simple, and the weight values of the keywords and related words can be quickly calculated, but the weight calculation algorithm is only suitable for the number of related words. When appropriate, the weight value can be quickly calculated and the weight value of the keyword can be guaranteed to be in the appropriate interval. When the number of related words is too much or too small, that is, not within the range of the upper limit quantity and the lower limit word quantity, the second weight calculation algorithm is used for calculation, which is specifically the implementation process of step S4.
进一步地,上述步骤S2中,当所述相关词集中的相关词的数量不处于所述上限词量和所述下限词量之间时,选择第二方式作为权重计算方式,则基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值,具体的实施过程为:Further, in the above step S2, when the number of related words in the related word set is not between the upper limit word amount and the lower limit word quantity, the second mode is selected as the weight calculation mode, based on the selected The weight calculation method calculates the weight value of each related word and the weight value of the keyword when performing the resume search according to the correlation degree of each of the related words and the keyword, and the specific implementation The process is:
根据基准分值公式
Figure PCTCN2016113202-appb-000021
计算出所述相关词集中第i个相关词的基准分值Si;其中,ri为所述相关词集中的第i个相关词的相关度;rmin为在所述相关词集中所记载的所有相关词的相关度的最小值;rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
Base score formula
Figure PCTCN2016113202-appb-000021
Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
对所述所有相关词的基准分值进行求和,获得基准分值总值sum;Calculating the benchmark scores of all the related words to obtain a total score of the benchmark scores sum;
根据第三权重公式WA=5+log1.5(sum+1),计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词;Calculating, according to the third weight formula W A =5+log 1.5 (sum+1), a weight value W A of the keyword when performing a resume search; wherein A is the keyword;
判断所述第i个相关词的基准分值是否大于WA/n;若是,则根据第四权重公式
Figure PCTCN2016113202-appb-000022
计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;若否,则根据第五权重公式
Figure PCTCN2016113202-appb-000023
计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;其中,n为第二权重系数;k为所述相关词集中的相关词的数量。
Determining whether the reference score of the i-th related word is greater than W A /n; if yes, according to the fourth weight formula
Figure PCTCN2016113202-appb-000022
Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula
Figure PCTCN2016113202-appb-000023
Calculating a weight value W i of the i-th related word when performing a resume search; wherein n is a second weight coefficient; k is a number of related words in the related word set.
需要说明的是,计算相关词集中每个相关词的基准分值的目的是将相关词的相关度基于同一个基准,以一个分值的形式,表示该相关词与关键词的相关程度,该基准值优选为rmax-rmin+1;第一权重公式采用对数形式的公式,在基准分值总值增长过快时,关键词的权重值仍保持缓和增长,因而,可以在相关词相对较多的情况下,避免关键词所占权重过大这种情况的出现;在进行相关词的权重计算时,分别采用第二权重公式和第三权重公式进行计 算,是考虑到当相关词的基准分值过大和过小这两种情况出现时,避免计算出该相关词的权重值大于关键词的权重值,或者是大大小于关键词的权重值,也就是说在检索简历过程中,能避免相关词过多把关键词淹没、以及出现关键词过于突出的情况;对于第二权重系数的设置,可由实际情况来决定,一般取值为2或4.It should be noted that the purpose of calculating the benchmark score of each related word in the related word set is to base the correlation degree of the related words on the same benchmark, and represent the degree of correlation between the related words and the keywords in the form of a score. The reference value is preferably r max -r min +1; the first weight formula adopts a logarithmic formula, and when the total value of the benchmark score grows too fast, the weight value of the keyword still maintains a moderate growth, and thus, the relevant words can be In a relatively large number of cases, avoid the occurrence of excessive weight of keywords; when calculating the weight of related words, the second weight formula and the third weight formula are used respectively to calculate, considering the relevant words. When the two benchmarks of the benchmark scores are too large or too small, it is avoided that the weight value of the related word is greater than the weight value of the keyword, or is much smaller than the weight value of the keyword, that is, during the process of searching for the resume, It can avoid the situation that the relevant words are overwhelmed by keywords and the keywords are too prominent; the setting of the second weight coefficient can be determined by the actual situation, and the general value is 2 or 4.
再进一步地,上述步骤S2的具体实施过程还包括以下步骤:Further, the specific implementation process of the foregoing step S2 further includes the following steps:
在所述判断所述第i个相关词的基准分值是否大于WA/n之前,根据均值公式
Figure PCTCN2016113202-appb-000024
计算基准分值均值
Figure PCTCN2016113202-appb-000025
其中,
Figure PCTCN2016113202-appb-000026
为所述所有相关词的相关度的平均值;
Before the determining whether the reference score of the i-th related word is greater than W A /n, according to the mean value formula
Figure PCTCN2016113202-appb-000024
Calculate the mean of the benchmark score
Figure PCTCN2016113202-appb-000025
among them,
Figure PCTCN2016113202-appb-000026
The average of the correlations of all related words;
对于所述相关词集中的第i个相关词,判断所述第i个相关词的基准分值Si是否大于所述基准分值均值
Figure PCTCN2016113202-appb-000027
Determining, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
Figure PCTCN2016113202-appb-000027
若是,通过更新公式
Figure PCTCN2016113202-appb-000028
更新所述第i个相关词的基准分值Si
If yes, by updating the formula
Figure PCTCN2016113202-appb-000028
Updating the reference score S i of the i-th related word.
需要说明的是,将该相关词集中的相关词的相关度均值基于同一个基准,且该基准与上述基准相同,目的是以一个分值的形式,表示该相关词集与关键词的平均相关程度;当一个相关词与关键词的相关程度(即基准分值)大于该相关词集与关键词的平均相关程度(平均分值)时,可将该相关词与关键词的相关程度,与该相关词集与关键词的平均相关程度这两个相关程度的差异值放大10倍,作为该相关词与关键词的相关程度的基准分值;反之,则保持该相关词与关键词的相关程度的原基准分值不变,这种方式更能确切地表达该相关词与关键词的相关程度,使得相关词与关键词之间的关系更为合理。It should be noted that the correlation degree of the related words in the related word set is based on the same benchmark, and the benchmark is the same as the above benchmark, and the purpose is to represent the average correlation between the related word set and the keyword in the form of a score. Degree; when the degree of correlation between a related word and a keyword (ie, the benchmark score) is greater than the average degree of correlation (average score) between the related word set and the keyword, the degree of correlation between the related word and the keyword may be The difference between the correlation degree of the related words and the average degree of relevance of the keywords is magnified 10 times as a reference score of the degree of relevance of the related words to the keywords; otherwise, the related words are related to the keywords. The original benchmark score of the degree is unchanged. This method can more accurately express the relevance of the related words and keywords, making the relationship between related words and keywords more reasonable.
更进一步地,上述步骤S3的具体实施过程为:Further, the specific implementation process of the above step S3 is:
根据搜索引擎的权重关联格式,将所述关键词所占的权重值和所述关键词相关联作为第一组合,以及将所述每一个相关词所占的权重值与其对应的相关词相关联作为第二组合;Correlating a weight value occupied by the keyword with the keyword as a first combination according to a weight association format of a search engine, and associating a weight value of each of the related words with a corresponding related word thereof As a second combination;
将所述第一组合和所述第二组合提交给搜索引擎,以供所述搜索引擎从所述简历数据库中搜索出简历,并根据所述搜索引擎内置的排序算法显示搜索出的简历;其中,所述简历中包含的所述关键词和所述每一个相关词以高亮形式显示。Submitting the first combination and the second combination to a search engine for the search engine to search for a resume from the resume database, and displaying the searched resume according to a sorting algorithm built into the search engine; The keyword and each of the related words included in the resume are displayed in a highlighted form.
需要说明的是,一般采用的搜索引擎为solr搜索引擎,则上述权重关联格式,可依据如下格式进行关联:关键词^关键词权重、相关词1^相关词1权重、相关词2^相关词2权重......It should be noted that the commonly used search engine is the solr search engine, and the above weight association format can be associated according to the following format: keyword ^ keyword weight, related word 1^ related word 1 weight, related word 2^ related words 2 weights...
实施本发明实施例的简历搜索方法,能根据相关词的数量选取不同的权重计算算法,来进行计算关键词和相关词在搜索过程所占的权重,充分考虑相关词的数量对关键词在检索过程的影响,平衡关键词和相关词在简历搜索过程中所占的权重,避免在权重计算过程中出现当相关词过多时导致关键词权重过大或过小,当相关词过少时关键词的权重过大或过小的权 重失衡情况,使简历检索的过程更为合理,减少再次进行简历搜索的发生,提高简历搜索的效率。The resume search method of the embodiment of the present invention can select different weight calculation algorithms according to the number of related words to calculate the weight of the keywords and related words in the search process, and fully consider the number of related words to search for the keywords. The influence of the process, balancing the weights of keywords and related words in the resume search process, avoiding the occurrence of too many related words when the related words are too large, causing the keyword weights to be too large or too small, when the related words are too small, the keywords Weight too large or too small The imbalance situation makes the process of resume search more reasonable, reduces the occurrence of resume search again, and improves the efficiency of resume search.
参见图2,是本发明提供的简历搜索装置的一个实施例的结构示意图,该简历搜索装置能实施上述简历搜索方法的全部流程,其具体结构如下:2 is a schematic structural diagram of an embodiment of a resume search device provided by the present invention. The resume search device can implement the entire flow of the resume search method, and the specific structure thereof is as follows:
接收模块10,用于接收用于从简历数据库中进行简历搜索的关键词和相关词集;其中,所述相关词集记载有多个相关词,以及每一个相关词与所述关键词的相关度;The receiving module 10 is configured to receive a keyword and a related word set for performing a resume search from a resume database, wherein the related word set records a plurality of related words, and each related word is related to the keyword degree;
权重计算模块20,用于根据所述相关词集中的相关词的数量选择权重计算方式,并基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值;The weight calculation module 20 is configured to select a weight calculation manner according to the number of related words in the related word set, and calculate, according to the selected weight calculation manner, according to the correlation degree between each of the related words and the keyword The weight value of each related word and the weight value of the keyword when performing a resume search;
搜索模块30,用于将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历。The search module 30 is configured to submit the keyword, the weight value occupied by the keyword, each related word, and the weight value of each of the related words to a search engine to search from the resume database. resume.
进一步地,参见图3,是本发明提供的简历搜索装置的权重计算模块的一个实施例的结构示意图;该权重计算模块20包括:Further, referring to FIG. 3, which is a schematic structural diagram of an embodiment of a weight calculation module of a resume search device provided by the present invention; the weight calculation module 20 includes:
第一计算单元21,用于当所述相关词集中的相关词的数量处于上限词量和下限词量之间时,选择第一方式作为权重计算方式,根据第一权重公式
Figure PCTCN2016113202-appb-000029
计算出所述相关词集中第i个相关词在进行简历搜索时所占的权重值Wi;其中,rmax为在所述相关词集中的所有相关词的相关度的最大值;
The first calculating unit 21 is configured to: when the number of related words in the related word set is between the upper limit word quantity and the lower limit word quantity, select the first mode as the weight calculation mode, according to the first weight formula
Figure PCTCN2016113202-appb-000029
Calculating a weight value W i of the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words in the related word set;
第二计算单元22,用于根据第二权重公式WA=m×sumW,计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词,m为第一权重系数,sumW为所述相关词集中所有相关词的权重值的总和。a second calculating unit 22, configured to calculate, according to the second weight formula W A =m×sumW, a weight value W A occupied by the keyword when performing a resume search; wherein A is the keyword, and m is The first weight coefficient, sumW, is the sum of the weight values of all related words in the related word set.
进一步地,如图3所示,上述权重计算模块20还包括:Further, as shown in FIG. 3, the weight calculation module 20 further includes:
基准分值计算单元23,用于当所述相关词集中的相关词的数量不处于所述上限词量和所述下限词量之间时,选择第二方式作为权重计算方式,根据基准分值公式
Figure PCTCN2016113202-appb-000030
计算出所述相关词集中第i个相关词的基准分值Si;其中,ri为所述相关词集中的第i个相关词的相关度;rmin为在所述相关词集中所记载的所有相关词的相关度的最小值;rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
The reference score calculation unit 23 is configured to: when the number of related words in the related word set is not between the upper limit word amount and the lower limit word quantity, select the second mode as the weight calculation mode, according to the reference score formula
Figure PCTCN2016113202-appb-000030
Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
求和计算单元24,用于对所述所有相关词的基准分值进行求和,获得所述相关词集的基准分值总值sum; The sum calculation unit 24 is configured to sum the reference scores of all the related words to obtain a reference score total value sum of the related word set;
第三计算单元25,用于根据第三权重公式WA=5+log1.5(sum+1),计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词;a third calculating unit 25, configured to calculate, according to the third weight formula W A =5+log 1.5 (sum+1), a weight value W A of the keyword when performing a resume search; wherein, A is Key words
第四计算单元26,用于判断所述第i个相关词的基准分值是否大于WA/n;若是,则根据第四权重公式
Figure PCTCN2016113202-appb-000031
计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;若否,则根据第五权重公式
Figure PCTCN2016113202-appb-000032
计算出所述第i个相关词词在进行简历搜索时所占的权重值Wi;其中,n为第二权重系数;k为所述相关词集中的相关词的数量。
a fourth calculating unit 26, configured to determine whether a reference score of the i-th related word is greater than W A /n; if yes, according to a fourth weight formula
Figure PCTCN2016113202-appb-000031
Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula
Figure PCTCN2016113202-appb-000032
Calculating a weight value W i occupied by the i-th related word in the resume search; wherein n is a second weight coefficient; k is the number of related words in the related word set.
再进一步地,如图3所示,上述权重计算模块20还包括:Further, as shown in FIG. 3, the weight calculation module 20 further includes:
均值计算单元27,用于在所述判断所述第i个相关词的基准分值是否大于WA/n之前,根据平均分值公式
Figure PCTCN2016113202-appb-000033
计算基准分值均值其中,
Figure PCTCN2016113202-appb-000035
为所述相关词集中的所有相关词的相关度的平均值;
The mean value calculating unit 27 is configured to: according to the average score formula, before determining whether the reference score of the i-th related word is greater than W A /n
Figure PCTCN2016113202-appb-000033
Calculate the mean of the benchmark score among them,
Figure PCTCN2016113202-appb-000035
The average of the correlations of all related words in the related word set;
判断单元28,用于对于所述相关词集中的第i个相关词,判断所述第i个相关词的基准分值Si是否大于所述基准分值均值
Figure PCTCN2016113202-appb-000036
The determining unit 28 is configured to determine, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
Figure PCTCN2016113202-appb-000036
更新单元29,用于当判断所述第i个相关词的基准分值Si是大于所述基准分值均值
Figure PCTCN2016113202-appb-000037
时,通过更新公式
Figure PCTCN2016113202-appb-000038
更新所述第i个相关词的基准分值Si
The updating unit 29 is configured to: when determining that the reference score S i of the ith related word is greater than the reference score mean
Figure PCTCN2016113202-appb-000037
By updating the formula
Figure PCTCN2016113202-appb-000038
Updating the reference score S i of the i-th related word.
更进一步地,参见图4,是本发明提供的简历搜索装置的搜索模块的一个实施例的结构示意图;该搜索模块30具体包括:Further, referring to FIG. 4, it is a schematic structural diagram of an embodiment of a search module of the resume search device provided by the present invention; the search module 30 specifically includes:
权重相联单元31,用于根据搜索引擎的权重关联格式,将所述关键词所占的权重值和所述关键词相关联作为第一组合,以及将所述每一个相关词所占的权重值与其对应的相关词相关联作为第二组合;The weight associating unit 31 is configured to associate, according to a weight association format of the search engine, a weight value occupied by the keyword and the keyword as a first combination, and a weight of each of the related words The value is associated with its corresponding related word as a second combination;
搜索显示单元32,用于将所述第一组合和所述第二组合提交给搜索引擎,以供所述搜索引擎从所述简历数据库中搜索出简历,并根据所述搜索引擎内置的排序算法显示搜索出的简历;其中,所述简历中包含的所述关键词和所述每一个相关词以高亮形式显示。a search display unit 32, configured to submit the first combination and the second combination to a search engine, for the search engine to search for a resume from the resume database, and according to a sorting algorithm built in the search engine The searched resume is displayed; wherein the keyword and each of the related words included in the resume are displayed in a highlighted form.
本发明实施例提供的简历搜索装置,能根据相关词的数量选取不同的权重计算算法,来进行计算关键词和相关词在搜索过程所占的权重,充分考虑相关词的数量对关键词在检索过程的影响,平衡关键词和相关词在简历搜索过程中所占的权重,避免在权重计算过程中出现当相关词过多时导致关键词权重过大或过小,当相关词过少时导致关键词的权重过大或过小的权重失衡情况,使简历检索的过程更为合理,减少再次进行简历搜索的发生,提高简历搜 索的效率。The resume search device provided by the embodiment of the present invention can select different weight calculation algorithms according to the number of related words to calculate the weight of the keywords and related words in the search process, and fully consider the number of related words to search for the keywords. The influence of the process, balancing the weights of keywords and related words in the resume search process, avoiding the occurrence of too many related words when the related words are too large, causing the keyword weights to be too large or too small. When the related words are too small, the keywords are caused. The weight imbalance is too large or too small, making the process of resume search more reasonable, reducing the occurrence of resume search again, improving resume search The efficiency of the cable.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory,ROM)或随机存储记忆体(Random Access Memory,RAM)等。One of ordinary skill in the art can understand that all or part of the process of implementing the foregoing embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium. When executed, the flow of an embodiment of the methods as described above may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也视为本发明的保护范围。 The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It is the scope of protection of the present invention.

Claims (10)

  1. 一种简历搜索方法,其特征在于,包括:A resume search method, characterized in that it comprises:
    接收用于从简历数据库中进行简历搜索的关键词和相关词集;其中,所述相关词集记载有多个相关词,以及每一个相关词与所述关键词的相关度;Receiving a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword;
    根据所述相关词集中的相关词的数量选择权重计算方式,并基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值;Selecting a weight calculation manner according to the number of related words in the related word set, and calculating, according to the selected weight calculation manner, each of the related words and the keyword correlation degree, each of the resume search The weight value of the related words and the weight value of the keyword;
    将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历。The keyword, the weight value of the keyword, each related word, and the weight value of each of the related words are submitted to a search engine to search for a resume from the resume database.
  2. 如权利要求1所述的简历搜索方法,其特征在于,当所述相关词集中的相关词的数量处于上限词量和下限词量之间时,选择第一方式作为权重计算方式,则基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值,具体为:The resume search method according to claim 1, wherein when the number of related words in the related word set is between the upper limit word amount and the lower limit word amount, the first mode is selected as the weight calculation method, The weight calculation method is selected, and according to the correlation degree of each of the related words and the keyword, the weight value of each related word and the weight value of the keyword when calculating the resume search are calculated, specifically for:
    根据第一权重公式
    Figure PCTCN2016113202-appb-100001
    计算出所述相关词集中第i个相关词在进行简历搜索时所占的权重值Wi;其中,rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
    According to the first weight formula
    Figure PCTCN2016113202-appb-100001
    Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
    根据第二权重公式WA=m×sum W,计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词,m为第一权重系数,sum W为所述相关词集中所有相关词的权重值的总和。Calculating, according to the second weight formula W A =m×sum W, the weight value W A of the keyword when performing the resume search; wherein A is the keyword, m is the first weight coefficient, and sum W The sum of the weight values of all related words in the related word set.
  3. 如权利要求1或2所述简历搜索方法,其特征在于,当所述相关词集中的相关词的数量不处于所述上限词量和所述下限词量之间时,选择第二方式作为权重计算方式,则基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值,具体为:The resume search method according to claim 1 or 2, wherein when the number of related words in the related word set is not between the upper limit word amount and the lower limit word amount, the second mode is selected as the weight The calculation method is based on the selected weight calculation manner, and based on the correlation degree of each of the related words and the keyword, the weight value of each related word and the keyword is calculated when performing the resume search. The weight value, which is:
    根据基准分值公式
    Figure PCTCN2016113202-appb-100002
    计算出所述相关词集中第i个相关词的基准分值Si;其中,ri为所述相关词集中的第i个相关词的相关度;rmin为在所述相关词集中所记载的所有相关词的相关度的最小值;rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
    Base score formula
    Figure PCTCN2016113202-appb-100002
    Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
    对所述所有相关词的基准分值进行求和,获得基准分值总值sum; Calculating the benchmark scores of all the related words to obtain a total score of the benchmark scores sum;
    根据第三权重公式WA=5+log1.5(sum+1),计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词;Calculating, according to the third weight formula W A =5+log 1.5 (sum+1), a weight value W A of the keyword when performing a resume search; wherein A is the keyword;
    判断所述第i个相关词的基准分值是否大于WA/n;若是,则根据第四权重公式
    Figure PCTCN2016113202-appb-100003
    计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;若否,则根据第五权重公式
    Figure PCTCN2016113202-appb-100004
    计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;其中,n为第二权重系数;k为所述相关词集中的相关词的数量。
    Determining whether the reference score of the i-th related word is greater than W A /n; if yes, according to the fourth weight formula
    Figure PCTCN2016113202-appb-100003
    Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula
    Figure PCTCN2016113202-appb-100004
    Calculating a weight value W i of the i-th related word when performing a resume search; wherein n is a second weight coefficient; k is a number of related words in the related word set.
  4. 如权利要求3所述的简历搜索方法,其特征在于,在所述判断所述第i个相关词的基准分值是否大于WA/n之前,还包括:The resume search method according to claim 3, further comprising: before determining whether the reference score of the i-th related word is greater than W A /n,
    根据均值公式
    Figure PCTCN2016113202-appb-100005
    计算基准分值均值
    Figure PCTCN2016113202-appb-100006
    其中,
    Figure PCTCN2016113202-appb-100007
    为所述所有相关词的相关度的平均值;
    According to the mean formula
    Figure PCTCN2016113202-appb-100005
    Calculate the mean of the benchmark score
    Figure PCTCN2016113202-appb-100006
    among them,
    Figure PCTCN2016113202-appb-100007
    The average of the correlations of all related words;
    对于所述相关词集中的第i个相关词,判断所述第i个相关词的基准分值Si是否大于所述基准分值均值
    Figure PCTCN2016113202-appb-100008
    Determining, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
    Figure PCTCN2016113202-appb-100008
    若是,通过更新公式
    Figure PCTCN2016113202-appb-100009
    更新所述第i个相关词的基准分值Si
    If yes, by updating the formula
    Figure PCTCN2016113202-appb-100009
    Updating the reference score S i of the i-th related word.
  5. 如权利要求1所述的简历搜索的方法,其特征在于,所述将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词所占的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历,具体为:The method of searching for a resume according to claim 1, wherein said keyword, said weight value of said keyword, each related word, and a weight value of said each related word Submitted to the search engine to search for a resume from the resume database, specifically:
    根据搜索引擎的权重关联格式,将所述关键词所占的权重值和所述关键词相关联作为第一组合,以及将所述每一个相关词所占的权重值与其对应的相关词相关联作为第二组合;Correlating a weight value occupied by the keyword with the keyword as a first combination according to a weight association format of a search engine, and associating a weight value of each of the related words with a corresponding related word thereof As a second combination;
    将所述第一组合和所述第二组合提交给搜索引擎,以供所述搜索引擎从所述简历数据库中搜索出简历,并根据所述搜索引擎内置的排序算法显示搜索出的简历;其中,所述简历中包含的所述关键词和所述每一个相关词以高亮形式显示。Submitting the first combination and the second combination to a search engine for the search engine to search for a resume from the resume database, and displaying the searched resume according to a sorting algorithm built into the search engine; The keyword and each of the related words included in the resume are displayed in a highlighted form.
  6. 一种简历搜索装置,其特征在于,包括:A resume search device, comprising:
    接收模块,用于接收用于从简历数据库中进行简历搜索的关键词和相关词集;其中,所述相关词集记载有多个相关词,以及每一个相关词与所述关键词的相关度; a receiving module, configured to receive a keyword and a related word set for performing a resume search from a resume database; wherein the related word set records a plurality of related words, and a relevance of each related word to the keyword ;
    权重计算模块,用于根据所述相关词集中的相关词的数量选择权重计算方式,并基于所选择的权重计算方式,根据所述每一个相关词与所述关键词的相关度,计算出在进行简历搜索时每一个相关词所占的权重值及所述关键词所占的权重值;a weight calculation module, configured to select a weight calculation manner according to the number of related words in the related word set, and calculate, according to the selected weight calculation manner, based on the correlation degree between each of the related words and the keyword, The weight value of each related word in the resume search and the weight value of the keyword;
    搜索模块,用于将所述关键词、所述关键词所占的权重值、每一个相关词和所述每一个相关词的权重值提交给搜索引擎,以从所述简历数据库中搜索出简历。a search module, configured to submit the keyword, the weight value occupied by the keyword, each related word, and the weight value of each of the related words to a search engine to search for a resume from the resume database .
  7. 如权利要求6所述的简历搜索装置,其特征在于,所述权重计算模块包括:The resume search device according to claim 6, wherein the weight calculation module comprises:
    第一计算单元,用于当所述相关词集中的相关词的数量处于上限词量和下限词量之间时,选择第一方式作为权重计算方式,根据第一权重公式
    Figure PCTCN2016113202-appb-100010
    计算出所述相关词集中第i个相关词在进行简历搜索时所占的权重值Wi;其中,rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
    a first calculating unit, configured to: when the number of related words in the related word set is between the upper limit word quantity and the lower limit word quantity, select the first mode as the weight calculation mode, according to the first weight formula
    Figure PCTCN2016113202-appb-100010
    Calculating a weight value W i occupied by the i-th related word in the related word set when performing a resume search; wherein r max is a maximum value of relevance of all related words recorded in the related word set;
    第二计算单元,用于根据第二权重公式WA=m×sum W,计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词,m为第一权重系数,sum W为所述相关词集中所有相关词的权重值的总和。a second calculating unit, configured to calculate, according to the second weight formula W A =m×sum W, a weight value W A of the keyword when performing a resume search; wherein A is the keyword, and m is The first weight coefficient, sum W, is the sum of the weight values of all related words in the related word set.
  8. 如权利要求6或7所述简历搜索装置,其特征在于,所述权重计算模块还包括:The resume search device according to claim 6 or 7, wherein the weight calculation module further comprises:
    基准分值计算单元,用于当所述相关词集中的相关词的数量不处于所述上限词量和所述下限词量之间时,选择第二方式作为权重计算方式,根据基准分值公式
    Figure PCTCN2016113202-appb-100011
    计算出所述相关词集中第i个相关词的基准分值Si;其中,ri为所述相关词集中的第i个相关词的相关度;rmin为在所述相关词集中所记载的所有相关词的相关度的最小值;rmax为在所述相关词集中所记载的所有相关词的相关度的最大值;
    a reference score calculation unit, configured to: when the number of related words in the related word set is not between the upper limit word amount and the lower limit word quantity, select the second mode as the weight calculation mode, according to the reference score formula
    Figure PCTCN2016113202-appb-100011
    Calculating a reference score S i of the i-th related word in the related word set; wherein r i is a correlation degree of the i-th related word in the related word set; r min is recorded in the related word set The minimum value of the relevance of all related words; r max is the maximum value of the relevance of all related words recorded in the related word set;
    求和计算单元,用于对所述所有相关词的基准分值进行求和,获得基准分值总值sum;a sum calculation unit for summing the reference scores of all the related words to obtain a total score of the reference scores sum;
    第三计算单元,用于根据第三权重公式WA=5+log1.5(sum+1),计算出所述关键词在进行简历搜索时所占的权重值WA;其中,A为所述关键词;a third calculating unit, configured to calculate, according to the third weight formula W A =5+log 1.5 (sum+1), a weight value W A of the keyword when performing a resume search; wherein A is the Key words;
    第四计算单元,用于判断所述第i个相关词的基准分值是否大于WA/n;若是,则根据第四权重公式
    Figure PCTCN2016113202-appb-100012
    计算出所述第i个相关词在进行简历搜索时所占的权重值Wi;若否,则根据第五权重公式
    Figure PCTCN2016113202-appb-100013
    计算出所述第i个相关词在进行简历搜索 时所占的权重值Wi;其中,n为第二权重系数;k为所述相关词集中的相关词的数量。
    a fourth calculating unit, configured to determine whether a reference score of the i-th related word is greater than W A /n; if yes, according to a fourth weight formula
    Figure PCTCN2016113202-appb-100012
    Calculating a weight value W i of the i-th related word when performing a resume search; if not, according to the fifth weight formula
    Figure PCTCN2016113202-appb-100013
    Calculating a weight value W i of the i-th related word when performing a resume search; wherein n is a second weight coefficient; k is the number of related words in the related word set.
  9. 如权利要求8所述的简历搜索装置,其特征在于,所述权重计算模块还包括:The resume search device according to claim 8, wherein the weight calculation module further comprises:
    均值计算单元,用于在所述判断所述第i个相关词的基准分值是否大于WA/n之前,根据均值公式
    Figure PCTCN2016113202-appb-100014
    计算基准分值均值
    Figure PCTCN2016113202-appb-100015
    其中,
    Figure PCTCN2016113202-appb-100016
    为所述所有相关词的相关度的平均值;
    a mean value calculation unit, configured to: according to the mean value formula, before determining whether the reference score of the i-th related word is greater than W A /n
    Figure PCTCN2016113202-appb-100014
    Calculate the mean of the benchmark score
    Figure PCTCN2016113202-appb-100015
    among them,
    Figure PCTCN2016113202-appb-100016
    The average of the correlations of all related words;
    判断单元,用于对于所述相关词集中的第i个相关词,判断所述第i个相关词的基准分值Si是否大于所述基准分值均值
    Figure PCTCN2016113202-appb-100017
    a determining unit, configured to determine, for the i-th related word in the related word set, whether the reference score S i of the i-th related word is greater than the reference score mean
    Figure PCTCN2016113202-appb-100017
    更新单元,用于当判断所述第i个相关词的基准分值Si是大于所述平均分值
    Figure PCTCN2016113202-appb-100018
    时,通过更新公式
    Figure PCTCN2016113202-appb-100019
    更新所述第i个相关词的基准分值Si
    An updating unit, configured to determine that a reference score S i of the i-th related word is greater than the average score
    Figure PCTCN2016113202-appb-100018
    By updating the formula
    Figure PCTCN2016113202-appb-100019
    Updating the reference score S i of the i-th related word.
  10. 如权利要求6所述的简历搜索的装置,其特征在于,所述搜索模块具体包括:The device for searching for a resume according to claim 6, wherein the search module specifically comprises:
    权重相联单元,用于根据搜索引擎的权重关联格式,将所述关键词所占的权重值和所述关键词相关联作为第一组合,以及将所述每一个相关词所占的权重值与其对应的相关词相关联作为第二组合;a weight associating unit, configured to associate, according to a weight association format of the search engine, a weight value occupied by the keyword and the keyword as a first combination, and a weight value occupied by each of the related words Corresponding related words are associated with the second combination;
    搜索显示单元,用于将所述第一组合和所述第二组合提交给搜索引擎,以供所述搜索引擎从所述简历数据库中搜索出简历,并根据所述搜索引擎内置的排序算法显示搜索出的简历;其中,所述简历中包含的所述关键词和所述每一个相关词以高亮形式显示。 Searching a display unit, configured to submit the first combination and the second combination to a search engine, for the search engine to search for a resume from the resume database, and display according to a sorting algorithm built in the search engine The searched resume; wherein the keyword and each of the related words included in the resume are displayed in a highlighted form.
PCT/CN2016/113202 2016-06-17 2016-12-29 Method and device for searching resumes WO2017215245A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610436857.7 2016-06-17
CN201610436857.7A CN105956195B (en) 2016-06-17 2016-06-17 Resume search method and apparatus

Publications (1)

Publication Number Publication Date
WO2017215245A1 true WO2017215245A1 (en) 2017-12-21

Family

ID=56905991

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/113202 WO2017215245A1 (en) 2016-06-17 2016-12-29 Method and device for searching resumes

Country Status (2)

Country Link
CN (1) CN105956195B (en)
WO (1) WO2017215245A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111737399A (en) * 2020-05-28 2020-10-02 北京百度网讯科技有限公司 Method and device for expanding question and answer set, electronic equipment and readable storage medium
CN112925872A (en) * 2019-12-05 2021-06-08 北京沃东天骏信息技术有限公司 Data searching method and device

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105956195B (en) * 2016-06-17 2019-03-29 广州视源电子科技股份有限公司 Resume search method and apparatus
CN106095982B (en) * 2016-06-17 2019-03-29 广州视源电子科技股份有限公司 Resume search method and device
CN112396401A (en) * 2020-11-23 2021-02-23 五八到家有限公司 Method and equipment for pushing resume of worker and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101140587A (en) * 2007-10-15 2008-03-12 深圳市迅雷网络技术有限公司 Searching method and apparatus
CN103226618A (en) * 2013-05-21 2013-07-31 焦点科技股份有限公司 Related word extracting method and system based on data market mining
US20130226936A1 (en) * 2012-02-24 2013-08-29 Hon Hai Precision Industry Co., Ltd. Electronic device and method for searching related terms
CN103425687A (en) * 2012-05-21 2013-12-04 阿里巴巴集团控股有限公司 Retrieval method and system based on queries
CN105653553A (en) * 2014-11-14 2016-06-08 腾讯科技(深圳)有限公司 Term weight generation method and device
CN105956195A (en) * 2016-06-17 2016-09-21 广州视源电子科技股份有限公司 Resume search method and apparatus
CN106095982A (en) * 2016-06-17 2016-11-09 广州视源电子科技股份有限公司 Resume searching method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4733458B2 (en) * 2005-07-29 2011-07-27 株式会社野村総合研究所 Search system and search method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101140587A (en) * 2007-10-15 2008-03-12 深圳市迅雷网络技术有限公司 Searching method and apparatus
US20130226936A1 (en) * 2012-02-24 2013-08-29 Hon Hai Precision Industry Co., Ltd. Electronic device and method for searching related terms
CN103425687A (en) * 2012-05-21 2013-12-04 阿里巴巴集团控股有限公司 Retrieval method and system based on queries
CN103226618A (en) * 2013-05-21 2013-07-31 焦点科技股份有限公司 Related word extracting method and system based on data market mining
CN105653553A (en) * 2014-11-14 2016-06-08 腾讯科技(深圳)有限公司 Term weight generation method and device
CN105956195A (en) * 2016-06-17 2016-09-21 广州视源电子科技股份有限公司 Resume search method and apparatus
CN106095982A (en) * 2016-06-17 2016-11-09 广州视源电子科技股份有限公司 Resume searching method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112925872A (en) * 2019-12-05 2021-06-08 北京沃东天骏信息技术有限公司 Data searching method and device
CN111737399A (en) * 2020-05-28 2020-10-02 北京百度网讯科技有限公司 Method and device for expanding question and answer set, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN105956195B (en) 2019-03-29
CN105956195A (en) 2016-09-21

Similar Documents

Publication Publication Date Title
WO2017215245A1 (en) Method and device for searching resumes
WO2017215243A1 (en) Method and device for searching resumes
JP5450842B2 (en) Determination of word information entropy
US9141728B2 (en) Methods and systems for creating and using an adaptive thesaurus
US6915295B2 (en) Information searching method of profile information, program, recording medium, and apparatus
WO2017215242A1 (en) Method and device for searching resumes
US20120226681A1 (en) Facet determination using query logs
US10152478B2 (en) Apparatus, system and method for string disambiguation and entity ranking
US20100153388A1 (en) Methods and apparatus for result diversification
JP2008519328A5 (en)
CN104994424B (en) A kind of method and apparatus for building audio and video standard data set
US8489604B1 (en) Automated resource selection process evaluation
WO2021082123A1 (en) Information recommendation method and apparatus, and electronic device
US10762091B2 (en) Interactive feedback for changes in search relevancy parameters
WO2018157790A1 (en) Method and device for determining related entity, computing device and storage medium
JP4375626B2 (en) Search service system and method for providing input order of keywords by category
US20130144815A1 (en) Making predictions regarding evaluation of functions for a database environment
CN112287094A (en) Similar case text retrieval system
CN112328775B (en) Case text information retrieval system
JP2011221872A (en) Knowledge quantity estimation apparatus and program
US9785676B2 (en) Systems and methods for providing ordered results for search queries
CN108170665B (en) Keyword expansion method and device based on comprehensive similarity
JP5310196B2 (en) Classification system revision support program, classification system revision support device, and classification system revision support method
US11468006B2 (en) Systems and methods to facilitate prioritization of documents in electronic discovery
CN106934007A (en) The method for pushing and device of related information

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16905352

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16905352

Country of ref document: EP

Kind code of ref document: A1