CN102385587A - Method and system for identifying name - Google Patents

Method and system for identifying name Download PDF

Info

Publication number
CN102385587A
CN102385587A CN201010270770XA CN201010270770A CN102385587A CN 102385587 A CN102385587 A CN 102385587A CN 201010270770X A CN201010270770X A CN 201010270770XA CN 201010270770 A CN201010270770 A CN 201010270770A CN 102385587 A CN102385587 A CN 102385587A
Authority
CN
China
Prior art keywords
name
candidate
initiation sequence
entry
frequency meter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201010270770XA
Other languages
Chinese (zh)
Other versions
CN102385587B (en
Inventor
罗长升
方高林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Tencent Computer Systems Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201010270770.XA priority Critical patent/CN102385587B/en
Publication of CN102385587A publication Critical patent/CN102385587A/en
Application granted granted Critical
Publication of CN102385587B publication Critical patent/CN102385587B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Character Discrimination (AREA)

Abstract

The invention provides method and system for identifying a name, being applicable to the field of internet and searching, and the method comprises the following steps: storing the name identified in initial sequence and the times that the name appears in a name frequency list, determining the candidate name according to lexical items in the initial sequence, and taking the candidate name as the identified name for marking if the candidate name appears in the name frequency list and the appearing times exceed preset time threshold. The technical scheme of the invention has the advantage of improving the name identifying accuracy.

Description

A kind of recognition methods of name and system
Technical field
The invention belongs to internet and search field, relate in particular to a kind of recognition methods and system of name.
Background technology
Along with Internet development, the user is increasing to be searched for the name of China through search software in the internet.The recognition methods of existing name is specially: Automatic Extraction Role Information from corpus (being stored data base); Take the Viterbi algorithm to carry out character labeling to cutting the speech result; On the basis of role's sequence, carry out the pattern maximum match, finally realize the identification of Chinese personal name.
According to the technical scheme that prior art provided, find to exist in the prior art following technical matters:
The method of the technical scheme that prior art provides is carried out character labeling to cutting the speech result, so when cutting the speech result and mistake occurs, easily to the name identification error, the identification error rate is high.
Summary of the invention
The embodiment of the invention provides a kind of recognition methods of name, when the recognition methods that is intended to solve prior art mistake occurs to cutting the speech result, and easily to the name identification error, the problem that the identification error rate is high.
The embodiment of the invention is achieved in that a kind of recognition methods of name, and said method comprises the steps:
The number of times of name that identifies in the initiation sequence and the appearance of this name is stored in the name frequency meter; Confirm candidate's name according to the entry in this initiation sequence;
Appear in this name frequency meter like this candidate's name, and occurrence number is when surpassing the preset times threshold value, with this candidate's name as the name that identifies.
The present invention also provides a kind of recognition system of name, and said system comprises:
Storage unit, the number of times that the name that is used for initiation sequence is identified and this name occur is stored in the name frequency meter;
Confirm the unit, be used for confirming candidate's name according to the entry of this initiation sequence;
Recognition unit is used for appearing at this name frequency meter at this candidate's name, and when preset times occurring and surpassing frequency threshold value, with this candidate's name as the name that identifies.
The embodiment of the invention compared with prior art; Beneficial effect is: technical scheme of the present invention is set up the name frequency meter to the name of initiation sequence with this name occurrence number; Confirm candidate's name according to the entry of this initiation sequence then, and the name in this candidate's name and this name frequency meter is compared, as appear in this name frequency meter; And when the number of times in this name frequency meter surpasses frequency threshold value; Confirm this candidate name that leaks identification by name, because this method is that error correction is carried out on the basis with the initiation sequence, so it has when cutting the speech result and mistake occurs; Can carry out the processing of error correction to the recognition result (being initiation sequence) of prior art, so it has the advantage that improves the name recognition accuracy.
Description of drawings
Fig. 1 is the process flow diagram of the recognition methods of a kind of name provided by the invention;
Fig. 2 is the process flow diagram that the embodiment of the invention one provides a kind of recognition methods of name;
Fig. 3 provides a kind of name correction process flow diagram for the embodiment of the invention one;
Fig. 4 is the process flow diagram that the embodiment of the invention two provides a kind of recognition methods of name;
Fig. 5 is the process flow diagram that the embodiment of the invention three provides a kind of recognition methods of name;
Fig. 6 provides a kind of structural drawing of recognition system of name for the present invention.
Embodiment
In order to make the object of the invention, technical scheme and advantage clearer,, the present invention is further elaborated below in conjunction with accompanying drawing and embodiment.Should be appreciated that specific embodiment described herein only in order to explanation the present invention, and be not used in qualification the present invention.
The present invention provides a kind of recognition methods of name, and this method is as shown in Figure 1, specifically comprises the steps:
S10, the number of times that the name that identifies in the initiation sequence and this name are occurred are stored in the name frequency meter;
Need to prove that above-mentioned initiation sequence can be sequence after name handled through preliminary identification.The method that above-mentioned identification is handled can be the method for prior art, and for example the Viterbi algorithm can certainly be other recognition methods, as long as this method can tentatively identify name, the present invention does not limit to the concrete manifestation form of this recognition methods.
S11, confirm candidate's name according to the entry in this initiation sequence;
S12, appear in this name frequency meter, and occurrence number is when surpassing the preset times threshold value like this candidate's name, with this candidate's name as the name that identifies.
Optional, this method can also comprise: mark the name that this identifies, and the number of times that in initiation sequence, occurs according to this candidate's name upgrades the name frequency meter.
Above-mentioned preset times threshold users can set up on their own in advance, and for example 1,2,3 or the like, the present invention does not limit to the concrete value of this frequency threshold value.
Optional, that realizes that the concrete grammar of S11 can be in subordinate's mode is any, can certainly be the combination in any in subordinate's mode.
Mode A, two or more continuous in initiation sequence entries are combined into candidate's name;
Mode B, candidate's name formed in first Chinese character of the back entry of the name entry of two words in the initiation sequence and this entry;
Mode C, candidate's name formed in the first two word of triliteral name entry in the initiation sequence.
Need to prove that people's name recognition method provided by the invention is mainly used in the identification of Chinese name,, then also can be applied to other literal, for example other minority name family literal etc. of the language of the Manchus or some if the name of other literal has the characteristic of Chinese name.
The method that present embodiment provides is set up the name frequency meter to the name of initiation sequence with this name occurrence number, confirms candidate's name according to the entry of this initiation sequence then, and the name in this candidate's name and this name frequency meter is compared; As appear in this name frequency meter; And when the number of times in this name frequency meter surpasses frequency threshold value, confirm this candidate name that leaks identification by name, with this candidate's name sign; And upgrade this name frequency meter; Because this method is that error correction is carried out on the basis with the initiation sequence,, can carry out the processing of error correction to the recognition result (being initiation sequence) of prior art so it has when cutting the speech result and mistake occurs; So it can solve the traditional difficult problem in the name identification emphatically: no surname name identification and name identification ambiguity, thus can improve the name recognition accuracy.
Embodiment one:
Present embodiment provides a kind of recognition methods of name; The technological scene that present embodiment is realized is: the method that present embodiment provides is accomplished by identification equipment, and this identification equipment specifically can do, digital electric equipment such as computing machine, portable terminal, PDA; Present embodiment is example with Chinese; Present embodiment is the recognition methods that example is explained present embodiment with the hypomere document, need to prove, the hypomere literal can be the sequence after the process identification disposal methods of prior art; The sequence unification that explanation for ease, present embodiment will be passed through after recognition methods identification is handled is called initiation sequence.Shown in this initiation sequence is specific as follows:
Be promoted to " topic player " Ceng Yike enrage Bao Xiaobai/nr/nr because of detonieren before and became the person who attract people's attention that night once more.In ground " support group " judging panel still very good once be lost can/original music and the pure and fresh typhoon of nr.Be called as " sheep angel " once be lost can/ the original works that remains oneself " Leo " that nr brings.Ceng Yike/nr remains " dispute can ".Ceng Yike/nr and second takes turns minimum " little swallow " Li Li/nr of score and carries out ultimate PK and fight to the finish.Once be lost this moment can/the ballot score of nr and Li Li/nr is 0: 2.Li Li/the nr on her next door has drawn time her to say then: keep your wool on.Three words of " wanting to swear at people " of Ceng Yike/nr are very clear.The reporter find once to be lost can/nr write blog all through the night.To having carried out sincere apology because of the said improper language of comfort Li Li/nr in the match.Ceng Yike/nr representes to abandon because of the comment in the external world music dream of oneself.Li Li/nr also in the blog of oneself for once be lost can/the nr clarification.But being lost of PK can still be unable to bear tear successively.Li Lixi/nr gets the 13rd in my elegant whole nation of my type of 2007 Sprites.Li Lifang/nr is able to win.Once be lost/ difference that nr can a ticket is defeated by Liu Xijun/nr and transfers to undetermined.
Wherein, the entry of " nr " in the above-mentioned initiation sequence is the name that identifies.Above-mentioned entry can be more predefined speech in the dictionary, for example " can ", " whole nation " or the like, can certainly be artificial some speech that are provided with, for example " Li Lianjie ", " Cheng Long ", " Jordon " etc.; Need to prove that the entry in the initiation sequence separates through space character, for example in " but successively ", entry " but " and entry " successively " separate through space character.The method that present embodiment provides is as shown in Figure 2, specifically comprises the steps:
S20, the number of times that the name that identifies in this initiation sequence and this name are occurred are stored in the name frequency meter;
The name frequency meter of above-mentioned initiation sequence specifically can be as shown in table 1:
Table 1:
Name Number of times
Ceng Yike 10
Bao Xiaobai 1
Li Li 7
Li Lixi 1
Li Lifang 1
Once be lost 1
S21, two or more continuous entries are combined into candidate's name;
Need to prove that above-mentioned entry can be the entry of single word, for example " "; Certainly in actual conditions, also can be the entry of a plurality of words, for example " language ".
Need to prove; During if any continuous a plurality of individual character entry; Candidate's name of its composition also can be a plurality of candidate's names; Here be combined into candidate's example by name with two continuous individual character entries, " drawn time she say " can be formed 4 candidate's names, be respectively: " leaving behind ", " time ", " down she ", " she says ".Need to prove that combinations thereof becomes the number of name list of the candidates words bar to be generally 2,3,4; Certainly the definition of this number is just stipulated by the custom of current Chinese name number of words; Do not get rid of when custom changes; The name number of words becomes numbers of words such as 8,9,10, and for example the number of the foreigner's Chinese name is and surpasses 4 number, above-mentioned a plurality of can the setting according to actual conditions.
S22, appear in the above-mentioned name frequency meter, and occurrence number is during greater than frequency threshold value, this candidate's name as the name mark that identifies, and is upgraded this name frequency meter like this candidate's name.
The concrete mode of this name frequency meter of above-mentioned renewal can for: the number of times to this candidate's name of occurring in this name frequency meter upgrades, and for example this candidate's name occurred 2 times, then the number of times to this candidate's name in this name frequency meter is increased by 2 times.
The process flow diagram of the correction in the present embodiment method is as shown in Figure 3, wherein, can the name of S20 be stored in the name frequency meter of Fig. 3, and error correction can be accomplished the operation of S21 and S22.
The method that present embodiment provides is set up the name frequency meter to the name of initiation sequence with this name occurrence number; Then candidate's names formed in continuous two or more entries and compare, as appear in this name frequency meter, and the number of times in this name frequency meter is during above frequency threshold value with the name in this name frequency meter; Confirm this candidate name that leaks identification by name; With this candidate's name sign, and upgrade this name frequency meter, because this method is that error correction is carried out on the basis with the initiation sequence; So it has when cutting the speech result and mistake occurs; Can carry out the processing of error correction to the recognition result (being initiation sequence) of prior art, so it can solve the traditional difficult problem in the name identification emphatically: no surname name identification and name are discerned ambiguity, thereby can improve the advantage of name recognition accuracy.
Embodiment two:
Present embodiment provides a kind of recognition methods of name, and the technological scene that present embodiment provides is identical with the technological scene that embodiment one provides, and this method is as shown in Figure 4, comprises the steps:
S40, the number of times that the name that identifies in this initiation sequence and this name are occurred are stored in the name frequency meter;
This name frequency meter specifically can be as shown in table 1.
S41, candidate's name formed in first Chinese character of the back entry of the name entry of two words and this entry;
The implementation method of S41 is described with a real example below, " once was lost/nr " with the name entry of above-mentioned two words here, a back entry of this entry be " can ", the candidate who then forms " Ceng Yike " by name.Certainly in actual conditions, also can candidate's name be formed in preceding two Chinese characters of the back entry of the name entry of two words and this entry.
S42, appear in the above-mentioned name frequency meter, and occurrence number is during greater than frequency threshold value, this candidate's name as the name mark that identifies, and is upgraded this name frequency meter like this candidate's name.
Here suppose that frequency threshold value is 3 times, certainly in actual conditions, can be arranged to other numeral; For example 2,4 or 1 or the like; Because the number of times that candidate's name " Ceng Yike " occurs in the name frequency meter is 10 times, greater than frequency threshold value, so " Ceng Yike " carried out the name mark; And upgrade the name frequency meter, the name frequency meter after the renewal is as shown in table 2:
Table 2
Name Number of times
Ceng Yike 11
Bao Xiaobai 1
Li Li 7
Li Lixi 1
Li Lifang 1
Once be lost 1
Sequence after the name that identifies marked is:
" beautiful side/nr is able to win.Ceng Yike/nr is defeated by Liu Xijun with the difference of a ticket ".Need to prove, owing to this mark is only changed the row second from the bottom of above-mentioned initiation sequence, so only write the delegation of change here.
The method that present embodiment provides is set up the name frequency meter to the name of initiation sequence with this name occurrence number; The name of then first Chinese character of the back entry of the name entry of two words and this entry being formed in candidate's name and this name frequency meter is compared; As appear in this name frequency meter; And when the number of times in this name frequency meter surpasses frequency threshold value, confirm this candidate name that leaks identification by name, with this candidate's name sign; And upgrade this name frequency meter; Because this method is that error correction is carried out on the basis with the initiation sequence,, can carry out the processing of error correction to the recognition result (being initiation sequence) of prior art so it has when cutting the speech result and mistake occurs; So it can solve the traditional difficult problem in the name identification emphatically: no surname name identification and name identification ambiguity, for example the ambiguity name perhaps becomes the identification problem of composer of ci poetry's name with context; Thereby can improve the advantage of name recognition accuracy.
Embodiment three:
Present embodiment provides a kind of recognition methods of name, and the technological scene that present embodiment provides is identical with the technological scene that embodiment one provides, and this method is as shown in Figure 4, comprises the steps:
S50, the number of times that the name that identifies in this initiation sequence and this name are occurred are stored in the name frequency meter;
This name frequency meter specifically can be as shown in table 1.
S51, candidate's name formed in the first two word of triliteral name entry;
The implementation method of S51 is described with a real example below, and here with above-mentioned triliteral name entry " Ceng Yike/nr ", the candidate of composition is by name: " once being lost "; The candidate that " Li Lixi/nr " and " Li Lifang/nr " forms is by name: " Li Li ".
S52, appear in the above-mentioned name frequency meter, and occurrence number is during greater than frequency threshold value, this candidate's name as the name mark that identifies, and is upgraded this name frequency meter like this candidate's name.
Here suppose that frequency threshold value is 3 times, because the occurrence number of " once being lost " is for once, so it is not greater than frequency threshold value; And the number of times that " Li Li " occurs is 7 times, greater than frequency threshold value, so " Li Lixi/nr " is modified as " Li Li/nr happiness "; " Li Licheng/nr " is modified as " Li Li/nr becomes "; And upgrade the name frequency meter, the name frequency meter after the renewal is as shown in table 3:
Table 3:
Figure BSA00000254941500081
Figure BSA00000254941500091
Sequence after the name that identifies marked is:
" water.Li Li/nr gains the 13rd in my elegant whole nation of my type of 2007 Sprites happily.Li Li/nr side is able to win.Once be lost/ difference that nr can a ticket is defeated by Liu Xijun "
The method that present embodiment provides is set up the name frequency meter to the name of initiation sequence with this name occurrence number; Then candidate's name formed in the first two word in the triliteral name entry and compare, as appear in this name frequency meter, and the number of times in this name frequency meter is during above frequency threshold value with the name in this name frequency meter; Confirm this candidate name that leaks identification by name; With this candidate's name sign, and upgrade this name frequency meter, because this method is that error correction is carried out on the basis with the initiation sequence; So it has when cutting the speech result and mistake occurs; Can carry out the processing of error correction to the recognition result (being initiation sequence) of prior art, so it can solve the traditional difficult problem in the name identification emphatically: no surname name identification and name are discerned ambiguity, thereby can improve the advantage of name recognition accuracy.
The present invention also provides a kind of recognition system of name, and this system is as shown in Figure 6, comprising:
Storage unit 61 is stored in the number of times of name that identifies in the initiation sequence and the appearance of this name in the name frequency meter;
Confirm that the entry in unit 62 these initiation sequences confirms candidate's name;
Recognition unit 63 appears in this name frequency meter at this candidate's name, and occurrence number marks this candidate's name when surpassing the preset times threshold value as the name that identifies.
The definition of above-mentioned initiation sequence can be referring to the associated description among the method embodiment.
Optional, said system can also comprise:
Mark updating block 64 these names that identify of mark, and the number of times that in initiation sequence, occurs according to this candidate's name upgrades the name frequency meter.
Optional, above-mentioned definite unit 62 can comprise in the following module any or a plurality of:
Composite module 621 is combined into candidate's name with two or more continuous in this initiation sequence entries continuously;
Composite module 622 is formed candidate's name with first Chinese character or preceding two Chinese characters of a back entry of the name entry of two words in this initiation sequence and this entry;
Form module 623 candidate's name formed in the first two word of triliteral name entry in this initiation sequence.
The system that present embodiment provides sets up the name frequency meter to the name of initiation sequence with this name occurrence number, confirms candidate's name according to the entry of this initiation sequence then, and the name in this candidate's name and this name frequency meter is compared; As appear in this name frequency meter; And when the number of times in this name frequency meter surpasses frequency threshold value, confirm this candidate name that leaks identification by name, with this candidate's name sign; And upgrade this name frequency meter; Because this system is that error correction is carried out on the basis with the initiation sequence,, can carry out the processing of error correction to the recognition result (being initiation sequence) of prior art so it has when cutting the speech result and mistake occurs; So it can solve the traditional difficult problem in the name identification emphatically: no surname name identification and name are discerned ambiguity, thereby can improve the advantage of name recognition accuracy.
It should be noted that among the said system embodiment that each included unit is just divided according to function logic, but is not limited to above-mentioned division, as long as can realize function corresponding; In addition, the concrete title of each functional unit also just for the ease of mutual differentiation, is not limited to protection scope of the present invention.
In addition; One of ordinary skill in the art will appreciate that all or part of step that realizes in the foregoing description method is to instruct relevant hardware to accomplish through program; Corresponding program can be stored in a kind of computer-readable recording medium; The above-mentioned storage medium of mentioning can be a ROM (read-only memory), disk or CD etc.
In sum, technical scheme provided by the invention has the advantage that is difficult for the name identification error.
The above is merely preferred embodiment of the present invention, not in order to restriction the present invention, all any modifications of within spirit of the present invention and principle, being done, is equal to and replaces and improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. the recognition methods of a name is characterized in that, said method comprises the steps:
The number of times of name that identifies in the initiation sequence and the appearance of this name is stored in the name frequency meter,
Confirm candidate's name according to the entry in this initiation sequence;
Appear in this name frequency meter like this candidate's name, and occurrence number is when surpassing the preset times threshold value, with this candidate's name as the name that identifies.
2. method according to claim 1 is characterized in that, said method also comprises the steps: this candidate's name as after the name that identifies
Mark the name that this identifies, and the number of times that in initiation sequence, occurs according to this candidate's name upgrades the name frequency meter.
3. method according to claim 1 is characterized in that, saidly confirms that according to the entry in this initiation sequence the step of candidate's name specifically comprises:
Two or more continuous in this initiation sequence entries are combined into candidate's name.
4. method according to claim 1 is characterized in that, saidly confirms that according to the entry in this initiation sequence the step of candidate's name specifically comprises:
Candidate's name formed in first Chinese character or the first two Chinese character of the back entry of the name entry of two words in this initiation sequence and this entry.
5. method according to claim 1 is characterized in that, saidly confirms that according to the entry in this initiation sequence the step of candidate's name specifically comprises:
Candidate's name formed in the first two word of triliteral name entry in this initiation sequence.
6. the recognition system of a name is characterized in that, said system comprises:
Storage unit, the number of times that the name that is used for initiation sequence is identified and this name occur is stored in the name frequency meter;
Confirm the unit, be used for confirming candidate's name according to the entry of this initiation sequence;
Recognition unit is used for appearing at this name frequency meter at this candidate's name, and occurrence number is when surpassing the preset times threshold value, with this candidate's name as the name that identifies.
7. system according to claim 6 is characterized in that, said system also comprises:
The mark updating block be used to mark the name that this identifies, and the number of times that in initiation sequence, occurs according to this candidate's name upgrades the name frequency meter.
8. system according to claim 6 is characterized in that, said candidate unit comprises:
Composite module is used for two or more entries that this initiation sequence is continuous and is combined into candidate's name continuously.
9. system according to claim 6 is characterized in that, said candidate unit comprises:
Composite module is used for candidate's name formed in first Chinese character or preceding two Chinese characters of the back entry of the name entry of two words of this initiation sequence and this entry.
10. system according to claim 6 is characterized in that, said candidate unit comprises:
Form module, be used for candidate's name formed in the first two word of the triliteral name entry of this initiation sequence.
CN201010270770.XA 2010-08-27 2010-08-27 Method and system for identifying name Active CN102385587B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010270770.XA CN102385587B (en) 2010-08-27 2010-08-27 Method and system for identifying name

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010270770.XA CN102385587B (en) 2010-08-27 2010-08-27 Method and system for identifying name

Publications (2)

Publication Number Publication Date
CN102385587A true CN102385587A (en) 2012-03-21
CN102385587B CN102385587B (en) 2014-07-30

Family

ID=45825008

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010270770.XA Active CN102385587B (en) 2010-08-27 2010-08-27 Method and system for identifying name

Country Status (1)

Country Link
CN (1) CN102385587B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103823859A (en) * 2014-02-21 2014-05-28 安徽博约信息科技有限责任公司 Name recognition algorithm based on combination of decision-making tree rules and multiple statistic models
CN105373530A (en) * 2015-12-03 2016-03-02 北京锐安科技有限公司 Chinese name identification method and apparatus
CN112016272A (en) * 2019-10-29 2020-12-01 河南拓普计算机网络工程有限公司 Bidding information review expert identification system and method
CN113792186A (en) * 2021-08-16 2021-12-14 青岛海尔科技有限公司 Method and device for name retrieval, electronic equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101088082A (en) * 2004-10-25 2007-12-12 英孚威尔公司 Full text query and search systems and methods of use
CN101645134A (en) * 2005-07-29 2010-02-10 富士通株式会社 Integral place name recognition method and integral place name recognition device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101088082A (en) * 2004-10-25 2007-12-12 英孚威尔公司 Full text query and search systems and methods of use
CN101645134A (en) * 2005-07-29 2010-02-10 富士通株式会社 Integral place name recognition method and integral place name recognition device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
和雪娟等: "基于统计和规则混合策略的中国人名识别研究", 《云南民族大学学报(自然科学版)》, vol. 18, no. 1, 31 January 2009 (2009-01-31), pages 70 - 72 *
罗智勇: "一种基于可信度的人名识别方法", 《中文信息学报》, vol. 19, no. 3, 30 June 2005 (2005-06-30) *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103823859A (en) * 2014-02-21 2014-05-28 安徽博约信息科技有限责任公司 Name recognition algorithm based on combination of decision-making tree rules and multiple statistic models
CN103823859B (en) * 2014-02-21 2017-02-22 安徽博约信息科技股份有限公司 Name recognition algorithm based on combination of decision-making tree rules and multiple statistic models
CN105373530A (en) * 2015-12-03 2016-03-02 北京锐安科技有限公司 Chinese name identification method and apparatus
CN112016272A (en) * 2019-10-29 2020-12-01 河南拓普计算机网络工程有限公司 Bidding information review expert identification system and method
CN113792186A (en) * 2021-08-16 2021-12-14 青岛海尔科技有限公司 Method and device for name retrieval, electronic equipment and storage medium
CN113792186B (en) * 2021-08-16 2023-07-11 青岛海尔科技有限公司 Method, device, electronic equipment and storage medium for name retrieval

Also Published As

Publication number Publication date
CN102385587B (en) 2014-07-30

Similar Documents

Publication Publication Date Title
US10783171B2 (en) Address search method and device
CN105718586B (en) The method and device of participle
CN103186524B (en) A kind of place name identification method and apparatus
CN102866782B (en) Input method and input method system for improving sentence generating efficiency
CN104281649B (en) Input method and device and electronic equipment
US20150302056A1 (en) Method, system, and storage medium for information search
CN104572625A (en) Recognition method of named entity
CN102750282B (en) Synonym template mining method and device as well as synonym mining method and device
CN101013342A (en) Chinese online input method based on Chinese network word base
CN102693279A (en) Method, device and system for fast calculating comment similarity
CN102385587B (en) Method and system for identifying name
CN103324626A (en) Method for setting multi-granularity dictionary and segmenting words and device thereof
CN106383814A (en) Word segmentation method of English social media short text
CN108650546B (en) Barrage processing method, computer-readable storage medium and electronic device
CN101271449B (en) Method and device for reducing vocabulary and Chinese character string phonetic notation
CN102193920A (en) Name word stock generating method and device as well as text input system
CN106959943B (en) Language identification updating method and device
CN106326206B (en) Entity extraction method based on grammar template
CN111091834B (en) Text and audio alignment method and related product
CN104615782B (en) Address matching process based on sliding window maximum matching algorithm
CN103500163A (en) Method and device for recognizing event key progress
CN102866783B (en) Syncopation method of Chinese phonetic string and system thereof
CN101539433A (en) Searching method with first letter of pinyin and intonation in navigation system and device thereof
CN102033891A (en) Retrieval method for Chinese information, retrieval engine for Chinese information and embedded terminal
CN103167087A (en) Method and system of searching cell phone contact persons

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20151230

Address after: The South Road in Guangdong province Shenzhen city Fiyta building 518057 floor 5-10 Nanshan District high tech Zone

Patentee after: Shenzhen Tencent Computer System Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518044 Zhenxing Road, SEG Science Park 2 East Room 403

Patentee before: Tencent Technology (Shenzhen) Co., Ltd.