CN105187600B - Recognition methods based on recursive telephone number and device - Google Patents

Recognition methods based on recursive telephone number and device Download PDF

Info

Publication number
CN105187600B
CN105187600B CN201510643026.2A CN201510643026A CN105187600B CN 105187600 B CN105187600 B CN 105187600B CN 201510643026 A CN201510643026 A CN 201510643026A CN 105187600 B CN105187600 B CN 105187600B
Authority
CN
China
Prior art keywords
telephone number
identified
digit
strings
cutting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510643026.2A
Other languages
Chinese (zh)
Other versions
CN105187600A (en
Inventor
马健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201510643026.2A priority Critical patent/CN105187600B/en
Publication of CN105187600A publication Critical patent/CN105187600A/en
Application granted granted Critical
Publication of CN105187600B publication Critical patent/CN105187600B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The present invention provides a kind of recognition methods based on recursive telephone number and devices.This method includes:Pretreatment operation carries out with the relevant pretreatment of phone number format original telephone number strings to be identified, the target telephone number strings to be identified that obtain that treated;Division operation divides target telephone number strings to be identified according to the division rule for meeting phone number format, obtains the number series of the first specified digit from initial position;Identification operation, identifies the classification of the corresponding telephone number of number series of the described first specified digit;Recursive operation carries out remaining telephone number strings to be identified to repeat recurrence, until remaining telephone number strings to be identified have all been identified if still having remaining telephone number strings to be identified.The embodiment of the present invention is based on recursive operation, then carries out repeating recurrence to remaining telephone number strings to be identified, until remaining telephone number strings to be identified have all been identified.

Description

Recognition methods based on recursive telephone number and device
Technical field
The present invention relates to technical field of internet application, especially a kind of recognition methods based on recursive telephone number and Device.
Background technology
POI (Point of Interest), i.e. point of interest, are the foundation stones of entire digital map navigation industry, especially when Forward Dynamic Internet era, map information data just become more indispensable.Include a large amount of POI information in magnanimity webpage, often A POI information includes the information such as title, address, longitude and latitude, telephone number, and the POI data levels of audit quality of different web pages is uneven, and Important way of the phone as contact point of interest, accuracy are to weigh the important indicator of a POI data quality.
Hundreds of millions of POI information is contained in magnanimity webpage, the presentation mode of telephone number is also complicated various, same POI information may include multiple fixed-line telephones or mobile phone, and staggeredly be merged together.In addition, from internet For the POI information of extraction there may be the data of a large amount of mistake, the telephone number of POI is also in this way, and the telephone number of mistake The injury in experience can be brought to user in application, so how accurately to identify the telephone number in webpage POI information As a technical problem to be solved urgently.
Invention content
In view of the above problems, it is proposed that the present invention overcoming the above problem in order to provide one kind or solves at least partly State the recognition methods based on recursive telephone number of problem and corresponding device.
One side according to the present invention provides a kind of recognition methods based on recursive telephone number, including:
Pretreatment operation obtains original telephone number strings progress to be identified and the relevant pretreatment of phone number format Target that treated telephone number strings to be identified;
Division operation, it is to be identified to the target according to the division rule for meeting phone number format from initial position Telephone number strings are divided, and the number series of the first specified digit is obtained;
Identification operation, identifies the classification of the corresponding telephone number of number series of the described first specified digit;
Recursive operation, if remaining telephone number strings to be identified are still had, to remaining telephone number to be identified String carries out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
Optionally, remaining telephone number strings to be identified are carried out repeating recurrence, including:
The pretreatment operation, the division operation and identification behaviour are executed to remaining telephone number strings to be identified Make.
Optionally, it is described to original telephone number strings to be identified carry out with the relevant pretreatment operation of phone number format, The target telephone number strings to be identified that obtain that treated, including:
It whether determines in the original telephone number strings to be identified comprising specified separator;
If comprising specified separator in the original telephone number strings to be identified, according to former described in the separator cutting Begin telephone number strings to be identified, obtains at least two targets telephone number strings to be identified after cutting.
Optionally, the specified separator includes at least one following:Pause mark, branch, slash, back slash, erects comma Bar.
Optionally, after at least two targets telephone number strings to be identified after obtaining cutting, further include:
For each target telephone number strings to be identified, determine whether the head of target telephone number strings to be identified has National area code;
If so, removing the national area code on target telephone number strings head to be identified.
Optionally, after the national area code for removing target telephone number strings head to be identified, further include:
Analysis eliminates the target telephone number strings to be identified after national area code;
If the head of the target telephone number strings to be identified has regional area code and this area's area code is imperfect, supplement This area's area code keeps it complete;
If the head of the target telephone number strings to be identified has regional area code and this area's area code repeats, to the ground Trivial number progress duplicate removal processing.
Optionally, the classification of the corresponding telephone number of number series of the described first specified digit is identified, including:
Judge whether the number series of the described first specified digit meets the attributive character of first category telephone number;
If so, according to the attributive character of the first category telephone number, at least two detection digits are determined;
Each detection digit is respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result;
Referred to described first as a result, choosing optimized detection digit from at least two detections digit according to the cutting The number series for positioning number carries out completion.
Optionally, described that each detection digit is respectively adopted to the progress cutting of target telephone number strings to be identified, it obtains To cutting as a result, including:
For each detection digit, using the detection digit to target telephone number strings to be identified, described first Telephone number strings after the number series of specified digit carry out cutting, obtain the first cutting number and the second cutting number;
Compare the first cutting number and the second cutting number, determines the identical position of number on the two corresponding position Number, as the corresponding cutting result of the detection digit.
Optionally, according to the cutting as a result, choosing optimized detection digit to institute from at least two detections digit The number series for stating the first specified digit carries out completion, including:
Compare the identical digit of the corresponding number of each detection digit;
From each detection digit, it is maximum as optimized detection digit to choose the identical digit of corresponding number;
Optimized detection digit described in number series completion to the described first specified digit.
Optionally, judging whether the number series of the described first specified digit meets the attribute spy of first category telephone number After sign, further include:
If the number series of the first specified digit does not meet the attributive character of first category telephone number, choose newly The division rule for meeting phone number format re-starts division to target telephone number strings to be identified, and it is specified to obtain second The number series of digit;
Judge whether the number series of the described second specified digit meets the attributive character of second category telephone number;
If so, according to the attributive character of the second category telephone number, to the number series of the described second specified digit Carry out completion.
Optionally, the original telephone number strings to be identified are obtained by following steps:
Point of interest POI information is obtained from webpage;
The original telephone number strings to be identified are extracted from the POI information.
Another aspect according to the present invention additionally provides a kind of identification device based on recursive telephone number, including:
Preprocessing module is suitable for original telephone number strings to be identified grasp with the relevant pretreatment of phone number format Make, the target telephone number strings to be identified that obtain that treated;
Division module, suitable for from initial position, being waited for the target according to the division rule for meeting phone number format Identification telephone number strings are divided, and the number series of the first specified digit is obtained;
Identification module, the classification of the corresponding telephone number of number series suitable for identifying the described first specified digit;
Recurrence module, if suitable for still having remaining telephone number strings to be identified, to remaining phone to be identified Number series carries out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
Optionally, the recurrence module is further adapted for:
For remaining telephone number strings to be identified, triggers the preprocessing module and execute pretreatment operation, the division Module executes division operation and the identification module executes identification operation, until remaining telephone number strings to be identified are by whole It has identified.
Optionally, the preprocessing module is further adapted for:
It whether determines in the original telephone number strings to be identified comprising specified separator;
If comprising specified separator in the original telephone number strings to be identified, according to former described in the separator cutting Begin telephone number strings to be identified, obtains at least two targets telephone number strings to be identified after cutting.
Optionally, the specified separator includes at least one following:Pause mark, branch, slash, back slash, erects comma Bar.
Optionally, the preprocessing module is further adapted for:
After at least two targets telephone number strings to be identified after obtaining cutting, for each target phone to be identified Number series, determines whether the head of target telephone number strings to be identified has national area code;
If so, removing the national area code on target telephone number strings head to be identified.
Optionally, the preprocessing module is further adapted for:
After the national area code for removing target telephone number strings head to be identified, analysis eliminates after national area code Target telephone number strings to be identified;
If the head of the target telephone number strings to be identified has regional area code and this area's area code is imperfect, supplement This area's area code keeps it complete;
If the head of the target telephone number strings to be identified has regional area code and this area's area code repeats, to the ground Trivial number progress duplicate removal processing.
Optionally, the identification module is further adapted for:
Judge whether the number series of the described first specified digit meets the attributive character of first category telephone number;
If so, according to the attributive character of the first category telephone number, at least two detection digits are determined;
Each detection digit is respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result;
Referred to described first as a result, choosing optimized detection digit from at least two detections digit according to the cutting The number series for positioning number carries out completion.
Optionally, the identification module is further adapted for:
For each detection digit, using the detection digit to target telephone number strings to be identified, described first Telephone number strings after the number series of specified digit carry out cutting, obtain the first cutting number and the second cutting number;
Compare the first cutting number and the second cutting number, determines the identical position of number on the two corresponding position Number, as the corresponding cutting result of the detection digit.
Optionally, the identification module is further adapted for:
Compare the identical digit of the corresponding number of each detection digit;
From each detection digit, it is maximum as optimized detection digit to choose the identical digit of corresponding number;
Optimized detection digit described in number series completion to the described first specified digit.
Optionally, the division module is further adapted for judging the number series of the described first specified digit in the identification module After the attributive character for whether meeting first category telephone number, if the number series of the first specified digit does not meet the first kind The attributive character of other telephone number then chooses the new division rule for meeting phone number format to target phone to be identified Number series re-starts division, obtains the number series of the second specified digit;
The identification module is further adapted for judging whether the number series of the described second specified digit meets second category phone number The attributive character of code;If so, according to the attributive character of the second category telephone number, to number of the described second specified digit Sequence carries out completion.
Optionally, described device further includes acquisition module, and the original phone to be identified is obtained suitable for passing through following steps Number series:
Point of interest POI information is obtained from webpage;
The original telephone number strings to be identified are extracted from the POI information.
In embodiments of the present invention, original telephone number strings to be identified are carried out first relevant pre- with phone number format Processing operation so that the target telephone number strings to be identified after pretreatment operation are consistent with phone number format, in order to follow-up The identification that telephone number is carried out based on the target telephone number strings to be identified after pretreatment operation, improves the identification of telephone number Rate.Also, the embodiment of the present invention combines the feature that different classes of telephone number (such as fixed-line telephone or mobile phone) has, Target telephone number strings to be identified are carried out using the division rule of the different classes of corresponding phone number format of telephone number It divides, the classification of its corresponding telephone number is identified according to the number series for dividing the first obtained specified digit, realize to not Effective identification of generic telephone number.Further, in the corresponding telephone number of number series for identifying the first specified digit Classification after, if still have remaining telephone number strings to be identified, the embodiment of the present invention be based on recursive operation, then it is right Remaining telephone number strings to be identified carry out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
Have very in addition, the embodiment of the present invention has also combined two fixed-line telephones in the same telephone unit or mobile phone The characteristics of big similitude, the scheme judged to detection digit after detect target telephone number strings to be identified, are known Not, the accuracy of telephone number identification is further improved.
Above description is only the general introduction of technical solution of the present invention, in order to better understand the technical means of the present invention, And can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, below the special specific implementation mode for lifting the present invention.
According to the following detailed description of specific embodiments of the present invention in conjunction with the accompanying drawings, those skilled in the art will be brighter The above and other objects, advantages and features of the present invention.
Description of the drawings
By reading the detailed description of hereafter preferred embodiment, various other advantages and benefit are common for this field Technical staff will become clear.Attached drawing only for the purpose of illustrating preferred embodiments, and is not considered as to the present invention Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 shows the flow chart of the recognition methods according to an embodiment of the invention based on recursive telephone number;
Fig. 2 shows the corresponding telephone numbers of number series of first specified digit of identification according to an embodiment of the invention The flow chart of classification;
Fig. 3 shows the flow chart of the recognition methods according to another embodiment of the present invention based on recursive telephone number;
Fig. 4 shows the structural representation of the identification device according to an embodiment of the invention based on recursive telephone number Figure;And
Fig. 5 shows the structural representation of the identification device according to another embodiment of the present invention based on recursive telephone number Figure.
Specific implementation mode
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.
In order to solve the above technical problems, an embodiment of the present invention provides a kind of identification sides based on recursive telephone number Method.Fig. 1 shows the flow chart of the recognition methods according to an embodiment of the invention based on recursive telephone number.Referring to Fig. 1, This method at least may include step S102 to step S108.
Step S102, pretreatment operation carry out original telephone number strings to be identified relevant pre- with phone number format Processing, the target telephone number strings to be identified that obtain that treated.
Step S104, division operation wait for target according to the division rule for meeting phone number format from initial position Identification telephone number strings are divided, and the number series of the first specified digit is obtained.
Step S106, identification operation, identifies the classification of the corresponding telephone number of number series of the first specified digit.
Step S108, recursive operation, if remaining telephone number strings to be identified are still had, to remaining to be identified Telephone number strings carry out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
In embodiments of the present invention, original telephone number strings to be identified are carried out first relevant pre- with phone number format Processing operation so that the target telephone number strings to be identified after pretreatment operation are consistent with phone number format, in order to follow-up The identification that telephone number is carried out based on the target telephone number strings to be identified after pretreatment operation, improves the identification of telephone number Rate.Also, the embodiment of the present invention combines the feature that different classes of telephone number (such as fixed-line telephone or mobile phone) has, Target telephone number strings to be identified are carried out using the division rule of the different classes of corresponding phone number format of telephone number It divides, the classification of its corresponding telephone number is identified according to the number series for dividing the first obtained specified digit, realize to not Effective identification of generic telephone number.Further, in the corresponding telephone number of number series for identifying the first specified digit Classification after, if still have remaining telephone number strings to be identified, the embodiment of the present invention be based on recursive operation, then it is right Remaining telephone number strings to be identified carry out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
Recognition methods provided in an embodiment of the present invention based on recursive telephone number can be to the phone number in POI information Code is effectively identified, that is, before above step S102, original telephone number strings to be identified can be obtained first, specifically, POI information can be obtained from webpage, and then original telephone number strings to be identified are extracted from POI information.
Phone information in webpage is broadly divided into mobile phone and fixed-line telephone, is with Chinese city, area, county's telephone number Example, mobile phone include 11, may determine that its correctness and affiliated area according to its first 7, here, mobile phone generally with 13,14,15,17,18 or 19 beginning can utilize mobile phone ownership table to judge preceding 7 correctness and affiliated area;It is fixed Phone is divided into 10 number telephones of the beginning of official 400 or 800, the common 7 or 8 regions electricity comprising 3 or 4 area codes Words, 5 telephone numbers of special official (such as 10086,95522 etc.) and special 3 telephone numbers (such as 110,119,114 Deng), and fixed-line telephone may include extension number.
Hundreds of millions of POI information is contained in magnanimity webpage, the presentation mode of telephone number is also complicated various, same POI information may include multiple fixed-line telephones or mobile phone, and staggeredly be merged together.Table 1 lists some nets Chinese city in page, area, county's telephone number presentation mode.The embodiment of the present invention is subsequently according to Chinese city mentioned above, area, county Telephone number mixed and disorderly in webpage is identified in the characteristics of telephone number.
It should be noted that the method for identification telephone number provided in an embodiment of the present invention can also be in conjunction with other countries The characteristics of telephone number, effectively identifies the telephone number of other countries.
Table 1
Telephone number Explanation about telephone number
400-890-0000 turns 805530 Extension number is illustrated by Chinese character
86-0877-70104577010457 Include 86 before phone, and multiple telephone numbers are without separator
0852-8719889868719669 There is national area code 86 among telephone number
028-84876877,1380233318 Mobile phone and fixed-line telephone superposition, mobile phone are imperfect
07710771324579718602365784 Regional area code repeats
286990619869906199 Regional area code lacks 0
0755-13651464541 Include regional area code before mobile phone
Telephone number presentation mode complexity as can be seen from Table 1 in webpage is various, and the embodiment of the present invention is in order to improve electricity The discrimination for talking about number can be to original telephone number strings progress to be identified and phone number format in above step S102 Relevant pretreatment operation so that the target telephone number strings to be identified after pretreatment operation are protected as far as possible with phone number format It holds consistent.
In embodiments of the present invention, original telephone number strings to be identified are carried out and the relevant pretreatment of phone number format Operation may include according to the pre- cutting of separator, the identification of national area code and removal, the supplement of regional area code and duplicate removal etc..
First, according to separator carry out pre-cut timesharing, it may be determined that in original telephone number strings to be identified whether include Specified separator, if comprising specified separator in original telephone number strings to be identified, it is original according to the separator cutting Telephone number strings to be identified obtain at least two targets telephone number strings to be identified after cutting.If conversely, original electricity to be identified It talks about and does not include specified separator in number series, then without pre- slicing operation.Here, specified separator can be pause mark ", ", comma, ", branch ";", slash "/", back slash " ", montant " | " etc., the invention is not limited thereto.
For example, " 028-84876877,1380233318 ", determining should for the original telephone number strings to be identified in table 1 above Comprising specified separator (that is, comma, ") in original telephone number strings to be identified, according to this, separator ", " cutting is original waits for Identify telephone number strings, obtain the telephone number strings to be identified of the target after cutting be " 028-84876877 " and “1380233318”。
Secondly, the identification and removal of national area code.In existing telephone number, in order to distinguish the phone number of every country Code, it will usually national area code is added before telephone number.By taking the telephone number of China as an example, it will usually add 86 before telephone number To indicate to distinguish, however in without transnational make a phone call, there is no substantive use for national area code, thus can be carried out to it Removal is handled.
In embodiments of the present invention, after at least two targets telephone number strings to be identified after obtaining cutting, for Each target telephone number strings to be identified, determine whether the head of target telephone number strings to be identified has national area code, if It is the national area code for then removing target telephone number strings head to be identified.If conversely, target telephone number strings to be identified Head does not have national area code, then without going division operation.
In the step of carrying out pre- cutting according to separator, for the original electricity to be identified of pre- slicing operation need not be carried out Number series is talked about, then further determines that whether the head of the original telephone number strings to be identified has national area code, if so, removal The national area code on the original telephone number strings head to be identified.If conversely, the head of target telephone number strings to be identified does not have There is national area code, then without going division operation.
In an embodiment of the present invention, by taking Chinese area code 86 as an example, 86 common forms include+86,086,0086,86 Deng the embodiment of the present invention can judge whether 86 be Chinese area code according to remaining phone digit.For example, the original in table 1 above The telephone number strings to be identified that begin " 86-0877-70104577010457 " judge 86 for China according to remaining phone digit Number, then it is removed processing to 86, obtaining that treated, target telephone number strings to be identified are " 0877- 70104577010457 ", processing is also removed to 86 subsequent symbol "-" here.
Furthermore supplement is carried out at trivial number over the ground and when duplicate removal, can wait knowing to eliminating the target after national area code Other telephone number strings are analyzed, if the head that analysis obtains target telephone number strings to be identified has regional area code and this area Area code is imperfect, then supplementing this area's area code keeps it complete;If analyzing the head for obtaining target telephone number strings to be identified has Regional area code and the repetition of this area's area code, then carry out duplicate removal processing to this area's area code.
In the step of carrying out pre- cutting according to separator, for the original electricity to be identified of pre- slicing operation need not be carried out Number series is talked about, or in the step of national area code is identified and is removed, for the original of operation need not be removed Telephone number strings to be identified then further analyze the original telephone number strings to be identified, original are waited for if analysis obtains this Identify that the head of telephone number strings has regional area code and this area's area code is imperfect, then supplementing this area's area code keeps it complete; If the head that analysis obtains the original telephone number strings to be identified has regional area code and this area's area code repeats, to this area Area code carries out duplicate removal processing.
For example, the original telephone number strings " 286990619869906199 " to be identified in table 1 above, this original is waited knowing Other telephone number strings are analyzed, and the head for obtaining the original telephone number strings to be identified has regional area code and this area's area code Imperfect, then supplementing this area's area code keeps it complete, the target telephone number strings to be identified after obtaining regional area code supplement completely “0286990619869906199”。
For another example the original telephone number strings " 07710771324579718602365784 " to be identified in table 1 above, right The original telephone number strings to be identified are analyzed, obtain the original telephone number strings to be identified head have regional area code and This area area code repeats, then carries out duplicate removal processing to this area's area code, obtains the target phone to be identified removably after trivial number Number series " 0771324579718602365784 ".
In embodiments of the present invention, Chinese city, area, county's telephone number shown in table 1 above are grasped by pretreatment above After work, the target telephone number strings to be identified that obtain that treated, as shown in table 2.For pretreatment operation mentioned above, that is, Include according to the pre- cutting of separator, the identification of national area code and removal, the supplement of regional area code and duplicate removal etc., the present invention is simultaneously unlimited The sequencing that they are executed is made, in practical operation, can the sequencing that they are executed be set according to actual demand.Example Such as, one of arbitrary pretreatment operation is executed;Or first according to the pre- cutting of separator, then carries out the identification of national area code and go It removes, then carries out the supplement and duplicate removal of regional area code.For another example, the identification and removal of national area code are first carried out, area is then carried out The supplement and duplicate removal of area code, then according to the pre- cutting of separator.For another example first carrying out the identification and removal of national area code, then According to the pre- cutting of separator, supplement and duplicate removal of regional area code, etc. are then carried out.
Table 2
It should be noted that being carried out and phone number format phase to original telephone number strings to be identified in the embodiment of the present invention The pretreatment operation of pass, it is not limited to which above-mentioned several pretreatment modes can be in conjunction with the electricity of country variant in practical operation The characteristics of talking about number carries out corresponding pretreatment operation so that the target telephone number strings to be identified after pretreatment operation and phone Number format is consistent as far as possible, to improve the discrimination of telephone number.
After the step S102 target telephone number strings to be identified that obtain that treated, from initial position in step S104 It rises, target telephone number strings to be identified is divided according to the division rule for meeting phone number format, it is specified to obtain first The number series of digit, here can be in conjunction with the characteristics of different classes of telephone number (such as fixed-line telephone or mobile phone), choosing Corresponding division rule is taken to be divided.
At this point, the classification of the corresponding telephone number of number series of the first specified digit is identified in step S106, the present invention Embodiment provides a kind of optional scheme, in this scenario, it can be determined that whether the number series of the first specified digit meets The attributive character of one classification telephone number, if the attribute that the number series of the first specified digit meets first category telephone number is special Sign carries out completion to the number series of the first specified digit, obtains the first finger then according to the attributive character of first category telephone number Position the corresponding telephone number of number series of number.
Further, according to the attributive character of first category telephone number, completion is carried out to the number series of the first specified digit, The present invention provides a kind of optional schemes, that is, according to the attributive character of first category telephone number, determines to the first specific bit Several number series carries out the completion digit of completion, then from target telephone number strings to be identified, the first specified digit number Go here and there corresponding division position rise, intercept completion digit number.Later, the number of completion digit is attached to the first specified digit Number series end.
If the number series of the first specified digit is unsatisfactory for the attributive character of first category telephone number, new meet is chosen The division rule of phone number format re-starts division to target telephone number strings to be identified, obtains number of the second specified digit Sequence, and then judge whether the number series of the second specified digit meets the attributive character of second category telephone number, if so, root According to the attributive character of second category telephone number, completion is carried out to the number series of the second specified digit, obtains the second specified digit The corresponding telephone number of number series.
By taking Chinese city, area, county's telephone number as an example, when selection meets the division rule of Mobile Directory Number format, by In mobile phone include 11, according to its first 7 may determine that its correctness and affiliated area (here, mobile phone generally with 13,14,15,17,18 or 19 beginning can utilize mobile phone ownership table to judge preceding 7 correctness and affiliated area), because And target telephone number strings to be identified can be divided according to the division rule for meeting Mobile Directory Number format, obtain The number series that one specified digit is 7.
In addition, choose meet the division rule of fixed telephone number format when, due to fixed-line telephone be divided into official 400 or 10 number telephones, common 7 or 85 electricity of region phone and special official comprising 3 or 4 area codes of 800 beginnings Number is talked about, thus target telephone number strings to be identified can be drawn according to the division rule for meeting fixed telephone number format Point, obtain the number series that the first specified digit is 3,4 or 5.
For example, the original telephone number strings to be identified extracted from POI information are "+8613651464541,28- 84876877 ", to original telephone number progress to be identified and the relevant pretreatment operation of phone number format, it is followed successively by basis The pre- cutting of separator, the identification of national area code and removal, the identification of regional area code and supplement, then treated target electricity to be identified It is " 13651464541 " and " 028-84876877 " to talk about number series.Further, from initial position, according to meeting mobile phone The division rule of number format divides target telephone number strings to be identified " 13651464541 ", obtains the first specific bit The number series " 1365146 " that number is 7.Alternatively, from initial position, according to the division rule for meeting fixed telephone number format Target telephone number strings to be identified " 028-84876877 " are divided, the number series that the first specified digit is 3 is obtained “028”。
In an embodiment of the present invention, if the head of target telephone number strings to be identified has regional area code, from initial Position is risen, and according to the division rule for meeting Mobile Directory Number format, the target after regional area code to removing head is to be identified Telephone number strings are divided, and the number series that the first specified digit is 7 is obtained.For example, in table 2 above, target electricity to be identified It is " 0755-13651464541 " to talk about number series, and the head of target telephone number strings to be identified has regional area code " 0755 ", Then from initial position, according to the division rule for meeting Mobile Directory Number format, the mesh after regional area code to removing head It marks telephone number strings to be identified to be divided, obtains the number series " 1365146 " that the first specified digit is 7.
In an embodiment of the present invention, it can choose first and meet the division rule of Mobile Directory Number format target is waited for Identification telephone number strings are divided, and are obtained the number series that the first specified digit is 7, are judged the first specified digit for 7 Whether number series meets the attributive character of first category telephone number (that is, mobile phone), if so, according to first category phone The attributive character of number (that is, mobile phone), the number series for being 7 to the first specified digit carry out completion, it is specified to obtain first The corresponding telephone number of number series (that is, mobile phone) that digit is 7.
Still it is " original to this for+8613651464541,28-84876877 " with original telephone number strings to be identified Telephone number to be identified carry out with the relevant pretreatment operation of phone number format, such as delete national area code, obtain that treated Target telephone number strings to be identified are " 13651464541,28-84876877 ".Further, from initial position, according to meeting The division rule of Mobile Directory Number format divides target telephone number strings to be identified, and it is 7 to obtain the first specified digit The number series " 1365146 " of position, and then can identify that the number series that the first specified digit is 7 is corresponding according to step S106 Telephone number is mobile phone " 13651464541 ".
If the number series that the first specified digit is 7 is unsatisfactory for the attribute of first category telephone number (that is, mobile phone) Feature is then chosen and meets the division rule of fixed telephone number format and re-start division to target telephone number strings to be identified, It is 3,4 or 5 number series to obtain the second specified digit, and then judges the second specified digit for 3,4 or 5 numbers Whether sequence meets the attributive character of second category telephone number (that is, fixed-line telephone), if so, according to second category phone number The attributive character of code (that is, fixed-line telephone), the number series for being 3,4 or 5 to the second specified digit carry out completion, obtain the The corresponding telephone number of number series (that is, fixed-line telephone) that two specified digits are 3,4 or 5.
For example, in table 2 above, pre-processed to original telephone number strings " 286990619869906199 " to be identified After operation, it is " 0286990619869906199 " to obtain target telephone number strings to be identified, next from initial position, root Target telephone number strings to be identified are divided according to the division rule for meeting Mobile Directory Number format, obtain the first specific bit The number series that number is 7 is " 0286990 ", which is that 7 number series are unsatisfactory for first category telephone number The attributive character of (that is, mobile phone) then chooses the division rule for meeting fixed telephone number format to target phone to be identified Number series re-starts division, and it is " 028 " to obtain the number series that the second specified digit is 3, identifies that the second specified digit is 3 The corresponding telephone number of number series " 028 " of position is fixed-line telephone, respectively 7 " 0286990619 " or 8 s' “02869906198”。
In another embodiment of the invention, the division rule pair for meeting fixed telephone number format can also be chosen first Target telephone number strings to be identified are divided, and are obtained the number series that the first specified digit is 3,4 or 5, are judged first Specified digit is the attributive character whether 3,4 or 5 number series meet first category telephone number (that is, fixed-line telephone), It it is 3,4 or 5 to the first specified digit if so, according to the attributive character of first category telephone number (that is, fixed-line telephone) The number series of position carries out completion, obtains the corresponding telephone number of number series that the first specified digit is 3,4 or 5 (that is, solid Determine phone).
If the number series that the first specified digit is 3,4 or 5 is unsatisfactory for first category telephone number (that is, fixed electricity Words) attributive character, then choose and meet the division rule of Mobile Directory Number format to target telephone number strings to be identified again Divided, it is 7 number series to obtain the second specified digit, and then judge the second specified digit for 7 number series whether Meet the attributive character of second category telephone number (that is, mobile phone), if so, according to second category telephone number (that is, moving Mobile phone) attributive character, the number series for being 7 to the second specified digit carries out completion, and it is 7 to obtain the second specified digit The corresponding telephone number of number series (that is, mobile phone).
First specified digit listed above is 7, and first category telephone number is mobile phone, and the second specified digit is 3,4 or 5, second category telephone number is fixed-line telephone;Alternatively, the first specified digit is 3,4 or 5, first Classification telephone number is fixed-line telephone, and the second specified digit is 7, and second category telephone number is mobile phone, is in The setting that the characteristics of city of state, area, county's telephone number carries out, it should be noted that the identification for the telephone number of other countries, Can in conjunction with other countries telephone number the characteristics of pair the first specified digit, first category telephone number, the second specified digit And second category telephone number is arranged accordingly.
In another embodiment of the invention, identify that the number series of the first specified digit is corresponding in above step S106 The classification of telephone number, an embodiment of the present invention provides another optional schemes.Fig. 2 shows according to an embodiment of the invention Identify the flow chart of the classification of the corresponding telephone number of number series of the first specified digit.Referring to Fig. 2, this method can at least wrap Step S202 is included to step S210.
Step S202, judges whether the number series of the first specified digit meets the attributive character of first category telephone number, If so, continuing to execute step S204, otherwise, step S210 is continued to execute.
Step S204 determines at least two detection digits according to the attributive character of first category telephone number.
Step S206 is respectively adopted each detection digit and carries out cutting to target telephone number strings to be identified, obtains cutting As a result.
In this step, for each detection digit, using the detection digit to target telephone number strings to be identified, Telephone number strings after the number series of one specified digit carry out cutting, obtain the first cutting number and the second cutting number, than Compared with the first cutting number and the second cutting number, the identical digit of number on the two corresponding position is determined, as the detection digit Corresponding cutting result.
Step S208 is specified according to cutting as a result, choosing optimized detection digit pair first from least two detection digits The number series of digit carries out completion.
In this step, the identical digit of the corresponding number of more each detection digit is chosen from each detection digit It is maximum as optimized detection digit to correspond to the identical digit of number, to the number series completion optimized detection position of the first specified digit Number.
Step S210 chooses the new division rule for meeting phone number format to target telephone number strings to be identified again It is divided, obtains the number series of the specified digit of new first, and return to step S202.
In above example, identify that the corresponding telephone number of number series " 028 " that the first specified digit is 3 is solid Determine phone, and the fixed-line telephone is not due to being with 400 or 800 beginnings, it is determined that 7 and 8 two detection digits.
For 7 detection digits, using the detection digit to target telephone number strings to be identified, the first specified digit Number series after telephone number strings (that is, 6990619869906199) carry out cutting, obtain the first cutting number " 6990619 " and the second cutting number " 8699061 " determines that the identical digit of number is 1 on the two corresponding position.
For 8 detection digits, using the detection digit to target telephone number strings to be identified, the first specified digit Number series after telephone number strings (that is, 6990619869906199) carry out cutting, obtain the first cutting number " 69906198 " and the second cutting number " 69906199 " determines that the identical digit of number is 7 on the two corresponding position.
Then, from 7 and 8 detection digits, it is maximum as optimized detection to choose the identical digit of corresponding number Digit, i.e. the detection digit of selection 8 are optimal to number series " 028 " completion of the first specified digit as optimized detection digit The fixed-line telephone that detection digit obtains is " 02869906198 ".Here, the foundation of this computational methods is selected to occur from same Two fixed-line telephones or mobile phone in telephone unit have prodigious similitude.
In another embodiment of the present invention, the number series pair of the first specified digit or the second specified digit is obtained in completion After the telephone number answered, completion can be exported and obtain the corresponding phone of number series of the first specified digit or the second specified digit Number.For example, identifying fixed-line telephone from target telephone number strings to be identified " 0286990619869906199 " After " 02869906198 ", fixed-line telephone " 02869906198 " can be exported.
Further, it for remaining telephone number strings " 69906199 " to be identified, then needs to execute again in step S102 Pretreatment operation, the division operation in step S104 and the operation of the identification in step S106, until remaining electricity to be identified Words number series has all been identified.That is, completion area area code " 028 " first, obtains target telephone number strings to be identified “02869906199”.Then, from initial position, target is waited knowing according to the division rule for meeting fixed telephone number format Other telephone number strings " 02869906199 " are divided, and the number series " 028 " that the first specified digit is 3, and then basis are obtained Step S108 can identify that the corresponding telephone number of number series that the first specified digit is 3 is fixed-line telephone “02869906199”。
For another example in table 2 above, target telephone number strings to be identified are " 400-890-0000 turns 805530 ", from initial Position is risen, according to the division rule for meeting fixed telephone number format telephone number strings " 400-890-0000 to be identified to target Turn 805530 " to be divided, obtains the first specified digit and be 3 number series " 400 ", and then can be identified according to step S108 It is fixed-line telephone " 400-890-0000 " to go out the corresponding telephone number of number series that the first specified digit is 3.For remaining Telephone number strings to be identified " turning 805530 " identify as extension number, then are added to the end of fixed-line telephone " 400-890-0000 " Tail obtains " 400-890-0000 turns 805530 ".
The recognition methods provided by the invention based on recursive telephone number is discussed in detail below by a specific embodiment Realization process by taking Chinese city, area, county's telephone number as an example, obtain POI information from webpage in this embodiment, and from Original telephone number strings to be identified are extracted in POI information.Fig. 3 shows identification telephone number according to another embodiment of the present invention Method flow chart.Referring to Fig. 3, this method at least may include step S302 to step S316.
Step S302 carries out pre- cutting processing to original telephone number strings to be identified according to separator.
In this step, it may be determined that whether comprising specified separator in original telephone number strings to be identified, if original It is obtained then according to the original telephone number strings to be identified of the separator cutting comprising specified separator in telephone number strings to be identified At least two targets telephone number strings to be identified after to cutting.If referring to conversely, not including in original telephone number strings to be identified Fixed separator, then without pre- slicing operation.Here, specified separator can be pause mark ", ", comma, ", branch ";”、 Slash "/", back slash " ", montant " | " etc., the invention is not limited thereto.
For example, " 028-84876877,1380233318 ", determining should for the original telephone number strings to be identified in table 1 above Comprising specified separator (that is, comma, ") in original telephone number strings to be identified, according to this, separator ", " cutting is original waits for Identify telephone number strings, obtain the telephone number strings to be identified of the target after cutting be " 028-84876877 " and “1380233318”。
Step S304, removal beginning 86.
In this step, after at least two targets telephone number strings to be identified after obtaining cutting, for each mesh Telephone number strings to be identified are marked, determine whether the head of target telephone number strings to be identified has national area code, if so, going Except the national area code on target telephone number strings head to be identified.If conversely, the head of target telephone number strings to be identified is not With national area code, then without going division operation.
In the step of carrying out pre- cutting according to separator, for the original electricity to be identified of pre- slicing operation need not be carried out Number series is talked about, then further determines that whether the head of the original telephone number strings to be identified has national area code, if so, removal The national area code on the original telephone number strings head to be identified.If conversely, the head of target telephone number strings to be identified does not have There is national area code, then without going division operation.
By taking Chinese area code 86 as an example, 86 common forms include+86,086,0086,86 etc., and the embodiment of the present invention can root Judge whether 86 be Chinese area code according to remaining phone digit.For example, the original telephone number strings " 86- to be identified in table 1 above 0877-70104577010457 " judges 86 for Chinese area code according to remaining phone digit, is then removed processing to 86, obtains To treated, target telephone number strings to be identified are " 0877-70104577010457 ", here to 86 subsequent symbol "-" It is removed processing.
Step S306, regional area code supplement and duplicate removal.
In this step, it can analyze eliminating the target telephone number strings to be identified after national area code, if The head that analysis obtains target telephone number strings to be identified has regional area code and this area's area code is imperfect, then supplements this area Area code keeps it complete;If the head that analysis obtains target telephone number strings to be identified has regional area code and this area's area code weight It is multiple, then duplicate removal processing is carried out to this area's area code.
In the step of carrying out pre- cutting according to separator, for the original electricity to be identified of pre- slicing operation need not be carried out Number series is talked about, or in the step of national area code is identified and is removed, for the original of operation need not be removed Telephone number strings to be identified then further analyze the original telephone number strings to be identified, original are waited for if analysis obtains this Identify that the head of telephone number strings has regional area code and this area's area code is imperfect, then supplementing this area's area code keeps it complete; If the head that analysis obtains the original telephone number strings to be identified has regional area code and this area's area code repeats, to this area Area code carries out duplicate removal processing.
For example, the original telephone number strings " 286990619869906199 " to be identified in table 1 above, this original is waited knowing Other telephone number strings are analyzed, and the head for obtaining the original telephone number strings to be identified has regional area code and this area's area code Imperfect, then supplementing this area's area code keeps it complete, the target telephone number strings to be identified after obtaining regional area code supplement completely “0286990619869906199”。
For another example the original telephone number strings " 07710771324579718602365784 " to be identified in table 1 above, right The original telephone number strings to be identified are analyzed, obtain the original telephone number strings to be identified head have regional area code and This area area code repeats, then carries out duplicate removal processing to this area's area code, obtains the target phone to be identified removably after trivial number Number series " 0771324579718602365784 ".
Step S308 determines whether mobile phone according to first 7 of target telephone number strings to be identified, if it is not, then after It is continuous to execute step S310, if so, continuing to execute step S312.
In this step, choose meet the division rule of Mobile Directory Number format to target telephone number strings to be identified into Row divides, and it is 7 number series to obtain the first specified digit, judges whether the first specified digit meets for 7 number series The attributive character of one classification telephone number (that is, mobile phone), if so, according to first category telephone number (that is, mobile electricity Words) attributive character, the number series for being 7 to the first specified digit carries out completion, and it is 7 numbers to obtain the first specified digit Go here and there corresponding telephone number (that is, mobile phone).
Step S310, it is backward to detect digit judgement.
In this step, if the number series that the first specified digit is 7 in step S308 is unsatisfactory for first category phone number The attributive character of code (that is, mobile phone) then chooses the division rule for meeting fixed telephone number format to target electricity to be identified Words number series re-starts division, obtains the second specified digit and is 3,4 or 5 number series, and then judges that second is specified Whether the number series that digit is 3,4 or 5 meets the attributive character of second category telephone number (that is, fixed-line telephone), if It, then according to the attributive character of second category telephone number (that is, fixed-line telephone), is 3,4 or 5 to the second specified digit to be Number series carry out completion, obtain the corresponding telephone number of number series that the second specified digit is 3,4 or 5 (that is, fixed Phone).
For example, in table 2 above, pre-processed to original telephone number strings " 286990619869906199 " to be identified After operation, it is " 0286990619869906199 " to obtain target telephone number strings to be identified, next from initial position, root Target telephone number strings to be identified are divided according to the division rule for meeting Mobile Directory Number format, obtain the first specific bit The number series that number is 7 is " 0286990 ", which is that 7 number series are unsatisfactory for first category telephone number The attributive character of (that is, mobile phone) then chooses the division rule for meeting fixed telephone number format to target phone to be identified Number series re-starts division, and it is " 028 " to obtain the number series that the second specified digit is 3, identifies that the second specified digit is 3 The corresponding telephone number of number series " 028 " of position is fixed-line telephone, respectively 7 " 0286990619 " or 8 s' “02869906198”。
In above example, is identified from target telephone number strings to be identified " 0286990619869906199 " The corresponding telephone number of number series that two specified digits are 3 is fixed-line telephone, respectively 7 " 0286990619 " or 8 " 02869906198 " of position.In order to choose suitable completion position, the discrimination of telephone number is improved, the embodiment of the present invention is in root According to the attributive character of second category telephone number, when carrying out completion to the number of the second specified digit, a kind of backward spy is provided The scheme of location number judgement, that is, it can determine at least two detection digits according to the attributive character of second category telephone number, Each detection digit is then respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result.Later, according to Cutting is as a result, the number series for choosing optimized detection the second specified digit of digit pair from least two detection digits carries out completion.
Further, for each detection digit, using the detection digit to target telephone number strings to be identified, the second finger It positions the telephone number strings after the number series of number and carries out cutting, obtain the first cutting number and the second cutting number, compare the All branch codes and the second cutting number determine the identical digit of number on the two corresponding position, are corresponded to as the detection digit Cutting result.Then, the identical digit of the corresponding number of more each detection digit, from each detection digit, selection pair Answer the identical digit of number is maximum to be used as optimized detection digit, to the number series completion optimized detection position of the second specified digit Number.
In the above example, identify that the corresponding telephone number of number series " 028 " that the second specified digit is 3 is solid Determine phone, respectively 7 " 0286990619 " or 8 " 02869906198 ", in order to choose suitable completion position, really Fixed 7 and 8 two detection digits.
For 7 detection digits, using the detection digit to target telephone number strings to be identified, the second specified digit Number series after telephone number strings (that is, 6990619869906199) carry out cutting, obtain the first cutting number " 6990619 " and the second cutting number " 8699061 " determines that the identical digit of number is 1 on the two corresponding position.
For 8 detection digits, using the detection digit to target telephone number strings to be identified, the second specified digit Number series after telephone number strings (that is, 6990619869906199) carry out cutting, obtain the first cutting number " 69906198 " and the second cutting number " 69906199 " determines that the identical digit of number is 7 on the two corresponding position.
Then, from 7 and 8 detection digits, it is maximum as optimized detection to choose the identical digit of corresponding number Digit, i.e. the detection digit of selection 8 are optimal to number series " 028 " completion of the second specified digit as optimized detection digit The fixed-line telephone that detection digit obtains is " 02869906198 ".Here, the foundation of this computational methods is selected to occur from same Two fixed-line telephones or mobile phone in telephone unit have prodigious similitude.
Step S312, judges whether mistake, if it is not, step S314 is then continued to execute, if so, terminating this flow.
In this step, it can be determined that the first specified digit is whether 7 corresponding telephone numbers of number series are accurate, such as Whether lack digit or whether is spacing etc..It can also judge that detecting the telephone number that digit judges in S310 backward is It is no accurate.
Step S314 exports telephone number.
Step S316, judges whether the length of remaining telephone number strings is more than 0, if so, S304 is returned to step, If it is not, then terminating this flow.
In embodiments of the present invention, original telephone number strings to be identified are carried out first relevant pre- with phone number format Processing operation (is followed successively by the supplement and duplicate removal according to the pre- cutting of separator, the identification of national area code and removal, regional area code), makes The target telephone number strings to be identified obtained after pretreatment operation are consistent with phone number format, in order to subsequently be based on pretreatment behaviour Target telephone number strings to be identified after work carry out the identification of telephone number, improve the discrimination of telephone number.Also, the present invention Embodiment combines the feature that different classes of telephone number (fixed-line telephone and mobile phone) has, using different classes of phone The division rule of the corresponding phone number format of number divides target telephone number strings to be identified, is obtained according to division The number series of first specified digit identifies the classification of its corresponding telephone number, realizes having to different classes of telephone number Effect identification.
Further, after identifying the classification of the corresponding telephone number of number series of the first specified digit, if still There are remaining telephone number strings to be identified, then the embodiment of the present invention is based on recursive operation, then to remaining phone number to be identified Sequence carries out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
Have very in addition, the embodiment of the present invention has also combined two fixed-line telephones in the same telephone unit or mobile phone The characteristics of big similitude, the scheme judged to detection digit after detect target telephone number strings to be identified, are known Not, the accuracy of telephone number identification is further improved.
Based on the recognition methods based on recursive telephone number that each embodiment provides above, it is based on same invention structure Think, the embodiment of the present invention additionally provides a kind of identification device based on recursive telephone number, and Fig. 4 is shown according to the present invention one The structural schematic diagram of the identification device based on recursive telephone number of embodiment.As shown in figure 4, the device at least may include Preprocessing module 410, division module 420, identification module 430 and recurrence module 440.
Now introduce each composition of the identification device based on recursive telephone number of the embodiment of the present invention or the function of device And the connection relation between each section:
Preprocessing module 410 is suitable for carrying out and the relevant pre- place of phone number format original telephone number strings to be identified Reason operation, the target telephone number strings to be identified that obtain that treated;
Division module 420 is coupled with preprocessing module 410, suitable for from initial position, according to meeting telephone number lattice The division rule of formula divides target telephone number strings to be identified, obtains the number series of the first specified digit;
Identification module 430 is coupled with division module 420, and the number series suitable for identifying the first specified digit is corresponding The classification of telephone number;
Recurrence module 440 is coupled with identification module 430, if suitable for still having remaining telephone number to be identified String then carries out remaining telephone number strings to be identified to repeat recurrence, until remaining telephone number strings to be identified are all known It is not complete.
In an embodiment of the present invention, recurrence module 440 is further adapted for:
For remaining telephone number strings to be identified, triggering preprocessing module executes pretreatment operation, division module executes Division operation and identification module execute identification operation, until remaining telephone number strings to be identified have all been identified.
In an embodiment of the present invention, preprocessing module 410 is further adapted for:
It whether determines in original telephone number strings to be identified comprising specified separator;
If original to be identified according to the separator cutting comprising specified separator in original telephone number strings to be identified Telephone number strings obtain at least two targets telephone number strings to be identified after cutting.
In an embodiment of the present invention, specified separator includes at least one following:Pause mark, comma, branch, slash, Back slash, montant.
In an embodiment of the present invention, preprocessing module 410 is further adapted for:
After at least two targets telephone number strings to be identified after obtaining cutting, for each target phone to be identified Number series, determines whether the head of target telephone number strings to be identified has national area code;
If so, removing the national area code on target telephone number strings head to be identified.
In an embodiment of the present invention, preprocessing module 410 is further adapted for:
After the national area code for removing target telephone number strings head to be identified, analysis eliminates after national area code Target telephone number strings to be identified;
If the head of target telephone number strings to be identified has regional area code and this area's area code is imperfect, the ground is supplemented Trivial number keeps it complete;
If the head of target telephone number strings to be identified has regional area code and this area's area code repeats, to area of this area Number carry out duplicate removal processing.
In an embodiment of the present invention, identification module 430 is further adapted for:
Judge whether the number series of the first specified digit meets the attributive character of first category telephone number;
If so, according to the attributive character of first category telephone number, at least two detection digits are determined;
Each detection digit is respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result;
According to cutting as a result, choosing the number of optimized detection the first specified digit of digit pair from least two detection digits String carries out completion.
In an embodiment of the present invention, identification module 430 is further adapted for:
For each detection digit, using the detection digit to target telephone number strings to be identified, the first specified digit Number series after telephone number strings carry out cutting, obtain the first cutting number and the second cutting number;
Compare the first cutting number and the second cutting number, determines the identical digit of number on the two corresponding position, as The corresponding cutting result of the detection digit.
In an embodiment of the present invention, identification module 430 is further adapted for:
The identical digit of the corresponding number of more each detection digit;
From each detection digit, it is maximum as optimized detection digit to choose the identical digit of corresponding number;
To the number series completion optimized detection digit of the first specified digit.
In an embodiment of the present invention, division module 420 are further adapted for judging the number of the first specified digit in identification module After whether string meets the attributive character of first category telephone number, if the number series of the first specified digit does not meet first category The attributive character of telephone number then chooses the new division rule for meeting phone number format to target telephone number strings to be identified Division is re-started, the number series of the second specified digit is obtained;
Identification module 430 is further adapted for judging whether the number series of the second specified digit meets second category telephone number Attributive character;If so, according to the attributive character of second category telephone number, the number series of the second specified digit is mended Entirely.
In an embodiment of the present invention, and pre- as shown in figure 5, the device of Fig. 4 displayings can also include acquisition module 450 Processing module 410 is coupled, and original telephone number strings to be identified are obtained suitable for passing through following steps:
Point of interest POI information is obtained from webpage;
Original telephone number strings to be identified are extracted from POI information.
According to the combination of any one above-mentioned preferred embodiment or multiple preferred embodiments, the embodiment of the present invention can reach Following advantageous effect:
In embodiments of the present invention, original telephone number strings to be identified are carried out first relevant pre- with phone number format Processing operation so that the target telephone number strings to be identified after pretreatment operation are consistent with phone number format, in order to follow-up The identification that telephone number is carried out based on the target telephone number strings to be identified after pretreatment operation, improves the identification of telephone number Rate.Also, the embodiment of the present invention combines the feature that different classes of telephone number (such as fixed-line telephone or mobile phone) has, Target telephone number strings to be identified are carried out using the division rule of the different classes of corresponding phone number format of telephone number It divides, the classification of its corresponding telephone number is identified according to the number series for dividing the first obtained specified digit, realize to not Effective identification of generic telephone number.Further, in the corresponding telephone number of number series for identifying the first specified digit Classification after, if still have remaining telephone number strings to be identified, the embodiment of the present invention be based on recursive operation, then it is right Remaining telephone number strings to be identified carry out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
Have very in addition, the embodiment of the present invention has also combined two fixed-line telephones in the same telephone unit or mobile phone The characteristics of big similitude, the scheme judged to detection digit after detect target telephone number strings to be identified, are known Not, the accuracy of telephone number identification is further improved.
In the instructions provided here, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice without these specific details.In some instances, well known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:It is i.e. required to protect Shield the present invention claims the more features of feature than being expressly recited in each claim.More precisely, as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific implementation mode are expressly incorporated in the specific implementation mode, wherein each claim itself All as a separate embodiment of the present invention.
Those skilled in the art, which are appreciated that, to carry out adaptively the module in the equipment in embodiment Change and they are arranged in the one or more equipment different from the embodiment.It can be the module or list in embodiment Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or Sub-component.Other than such feature and/or at least some of process or unit exclude each other, it may be used any Combination is disclosed to all features disclosed in this specification (including adjoint claim, abstract and attached drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power Profit requires, abstract and attached drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation It replaces.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in detail in the claims, embodiment claimed it is one of arbitrary It mode can use in any combination.
The all parts embodiment of the present invention can be with hardware realization, or to run on one or more processors Software module realize, or realized with combination thereof.It will be understood by those of skill in the art that can use in practice Microprocessor or digital signal processor (DSP) realize the knowledge according to the ... of the embodiment of the present invention based on recursive telephone number The some or all functions of some or all components in other device.The present invention is also implemented as executing institute here Some or all equipment or program of device of the method for description are (for example, computer program and computer program production Product).It is such to realize that the program of the present invention may be stored on the computer-readable medium, or can have one or more The form of signal.Such signal can be downloaded from internet website and be obtained, and either be provided on carrier signal or to appoint What other forms provides.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference mark between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of several different elements and being come by means of properly programmed computer real It is existing.In the unit claims listing several devices, several in these devices can be by the same hardware branch To embody.The use of word first, second, and third does not indicate that any sequence.These words can be explained and be run after fame Claim.
So far, although those skilled in the art will appreciate that present invention has been shown and described in detail herein multiple shows Example property embodiment still without departing from the spirit and scope of the present invention, still can according to the present disclosure directly Determine or derive many other variations or modifications consistent with the principles of the invention.Therefore, the scope of the present invention is understood that and recognizes It is set to and covers other all these variations or modifications.
The embodiment of the invention also discloses:A1, a kind of recognition methods based on recursive telephone number, including:
Pretreatment operation obtains original telephone number strings progress to be identified and the relevant pretreatment of phone number format Target that treated telephone number strings to be identified;
Division operation, it is to be identified to the target according to the division rule for meeting phone number format from initial position Telephone number strings are divided, and the number series of the first specified digit is obtained;
Identification operation, identifies the classification of the corresponding telephone number of number series of the described first specified digit;
Recursive operation, if remaining telephone number strings to be identified are still had, to remaining telephone number to be identified String carries out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
A2, the method according to A1, wherein remaining telephone number strings to be identified are carried out to repeat recurrence, including:
The pretreatment operation, the division operation and identification behaviour are executed to remaining telephone number strings to be identified Make.
A3, the method according to A1 or A2, wherein described to original telephone number strings progress to be identified and telephone number The relevant pretreatment operation of format, the target telephone number strings to be identified that obtain that treated, including:
It whether determines in the original telephone number strings to be identified comprising specified separator;
If comprising specified separator in the original telephone number strings to be identified, according to former described in the separator cutting Begin telephone number strings to be identified, obtains at least two targets telephone number strings to be identified after cutting.
A4, according to A1-A3 any one of them methods, wherein the specified separator includes at least one following: Number, comma, branch, slash, back slash, montant.
A5, according to A1-A4 any one of them methods, wherein at least two targets electricity to be identified after obtaining cutting After talking about number series, further include:
For each target telephone number strings to be identified, determine whether the head of target telephone number strings to be identified has National area code;
If so, removing the national area code on target telephone number strings head to be identified.
A6, according to A1-A5 any one of them methods, wherein removing target telephone number strings head to be identified After national area code, further include:
Analysis eliminates the target telephone number strings to be identified after national area code;
If the head of the target telephone number strings to be identified has regional area code and this area's area code is imperfect, supplement This area's area code keeps it complete;
If the head of the target telephone number strings to be identified has regional area code and this area's area code repeats, to the ground Trivial number progress duplicate removal processing.
A7, according to A1-A6 any one of them methods, wherein identify the described first specified digit number series correspond to Telephone number classification, including:
Judge whether the number series of the described first specified digit meets the attributive character of first category telephone number;
If so, according to the attributive character of the first category telephone number, at least two detection digits are determined;
Each detection digit is respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result;
Referred to described first as a result, choosing optimized detection digit from at least two detections digit according to the cutting The number series for positioning number carries out completion.
A8, according to A1-A7 any one of them methods, wherein it is described that each detection digit is respectively adopted to the target Telephone number strings to be identified carry out cutting, obtain cutting as a result, including:
For each detection digit, using the detection digit to target telephone number strings to be identified, described first Telephone number strings after the number series of specified digit carry out cutting, obtain the first cutting number and the second cutting number;
Compare the first cutting number and the second cutting number, determines the identical position of number on the two corresponding position Number, as the corresponding cutting result of the detection digit.
A9, according to A1-A8 any one of them methods, wherein according to the cutting as a result, from it is described at least two detection Optimized detection digit is chosen in digit, and completion is carried out to the number series of the described first specified digit, including:
Compare the identical digit of the corresponding number of each detection digit;
From each detection digit, it is maximum as optimized detection digit to choose the identical digit of corresponding number;
Optimized detection digit described in number series completion to the described first specified digit.
A10, according to A1-A9 any one of them methods, wherein the number series for judging the described first specified digit whether After the attributive character for meeting first category telephone number, further include:
If the number series of the first specified digit does not meet the attributive character of first category telephone number, choose newly The division rule for meeting phone number format re-starts division to target telephone number strings to be identified, and it is specified to obtain second The number series of digit;
Judge whether the number series of the described second specified digit meets the attributive character of second category telephone number;
If so, according to the attributive character of the second category telephone number, to the number series of the described second specified digit Carry out completion.
A11, according to A1-A10 any one of them methods, wherein obtain the original electricity to be identified by following steps Talk about number series:
Point of interest POI information is obtained from webpage;
The original telephone number strings to be identified are extracted from the POI information.
B12, a kind of identification device based on recursive telephone number, including:
Preprocessing module is suitable for original telephone number strings to be identified grasp with the relevant pretreatment of phone number format Make, the target telephone number strings to be identified that obtain that treated;
Division module, suitable for from initial position, being waited for the target according to the division rule for meeting phone number format Identification telephone number strings are divided, and the number series of the first specified digit is obtained;
Identification module, the classification of the corresponding telephone number of number series suitable for identifying the described first specified digit;
Recurrence module, if suitable for still having remaining telephone number strings to be identified, to remaining phone to be identified Number series carries out repeating recurrence, until remaining telephone number strings to be identified have all been identified.
B13, the device according to B12, wherein the recurrence module is further adapted for:
For remaining telephone number strings to be identified, triggers the preprocessing module and execute pretreatment operation, the division Module executes division operation and the identification module executes identification operation, until remaining telephone number strings to be identified are by whole It has identified.
B14, the device according to B12 or B13, wherein the preprocessing module is further adapted for:
It whether determines in the original telephone number strings to be identified comprising specified separator;
If comprising specified separator in the original telephone number strings to be identified, according to former described in the separator cutting Begin telephone number strings to be identified, obtains at least two targets telephone number strings to be identified after cutting.
B15, according to B12-B14 any one of them devices, wherein the specified separator include it is following at least it One:Pause mark, comma, branch, slash, back slash, montant.
B16, according to B12-B15 any one of them devices, wherein the preprocessing module is further adapted for:
After at least two targets telephone number strings to be identified after obtaining cutting, for each target phone to be identified Number series, determines whether the head of target telephone number strings to be identified has national area code;
If so, removing the national area code on target telephone number strings head to be identified.
B17, according to B12-B16 any one of them devices, wherein the preprocessing module is further adapted for:
After the national area code for removing target telephone number strings head to be identified, analysis eliminates after national area code Target telephone number strings to be identified;
If the head of the target telephone number strings to be identified has regional area code and this area's area code is imperfect, supplement This area's area code keeps it complete;
If the head of the target telephone number strings to be identified has regional area code and this area's area code repeats, to the ground Trivial number progress duplicate removal processing.
B18, according to B12-B17 any one of them devices, wherein the identification module is further adapted for:
Judge whether the number series of the described first specified digit meets the attributive character of first category telephone number;
If so, according to the attributive character of the first category telephone number, at least two detection digits are determined;
Each detection digit is respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result;
Referred to described first as a result, choosing optimized detection digit from at least two detections digit according to the cutting The number series for positioning number carries out completion.
B19, according to B12-B18 any one of them devices, wherein the identification module is further adapted for:
For each detection digit, using the detection digit to target telephone number strings to be identified, described first Telephone number strings after the number series of specified digit carry out cutting, obtain the first cutting number and the second cutting number;
Compare the first cutting number and the second cutting number, determines the identical position of number on the two corresponding position Number, as the corresponding cutting result of the detection digit.
B20, according to B12-B19 any one of them devices, wherein the identification module is further adapted for:
Compare the identical digit of the corresponding number of each detection digit;
From each detection digit, it is maximum as optimized detection digit to choose the identical digit of corresponding number;
Optimized detection digit described in number series completion to the described first specified digit.
B21, according to B12-B20 any one of them devices, wherein
The division module is further adapted for judging whether the number series of the described first specified digit meets in the identification module After the attributive character of first category telephone number, if the number series of the first specified digit does not meet first category phone number The attributive character of code then chooses the new division rule for meeting phone number format to target telephone number strings weight to be identified It is newly divided, obtains the number series of the second specified digit;
The identification module is further adapted for judging whether the number series of the described second specified digit meets second category phone number The attributive character of code;If so, according to the attributive character of the second category telephone number, to number of the described second specified digit Sequence carries out completion.
B22, according to B12-B21 any one of them devices, wherein further include acquisition module, suitable for passing through following steps Obtain the original telephone number strings to be identified:
Point of interest POI information is obtained from webpage;
The original telephone number strings to be identified are extracted from the POI information.

Claims (20)

1. a kind of recognition methods based on recursive telephone number, including:
Pretreatment operation handles original telephone number strings progress to be identified and the relevant pretreatment of phone number format Target telephone number strings to be identified afterwards;
Division operation, from initial position, according to meeting the division rule of phone number format to target phone to be identified Number series is divided, and the number series of the first specified digit is obtained;
Identification operation, identifies the classification of the corresponding telephone number of number series of the described first specified digit;
Recursive operation holds remaining telephone number strings to be identified if still having remaining telephone number strings to be identified The row pretreatment operation, the division operation and identification operation, until remaining telephone number strings to be identified are complete Portion has identified;
Wherein, the pretreatment operation includes at least one of:
According to the pre- cutting of separator, the identification of national area code and removal, the supplement and duplicate removal of regional area code.
2. described to be carried out and telephone number lattice to original telephone number strings to be identified according to the method described in claim 1, wherein The relevant pretreatment operation of formula, the target telephone number strings to be identified that obtain that treated, including:
It whether determines in the original telephone number strings to be identified comprising specified separator;
If being waited for according to original described in the separator cutting comprising specified separator in the original telephone number strings to be identified It identifies telephone number strings, obtains at least two targets telephone number strings to be identified after cutting.
3. according to the method described in claim 2, wherein, the specified separator includes at least one following:Pause mark is teased Number, branch, slash, back slash, montant.
4. according to the method described in claim 2, wherein, at least two targets telephone number strings to be identified after obtaining cutting Later, further include:
For each target telephone number strings to be identified, determine whether the head of target telephone number strings to be identified has country Area code;
If so, removing the national area code on target telephone number strings head to be identified.
5. according to the method described in claim 4, wherein, in the national area code for removing target telephone number strings head to be identified Later, further include:
Analysis eliminates the target telephone number strings to be identified after national area code;
If the head of the target telephone number strings to be identified has regional area code and this area's area code is imperfect, the ground is supplemented Trivial number keeps it complete;
If the head of the target telephone number strings to be identified has regional area code and this area's area code repeats, to area of this area Number carry out duplicate removal processing.
6. according to the method described in claim 1, wherein, identifying the corresponding phone number of number series of the described first specified digit The classification of code, including:
Judge whether the number series of the described first specified digit meets the attributive character of first category telephone number;
If so, according to the attributive character of the first category telephone number, at least two detection digits are determined;
Each detection digit is respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result;
According to the cutting as a result, choosing optimized detection digit to first specific bit from at least two detections digit Several number series carries out completion.
7. described that each detection digit is respectively adopted to target electricity to be identified according to the method described in claim 6, wherein It talks about number series and carries out cutting, obtain cutting as a result, including:
For each detection digit, specified to target telephone number strings to be identified, described first using the detection digit Telephone number strings after the number series of digit carry out cutting, obtain the first cutting number and the second cutting number;
Compare the first cutting number and the second cutting number, determine the identical digit of number on the two corresponding position, As the corresponding cutting result of the detection digit.
8. according to the method described in claim 7, wherein, according to the cutting as a result, from at least two detections digit It chooses optimized detection digit and completion is carried out to the number series of the described first specified digit, including:
Compare the identical digit of the corresponding number of each detection digit;
From each detection digit, it is maximum as optimized detection digit to choose the identical digit of corresponding number;
Optimized detection digit described in number series completion to the described first specified digit.
9. according to the method described in claim 6, wherein, judging whether the number series of the described first specified digit meets first After the attributive character of classification telephone number, further include:
If the number series of the first specified digit does not meet the attributive character of first category telephone number, new meet is chosen The division rule of phone number format re-starts division to target telephone number strings to be identified, obtains the second specified digit Number series;
Judge whether the number series of the described second specified digit meets the attributive character of second category telephone number;
If so, according to the attributive character of the second category telephone number, the number series of the described second specified digit is carried out Completion.
10. according to the method described in claim 1, wherein, the original telephone number strings to be identified are obtained by following steps:
Point of interest POI information is obtained from webpage;
The original telephone number strings to be identified are extracted from the POI information.
11. a kind of identification device based on recursive telephone number, including:
Preprocessing module is suitable for original telephone number strings progress to be identified and the relevant pretreatment operation of phone number format, The target telephone number strings to be identified that obtain that treated;
Division module, it is to be identified to the target according to the division rule for meeting phone number format suitable for from initial position Telephone number strings are divided, and the number series of the first specified digit is obtained;
Identification module, the classification of the corresponding telephone number of number series suitable for identifying the described first specified digit;
Recurrence module, if suitable for still having remaining telephone number strings to be identified, for remaining phone number to be identified Sequence triggers the preprocessing module and executes pretreatment operation, division module execution division operation and the identification mould Block executes identification operation, until remaining telephone number strings to be identified have all been identified;
Wherein, the pretreatment operation includes at least one of:
According to the pre- cutting of separator, the identification of national area code and removal, the supplement and duplicate removal of regional area code.
12. according to the devices described in claim 11, wherein the preprocessing module is further adapted for:
It whether determines in the original telephone number strings to be identified comprising specified separator;
If being waited for according to original described in the separator cutting comprising specified separator in the original telephone number strings to be identified It identifies telephone number strings, obtains at least two targets telephone number strings to be identified after cutting.
13. device according to claim 12, wherein the specified separator includes at least one following:Pause mark is teased Number, branch, slash, back slash, montant.
14. device according to claim 12, wherein the preprocessing module is further adapted for:
After at least two targets telephone number strings to be identified after obtaining cutting, for each target telephone number to be identified String, determines whether the head of target telephone number strings to be identified has national area code;
If so, removing the national area code on target telephone number strings head to be identified.
15. device according to claim 14, wherein the preprocessing module is further adapted for:
After the national area code for removing target telephone number strings head to be identified, analysis eliminates the institute after national area code State target telephone number strings to be identified;
If the head of the target telephone number strings to be identified has regional area code and this area's area code is imperfect, the ground is supplemented Trivial number keeps it complete;
If the head of the target telephone number strings to be identified has regional area code and this area's area code repeats, to area of this area Number carry out duplicate removal processing.
16. according to the devices described in claim 11, wherein the identification module is further adapted for:
Judge whether the number series of the described first specified digit meets the attributive character of first category telephone number;
If so, according to the attributive character of the first category telephone number, at least two detection digits are determined;
Each detection digit is respectively adopted, cutting is carried out to target telephone number strings to be identified, obtains cutting result;
According to the cutting as a result, choosing optimized detection digit to first specific bit from at least two detections digit Several number series carries out completion.
17. device according to claim 16, wherein the identification module is further adapted for:
For each detection digit, specified to target telephone number strings to be identified, described first using the detection digit Telephone number strings after the number series of digit carry out cutting, obtain the first cutting number and the second cutting number;
Compare the first cutting number and the second cutting number, determine the identical digit of number on the two corresponding position, As the corresponding cutting result of the detection digit.
18. device according to claim 17, wherein the identification module is further adapted for:
Compare the identical digit of the corresponding number of each detection digit;
From each detection digit, it is maximum as optimized detection digit to choose the identical digit of corresponding number;
Optimized detection digit described in number series completion to the described first specified digit.
19. device according to claim 16, wherein
The division module is further adapted for judging whether the number series of the described first specified digit meets first in the identification module After the attributive character of classification telephone number, if the number series of the first specified digit does not meet first category telephone number Attributive character, then choose the new division rule for meeting phone number format to target telephone number strings to be identified again into Row divides, and obtains the number series of the second specified digit;
The identification module is further adapted for judging whether the number series of the described second specified digit meets second category telephone number Attributive character;If so, according to the attributive character of the second category telephone number, to the number series of the described second specified digit Carry out completion.
20. according to the devices described in claim 11, wherein further include acquisition module, be suitable for by described in following steps acquisition Original telephone number strings to be identified:
Point of interest POI information is obtained from webpage;
The original telephone number strings to be identified are extracted from the POI information.
CN201510643026.2A 2015-09-30 2015-09-30 Recognition methods based on recursive telephone number and device Active CN105187600B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510643026.2A CN105187600B (en) 2015-09-30 2015-09-30 Recognition methods based on recursive telephone number and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510643026.2A CN105187600B (en) 2015-09-30 2015-09-30 Recognition methods based on recursive telephone number and device

Publications (2)

Publication Number Publication Date
CN105187600A CN105187600A (en) 2015-12-23
CN105187600B true CN105187600B (en) 2018-09-07

Family

ID=54909438

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510643026.2A Active CN105187600B (en) 2015-09-30 2015-09-30 Recognition methods based on recursive telephone number and device

Country Status (1)

Country Link
CN (1) CN105187600B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106921776B (en) * 2015-12-24 2020-03-17 北京四维图新科技股份有限公司 Method and device for optimizing telephone number in POI (Point of interest) data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102088697A (en) * 2010-12-17 2011-06-08 北京华中融合科技有限公司 Method and system for processing spam
CN104731977A (en) * 2015-04-14 2015-06-24 海量云图(北京)数据技术有限公司 Phone number data search and classification method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4163138B2 (en) * 2004-04-05 2008-10-08 松下電器産業株式会社 Mobile phone equipment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102088697A (en) * 2010-12-17 2011-06-08 北京华中融合科技有限公司 Method and system for processing spam
CN104731977A (en) * 2015-04-14 2015-06-24 海量云图(北京)数据技术有限公司 Phone number data search and classification method

Also Published As

Publication number Publication date
CN105187600A (en) 2015-12-23

Similar Documents

Publication Publication Date Title
CN103123624B (en) Determine method and device, searching method and the device of centre word
EP2991004A2 (en) Method and apparatus for labeling training samples
US20150207704A1 (en) Public opinion information display system and method
CN104090976B (en) The method and device of search engine crawler capturing webpage
CN108920462A (en) Point of interest POI search method and device based on map
CN105608113B (en) Judge the method and device of POI data in text
CN106528508A (en) Repeated text judgment method and apparatus
CN105227737B (en) The recognition methods of telephone number and device
CN103559313B (en) Searching method and device
US20170154056A1 (en) Matching image searching method, image searching method and devices
CN105446592A (en) Application icon classification and displaying method and device
CN105335956B (en) Method and device for checking homologous images
CN108170293A (en) Input the personalized recommendation method and device of association
CN106951571A (en) A kind of method and apparatus for giving application mark label
CN108734306A (en) A kind of data processing method, device, road upkeep system and storage medium
CN109543139A (en) Convolution algorithm method, apparatus, computer equipment and computer readable storage medium
CN108780047A (en) The detection method and relevant apparatus and computer readable storage medium of material composition
CN104778159B (en) Word segmenting method and device based on word weights
CN105260440B (en) Identify the method and device of telephone number
CN104484391A (en) Method and device for calculating similarity of character strings
CN105187600B (en) Recognition methods based on recursive telephone number and device
CN106919576A (en) Using the method and device of two grades of classes keywords database search for application now
CN103810241B (en) Filter method and device that a kind of low frequency is clicked on
CN106569734B (en) The restorative procedure and device that memory overflows when data are shuffled
CN111724143A (en) RPA-based flow element positioning method and device, computing equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220715

Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015

Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park)

Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Patentee before: Qizhi software (Beijing) Co.,Ltd.

TR01 Transfer of patent right