CN101807184A - Method for searching character string with wildcard character and system thereof - Google Patents

Method for searching character string with wildcard character and system thereof Download PDF

Info

Publication number
CN101807184A
CN101807184A CN 200910007724 CN200910007724A CN101807184A CN 101807184 A CN101807184 A CN 101807184A CN 200910007724 CN200910007724 CN 200910007724 CN 200910007724 A CN200910007724 A CN 200910007724A CN 101807184 A CN101807184 A CN 101807184A
Authority
CN
China
Prior art keywords
character string
asterisk wildcard
character
string
specific
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200910007724
Other languages
Chinese (zh)
Other versions
CN101807184B (en
Inventor
董琪
陆海涛
任成波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alcatel Lucent SAS
Alcatel Optical Networks Israel Ltd
Original Assignee
Alcatel Optical Networks Israel Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alcatel Optical Networks Israel Ltd filed Critical Alcatel Optical Networks Israel Ltd
Priority to CN 200910007724 priority Critical patent/CN101807184B/en
Publication of CN101807184A publication Critical patent/CN101807184A/en
Application granted granted Critical
Publication of CN101807184B publication Critical patent/CN101807184B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a novel search scheme of character string with wildcard character, wherein an information storage device maintains and uses an identification information for the search of the character string with wildcard characters, the identification information indicates whether the alternative character strings containing wildcard character can be used for showing one or a plurality of certain characters, and the identification information is saved associated with the alternative character strings containing wildcard characters and the one or the plurality of certain characters. The technical scheme of the invention can realize the search of the character string containing wildcard character based on the specific character string and reduce the complexity of the search process and the pre-stored information content for the search, thereby achieving fast efficiency and less occupation of hardware resource such as the processor of the information storage device and the like.

Description

Be used to retrieve the method and system of the character string that comprises asterisk wildcard
Technical field
The present invention relates to information retrieval, relate in particular to the method and system that is used to retrieve the character string that comprises asterisk wildcard.
Background technology
At electronic information field, constantly all need to carry out the retrieval of various information.Wherein just comprise retrieval to character string.
Wherein, memory device often need go to retrieve the concrete character string that is complementary with this character according to the character string that comprises asterisk wildcard of an input.In actual applications, memory device comprises the character string of asterisk wildcard with each and the concrete character string that is complementary is with it preserved explicitly.So when a character string that comprises asterisk wildcard as access entry arrived, memory device can be inquired about those concrete character strings of preserving with this character string that comprises asterisk wildcard before this, thereby obtains result for retrieval.
Nowadays, increasing standard, agreement have proposed new Search Requirement in the electronic applications, for example, according to the concrete character string that the user provides, inquire about the character string that comprises asterisk wildcard that is complementary with it.Such standard is 3GPP TS 23.003 for example.
But, there is no the prior art existence that can be used for realizing the reverse retrieval of this kind in this area.
Summary of the invention
For solving the aforementioned problems in the prior, the present invention proposes a kind of new technical scheme, wherein, a kind of identification information is safeguarded and utilized to information storing device, in order to the character string that comprises asterisk wildcard is retrieved, this identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate one or more specific characters, and preserves explicitly with described each alternative character string that comprises asterisk wildcard and described one or more specific character.
According to a specific embodiment of the present invention, a kind of method that is used to retrieve the character string that comprises asterisk wildcard in information storing device is provided, wherein, may further comprise the steps: obtain first character string; Retrieve the character string that comprises asterisk wildcard that is complementary with first character string based on the identification information that prestores, wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate one or more specific characters, and preserves explicitly with described each alternative character string that comprises asterisk wildcard and described one or more specific character.
According to another specific embodiment of the present invention, a kind of first indexing unit that is used to retrieve the character string that comprises asterisk wildcard in information storing device is provided, wherein, comprising: first obtains device, is used to obtain first character string; Second indexing unit, be used for retrieving the character string that comprises asterisk wildcard that is complementary with this first character string based on the identification information that prestores, wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate one or more specific characters, and preserves explicitly with described each alternative character string that comprises asterisk wildcard and described one or more specific character.
Adopt technical scheme of the present invention, can be when realization be retrieved the character string this purpose that comprises asterisk wildcard based on concrete character string, the prestored information amount that reduces the complexity of retrieving as much as possible and be used to retrieve, thereby the guarantee information memory device is when carrying out above-mentioned retrieval, have efficient faster, take less the hardware resources such as processor of information storing device.
Description of drawings
By reading the description of indefiniteness embodiment being done below in conjunction with accompanying drawing, other purpose of the present invention, feature and advantage will be more obvious.Wherein, same or analogous Reference numeral is represented same or analogous technical characterictic.
Fig. 1 shows a network structure based on IMS;
Fig. 2 a shows the process flow diagram that is used to retrieve the method for the character string that comprises asterisk wildcard according to a specific embodiment of the present invention in information storing device;
Fig. 2 b is the method flow diagram that is used to retrieve the character string that comprises asterisk wildcard in information storing device according to a specific embodiment of the present invention;
Fig. 3 shows the structured flowchart according to first indexing unit that is used to retrieve the character string that comprises asterisk wildcard in information storing device of the specific embodiment of the present invention.
Embodiment
As a reference, the network structure based on IMS of one of a plurality of applied environments of the present invention is briefly introduced as follows, as shown in Figure 1.Wherein, shown in the bottom of network be bearing bed, be used to provide the access and the transmission of IMS-SIP session.The major equipment of this layer has SGSN (Serving GPRS Support Node), GGSN (gateway GPRS business support node) and MGW (media gateway).
The centre is the signaling control layer, and the signaling control of all IP multimedia service is all finished at this one deck.The major function entity of this layer comprises CSCF, HSS (Home Subscriber Server, home subscriber server), MGCF etc., these functional entitys are taken on different roles, as signaling control server, database, media gateway server etc., the collaborative processing capacity of finishing the signaling aspect, for example foundation of SIP session, release.
What be positioned at this network top is application layer, is responsible for the user IMS is provided value-added service, and the network element of this layer mainly is a series of application servers (AS) that multimedia service is provided by Camel, OSA/Parlay and sip technique.
In the IMS network, public service identity (PSI) is used for identifying the business of IMS network, is used for the business in IMS territory is routed to corresponding server.Each public service identity is stored among the HSS, and by application management server.Application server is carried out logical controlling according to public service identity.The form of public service identity can be SIP URI or Tel URI.Wherein, public service identity can be that a definite PSI also can be an asterisk wildcard PSI, as " SIP:Chatlist*@Example.com ", the PSI that can be complementary with this asterisk wildcard PSI is as " SIP:Chatlist1@Example.com ", and " SIP:ChatlistA*@Example.com " reaches " SIP:Chatlistabc*@Example.com " etc. all can trigger this business.
It will be appreciated by those skilled in the art that the example of an indefiniteness of the character string that comprises asterisk wildcard that asterisk wildcard PSI promptly will mention herein.
When IMS user shown in Figure 1 (hereinafter to be referred as the user) 1 request triggers one when professional, promptly in request message, load a different PSI (distinct PSI, in network shown in Figure 1, has uniqueness), in the HSS2 of signaling control layer, this different PSI will be as access entry, and the pairing business of asterisk wildcard PSI that is complementary with this different PSI of inquiry gained can be triggered.
The purpose of proposition of the present invention is to provide a kind of effective method and device, realizes retrieving for example asterisk wildcard PSI of the regular expression that is complementary with it according to for example different PSI of concrete character string.It will be appreciated by those skilled in the art that to be that example only is the clear and easy of explanation with different PSI and asterisk wildcard PSI herein, this example does not constitute any limitation protection scope of the present invention.
Contribution of the present invention is, writes down the character string that comprises asterisk wildcard and the relation between the specific character in the mode of identification information (as, array), thereby realizes retrieving comparatively fast, and in theory, complexity of the present invention is O (1).
Fig. 2 a shows the process flow diagram that is used to retrieve the method for the character string that comprises asterisk wildcard according to a specific embodiment of the present invention in information storing device.Graphic technique comprises that following two step S23. obtain first character string; S24. retrieve the character string that comprises asterisk wildcard that is complementary with first character string based on the identification information that prestores, wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate one or more specific characters, and preserves explicitly with described each alternative character string that comprises asterisk wildcard and described one or more specific character.
As follows to the some of them concept explanation:
● ' information storing device ': at this equipment that refers to HSS and similar functions is arranged with it, be IMS owing to do not limit applied environment in claim 1, therefore, we use this upperseat concept of HSS to be limited.
● ' character string that comprises asterisk wildcard ': refer to regular expression, import automatically and stored by system manager's input in advance or by information storing device.
● ' first character string ': promptly as the concrete character string of access entry, for example, 123456.
● ' can be used to illustrate ': a character string that comprises asterisk wildcard can be used to illustrate the just expression of a specific character, it or directly comprised this specific character, for example ' 123* ' just directly comprised ' 1 '; Perhaps comprise the asterisk wildcard that can represent this specific character, for example ' 123* ' comprised and can represent ' 4 ' asterisk wildcard ' * '.
Be to be understood that, information storing device for example HSS is retrieved the character string that comprises asterisk wildcard if desired, may be because an operator has moved search program on this equipment, also may be that this equipment is received the indication that miscellaneous equipment is sent, and also may be that this equipment spontaneously produces Search Requirement.Therefore, ' obtaining operation ' among the step S23 has multiple implementation, can be to be received by other equipment places, also can be spontaneously to produce, and perhaps gathers user's input.
Describe in detail hereinafter with reference to process flow diagram each non-limiting examples method provided by the invention.
Fig. 2 b is the method flow diagram that is used to retrieve the character string that comprises asterisk wildcard in information storing device according to a specific embodiment of the present invention.Wherein, in order to be introduced, also show the step S21 before the step S23: obtain identification information, comprise the character string of asterisk wildcard from whole angle; And step S22: preserve identification information that obtains and the character string that comprises asterisk wildcard.
First embodiment
Should be appreciated that the step S21 shown in Fig. 2 b, S22 are not all is necessary at every turn, when needs upgrade identification information or comprise the character string of asterisk wildcard, preferably carries out this two steps when retrieving as required.If character string that comprises asterisk wildcard that is used to retrieve in the information storing device and respective identification information are by static configuration and can not upgrade, then step S21 and step S22 can economize.Scene in this example is a data no initializtion still of retrieving the character string that comprises asterisk wildcard being used in the information storing device, so, order execution in step S21 and S22.
Step S21
At first, HSS2 obtains one or more alternative character strings that comprise asterisk wildcard and corresponding identification information, and this identification information indicates these alternative character strings that comprise asterisk wildcard whether can be used to illustrate one or more specific characters.Wherein, the mode of obtaining the alternative character string that comprises asterisk wildcard includes but not limited to be received by the miscellaneous equipment place, and the input of gathering the user.
In this example, the ground that is without loss of generality, the character string that comprises asterisk wildcard that HSS2 obtains has: 12[a-c], 2? and 1! , and HSS2 uses following asterisk wildcard:
'. ' represents 1 numeral (0-9) or letter (a-z) arbitrarily;
'! ' expression 1 arbitrarily the numeral;
'? ' expression 1 arbitrarily the letter;
' [x-y] ' expression is from x (comprising x), in integer (when x, y are integer) that finishes to y or the letter (when x, y are letter) any 1, for example, any numeral in [2-6] expression 2,3,4,5,6.
After having obtained above-mentioned three character strings (hereinafter also will be called for short character string under the unlikely situation about obscuring) that comprise asterisk wildcard, information storing device is resolved these character strings, is used to represent with generation whether these character strings can be used to illustrate the identification information of specific character.Wherein, preferably, during parsing at specific character set can be pre-defined.
In this example, above-mentioned parsing at specific character set for 1,2,3, a, b, c}.
Wherein, the ground that is without loss of generality, described identification information are the some arrays that correspond respectively to a specific character in this set, and a unit in each array is then corresponding to one in above-mentioned three character strings.
Preferably, described each array is the scale-of-two array, and when a certain character string can be used to illustrate a specific character, promptly puts 1 with the corresponding binary digit of this character string in the array of generation, otherwise put 0.
Based on this kind rule, table 1 has exemplarily been represented the identification information that generates in this example:
Table 1: the identification information example that generates among first embodiment
Specific character Array
??1 ??0101
??2 ??0111
??3 ??0100
??a ??0011
??b ??0011
??c ??0011
Wherein, the length of the alternative character string that comprises asterisk wildcard of definition is no more than 3.Therefore the number that defines the alternative character string that comprises asterisk wildcard is no more than 4, is 4 bit array shown in the table.
As seen, in each row of form, comprise a specific character and a corresponding array respectively.Wherein, dextrosinistral first three binary digit corresponds respectively to 12[a-c in each array], 2? and 1!
Also promptly, 0101 ' 1 ' expression 12[a-c on right several first of second row] can be used to illustrate 1 this specific character, because this character string itself has just directly comprised 1.
0101 ' 0 ' expression 2 on right several second of second row? can't be used to illustrate 1, because? definition be 1 letter arbitrarily, so this character string both directly do not comprise 1, do not comprise the asterisk wildcard that can be used in expression 1 yet.
Second row 0101 on right several the 3rd ' 1 ' expression 1! Can be used to illustrate 1, because it has directly comprised 1.
See fifth line again, ' 1 ' expression 12[a-c on wherein 0011 right several first] can be used to illustrate this specific character of a, because asterisk wildcard [a-c] expression a or b or c, nature can illustrate a.
' 1 ' expression 2 on 0011 right several second of fifth line? can be used to illustrate a, because asterisk wildcard? can represent the arbitrary letter among a to z, nature can illustrate a.
' 0 ' expression on 0011 right several the 3rd of fifth line 1! A can't be shown, because asterisk wildcard! Only can represent one arbitrarily the numeral, so 1! Neither directly comprise a, also do not comprise the asterisk wildcard that to represent a.
The content of other row can be analogized with reference to above-mentioned introduction in the form, repeats no more.
Step S22
The above-mentioned identification information that obtains among the step S21 will be with character string 12[a-c], 2? and 1! In step S22, be kept at the information storing device place together, so that deal with the query task that may arrive.Certainly, identification information and each comprise between the character string of asterisk wildcard the incidence relation of expressing or hinting, so that whether can represent that to character string specific character represents.
Step S23
After this in a certain moment, the task of retrieving the character string that comprises asterisk wildcard that is complementary according to concrete character string produces, and wherein, order is that first character string is 12a as the concrete character string of access entry, and character 1,2 and a are wherein arranged.
Step S24
So based on alternative character string that comprises asterisk wildcard of obtaining in step S21 and preserving in step S22 and corresponding identification information, information storing device is retrieved the character string that comprises asterisk wildcard that is complementary with 12a in step S24.
Concrete, in step S24, in table 1, obtain information at each character among the first character string 12a, detailed process is as follows:
First character among the 12a is 1, in table 1, is considered as specific character with 1, and then corresponding array is ' 0101 ';
Second character among the 12a is 2, in table 1, is considered as specific character with 2, and then corresponding array is ' 0111 ';
The 3rd character among the 12a is a, in table 1, a is considered as specific character, and then corresponding array is ' 0011 '.
Certainly, because general information storing device all can carry out the multi-process computing, therefore, searching of above-mentioned three arrays can be carried out synchronously, and visible time complexity is very low.
Then, three scale-of-two arrays that obtain are carried out the logical and operation,
Figure B2009100077248D0000081
Obtain a new scale-of-two array 0001, and as can be known according to the corresponding relation between binary digit in each array in the table 1 and the alternative character string that comprises asterisk wildcard, in this new scale-of-two array 1 is corresponding to 12[a-c], also promptly, 12[a-c] be unique character string that is complementary with 12a in above-mentioned three character strings.
In first embodiment, identification information represents more roughly whether a character string that comprises asterisk wildcard can be used to illustrate a specific character, also promptly, 12[a-c] be regarded as can being used to illustrate 1,2, a, b, these five characters of c, and ignore 12[a-c] in position concept.Therefore, the situation among this embodiment is applicable to that more retrieval concerns insensitive situation for character in the character string and the position between the character.
And when the alternative character string quantity that comprises asterisk wildcard is big, according to the scheme shown in first embodiment, the precision of resulting result for retrieval may be not very good in step S24, for example, with 12a during as access entry, if preserved 12[a-c in the information storing device], 1[a-c] 2 and [a-c] 12 as the alternative character string that comprises asterisk wildcard, then these three character strings will all be regarded as being complementary with 12a.But people may wish that information searching device can be only with 12[a-c] provide as final result for retrieval.And this will be achieved in more preferred second embodiment of the present invention.
Second embodiment
Below, with reference to Fig. 2 a, 2b and in conjunction with Fig. 1 the second embodiment of the present invention is introduced, wherein, with a HSS2 shown in Figure 1 example as information storing device.
In a second embodiment, consider the position concept of character in character string, wherein each position is called an ad-hoc location.For example, in ' 12[a-c] ', ' 1 ' at first ad-hoc location, and ' 2 ' is the second place at second ad-hoc location, and ' [a-c] ' is at the 3rd ad-hoc location.
Step S21
In step S21, HSS2 at first obtains alternative character string that comprises asterisk wildcard and respective identification information.Wherein, the character string that order is obtained comprises 12[a-c], 2*, 1.3.Wherein the asterisk wildcard meaning of Chu Xianing is as follows:
[x-y] expression is from x (comprising x), in integer (when x, y are integer) that finishes to y or the letter (when x, y are letter) any 1, for example, any numeral in [2-6] expression 2,3,4,5,6.
'. ' represents 1 numeral (0-9) or letter (a-z) arbitrarily.
' * ' expression length is 0 to N any alphabetic string or the character string of numeric string or mixing, as, 123,134b, afgh8a2 etc.
Then, three character strings that comprise asterisk wildcard that get access to are resolved, obtaining corresponding identification information, precedingly address, the identification information of this moment will be relevant with the position concept in the character string.Preferably, during parsing at the specific character set can be pre-defined.
In this example, above-mentioned parsing at specific character set for 1,2,3, a, b, c}.
Wherein, the ground that is without loss of generality, described identification information are a plurality of arrays, and each array wherein meets the following conditions:
-corresponding to a specific character in the described specific character set;
-corresponding to an ad-hoc location;
Whether-wherein each unit can be used to illustrate the described specific character that is positioned at described ad-hoc location corresponding to an alternative character string that comprises asterisk wildcard to represent this character string.
Preferably, described each array is the scale-of-two array, and when a certain character string can be used to illustrate certain specific character that is positioned at an ad-hoc location, generate with this specific character and the corresponding array of this ad-hoc location in, with the corresponding binary location 1 of this character string, otherwise put 0.
Based on this kind rule, the identification information that generates in this example has exemplarily been represented in table 21~23:
Table 2_1: the first of the identification information that generates among second embodiment
Specific character Array Finish array
??1 ??0101 ??0000
??2 ??0010 ??0010
Specific character Array Finish array
??3 ??0000 ??0000
??a ??0000 ??0000
??b ??0000 ??0000
??c ??0000 ??0000
Table 2_2: the second portion of the identification information that generates among second embodiment
Specific character Array Finish array
??1 ??0110 ??0010
??2 ??0111 ??0010
??3 ??0110 ??0010
??a ??0110 ??0010
??b ??0110 ??0010
??c ??0110 ??0010
Table 2_3: the third part of the identification information that generates among second embodiment
Specific character Array Finish array
??1 ??0000 ??0010
??2 ??0000 ??0010
??3 ??0000 ??0110
??a ??0000 ??0011
??b ??0000 ??0011
??c ??0000 ??0011
Wherein, the number of the alternative character string that comprises asterisk wildcard of definition is no more than 3.Each length that comprises the character string of asterisk wildcard is no more than 4, so the scale-of-two array in the form is 4.As seen, in each row of form, comprise a specific character and a corresponding array respectively.Wherein, dextrosinistral first three binary digit corresponds respectively to 12[a-c in each array], 2? and 1!
Table 2_1 shown each array also is 12[a-c corresponding to first ad-hoc location in the character string that comprises asterisk wildcard] in the position at 1 place, i.e. the position at 2 places among the 2*, the i.e. position at 1 place in 1.3.
So only when certain character string that comprises asterisk wildcard can illustrate certain specific character that is positioned on a certain ad-hoc location, the corresponding positions of corresponding array just can put 1, otherwise puts 0.
An example, each character string of array representation that second ranks in table 21 go out and the relation of the expression between the specific character 1.0101 dextrosinistral each binary digit wherein is successively corresponding to 12[a-c], 2*, 1.3 these three character strings.
Table 2_1 second row 0101 in 1 on first binary digit of right number corresponding to 12[a-c], expression 12[a-c] can be used to illustrate the specific character 1 that is positioned at first ad-hoc location.Because, apparently, 12[a-c] directly comprised specific character 1, and this specific character exactly is positioned at first ad-hoc location wherein.
Table 2_1 second row 0101 in 0 on right several second binary digit corresponding to 2* because first ad-hoc location of 2* is occupied by 2, therefore the specific character 1 that is positioned on first ad-hoc location can't be shown, therefore put 0.
Table 2_1 second row 0101 in 1 on right several the 3rd binary digits corresponding to 1.3, expression 1.3 can illustrate and be positioned at 1 on first ad-hoc location.Because apparently, 1.3 have directly comprised specific character 1, and this specific character exactly is positioned at first ad-hoc location wherein.
Because it is 0 to some arbitrary strings that this asterisk wildcard of * can be represented length, and '. ' can represent that length is 1 any character (numeral or letter), therefore, the secondary series in the table 22 corresponding with second ad-hoc location can be seen, all puts 1 with 2* and 1.3 corresponding binary digits.And only at the third line because specific character is 2, thus in the array 0111 with 12[a-c] corresponding binary digit also puts 1.
As can be seen, also comprise the end array in the form, its comprise with each form secondary series in the identical implication of array, represent promptly whether each character string can represent to be positioned at the specific character of respective specific position.In addition, these finish array and represent also whether corresponding specific character is last character of this character string that comprises asterisk wildcard, if, then put 1, otherwise zero setting.
By finish the effect of array with the introduction of next example:
The character string that comprises asterisk wildcard " 2* " is above become " * 2 ".So, when not finishing array, when input " 23 " during as concrete character string, in table 21, will obtain 0010, and in table 2_2, will obtain 0110, through behind the logic and operation, will obtain 0010, be " * 2 " so draw the character string that comprises asterisk wildcard of coupling, and in fact they are unmatched.Having when finishing array, equally will " 23 " as access entry, in table 2_1, obtain 0010, and in table 2_2, obtain 0000 according to finishing array, through logic and operation, finally obtain 0000, so know there is not the characters matched string.
So in table 2_2, because the * among the 2* can represent that length is 1 character string, and value can be 1,2,3, a, b, among the c any, so these six specific characters last character that all may be 2* are so each finishes all to put 1 with the corresponding binary digit of 2* in array.
Because 3 is predefined maximum lengths that comprise the character string of asterisk wildcard, therefore in table 2_3, only represent that with the end array representing between each character string and the specific character concerns.
Step S22
After having obtained each above-mentioned identification information, this method enters step S22, character string that comprises asterisk wildcard and above-mentioned identification information that each is alternative are preserved, certainly, between each array and the specific character, and the corresponding relation between the ad-hoc location, and the corresponding relation between wherein each unit (binary digit) and each character string is also to determine and explicit or preservation implicitly.
Step S23
In this example, first character string of obtaining is ' 123 '.
Step S24
Based on alternative character string that comprises asterisk wildcard of obtaining in step S21 and preserving in step S22 and corresponding identification information, HSS2 retrieves in step S24 and 123 character strings that comprise asterisk wildcard that are complementary.
Concrete, in step S24, in table 21~23, obtaining information respectively at each character in first character string 123, detailed process is as follows:
The character that is positioned at first ad-hoc location in 123 is 1, in table 21, is considered as specific character with 1, and then corresponding array is ' 0101 ';
The character that is positioned at second ad-hoc location in 123 is 2, in table 22, is considered as specific character with 2, and then corresponding array is ' 0111 ';
The character that is positioned at the 3rd ad-hoc location in 123 is 3, in table 23, is considered as specific character with 3, and then corresponding array is ' 0110 '.
Certainly, because general information storing device all can carry out the multi-process computing, therefore, searching of above-mentioned three arrays can be carried out synchronously, and visible time complexity is very low.
Then, three scale-of-two arrays that obtain are carried out the logical and operation,
Figure B2009100077248D0000141
Obtain a new scale-of-two array 0100, and as can be known according to the corresponding relation between binary digit in each array among table 21~2-3 and the alternative character string that comprises asterisk wildcard, in this new scale-of-two array 1 is corresponding to 1.3, also promptly, the 1.3rd, unique and 123 character strings that are complementary in above-mentioned three character strings.
Miscellaneous equipment be used or be offered to the character string that comprises asterisk wildcard that retrieval obtains promptly can at local (in information storing device).In the IMS system, asterisk wildcard PSI promptly is a kind of character string that comprises asterisk wildcard, behind the asterisk wildcard PSI that HS S2 inquiry obtains being complementary with different PSI, is about to it and offers the application corresponding server, to trigger corresponding business.
For the follow-up use of asterisk wildcard PSI, existing more scheme also can repeat no more referring to the TS23.003 in the 3GPP standard in this area.
Carrying out when it will be appreciated by those skilled in the art that step S21 and S22, can also when upgrading this partial data or revise, carry out by needs except can be in information storing device relevant data initialization with the string search that comprises asterisk wildcard.
Refer again to the device block diagram below device provided by the invention is introduced, wherein, obtained sufficient elaboration hereinbefore owing to install corresponding method with this, introduction hereinafter will be comparatively simple.Fig. 3 shows first indexing unit 30 that is used to retrieve the character string that comprises asterisk wildcard in information storing device according to the specific embodiment of the present invention.
Illustrated first indexing unit 30 comprises that first obtains device 300, is used to obtain first character string; Second indexing unit 301, be used for retrieving the character string that comprises asterisk wildcard that is complementary with this first character string based on the identification information that prestores, wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate one or more specific characters, and preserves explicitly with described each alternative character string that comprises asterisk wildcard and described one or more specific character.
First deriving means 302 is used to obtain described one or more alternative character strings that comprise asterisk wildcard and described identification information; Save set 303 is used to preserve described one or more alternative character strings that comprise asterisk wildcard and described identification information.
Concrete, described first deriving means 302 comprises: second deriving means 3020 is used to obtain described one or more alternative character string that comprises asterisk wildcard; Resolver 3021 is used for described at least one character string that comprises asterisk wildcard is resolved, to generate described identification information.
More specifically, described resolver 3021 comprises the following device that is used to each described alternative character string that comprises asterisk wildcard to carry out corresponding operating:
Judgment means 30210 is used for judging whether this character string can be used to illustrate each specific character of a specific character set;
Generating apparatus 30211 is used for generating respectively the information whether this character string of expression can be used to illustrate described each specific character.
Described resolver 3021 also comprises:
Second obtains device 30212, is used to be based upon described each and alternative comprise the information whether expression respective symbols string that wild card string generates can be used to illustrate described each specific character and obtain described identification information.
Wherein, described second obtains device 3020 is used for: be based upon described each and alternative comprise the information whether expression respective symbols string that wild card string generates can be used to illustrate described each specific character and generate a plurality of arrays, with as described identification information, wherein, each array is corresponding to a specific character in the described specific character set, and whether the arbitrary unit in each array can be used to illustrate this specific character corresponding to a described alternative character string that comprises asterisk wildcard to represent this character string.
Wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate the one or more specific characters that are positioned at each ad-hoc location, and described judgment means 30210 also is used for: judge whether this character string can be used for illustrating described each specific character when being positioned at each ad-hoc location.
Described generating apparatus 30211 also is used for: generate the information whether this character string of expression can be used to illustrate described each specific character that is positioned at each described ad-hoc location respectively.
Described second obtains device 30212 also is used for: whether this character string of expression that is based upon described each alternative character string generation that comprises asterisk wildcard can be used to illustrate the information of described each specific character that is positioned at each described ad-hoc location, obtains described identification information.
Further, described second obtains device 30212 also is used for: whether this character string of expression that is based upon described each alternative character string generation that comprises asterisk wildcard can be used to illustrate the information of described each specific character that is positioned at each described ad-hoc location, generate a plurality of arrays, with as described identification information, wherein, each array meets the following conditions:
-corresponding to a specific character in the described specific character set;
-corresponding to a described ad-hoc location; And
Whether-wherein each unit can be used to illustrate the described specific character that is positioned at described ad-hoc location corresponding to an alternative character string that comprises asterisk wildcard to represent this character string.
Wherein, described second indexing unit 301 comprises:
Inquiry unit 3010 is used in described a plurality of arrays, respectively each specific character and this specific character corresponding array of residing ad-hoc location in described first character string of inquiry and described first character string;
Determine device 3011, be used for, determine the character string that comprises asterisk wildcard that is complementary with this first character string based on each array that inquires.
Preferably, described each array is the scale-of-two array, arbitrary unit in each array is a bit, described definite device 3011 is used for: each array that inquires is carried out step-by-step and operation, generating the string of binary characters of the character string that comprises asterisk wildcard that indication and this first character string be complementary, and the character string that comprises asterisk wildcard that described string of binary characters is indicated is defined as the character string that comprises asterisk wildcard that is complementary with this first character string.
More preferably, the character string that comprises asterisk wildcard that described string of binary characters is indicated is that described string of binary characters intermediate value is 1 the pairing character string that comprises asterisk wildcard in position.
More than specific embodiments of the invention are described.It will be appreciated that the present invention is not limited to above-mentioned specific implementations, those skilled in the art can make various distortion or modification within the scope of the appended claims.

Claims (24)

1. method that is used to retrieve the character string that comprises asterisk wildcard in information storing device wherein, may further comprise the steps:
A. obtain first character string;
B. retrieve the character string that comprises asterisk wildcard that is complementary with first character string based on the identification information that prestores, wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate one or more specific characters, and preserves explicitly with described each alternative character string that comprises asterisk wildcard and described one or more specific character.
2. method according to claim 1, wherein, further comprising the steps of before described step a:
I. obtain described one or more alternative character strings that comprise asterisk wildcard and described identification information;
II. preserve described one or more alternative character strings that comprise asterisk wildcard and described identification information.
3. method according to claim 2, wherein, described step I comprises:
I1. obtain described one or more alternative character string that comprises asterisk wildcard;
I2. described at least one character string that comprises asterisk wildcard is resolved, to generate described identification information.
4. method according to claim 3, wherein, described step I2 comprises, for each described alternative character string that comprises asterisk wildcard is carried out following steps:
I21. judge whether this character string can be used for illustrating each specific character of a specific character set;
I22. generate the information whether this character string of expression can be used to illustrate described each specific character respectively,
Described step I2 also comprises:
I. being based upon described each alternative comprise the information whether expression respective symbols string that wild card string generates can be used to illustrate described each specific character and obtains described identification information.
5. method according to claim 4, wherein, described step I comprises:
Being based upon described each alternative comprise the information whether expression respective symbols string that wild card string generates can be used to illustrate described each specific character and generates a plurality of arrays, with as described identification information, wherein, each array is corresponding to a specific character in the described specific character set, and whether each unit in each array can be used to illustrate this specific character corresponding to a described alternative character string that comprises asterisk wildcard to represent this character string.
6. method according to claim 4, wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate the one or more specific characters that are positioned at each ad-hoc location, and described determining step I21 comprises,
Judge whether this character string can be used for illustrating described each specific character when being positioned at each ad-hoc location;
Described step I22 comprises:
Generate the information whether this character string of expression can be used to illustrate described each specific character that is positioned at each described ad-hoc location respectively,
Described step I comprises:
Whether this character string of expression that is based upon described each alternative character string generation that comprises asterisk wildcard can be used to illustrate the information of described each specific character that is positioned at each described ad-hoc location, obtains described identification information.
7. method according to claim 6, wherein, described step I comprises:
Whether this character string of expression that is based upon described each alternative character string generation that comprises asterisk wildcard can be used to illustrate the information of described each specific character that is positioned at each described ad-hoc location, generate a plurality of arrays, with as described identification information, wherein, each array meets the following conditions:
-corresponding to a specific character in the described specific character set;
-corresponding to a described ad-hoc location; And
-wherein each unit is corresponding to an alternative character string that comprises asterisk wildcard, with table
Show whether this character string can be used to illustrate the described specific character that is positioned at described ad-hoc location.
8. search method according to claim 7, wherein, described step b may further comprise the steps:
B1. in described a plurality of arrays, inquire about each specific character and this specific character corresponding array of residing ad-hoc location in described first character string respectively with described first character string;
B2. based on each array that inquires, determine the character string that comprises asterisk wildcard that is complementary with this first character string.
9. search method according to claim 8, wherein, described each array is the scale-of-two array, and the arbitrary unit in each array is a bit, and described step b2 comprises:
-each array that inquires is carried out step-by-step and operation, to generate the string of binary characters of indicating the character string that comprises asterisk wildcard that is complementary with this first character string;
-the character string that comprises asterisk wildcard that described string of binary characters is indicated is defined as the character string that comprises asterisk wildcard that is complementary with this first character string.
10. search method according to claim 9, wherein, the character string that comprises asterisk wildcard that described string of binary characters is indicated is that described string of binary characters intermediate value is 1 the pairing character string that comprises asterisk wildcard in position.
11. according to each described method in the claim 1 to 10, wherein, described information storing device is an attribution server, the described character string that comprises asterisk wildcard is a public service identity.
12. a method that is used to provide public service identity in the attribution server based on the IP multimedia system wherein, may further comprise the steps:
Use is retrieved one or more public service identity that comprise asterisk wildcard according to each method among the claim 1-10;
The one or more public service identity that comprise asterisk wildcard that retrieve are offered respective application server.
13. first indexing unit that is used to retrieve the character string that comprises asterisk wildcard in information storing device wherein, comprising:
First obtains device, is used to obtain first character string;
Second indexing unit, be used for retrieving the character string that comprises asterisk wildcard that is complementary with this first character string based on the identification information that prestores, wherein, described identification information indicates each alternative character string that comprises asterisk wildcard whether can be used to illustrate one or more specific characters, and preserves explicitly with described each alternative character string that comprises asterisk wildcard and described one or more specific character.
14. first indexing unit according to claim 13 wherein, also comprises:
First deriving means is used to obtain described one or more alternative character strings that comprise asterisk wildcard and described identification information;
Save set is used to preserve described one or more alternative character strings that comprise asterisk wildcard and described identification information.
15. first indexing unit according to claim 14, wherein, described first deriving means comprises:
Second deriving means is used to obtain described one or more alternative character string that comprises asterisk wildcard;
Resolver is used for described at least one character string that comprises asterisk wildcard is resolved, to generate described identification information.
16. first indexing unit according to claim 15, wherein, described resolver comprises the following device that is used to each described alternative character string that comprises asterisk wildcard to carry out corresponding operating:
Judgment means is used for judging whether this character string can be used to illustrate each specific character of a specific character set;
Generating apparatus is used for generating respectively the information whether this character string of expression can be used to illustrate described each specific character,
Described resolver also comprises:
Second obtains device, is used to be based upon described each and alternative comprise the information whether expression respective symbols string that wild card string generates can be used to illustrate described each specific character and obtain described identification information.
17. first indexing unit according to claim 16, wherein, described second obtains device is used for:
Being based upon described each alternative comprise the information whether expression respective symbols string that wild card string generates can be used to illustrate described each specific character and generates a plurality of arrays, with as described identification information, wherein, each array is corresponding to a specific character in the described specific character set, and whether the arbitrary unit in each array can be used to illustrate this specific character corresponding to a described alternative character string that comprises asterisk wildcard to represent this character string.
18. method according to claim 17, wherein, whether described identification information indicates each alternative character string that comprises asterisk wildcard can be used to the one or more specific characters that are positioned at each ad-hoc location are shown, and described judgment means is also used hand:
Judge whether this character string can be used for illustrating described each specific character when being positioned at each ad-hoc location;
Described generating apparatus also is used for:
Generate the information whether this character string of expression can be used to illustrate described each specific character that is positioned at each described ad-hoc location respectively,
Described second obtains device also is used for:
Whether this character string of expression that is based upon described each alternative character string generation that comprises asterisk wildcard can be used to illustrate the information of described each specific character that is positioned at each described ad-hoc location, obtains described identification information.
19. first indexing unit according to claim 18, wherein, described second obtains device also is used for:
Whether this character string of expression that is based upon described each alternative character string generation that comprises asterisk wildcard can be used to illustrate the information of described each specific character that is positioned at each described ad-hoc location, generate a plurality of arrays, with as described identification information, wherein, each array meets the following conditions:
-corresponding to a specific character in the described specific character set;
-corresponding to a described ad-hoc location; And
-wherein each unit is corresponding to an alternative character string that comprises asterisk wildcard, with table
Show whether this character string can be used to illustrate the described specific character that is positioned at described ad-hoc location.
20. first indexing unit according to claim 19, wherein, described second indexing unit comprises:
Inquiry unit is used in described a plurality of arrays, respectively each specific character and this specific character corresponding array of residing ad-hoc location in described first character string of inquiry and described first character string;
Determine device, be used for, determine the character string that comprises asterisk wildcard that is complementary with this first character string based on each array that inquires.
21. first indexing unit according to claim 20, wherein, described each array is the scale-of-two array, and the arbitrary unit in each array is a bit, and described definite device is used for:
Be used for each array that inquires is carried out step-by-step and operation, generating the string of binary characters of the character string that comprises asterisk wildcard that indication and this first character string be complementary, and the character string that comprises asterisk wildcard that described string of binary characters is indicated is defined as the character string that comprises asterisk wildcard that is complementary with this first character string.
22. first indexing unit according to claim 21, wherein, the character string that comprises asterisk wildcard that described string of binary characters is indicated is that described string of binary characters intermediate value is 1 the pairing character string that comprises asterisk wildcard in position.
23. according to each described method in the claim 13 to 22, wherein, described information storing device is an attribution server, the described character string that comprises asterisk wildcard is a public service identity.
24. first generator that is used to provide public service identity in the attribution server based on the IP multimedia system wherein, comprising:
According to each described first indexing unit among the claim 13-23, be used to retrieve one or more public service identity that comprise asterisk wildcard;
Second generator, the one or more public service identity that comprise asterisk wildcard that are used for retrieving offer second equipment.
CN 200910007724 2009-02-16 2009-02-16 Method for searching character string with wildcard character and system thereof Expired - Fee Related CN101807184B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200910007724 CN101807184B (en) 2009-02-16 2009-02-16 Method for searching character string with wildcard character and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200910007724 CN101807184B (en) 2009-02-16 2009-02-16 Method for searching character string with wildcard character and system thereof

Publications (2)

Publication Number Publication Date
CN101807184A true CN101807184A (en) 2010-08-18
CN101807184B CN101807184B (en) 2013-05-01

Family

ID=42608983

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200910007724 Expired - Fee Related CN101807184B (en) 2009-02-16 2009-02-16 Method for searching character string with wildcard character and system thereof

Country Status (1)

Country Link
CN (1) CN101807184B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104765741A (en) * 2014-01-06 2015-07-08 中国银联股份有限公司 Data processing method
WO2017161749A1 (en) * 2016-03-21 2017-09-28 乐视控股(北京)有限公司 Method and device for information matching
CN108430045A (en) * 2018-03-14 2018-08-21 北京思特奇信息技术股份有限公司 A kind of character string pruning method and device
CN108536713A (en) * 2017-03-03 2018-09-14 广东神马搜索科技有限公司 Character string checking method, device and electronic equipment
CN110008385A (en) * 2018-04-20 2019-07-12 武汉绿色网络信息服务有限责任公司 A kind of quick matching and recognition method and device based on character string
CN111158500A (en) * 2019-12-18 2020-05-15 河南芯盾网安科技发展有限公司 Method and device for improving input efficiency by using wildcard
CN112732796A (en) * 2021-01-23 2021-04-30 河北省科学院应用数学研究所 Fuzzy query matching method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1265307C (en) * 2002-12-12 2006-07-19 华为技术有限公司 Characteristic character string extracting and substituting method in language localization
CN100530182C (en) * 2006-10-17 2009-08-19 中兴通讯股份有限公司 Character string matching information processing method in communication system

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104765741A (en) * 2014-01-06 2015-07-08 中国银联股份有限公司 Data processing method
WO2017161749A1 (en) * 2016-03-21 2017-09-28 乐视控股(北京)有限公司 Method and device for information matching
CN108536713A (en) * 2017-03-03 2018-09-14 广东神马搜索科技有限公司 Character string checking method, device and electronic equipment
CN108536713B (en) * 2017-03-03 2021-05-18 阿里巴巴(中国)有限公司 Character string auditing method and device and electronic equipment
CN108430045A (en) * 2018-03-14 2018-08-21 北京思特奇信息技术股份有限公司 A kind of character string pruning method and device
CN108430045B (en) * 2018-03-14 2020-11-24 北京思特奇信息技术股份有限公司 Character string trimming method and device
CN110008385A (en) * 2018-04-20 2019-07-12 武汉绿色网络信息服务有限责任公司 A kind of quick matching and recognition method and device based on character string
CN110083746A (en) * 2018-04-20 2019-08-02 武汉绿色网络信息服务有限责任公司 A kind of quick matching and recognition method and device based on character string
CN110008385B (en) * 2018-04-20 2020-12-22 武汉绿色网络信息服务有限责任公司 Quick matching identification method and device based on character strings
CN110083746B (en) * 2018-04-20 2021-01-22 武汉绿色网络信息服务有限责任公司 Quick matching identification method and device based on character strings
CN111158500A (en) * 2019-12-18 2020-05-15 河南芯盾网安科技发展有限公司 Method and device for improving input efficiency by using wildcard
CN112732796A (en) * 2021-01-23 2021-04-30 河北省科学院应用数学研究所 Fuzzy query matching method

Also Published As

Publication number Publication date
CN101807184B (en) 2013-05-01

Similar Documents

Publication Publication Date Title
CN101807184B (en) Method for searching character string with wildcard character and system thereof
US9928251B2 (en) System and method for distributed categorization
CN111859470B (en) Business data chaining method and device
CN107391758A (en) Database switching method, device and equipment
CN103166911B (en) A kind of version management server right management method and equipment
CN104125208A (en) Data transmission method and data transmission device
TW201800967A (en) Method and device for processing distributed streaming data
CN102761628B (en) Pan-domain name identification and processing device and method
CN104239508B (en) Data query method and data query device
CN103455335A (en) Multilevel classification Web implementation method
CN108829753A (en) A kind of information processing method and device
CN105207881A (en) Message sending method and equipment
CN107577787A (en) The method and system of associated data information storage
CN110851663B (en) Method and device for managing metadata
CN101902347B (en) Anonymous meeting terminal enrollment method and device
US8150942B2 (en) Conveying access to digital content using a physical token
CN101605301A (en) A kind of group system and request message distribution method that carries out the multinode transaction
US20030200210A1 (en) Method of searching an email address by means of a numerical code including a combination of specific phone numbers
CN100487697C (en) Searching method by using modified hash method
CN106959975B (en) Transcoding resource cache processing method, device and equipment
CN106469166B (en) A kind of information processing method and device
CN110837499B (en) Data access processing method, device, electronic equipment and storage medium
CN102143126A (en) Converged IP messaging (CPM) conversation history accessing method and message storage server
CN111178965A (en) Resource delivery method and server
CN110866085A (en) Data feedback method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20130501

Termination date: 20170216