The content of the invention
For in the prior art the defects of, the present invention provide it is a kind of based on community network service index database method for building up
And service search method, solve the problems, such as reasonably provide a user user's required service in the prior art.
In a first aspect, the present invention provides a kind of method for building up of the service index database based on community network, including:
Information on services is gathered, the information on services includes:The classification of service, the title of service, the description information of service, clothes
The price of business and/or Service provider information;
The information on services of collection is generated into service data set, extracts the mark of each service in the service data set
Label, the label of all services is generated into service labels storehouse;
Obtain the transaction data of all services, obtained from the transaction data all services service user's information,
And the activity price of service;
The service is obtained according to the label of service user's information, the activity price of service, the service
The fraction of all service corresponding labels in tag library;
According to the fraction generation service index database of label, the label corresponding to the service.
Second aspect, the present invention provide a kind of service search method, including:
The service-seeking request of user's input is received, the service-seeking request includes the keyword letter of service to be checked
Breath;
Asked according to the service-seeking, determine that the classification of the service to be checked, and the keyword message correspond to
Service to be checked label;
According to the classification of the service to be checked, label from service index database in obtain under the classification with the label
All information for the service matched somebody with somebody;
By all information of the service with the tag match, it is ranked up according to the fraction of label, and by after sequence
Presentation of information;
Wherein, service index database be using aforesaid way obtain include classification of service, the label serviced in classification of service,
The index database of the fraction of label.
As shown from the above technical solution, the method for building up of the service index database of the invention based on community network and service are searched
Suo Fangfa, by gathering information on services, the label of service is obtained, and wrapped according to the real trade data and label of service
The service index database of service labels and label fraction, and then the search serviced according to service index database are included, can precisely be closed
Reason provides a user user's required service.
Embodiment
With reference to the accompanying drawings and examples, the embodiment of the present invention is described in further detail.Implement below
Example is used to illustrate the present invention, but is not limited to the scope of the present invention.
Fig. 1 shows the flow of the method for building up for the service index database based on community network that one embodiment of the invention provides
Schematic diagram, as shown in figure 1, the method for building up of the service index database based on community network of the present embodiment is as described below.
101st, information on services is gathered, the information on services includes:The classification of service, the title of service, the description letter of service
Breath, the price of service and/or Service provider information.
System obtains USI user service information by the transaction data of e-commerce platform, or passes through application programming interfaces
The crawl of (Application Program Interface, abbreviation API) or web crawlers microblogging, Taobao, go to market, Jingdone district etc., production
The USI user service information of product forum.
A kind of information on services is exemplified below:So-and-so children's photography and vedio recording of Beijing is gathered, the information on services of collection includes service
Title:So-and-so children's photography and vedio recording of Beijing, classification of service:Service for life;Service price:1000 yuan;ISP:Beijing
So-and-so company;ISP's contact method:186xxxxxxxx;Service provider location:The information such as Zhongguancun, Haidian, Beijing.
The present embodiment can also generate the information on services of all services of collection the service data for unifying form, and then will unify form
Service data forms service data set.
102nd, the information on services of collection is generated into service data set, extracts each service in the service data set
Label, the label of all services is generated into service labels storehouse.
For example, the correlation tag of each service in the service data set is extracted using word segmentation processing mode.
For example, the correlation tag of each service in the service data set is extracted using Chinese word segmentation processing mode.In
The Words partition system of text participle is, it is necessary to call self-defined dictionary.Self-defined dictionary includes specific word and part-of-speech tagging, self-defined
Dictionary is equivalent to a submodule of Words partition system, and on the basis of self-defined dictionary, segmentation methods could be by a word cutting
For different words.Self-defined dictionary comprehensive influence participle accuracy, self-defined dictionary meet it is renewable, can accumulate
And agree with the requirement of information on services/forum.
That is, the insignificant word of service data in service data set is removed by word segmentation processing.
103rd, the transaction data of all services is obtained, service user's letter of all services is obtained from the transaction data
Breath and the activity price of service.
For example, transaction data includes the All Activity for buying some service, records one by one.As user X have purchased use
Family Y Android APP exploitation services, then represent the ability of user X authorised users Y Android APP exploitations.In foregoing step
The label that Android APP exploitations service participle obtains in rapid 102 has two:" Android ", " APP exploitations ".Then record two
Data:1st, buyer is serviced:X, ISP:Y, label:Android;2nd, buyer is serviced:X, ISP:Y, mark
Label:APP is developed.It is recorded to be used in step 104 generator matrix A.
When calculating the fraction of APP exploitations, due to that can find X positions service buyer, Y is ISP, label
The record developed for APP, so Axy=1.
It should be noted that this implementation can obtain the transaction data of all services in a period of time, in nearly one month
Transaction data of all services etc. in the transaction data of all services, or nearly half a year or 1 year.
104th, according to obtaining the label of service user's information, the activity price of service, the service
The fraction of all service corresponding labels in service labels storehouse.
It will be appreciated that (including it can be carried according to each user of the calculating such as ISP, transaction data and service user
Donor and user), each service the fractions of different labels.
105th, the fraction generation of label, the label services index database according to corresponding to the service.
That is, weight shared by each label is the fraction of its ISP's corresponding label, i.e., the n finally calculated
Value often capable can be the fraction of corresponding user in the matrix that row 1 arranges.
The method for building up of the service index database of the present embodiment, by gathering information on services, obtains the label of service, Yi Jigen
Being obtained according to the real trade data and label of service includes the service index database of service labels and label fraction, and then according to service
The search that index database is serviced, it precisely can reasonably provide a user user's required service.
It should be noted that the service index database established in the present embodiment can regularly update, update once within such as one week, Huo Zhegen
Inferior according to the method renewal one in two days shown in above-mentioned Fig. 1, the present embodiment is by way of example only.
For example, foregoing step 104 may particularly include the sub-step not shown in following figures:
1041st, the quantity n of all users related to the label serviced is obtained, and generates n rank matrix As;
1042nd, each elements A in n rank matrix As is determinedijValue, if user i has used user j service, Aij=1,
Otherwise Aij=0;
1043rd, according to default damped coefficient m, n rank matrix As are converted into another matrix A ';
Make X=(1,1 ..., 1)T, calculating matrix iteration convergence value limn-∞A'nX, limn-∞A'nX value is pair of service
The rank score of specific label is answered, wherein, X is the transposed matrix of unit column vector.In calculating matrix multiplication, it can use and divide
Cloth system accelerates calculating speed.
For example, the span of the damped coefficient m is 0.15 to 0.25, according to number of users/scale and society
Network structure/community network feature m values are adjusted.Preferable damped coefficient m takes 0.2, for example, A 'ij=Aij*0.8+0.2。
Fig. 2 shows the schematic flow sheet for the service search method that one embodiment of the invention provides, as shown in Fig. 2 this reality
The service search method for applying example is as described below.
201st, the service-seeking request of user's input is received, the service-seeking request includes the keyword of service to be checked
Information.
202nd, asked according to the service-seeking, determine the classification of the service to be checked, and the keyword message
The label of corresponding service to be checked.
For example, word segmentation processing mode can be used to extract the label of service to be checked from the keyword message, with
And the classification of the service to be checked is determined according to the label of the service to be checked of extraction.For example, at using language-specific participle
Reason mode is handled.By taking Chinese word segmentation as an example, the Words partition system of Chinese word segmentation needs to call self-defined dictionary.Self-defined dictionary
Including specific word and part-of-speech tagging, self-defined dictionary equivalent to Words partition system a submodule, in self-defined dictionary
On the basis of, a word cutting could be different words by segmentation methods.The comprehensive influence of self-defined dictionary segments accurate
Property, self-defined dictionary meets requirement that is renewable, can accumulating and agree with information on services/forum.
It will be appreciated that the information such as the classifying of user's service to be checked, area, distance, Price Range are obtained, and to pass
Key word information carries out word segmentation processing, rejects meaningless information therein, and the label that remaining vocabulary forms service to be checked is searched for
Label.
203rd, according to the classification of the service to be checked, label from service index database in obtain under the classification with the mark
Sign all information of the service of matching.
It will be appreciated that service index database can be the index database shown in above-mentioned Fig. 1, the service index database may include service point
The information of the services such as the label that is serviced in class, classification of service, the fraction of label.
For example, from service index database in choose meet search condition service (select belonging to service to be checked point
Class), it is necessary first to meet that classification, area, distance etc. require, then need comprising the information for searching for label, will meet to require
Result generate a set of service.
204th, by all information of the service with the tag match, it is ranked up according to the fraction of label, and will sequence
Presentation of information afterwards.
That is, the result of requirement will be met in set of service, is ranked up according to the fraction of label, the information after sequence is shown
Show.Specifically, the fraction of the respective labels of each service can be added in the result for meeting to require, according to fraction addition result
By service ranking, the service after sequence is presented.
Generally, be presented to the service after the sequence of user can according to needed for user/form liked shows.
Such as:User prepares search Beijing senior middle school private tutor, then the service-seeking request first to user's input is handled, and is entered
Three labels " Beijing ", " senior middle school ", " private tutors " are obtained after row participle.Then scanned in index database, obtain have this three
The service of individual label 325.
Furthermore 325 services are classified according to three label fraction sums, such as there are three services, service one is
Beijing senior middle school private tutor service that certain domestic consumer A is provided, three label fractions of the service are:Beijing 20, senior middle school 50, private tutor 25;
Service two is Beijing senior middle school private tutor service that certain private tutor mechanism B is provided, and three label fractions of the service are:Beijing 22, senior middle school 40,
Private tutor 40;Service three is Beijing senior middle school private tutor service that certain star user C is provided, and three label fractions of the service are:Beijing 25,
Senior middle school 70, private tutor 35;Then finally sequence position service three, service two, service one.Because mechanism B provides many private tutor's services, institute
It is relatively higher with its private tutor's fraction, there is no that user A is high in terms of senior middle school, but private tutor is higher to be made before it has come.And use
Family C is because ability itself is higher, so coming foremost.Exported after finally result is arranged.
Certainly, in a particular application, if the not service with the tag match in the service index database, by described in
The label of all service is ranked up according to the fraction of label in service index database, and by the presentation of information after sequence;
Or the prompt message of matching content is displayed without to user.
In addition, it should be noted that, if the not service with the tag match in the service index database, described to be checked
Ask the classification of service, service index database described in tag update, can using shown in above-mentioned Fig. 1 by the way of more new demand servicing index database,
Embodiment is not limited thereof.
That is, service index database is according to the information such as the title of service, classification and description, the related mark of extraction service
Label, and by service transacting data, such as according to the price of transaction, the service ability of both parties, it is each to calculate everyone
The service ability of label.Thus, when user scans for, label is matched according to keyword message, by the service of corresponding label
Ability returns to the result after processing to be ranked up.
Fig. 3 shows that a kind of terminal that one embodiment of the invention provides shows interaction figure, as shown in figure 3, the present embodiment
A kind of terminal is shown as described below.
Described in the sample enumerated in above-mentioned steps 204, C, B, A come resultful first 3 of institute after search, then show
Show its service name and each label fraction for selection by the user.
The present invention by information on services by being extracted as multiple service labels, so as to calculate everyone by transaction data
Different labels service ability.By calculating so that the people of an individual service is more, and the people ability of service itself is stronger, its
The service ability of itself is stronger.Simultaneously so that service generates with ISP to be associated, and when carrying out service search, is influenceed
Service result and the factor of sequence obtain search just from the Information expansion of service itself to the service ability of ISP
Result it is more accurate.
Service search method described in the present embodiment, the search of service is associated with the real service energy of ISP
Power, ranking is avoided to a certain extent and is played tricks or the effect of brush ranking.For example, when a user uses multiple small size purchases
After the service of oneself, because the small size service ability of itself is not strong so that the service ability that can be provided also will not be very strong,
Be weaker than really has the user of demand in itself in this label.
Meanwhile for new service, if ISP have accumulated certain point in original some labels in itself
Number, then new service can also lift certain ranking, without being constantly in backmost.
Finally it should be noted that:Various embodiments above is merely illustrative of the technical solution of the present invention, rather than its limitations;To the greatest extent
The present invention is described in detail with reference to foregoing embodiments for pipe, it will be understood by those within the art that:Its according to
The technical scheme described in foregoing embodiments can so be modified, either which part or all technical characteristic are entered
Row equivalent substitution;And these modifications or replacement, the essence of appropriate technical solution is departed from the claims in the present invention and limited
Fixed scope.