CN104331429B - The method and device of multiple features dimension quantization is carried out to network object - Google Patents

The method and device of multiple features dimension quantization is carried out to network object Download PDF

Info

Publication number
CN104331429B
CN104331429B CN201410564598.7A CN201410564598A CN104331429B CN 104331429 B CN104331429 B CN 104331429B CN 201410564598 A CN201410564598 A CN 201410564598A CN 104331429 B CN104331429 B CN 104331429B
Authority
CN
China
Prior art keywords
data
characteristic dimension
network object
feature
weight
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410564598.7A
Other languages
Chinese (zh)
Other versions
CN104331429A (en
Inventor
彭可飞
尹红光
刘佳良
詹金林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201410564598.7A priority Critical patent/CN104331429B/en
Publication of CN104331429A publication Critical patent/CN104331429A/en
Application granted granted Critical
Publication of CN104331429B publication Critical patent/CN104331429B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a kind of method and device that multiple features dimension quantization is carried out to network object, this method includes:Determine a network object, obtain and describe data with the corresponding feature of the network object;Different characteristic according to pre-establishing describes the correspondence between data and different characteristic dimension, the feature corresponding with the network object is described data to be matched with characteristic dimension, and each characteristic dimension weight corresponding with the network object is obtained according to matching result;According to each characteristic dimension weight, the quantized value of each characteristic dimension of the network object is obtained.Method using the present invention, is handled by describing data to the feature of network object, forms the quantized value of each characteristic dimension of the network object, intuitively the feature of each dimension of reaction network object.

Description

The method and device of multiple features dimension quantization is carried out to network object
Technical field
This application involves Internet technical field, more particularly to a kind of side that multiple features dimension quantization is carried out to network object Method and device.
Background technology
With the rapid development of Internet technology, there is more and more network objects, such as video text on network Part, audio file, text etc., go to browse or download for users.On the other hand, skill is searched for by web crawlers etc. Art, for different network objects, can search on network and get a large amount of relevant information, such as these networks pair Comment, recommendation or label data of elephant etc., these information are much all based on UGC (User Generated Content, user Generation content) mode generate.
However, for the great deal of related information of network object, particularly UGC data messages, since different web sites are to difference The subjective division of network object, and structural input limitation of the user on different web sites, causing these data can only Isolated is present in different web sites, and is scattered in the different web pages of different web sites, can not produce between each other very big Reference significance, although some websites may also can be integrated and be quantified to some of which information data, in view of different nets The information standard and disunity stood to consolidated network object, therefore be generally also limited only to carry out the relevant information in our station Integrate, and the quantized data after this integration is very single in structure and is difficult to extend, while also result in and measuring Change and be very limited system on the exhibition method of data.
The content of the invention
In view of the above problems, it is proposed that the present invention overcomes the above problem in order to provide one kind or solves at least in part That states problem carries out the method for multiple features dimension quantization and corresponding device to network object.
According to an aspect of the invention, there is provided a kind of method that multiple features dimension quantization is carried out to network object, bag Include:Determine a network object, obtain and describe data with the corresponding feature of the network object;According to the different characteristic pre-established The correspondence between data and different characteristic dimension is described, the feature corresponding with the network object is described into data and spy Sign dimension is matched, and obtains each characteristic dimension weight corresponding with the network object according to matching result;According to described each Characteristic dimension weight, obtains the quantized value of each characteristic dimension of the network object.
Wherein, the step of describing data with the corresponding feature of the network object is obtained, including:According to the network object Type, predefine one or more web search addresses;Obtained by the web search address opposite with the network object The feature answered describes data.
Wherein, the step of describing data with the corresponding feature of the network object is obtained, including:Heterogeneous networks will be come from The feature of address describes data and carries out registration matching;Description data of the registration less than threshold value are deleted, generate final spy Descriptor data set is levied to close.
Wherein, the step of describing data with the corresponding feature of the network object is obtained, including:Heterogeneous networks will be come from The feature of address describes data and carries out registration matching;Data distribution is described accordingly for the feature according to the height of registration Weight, the more high then weight of registration are higher;The feature repeated is described data to delete, generates the feature descriptor data set of Weight Close.
Wherein, the step of describing data with the corresponding feature of the network object is obtained, is further comprised:According to true in advance The fixed weight set for heterogeneous networks address, further describes the feature for coming from heterogeneous networks address the weight of data It is adjusted.
Wherein, the characteristic dimension is determined according to the type of the network object.
Wherein, correspondence between data and different characteristic dimension is described according to the different characteristic pre-established, by institute State feature corresponding with the network object and describe data and matched with characteristic dimension, and obtained and the network according to matching result The step of object corresponding each characteristic dimension weight, including:According to the correspondence, by whole features describe data respectively with Characteristic dimension is matched, and repeated matching is then carried out if the feature repeated describes data;Determined according to matching result to each The ratio of the hit-count of characteristic dimension;According to the ratio of the hit-count, each feature corresponding with the network object is obtained Dimension weight.
Wherein, correspondence between data and different characteristic dimension is described according to the different characteristic pre-established, by institute State feature corresponding with the network object and describe data and matched with characteristic dimension, and obtained and the network according to matching result The step of object corresponding each characteristic dimension weight, including:According to the correspondence, data difference is described into different features Matched with characteristic dimension;The ratio of the hit-count to each characteristic dimension is determined according to matching result;According to different characteristic The weight of data is described, the hit-count ratio of characteristic dimension will be adjusted;According to the ratio of the hit-count after adjustment, Obtain each characteristic dimension weight corresponding with the network object.
Wherein, according to each characteristic dimension weight, the step of obtaining the quantized value of each characteristic dimension of the network object, Including:Obtain score value corresponding with the network object and describe data;To be described with the corresponding score value of the network object data with Each characteristic dimension weight is multiplied respectively, obtains the quantized value of each characteristic dimension of the network object.
Wherein, the step of score value corresponding with the network object describes data is obtained, including:Acquisition comes from heterogeneous networks The score value corresponding with the network object of address describes data, and according to the predetermined power set for heterogeneous networks address Weight, describes data by the score value for coming from heterogeneous networks address and is weighted summation, data are described as final score value.
Wherein, the description data include label data, comment data, and/or key data.
Wherein, further comprise:From described with choosing at least partly spy in the corresponding each characteristic dimension of the network object Levy dimension;According to obtaining quantized value corresponding with the characteristic dimension of the selection, in a manner of multi-dimensional map by the quantized value into Row displaying.
Wherein, further comprise:According to the quantized value of each characteristic dimension of the obtained network object, therefrom choose at least Partial Feature dimension is shown.
According to another aspect of the present invention, there is provided a kind of device that multiple features dimension quantization is carried out to network object, is obtained Module is obtained, for determining a network object, obtains and describes data with the corresponding feature of the network object;Matching module, is used for Different characteristic according to pre-establishing describes the correspondence between data and different characteristic dimension, will the described and network object Corresponding feature describes data and is matched with characteristic dimension, and obtains each spy corresponding with the network object according to matching result Levy dimension weight;Quantization modules, for according to each characteristic dimension weight, obtaining the amount of each characteristic dimension of the network object Change value.
Wherein, the acquisition module includes:Determining module, for the type according to the network object, predefines one A or multiple web search addresses;Acquisition module, it is corresponding with the network object for being obtained by the web search address Feature describes data.
Wherein, the acquisition module includes:First registration matching module, for the spy of heterogeneous networks address will to be come from Sign description data carry out registration matching;First generation module, for description data of the registration less than threshold value to be deleted, generation Final feature descriptor data set closes.
Wherein, the acquisition module includes:Second registration matching module, for the spy of heterogeneous networks address will to be come from Sign description data carry out registration matching;Weight distribution module, for describing data according to the height of registration for the feature Corresponding weight is distributed, the more high then weight of registration is higher;Second generation module, is deleted for the feature repeated to be described data Remove, the feature descriptor data set for generating Weight closes.
Wherein, the acquisition module further comprises:Weight adjusts module, for being directed to different nets according to predetermined The weight that network address is set, the weight that data are further described to the feature for coming from heterogeneous networks address are adjusted.
Wherein, the characteristic dimension is determined according to the type of the network object.
Wherein, the matching module includes:First matching module, for according to the correspondence, whole features to be retouched State data to be matched with characteristic dimension respectively, repeated matching is then carried out if the feature repeated describes data;First ratio Determining module, for determining the ratio of the hit-count to each characteristic dimension according to matching result;First weight obtains module, uses In the ratio according to the hit-count, each characteristic dimension weight corresponding with the network object is obtained.
Wherein, the matching module includes:Second matching module, for according to the correspondence, by different features Description data are matched with characteristic dimension respectively;Second ratio-dependent module, for being determined according to matching result to each feature The ratio of the hit-count of dimension;Ratio adjusts module, will be to characteristic dimension for describing the weight of data according to different characteristic Hit-count ratio be adjusted;Second weight obtains module, for the ratio according to the hit-count after adjustment, obtain with The corresponding each characteristic dimension weight of the network object.
Wherein, the quantization modules, including:Score value acquisition module, is retouched for obtaining score value corresponding with the network object State data;Quantification treatment module, is weighed for that will describe data with the corresponding score value of the network object with each characteristic dimension It is multiplied respectively again, obtains the quantized value of each characteristic dimension of the network object.
Wherein, the score value acquisition module, is further used for:Obtain come from heterogeneous networks address with the network object Corresponding score value describes data, and according to the predetermined weight set for heterogeneous networks address, will come from different nets The score value of network address describes data and is weighted summation, and data are described as final score value.
Wherein, the description data include label data, comment data, and/or key data.
Wherein, further comprise:Module is chosen, for being selected from corresponding each characteristic dimension with the network object Take at least part characteristic dimension;First display module, the quantized value corresponding with the characteristic dimension of the selection obtained for basis, The quantized value is shown in a manner of multi-dimensional map.
Wherein, the second display module, for the quantized value of each characteristic dimension according to the obtained network object, Cong Zhongxuan At least part characteristic dimension is taken to be shown.
Technique according to the invention scheme, there are following technique effect:
The present invention describes the correspondence between data and different characteristic dimension according to the different characteristic pre-established, by net The corresponding feature of network object describes data and is matched with characteristic dimension, obtains each characteristic dimension power corresponding with the network object Weight, and then the quantized value of each characteristic dimension of network object is obtained according to the weight of each characteristic dimension, it can intuitively react net The feature of each characteristic dimension of network object, simultaneously as can according to different network object types and to characteristic dimension into Row increase or decrease, and and then to description data and characteristic dimension between correspondence be adjusted at any time, therefore, the present invention It is as rich as Croesus that the technical solution of offer, which is directed in the structure of quantized data of the relevant information of heterogeneous networks object after integration, Elasticity simultaneously easily extends, this also provides possibility for the more diversified exhibition method of offer.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the embodiment of the present invention.
Brief description of the drawings
Attached drawing described herein is used for providing further understanding of the present application, forms the part of the application, this Shen Schematic description and description please is used to explain the application, does not form the improper restriction to the application.In the accompanying drawings:
Fig. 1 is the main-process stream of the method according to an embodiment of the invention that multiple features dimension quantization is carried out to network object Figure;
Fig. 2 is the step of acquisition according to an embodiment of the invention describes data with the corresponding feature of the network object Flow chart;
Fig. 3 is that the different characteristic that basis according to an embodiment of the invention pre-establishes describes data and different characteristic dimension Correspondence between degree, describes data by the feature corresponding with the network object and is matched with characteristic dimension, and root The flow chart for the step of obtaining each characteristic dimension weight corresponding with the network object according to matching result;
Fig. 4 is according to an embodiment of the invention according to each characteristic dimension weight, obtains each of the network object The flow chart of the step of quantized value of characteristic dimension;
Fig. 5 is the step of acquisition according to another embodiment of the present invention describes data with the corresponding feature of the network object Flow chart;
Fig. 6 is that the different characteristic that basis according to another embodiment of the present invention pre-establishes describes data and different characteristic dimension Correspondence between degree, describes data by the feature corresponding with the network object and is matched with characteristic dimension, and root The flow chart for the step of obtaining each characteristic dimension weight corresponding with the network object according to matching result;
Fig. 7 is the structural frames of the device according to an embodiment of the invention that multiple features dimension quantization is carried out to network object Figure;And
Fig. 8 is the quantized value of each characteristic dimension according to an embodiment of the invention to a film with multi-dimensional map Displaying examples of interfaces figure when mode is shown.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although the disclosure is shown in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure Completely it is communicated to those skilled in the art.
As shown in Figure 1, Fig. 1 is the side according to an embodiment of the invention that multiple features dimension quantization is carried out to network object The general flow chart of method.
At step S110, a network object is determined, obtain and describe data with the corresponding feature of the network object.
Specifically, network object can be determined according to the search of user, e.g., determines to use according to searching request input by user The network object of family search;Or network object can be determined to the access of preset link according to user, e.g., user is to preset chain Click connect etc..
Wherein, network object can include:Resource on internet, can include video file, audio file, text text Part etc., e.g., film, TV, music, article, books etc..Describing data with the corresponding feature of the network object can include: The label data of the network object, comment data, and/or key data etc..Wherein, label data for example can be that user gives birth to Into for label that the network object is recommended, introduces, commented on, for example, substantial amounts of electricity can be searched on network Shadow label;Comment data can be the comment that user carries out the network object;Key data can be to network object Recommendation, comment data carry out the pass for the network object that specifically processing (e.g., carrying out word segmentation processing to comment word) is got Key word.It will be appreciated by those skilled in the art that the embodiment of the present invention not limited to this.
According to one embodiment of present invention, can be obtained using following sub-step corresponding with the network object Feature describes data.
First, according to the type of the network object, one or more web search address is predefined.For example, interconnection The special sub- column for having many special websites either number of site on the net issues or searches for certain types of net for user The feature of network object describes data, e.g., website or the net of information is issued or searched for for offers such as film, TV, books and periodicals, magazines Stand sub- column etc., therefore, the website of data can be described according to the type of network object, predetermined search characteristics, that is, determine One or more web search addresses.
Then, obtained by the web search address and describe data with the corresponding feature of the network object.That is, by true The feature that the fixed one or more web search address obtains the network object describes data.It is pre- according to the type of network object First determine web search address, search can be made to have more directionality, avoid scanning for from incoherent website, improve search effect Rate.Also, the feature that network object is obtained from different network address describes data, can be by from different web sites and network The corresponding feature of object describes data and is compared comprehensive, unified integration, and the number between data is described according to different characteristic The correspondence that magnitude relation and feature are described between data and characteristic dimension quantifies the feature of network object.
In a specific embodiment, above-mentioned sub-step can be realized by web crawlers technology, i.e. by program or Script obtains feature corresponding with the network object and describes data automatically.
With reference to figure 2, Fig. 2 is that acquisition according to an embodiment of the invention is described with the corresponding feature of the network object The flow chart of the step of data.
As shown in Fig. 2, obtaining the step of describing data with the corresponding feature of the network object, following step can be included Suddenly:
Step S210, describes data by the feature for coming from heterogeneous networks address and carries out registration matching.
The feature for coming from heterogeneous networks address describes in data that there may be (identical) data of many repetitions, i.e. Same feature, which describes data, to be occurred repeatedly, wherein, similar data (the same or like data of meaning) can also regard repetition as (identical) data.Specifically, a certain feature describe Data duplication appearance number it is more, or this feature describes data And described with its similar feature data appearance number (overlapping number) it is more, then it is believed that this feature describes the weights of data It is right higher.For example, being directed to a certain portion's film, there is the substantial amounts of label (tag) from each different web sites, in these labels It may include many identical labels or similar label, then the label of the film got can be subjected to registration Match somebody with somebody, the number (coincidence number) that a certain label repeats is more, or time that the label and the label similar to its occur Number is more, then its registration is higher.
Step S220, describes data less than the feature of threshold value by registration and deletes, generate final feature descriptor data set Close.
Specifically, feature of the registration less than the threshold value can be described by data according to pre-set registration threshold value Delete, generate final feature descriptor data set and close.For example, can be according to pre-set coincidence frequency threshold value.Delete and be somebody's turn to do The corresponding feature of network object describes to overlap the data that number is less than the coincidence frequency threshold value in data, generates final feature and retouches State data acquisition system.Data are described less than the feature of threshold value by deleting registration, subsequent process number to be treated can be reduced Data bulk, makes the data processing in subsequent process easier.
It should be noted that during the final feature descriptor data set of generation closes, deletion registration can be included and be less than the threshold value Feature describe whole features of the remaining network object after data and describe data, in other words, including repeating or Similar feature describes data.
Fig. 1 is returned to, after obtaining and describing data with the corresponding feature of the network object, next, in step At S120, the correspondence between data and different characteristic dimension is described according to the different characteristic pre-established, will be described with being somebody's turn to do The corresponding feature of network object describes data and is matched with characteristic dimension, and is obtained and the network object pair according to matching result Each characteristic dimension weight answered.
Specifically, the corresponding characteristic dimension of the network object first can be determined according to the type of the network object, then The correspondence that different characteristic according to pre-establishing is described between data and different characteristic dimension is corresponding by the network object Feature describes data and is matched with characteristic dimension.If for example, the network object is film, the relevant spy of film can be determined Sign dimension is the corresponding characteristic dimension of the network object, for example, the corresponding characteristic dimension of film can include:Audiovisual, plot, think of Think, the characteristic dimension such as atmosphere, or the characteristic dimension of other attributes can also be included, e.g., the dimension such as spectators' gender, age, constellation Degree.Also, can also be according to the type of different network objects, the demand of user etc., feature corresponding to network object Dimension is increased or decreased, great flexibility.
Wherein, the correspondence that different characteristic is described between data and different characteristic dimension can be advance by following steps Establish:
First, multiple features are obtained and describe data, and determine multiple characteristic dimensions.Wherein, the multiple characteristic dimension can To include each multiple characteristic dimension included in one or more features dimension classification respectively.In other words, it is described Multiple characteristic dimensions include one or more features dimension classification, and each of which characteristic dimension classification includes multiple feature dimensions Degree.
Then, it is determined that the multiple feature describes the correspondence of data and the multiple characteristic dimension, for example, can root Determine that feature describes whether data and characteristic dimension have correspondence according to the result that a large amount of historical datas are carried out with statistical analysis. Also, the correspondence that can also be described different characteristic between data and different characteristic dimension is configured to a mapping table.
By taking film label as an example, configure corresponding between each film label (feature describes data) and each characteristic dimension Relation table can be as shown in table 1:
Table 1
Wherein, for each label (feature describes data), pair it is marked with its characteristic dimension with correspondence, That is, marking the label and this feature dimension has correspondence, for example, by the use of " 1 " as the mark with correspondence in table 1, For example, label " comedy " has corresponding relation with characteristic dimension " plot ", then " plot " dimension of " comedy " is marked. It should be noted that the correspondence that Partial Feature describes data and Partial Feature dimension is merely illustrated in table 1.
It should be noted that different characteristic is pre-established in the present invention describes correspondence between data and different characteristic dimension Mode not limited to this.Also, since the correspondence that the different characteristic describes between data and different characteristic dimension is advance Configured, therefore, in actual application, dimension extension can be carried out according to different demands, i.e. increase new feature Dimension, also, during progress dimension extension, it is not necessary to collect relevant feature for newly-increased dimension and describe data, only need to be according to Some features describe data, and the correspondence that complementary features describe data and newly-increased feature dimension is adjusted, the extension of dimension Quite flexible.
With reference to figure 3, Fig. 3 be according to one embodiment of present invention according to the different characteristic that pre-establishes describe data with not With the correspondence between characteristic dimension, data and characteristic dimension progress are described into the feature corresponding with the network object The flow of the step of matching somebody with somebody, and each characteristic dimension weight corresponding with the network object is obtained according to matching result (step S120) Figure.
As shown in figure 3, step S120 may further include following steps:
Whole features according to the correspondence, are described data and are matched respectively with characteristic dimension, such as by step S310 The feature that fruit has repetition describes data and then carries out repeated matching.
Specifically, the different characteristic that can be pre-established according to this describes corresponding between data and different characteristic dimension Acquisition whole features corresponding with the network object are described data and are matched respectively with characteristic dimension by relation, that is, Each feature during the feature descriptor data set of the network object is closed describes data and is matched respectively with characteristic dimension.Wherein, If the feature that whole features describes to there is repetition in data describes data (that is, the same or similar feature describes data), Then carry out repeated matching, that is to say, that no matter describing data in the presence of how much the same or similar features, all respectively with characteristic dimension Matched so that the number of repetition that different characteristic describes data has an impact final quantized value result.
More specifically, it may be determined that whole features of the network object describe to retouch in the feature pre-established in data State feature present in the correspondence of data and different characteristic dimension time and describe data, and there will be feature data are described Matched respectively with each characteristic dimension in the correspondence, if a feature describes data in the feature description pre-established Data then determine to describe data tool with this feature with the correspondence of different characteristic dimension time existing according to the correspondence There is the one or more features dimension of correspondence, and record the network object and the one or more features dimension is hit respectively Once, if the feature repeated describes data, then the feature repeated is described into data and carries out repeated matching with each characteristic dimension.
Step S320, the ratio of the hit-count to each characteristic dimension is determined according to matching result.
Specifically, hit that can be according to each characteristic dimension in the corresponding whole features of the network object describe data Number, determines the hit-count ratio of each characteristic dimension, that is to say, that the whole features for accumulating the network object describe data pair The hit-count of each characteristic dimension, and determine the network object to each characteristic dimension according to the hit-count of each characteristic dimension Hit-count ratio.
Step S330, according to the ratio of the hit-count, obtains each characteristic dimension weight corresponding with the network object.
That is, according to the hit-count ratio of each characteristic dimension, the corresponding each characteristic dimension of the network object is determined Weight.Specifically, how corresponding the weight of hit-count characteristic dimension more be bigger, thus can according to hit-count ratio from Small order is arrived greatly, and the weight of each characteristic dimension is determined in a manner of weight is successively decreased successively, or can be directly by each feature dimensions Weight of the hit-count ratio of degree as each characteristic dimension.And wherein, if the plurality of characteristic dimension includes multiple features Dimension classification, then can determine weight of each characteristic dimension in each affiliated classification respectively according to characteristic dimension classification, Wherein, the sum of weight of each characteristic dimension is equal to 1 in any one characteristic dimension classification.
The corresponding weight of corresponding with the network object each characteristic dimension can be obtained by above-mentioned steps S310~S330, After each characteristic dimension weight corresponding with the network object is obtained, Fig. 1 is next returned, carries out step S130.
At step S130, according to each characteristic dimension weight, the quantization of each characteristic dimension of the network object is obtained Value.
In order to more clearly explain the step, a kind of optional specific embodiment of the step is described below with reference to Fig. 4. Fig. 4 is foundation each characteristic dimension weight according to one embodiment of the invention, obtains each characteristic dimension of the network object Quantized value the step of flow chart.
Step S410, obtains score value corresponding with the network object and describes data.
Specifically, it can obtain and come from the score value corresponding with the network object of heterogeneous networks address and describe data, and According to it is predetermined for heterogeneous networks address set weight, by the score value for coming from heterogeneous networks address describe data into Row weighted sum, data are described as final score value.That is, data are described into the score value from heterogeneous networks address The weighted average score value final as the network object describes data.For example, for a certain portion's film, it can obtain and come from User's scoring score value of the film of heterogeneous networks address, and will be come from according to the weight set in advance for heterogeneous networks address User's scoring score value of heterogeneous networks address is weighted summation, using point that obtained weighted average is final as the film Value.
The above-mentioned weight according to network address describes data to the score value from heterogeneous networks address and is weighted, and obtains Final score value data are described will be more objective and accurate.Wherein, the weight set for heterogeneous networks address, can be with It is configured in advance, for example, the weight of the website is set according to user to the fancy grade or degree of concern of website, that is, Say, the weight of the network address of the website is set, for the network address that user preferences degree is higher or degree of concern is higher, its Corresponding weight can set higher.
Step S420, data will be described with the corresponding score value of the network object and will be multiplied respectively with each characteristic dimension weight, Obtain the quantized value of each characteristic dimension of the network object.
That is, based on the score value of the network object describes data, the corresponding score value of the network object is retouched State each characteristic dimension multiplied by weight of the data respectively with the network object, quantization of the obtained result respectively as each characteristic dimension Value, i.e. the corresponding score value of each characteristic dimension of the network object.
Detailed above in association with Fig. 1~Fig. 4 according to embodiments of the present invention to network object progress multiple features dimension quantization Method idiographic flow.
Multiple features dimension is carried out to network object to according to another embodiment of the present invention with reference to Fig. 1, Fig. 5 and Fig. 6 The method of quantization is described in detail.
As shown in Figure 1, at step S110, a network object is determined, acquisition is retouched with the corresponding feature of the network object State data.
Specifically, network object can be determined according to the search of user, e.g., according in search query input by user Keyword determine network object;Or network object can be determined to the access of preset link according to user, e.g., user is to pre- Click for the access buttons put etc..After determining network object, obtain and describe data, example with the corresponding feature of the network object Such as, it can automatically be obtained using crawlers or script and retouched with the corresponding feature of the network object by web crawlers technology State data.
Fig. 5 is the step of describing data with the corresponding feature of the network object according to the acquisition of another embodiment of the present invention Flow chart.As shown in figure 5, in the present embodiment, obtaining the step of describing data with the corresponding feature of the network object can To further comprise the steps:
Step S510, describes data by the feature for coming from heterogeneous networks address and carries out registration matching.
Specifically, the feature for coming from heterogeneous networks address that can be according to corresponding to the network object is described in data The number that each Data duplication occurs, determines that each feature describes the registration height of data, i.e. comes from the network object is corresponding Being described in the feature of heterogeneous networks in data, identical feature describes the number of Data duplication appearance, wherein, similar data (the same or like data of meaning) can also regard (identical) data of repetition as.A certain feature describes Data duplication appearance Number is more, or this feature describes data and the feature similar with its describes the number (overlapping number) of data appearance more It is more, then it is believed that this feature describe data registration it is higher.
Step S520, describes data for the feature according to the height of registration and distributes corresponding weight, registration is higher Then weight is higher.
That is, describing the registration height of data according to feature, the corresponding power of data distribution is described for each feature Weight, the higher feature of registration describe data, then the weight for its distribution is higher.In a specific embodiment, can basis Each feature that feature corresponding with the network object is described in data describes the registrations of data ranking from high to low, with weight The mode successively decreased successively describes data for each feature and distributes corresponding weight, wherein, identical or similar feature describes number According to same data are considered as, identical weight is distributed for it.
Step S530, describes data by the feature repeated and deletes, the feature descriptor data set for generating Weight closes.
Specifically, the corresponding feature of the network object can be described in data repeat feature describe data (including Identical or similar feature describes data) one of them is only stayed, other data repeated are deleted, the feature for generating Weight is retouched State data acquisition system, that is to say, that the corresponding feature of the network object is described into the same or similar feature in data and describes data One of them is only taken, remaining deletion, i.e. there is no the same or similar feature in the feature descriptor data set of generation to describe number According to.In this way, different features just to be described to the number of Data duplication appearance, the weight that different characteristic describes data is converted into, letter Follow-up data handling procedure is changed.
Alternatively, step S540 can also be included after step S530, according to predetermined for heterogeneous networks The weight that location is set, the weight that data are further described to the feature for coming from heterogeneous networks address are adjusted.
Specifically, weight can be set for heterogeneous networks address in advance, for example, hobby that can be according to user to website Degree or degree of concern set the weight of different websites, that is to say, that the weight of different network address are set, for user The network address that fancy grade is higher or degree of concern is higher, the weight corresponding to it can set higher.According to different nets The weight that the weight of network address describes each feature data is adjusted, and has considered fancy grade or pass of the user to website Influence of the note degree to quantized result, enables to the quantized value of each characteristic dimension that finally obtains more accurate.
, can be further to coming from heterogeneous networks according to the predetermined weight set for heterogeneous networks address The weight that the feature of location describes data is adjusted, in other words, according to each feature describe data from different networks Weight corresponding to address, the weight that data are described to each feature are adjusted, and specifically, can be retouched according to each feature Ratio/ratio relation between the weight corresponding to the heterogeneous networks address of data is stated, adjusts the power that each feature describes data Weight.
At step S120, the corresponding pass between data and different characteristic dimension is described according to the different characteristic pre-established System, describes data by the feature corresponding with the network object and is matched with characteristic dimension, and obtained according to matching result Each characteristic dimension weight corresponding with the network object.
Based on the above embodiments, step S120 may further include following step:
As shown in fig. 6, the different characteristic to be pre-established according to the basis of another embodiment of the present invention describe data with Correspondence between different characteristic dimension, describes data by the feature corresponding with the network object and is carried out with characteristic dimension The flow of the step of matching, and each characteristic dimension weight corresponding with the network object is obtained according to matching result (step S120) Figure.
Step S610, according to the correspondence, describes data by different features and is matched respectively with characteristic dimension.
Specifically, the different characteristic that can be pre-established according to this describes corresponding between data and different characteristic dimension Relation, describes data by acquisition different feature corresponding from the network object and is matched respectively with characteristic dimension.More For body, it may be determined that the feature descriptor data set of the network object close in the feature pre-established describe data from it is different Feature present in the correspondence of characteristic dimension time describes data, and there will be feature to describe data corresponding with this respectively Each characteristic dimension in relation is matched, if a feature describes data describes data and different spies in the feature pre-established Levy and exist in the correspondence of dimension time, then determine that describing data with this feature has correspondence according to the correspondence One or more features dimension, and record the network object and the one or more features dimension is hit once respectively, so that really The fixed corresponding different feature of the network object describes hit-count of the data to each characteristic dimension.
Step S620, the ratio of the hit-count to each characteristic dimension is determined according to matching result.
That is, hit of the data to each characteristic dimension time is described according to the corresponding different feature of the network object Number, determines hit-count ratio of the network object to each characteristic dimension.
Step S630, the weight of data is described according to different characteristic, and the hit-count ratio of characteristic dimension will be adjusted It is whole.
In step S530 or step S530~S540 above, the feature repeated data are described into and has deleted generation band The feature descriptor data set of weight closes, that is to say, that there is no the number repeated during the feature descriptor data set of the network object closes According to, and each feature describes data and has corresponding weight, therefore, the corresponding power of data can be described according to different characteristic Weight, will be adjusted the ratio of the hit-count of characteristic dimension.Specifically, data can be described according to each different characteristic Weight between proportionate relationship/ratio relation, adjust the hit-count ratio to each characteristic dimension.
Step S640, according to the ratio of the hit-count after adjustment, obtains each characteristic dimension corresponding with the network object Weight.
That is, according to the hit-count ratio to each characteristic dimension after adjustment, determine that the network object is corresponding The weight of each characteristic dimension.Specifically, can according to after adjustment to each characteristic dimension hit-count ratio from big to small Sequentially, the weight of each characteristic dimension is determined in a manner of weight is successively decreased successively, or can be directly by each characteristic dimension after adjustment Weight of the hit-count ratio as each characteristic dimension.And wherein, if the corresponding characteristic dimension of the network object includes Multiple characteristic dimension classifications, then can determine each characteristic dimension in each affiliated classification respectively according to characteristic dimension classification Weight.
At step S130, according to each characteristic dimension weight, the quantization of each characteristic dimension of the network object is obtained Value.
Specifically, it can obtain and describe data with the corresponding score value of the network object, and will be opposite with the network object The score value answered describes data and is multiplied respectively with each characteristic dimension weight, obtains the quantized value of each characteristic dimension of the network object. That is, based on the score value of the network object describes data, data point are described into the network object corresponding score value Not with each characteristic dimension multiplied by weight of the network object, quantized value of the obtained result respectively as each characteristic dimension.
By above-mentioned steps S110~S130, the quantized value of each characteristic dimension of network object can be obtained, so that more straight The feature of some dimension of the reaction network object of sight.Meanwhile can be according to different network object types and to characteristic dimension Increased or decreased, and and then the correspondence between description data and characteristic dimension is adjusted at any time.Therefore, for High resilience and easily extended very much in the structure of quantized data of the relevant information of heterogeneous networks object after integration, be More diversified exhibition method is provided and provides possibility.
According to one embodiment of present invention, which can also be into one Step includes the quantized value of each characteristic dimension according to the obtained network object, therefrom chooses at least part characteristic dimension and is opened up The step of showing.
Specifically, can according to the quantized value of each characteristic dimension of the obtained network object, therefrom selected part compared with Excellent characteristic dimension (for example, quantized value higher characteristic dimension) is shown, for example, to Mr. Yu portion film, its " plot " dimension The score value of degree is higher, then " plot " dimension can be carried out protruding displaying, for example, illustrating with word, image etc..
According to another embodiment of the invention, which can also be into One step comprises the following steps:
First, from described with choosing at least part characteristic dimension in the corresponding each characteristic dimension of the network object.For example, If the corresponding each characteristic dimension of the network object includes one or more features dimension classification, can therefrom choose at least one Characteristic dimension classification, i.e. choose each characteristic dimension that at least one characteristic dimension classification is included.
Then, according to quantized value corresponding with the characteristic dimension of the selection is obtained, by the amount in a manner of multi-dimensional map Change value is shown.
According to one embodiment of present invention, can according to the quantity of the characteristic dimension of selection, generate respective numbers from The ray that same origin is sent but direction is different, further according to the corresponding quantized value of each characteristic dimension of selection, determines each Position of the quantized value on corresponding ray, and the corresponding position of quantized value between adjacent ray is connected, generation shows the net The multi-dimensional map of the quantized value of each characteristic dimension of network object.Wherein, the angle between adjacent ray is equal, so that generation is more Dimension figure more intuitively reflects the magnitude relationship of each quantized value.
Specifically, the quantized value of each characteristic dimension according to unit length and the correspondence of quantized value, can be determined The corresponding position on corresponding ray, reconnects the corresponding position of quantized value between adjacent ray, and the net is shown with generation The multi-dimensional map of the quantized value of each characteristic dimension of network object.Wherein, the correspondence of unit length and quantized value can be set in advance Put, if for example, quantized value is score value, can be with the corresponding score value of setting unit length.
More specifically, the quantization of each characteristic dimension can be determined according to unit length and the correspondence of quantized value It is worth the quantity of corresponding unit length, i.e. determine that the quantized value of each characteristic dimension corresponds to how many a unit lengths respectively, so Afterwards, according to the quantity of the corresponding unit length of quantized value of definite each characteristic dimension, determine each quantized value corresponding Position on ray.Alternatively, multiple points can also be set on every ray, wherein, on every ray, each two is adjacent Point between distance it is equal, in other words, the line segment length all same on every ray between two neighboring point, and each two phase The length of line segment between adjacent point is equal to the unit length.It is corresponding according to the quantized value of definite each characteristic dimension The quantity of unit length, you can determine the quantized value of each characteristic dimension point corresponding on its corresponding ray, the correspondence Point be position of the quantized value of this feature dimension on corresponding ray.Determine each quantized value on corresponding ray Behind position, the position corresponding to the quantized value between adjacent ray is connected, you can generation shows each feature dimensions of the network object The multi-dimensional map of the quantized value of degree.Wherein, one can be encircled into after connecting the position corresponding to the quantized value between adjacent ray Polygon, the quantity of characteristic dimension of the quantity on the side of polygon with choosing is identical, if for example, the characteristic dimension quantity chosen is N, the then amount of radiation generated are N, if connecting and can enclosing the position corresponding to quantized value on every two adjacent rays Form a N sides shape.The polygon being encircled into by this, is capable of the feature of each characteristic dimension of reaction network object.
It is alternatively possible to the region for connecting and being enclosed is set to be shown with foreground, that is to say, that by N bars The N sides shape that position line on ray corresponding to quantized value is enclosed is shown with foreground.Pass through this display side Formula, can be distinct represent the magnitude relationship of each characteristic dimension quantized value, and reflect some dimension of the network object Feature.Fig. 8 is the quantized value of each characteristic dimension according to an embodiment of the invention to a film with the side of multi-dimensional map Displaying examples of interfaces figure when formula is shown, as can be seen from Figure 8, the plot of the film, which is deduced, more to be protruded.
According to one embodiment of present invention, M unit length can also be chosen in every ray of generation, will be adjacent The corresponding position of same units length between ray connects, and is shown with background colour.
Specifically, if the amount of radiation of generation is N, M unit length is chosen respectively on every ray of N bar rays Degree, from the common origin of the ray of generation, every a unit length, by same units length pair on every two adjacent rays The position answered connects, formed area gradually increases from the inside to the outside M N while shape (while quantity and the characteristic dimension of selection Quantity it is identical), the M N sides shape, formed a net structure figure, the figure of formation can be shown with background colour, Enable a user to more intuitively judge that each characteristic dimension corresponds to the magnitude relationship of quantized value, so as to know the network pair The more prominent characteristic dimension of elephant.
According to one embodiment of present invention, can also be opened up in the side of the ray of generation (including upper and lower, left and right side) Multiple figures are shown with, wherein each figure corresponds to a characteristic dimension not characterized by the ray.That is, penetrated unused The characteristic dimension of line displaying, is shown with other figures, wherein each figure corresponds to a characteristic dimension.So that user is also It will appreciate that the network object others characteristic dimension.
Alternatively, the corresponding characteristic dimension of each figure can be more excellent in the characteristic dimension classification belonging to this feature dimension Characteristic dimension, i.e. each figure corresponds in one characteristic dimension classification of displaying preferably dimension, for example, can be such as Fig. 8 institutes Show, in the figure shown below the ray of generation, the figure of " Libra favorite " represents be used as characteristic dimension using constellation Dimension classification in, the user of Libra may be the user for most liking the film, and the figure of " male audience is in the majority " represents to exist In dimension classification using gender as characteristic dimension, liking may be in the majority with male audience in the user of the film.This mode More prominent dimension of the network object in other each different dimensions classifications can be embodied, so as to from a variety of differences Angle, provides the decision information of selection network object to the user.
, can also be according to the control instruction received, by the multiple figure along orientation at displaying interface based on this It is middle to roll displaying.That is, according to the control instruction received, by multiple other characteristic dimensions of displaying in interface is shown Figure is scrolled along the direction (such as direction or from left to right direction from top to bottom) of arrangement, wherein, the control received System instruction, can be instruction input by user, or pre-set control instruction, for example, pre-setting when user selects When selecting the multiple features dimension present graphical for checking network object, that is, scrolled, alternatively, carrying out once at predetermined time intervals Scrolling display.
The quantized value of each characteristic dimension of network object is shown by the exhibition method of above-mentioned multi-dimensional map, Neng Gouzhi The feature of each characteristic dimension of ground reaction network object is seen, so as to obtain the decision information of network object as user.
Present invention also offers a kind of device that multiple features dimension quantization is carried out to network object.As shown in fig. 7, Fig. 7 is The structure diagram of the device 700 according to an embodiment of the invention that multiple features dimension quantization is carried out to network object.
Device 700 includes:Obtain module 710, matching module 720 and quantization modules 730.
Obtain module 710 and be determined for a network object, obtain and describe number with the corresponding feature of the network object According to.Matching module 720 can be used for being described according to the different characteristic pre-established corresponding between data and different characteristic dimension Relation, describes data by the feature corresponding with the network object and is matched with characteristic dimension, and is obtained according to matching result Obtain each characteristic dimension weight corresponding with the network object.Quantization modules 730 can be used for according to each characteristic dimension weight, Obtain the quantized value of each characteristic dimension of the network object.
In the apparatus according to the invention 700, the acquisition module 710 can include:Determining module and acquisition module.
Wherein it is determined that module can be used for the type according to the network object, predefine one or more network and search Rope address.Acquisition module can be used for obtaining by the web search address describes number with the corresponding feature of the network object According to.
According to another embodiment of the invention, the acquisition module 710 can include:First registration matching module and First generation module.
Wherein, the first registration matching module can be used for the feature for coming from heterogeneous networks address describing data progress Registration matches.First generation module can be used for deleting description data of the registration less than threshold value, generate final feature Descriptor data set closes.
According to one embodiment of present invention, the matching module 720 can include:First matching module, the first ratio Determining module and the first weight obtain module.
Specifically, the first matching module can be used for according to the correspondence, by whole features describe data respectively with Characteristic dimension is matched, and repeated matching is then carried out if the feature repeated describes data.First ratio-dependent module can be with For determining the ratio of the hit-count to each characteristic dimension according to matching result.First weight obtains module and can be used for basis The ratio of the hit-count, obtains each characteristic dimension weight corresponding with the network object.
According to another embodiment of the invention, the acquisition module 710 can include:Second registration matching module is weighed Reallocation module and the second generation module.
Wherein, the second registration matching module can be used for the feature for coming from heterogeneous networks address describing data progress Registration matches.Weight distribution module can be used for describing the corresponding power of data distribution according to the height of registration for the feature Weight, the more high then weight of registration are higher.Second generation module can be used for the feature repeated describing data deletion, generate cum rights The feature descriptor data set of weight closes.
Further, in the apparatus according to the invention 700, the acquisition module 710 can further include:Weight Adjust module, the module can be used for according to it is predetermined for heterogeneous networks address set weight, further to from The weight that feature in heterogeneous networks address describes data is adjusted.
According to another embodiment of the invention, the matching module can include:Second matching module, the second ratio are true Cover half block, ratio adjustment module and the second weight obtain module.
Wherein, the second matching module can be used for according to the correspondence, by different features describe data respectively with Characteristic dimension is matched.Second ratio-dependent module can be used for determining the hit time to each characteristic dimension according to matching result Several ratios.Ratio adjustment module can be used for the weight that data are described according to different characteristic, by the hit time to characteristic dimension Number ratio is adjusted.Second weight obtains module, for the ratio according to the hit-count after adjustment, obtains and the network pair As corresponding each characteristic dimension weight.
In the apparatus according to the invention 700, the characteristic dimension is determined according to the type of the network object.
In the apparatus according to the invention 700, the quantization modules 730 can include:At score value acquisition module and quantization Manage module.
Wherein, score value acquisition module, which can be used for obtaining corresponding with network object score value, describes data;Quantification treatment Module can be used for that data will be described with the corresponding score value of the network object being multiplied respectively with each characteristic dimension weight, obtain To the quantized value of each characteristic dimension of the network object.
Score value acquisition module, which can be further used for obtaining, comes from the corresponding with the network object of heterogeneous networks address Score value describes data, and according to the predetermined weight set for heterogeneous networks address, will come from heterogeneous networks address Score value describe data and be weighted summation, data are described as final score value.
In the apparatus according to the invention 700, the feature describe data can include label data, comment data, and/ Or key data.
In the apparatus according to the invention 700, it can further include:Choose module and the first display module.
Specifically, module is chosen to can be used for from described with choosing at least in the corresponding each characteristic dimension of the network object Partial Feature dimension.First display module can be used for according to obtained quantized value corresponding with the characteristic dimension of the selection, with The quantized value is shown by the mode of multi-dimensional map.
In the apparatus according to the invention 700, it can further include:Second display module, obtains for basis The quantized value of each characteristic dimension of the network object, therefrom chooses at least part characteristic dimension and is shown.
In the apparatus according to the invention 700, the second display module can be further used for:Setting connects enclosing shape Into region be shown with foreground.
In the apparatus according to the invention 700, the second display module may further include:Choose submodule be connected and Show submodule.Wherein, submodule is chosen to can be used for choosing M unit length in every ray of generation.Connection and exhibition Show that module can be used for connecting the corresponding position of same units length between adjacent ray, and opened up with background colour Show.
In the apparatus according to the invention 700, the angle between the adjacent ray is identical.
In the apparatus according to the invention 700, the second display module can further include:Figure shows submodule, For the side of the ray in generation, displaying has multiple figures, wherein what each figure correspondence one was not characterized by the ray Characteristic dimension.
According to one embodiment of present invention, the figure displaying submodule can also be further used for:According to receiving Control instruction, the multiple figure rolls displaying along orientation in interface is shown.
The device described above that multiple features dimension quantization is carried out to network object with before describe to network object into The processing for the method that row multiple features dimension quantifies is corresponding, accordingly, with respect to more detailed ins and outs, is retouched before may refer to The method stated.
Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein. Various general-purpose systems can also be used together with teaching based on this.As described above, required by constructing this kind of system Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It should be understood that it can utilize various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair Bright preferred forms.
In the specification that this place provides, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice in the case of these no details.In some instances, known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description to the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor The application claims of shield features more more than the feature being expressly recited in each claim.It is more precisely, such as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself Separate embodiments all as the present invention.
Those skilled in the art, which are appreciated that, to carry out adaptively the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or Sub-component.In addition at least some in such feature and/or process or unit exclude each other, it can use any Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and attached drawing) and so to appoint Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power Profit requires, summary and attached drawing) disclosed in each feature can be by providing the alternative features of identical, equivalent or similar purpose come generation Replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed is appointed One of meaning mode can use in any combination.
The all parts embodiment of the present invention can be with hardware realization, or to be run on one or more processor Software module realize, or realized with combinations thereof.It will be understood by those of skill in the art that it can use in practice Microprocessor or digital signal processor (DSP) are according to embodiments of the present invention to network object progress multiple features dimension to realize The some or all functions of some or all components in the device of metrization.The present invention is also implemented as being used to perform The some or all equipment or program of device of method as described herein are (for example, computer program and computer journey Sequence product).It is such realize the present invention program can store on a computer-readable medium, either can have one or The form of multiple signals.Such signal can be downloaded from internet website and obtained, either provide on carrier signal or There is provided in the form of any other.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims, Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" does not exclude the presence of not Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can be by means of including the hardware of some different elements and being come by means of properly programmed computer real It is existing.In if the unit claim of equipment for drying is listed, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and run after fame Claim.

Claims (26)

1. a kind of method that multiple features dimension quantization is carried out to network object, including:
Determine a network object, obtain and describe data with the corresponding feature of the network object;
Different characteristic according to pre-establishing describes the correspondence between data and different characteristic dimension, will the described network pair Matched as corresponding feature describes data characteristic dimension corresponding with the described network object, and obtained according to matching result Each characteristic dimension weight corresponding with the network object;
According to each characteristic dimension weight, the quantized value of each characteristic dimension of the network object is obtained.
2. the method as described in claim 1, it is characterised in that obtain and describe data with the corresponding feature of the network object Step, including:
According to the type of the network object, one or more web search address is predefined;
Obtained by the web search address and describe data with the corresponding feature of the network object.
3. the method as described in claim 1, it is characterised in that obtain and describe data with the corresponding feature of the network object Step, including:
The feature for coming from heterogeneous networks address is described into data and carries out registration matching;
Description data of the registration less than threshold value are deleted, final feature descriptor data set is generated and closes.
4. the method as described in claim 1, it is characterised in that obtain and describe data with the corresponding feature of the network object Step, including:
The feature for coming from heterogeneous networks address is described into data and carries out registration matching;
Data are described for the feature according to the height of registration and distribute corresponding weight, the more high then weight of registration is higher;
The feature repeated is described data to delete, the feature descriptor data set for generating Weight closes.
5. method as claimed in claim 4, it is characterised in that obtain and describe data with the corresponding feature of the network object Step, further comprises:
According to the predetermined weight set for heterogeneous networks address, further to coming from the feature of heterogeneous networks address The weight of description data is adjusted.
6. the method as described in claim 1, it is characterised in that the characteristic dimension is true according to the type of the network object Fixed.
7. the method as described in claim 1 or 3, it is characterised in that
Different characteristic according to pre-establishing describes the correspondence between data and different characteristic dimension, will the described network pair Matched as corresponding feature describes data characteristic dimension corresponding with the described network object, and obtained according to matching result The step of each characteristic dimension weight corresponding with the network object, including:
According to the correspondence, whole features are described into data and are matched respectively with characteristic dimension, if the spy repeated Sign description data then carry out repeated matching;
The ratio of the hit-count to each characteristic dimension is determined according to matching result;
According to the ratio of the hit-count, each characteristic dimension weight corresponding with the network object is obtained.
8. method as claimed in claim 4, it is characterised in that
Different characteristic according to pre-establishing describes the correspondence between data and different characteristic dimension, will the described network pair Matched as corresponding feature describes data characteristic dimension corresponding with the described network object, and obtained according to matching result The step of each characteristic dimension weight corresponding with the network object, including:
According to the correspondence, different features is described into data and is matched respectively with characteristic dimension;
The ratio of the hit-count to each characteristic dimension is determined according to matching result;
The weight of data is described according to different characteristic, the hit-count ratio of characteristic dimension will be adjusted;
According to the ratio of the hit-count after adjustment, each characteristic dimension weight corresponding with the network object is obtained.
9. the method as described in claim 1, it is characterised in that according to each characteristic dimension weight, obtain the network object Each characteristic dimension quantized value the step of, including:
Obtain score value corresponding with the network object and describe data;
Data will be described with the corresponding score value of the network object to be multiplied respectively with each characteristic dimension weight, obtain the network The quantized value of each characteristic dimension of object.
10. method as claimed in claim 9, it is characterised in that obtain score value corresponding with the network object and describe data Step, including:
The score value corresponding with the network object that acquisition comes from heterogeneous networks address describes data, and according to predetermined pin The weight set to heterogeneous networks address, describes data by the score value for coming from heterogeneous networks address and is weighted summation, as Final score value describes data.
11. the method as described in claim 1, it is characterised in that it is described description data include label data, comment data, and/ Or key data.
12. the method as described in claim 1, it is characterised in that further comprise:
From described with choosing at least part characteristic dimension in the corresponding each characteristic dimension of the network object;
According to quantized value corresponding with the characteristic dimension of the selection is obtained, the quantized value is opened up in a manner of multi-dimensional map Show.
13. the method as described in claim 1, it is characterised in that further comprise:
According to the quantized value of each characteristic dimension of the obtained network object, therefrom choose at least part characteristic dimension and opened up Show.
14. a kind of device that multiple features dimension quantization is carried out to network object, including:
Module is obtained, for determining a network object, obtains and describes data with the corresponding feature of the network object;
Matching module, for describing the correspondence between data and different characteristic dimension according to the different characteristic pre-established, The corresponding feature of the described network object is described data characteristic dimension corresponding with the described network object to be matched, and root Each characteristic dimension weight corresponding with the network object is obtained according to matching result;
Quantization modules, for according to each characteristic dimension weight, obtaining the quantized value of each characteristic dimension of the network object.
15. device as claimed in claim 14, it is characterised in that the acquisition module includes:
Determining module, for the type according to the network object, predefines one or more web search address;
Acquisition module, data are described for being obtained by the web search address with the corresponding feature of the network object.
16. device as claimed in claim 14, it is characterised in that the acquisition module includes:
First registration matching module, registration matching is carried out for the feature for coming from heterogeneous networks address to be described data;
First generation module, for description data of the registration less than threshold value to be deleted, generates final feature descriptor data set Close.
17. device as claimed in claim 14, it is characterised in that the acquisition module includes:
Second registration matching module, registration matching is carried out for the feature for coming from heterogeneous networks address to be described data;
Weight distribution module, corresponding weight, registration are distributed for describing data according to the height of registration for the feature More high then weight is higher;
Second generation module, is deleted for the feature repeated to be described data, and the feature descriptor data set for generating Weight closes.
18. device as claimed in claim 17, it is characterised in that the acquisition module further comprises:
Weight adjust module, for according to it is predetermined for heterogeneous networks address set weight, further to coming from The weight that the feature of heterogeneous networks address describes data is adjusted.
19. device as claimed in claim 14, it is characterised in that the characteristic dimension is the type according to the network object Definite.
20. the device as described in claim 14 or 16, it is characterised in that the matching module includes:
Whole features for according to the correspondence, are described data and are carried out respectively with characteristic dimension by the first matching module Match somebody with somebody, repeated matching is then carried out if the feature repeated describes data;
First ratio-dependent module, for determining the ratio of the hit-count to each characteristic dimension according to matching result;
First weight obtains module, for the ratio according to the hit-count, obtains each feature corresponding with the network object Dimension weight.
21. device as claimed in claim 17, it is characterised in that the matching module includes:
Second matching module, for according to the correspondence, different features being described data and is carried out respectively with characteristic dimension Matching;
Second ratio-dependent module, for determining the ratio of the hit-count to each characteristic dimension according to matching result;
Ratio adjust module, for describing the weight of data according to different characteristic, by the hit-count ratio of characteristic dimension into Row adjustment;
Second weight obtains module, for the ratio according to the hit-count after adjustment, obtains corresponding with the network object each Characteristic dimension weight.
22. device as claimed in claim 14, it is characterised in that the quantization modules, including:
Score value acquisition module, data are described for obtaining score value corresponding with the network object;
Quantification treatment module, for data and each characteristic dimension weight point will to be described with the corresponding score value of the network object It is not multiplied, obtains the quantized value of each characteristic dimension of the network object.
23. device as claimed in claim 22, it is characterised in that the score value acquisition module, is further used for:
The score value corresponding with the network object that acquisition comes from heterogeneous networks address describes data, and according to predetermined pin The weight set to heterogeneous networks address, describes data by the score value for coming from heterogeneous networks address and is weighted summation, as Final score value describes data.
24. device as claimed in claim 14, it is characterised in that the description data include label data, comment data, And/or key data.
25. device as claimed in claim 14, it is characterised in that further comprise:
Module is chosen, for choosing at least part characteristic dimension from the described and corresponding each characteristic dimension of the network object;
First display module, the quantized value corresponding with the characteristic dimension of the selection obtained for basis, in a manner of multi-dimensional map The quantized value is shown.
26. device as claimed in claim 14, it is characterised in that further comprise:
Second display module, for the quantized value of each characteristic dimension according to the obtained network object, therefrom chooses at least portion Point characteristic dimension is shown.
CN201410564598.7A 2014-10-21 2014-10-21 The method and device of multiple features dimension quantization is carried out to network object Active CN104331429B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410564598.7A CN104331429B (en) 2014-10-21 2014-10-21 The method and device of multiple features dimension quantization is carried out to network object

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410564598.7A CN104331429B (en) 2014-10-21 2014-10-21 The method and device of multiple features dimension quantization is carried out to network object

Publications (2)

Publication Number Publication Date
CN104331429A CN104331429A (en) 2015-02-04
CN104331429B true CN104331429B (en) 2018-04-27

Family

ID=52406156

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410564598.7A Active CN104331429B (en) 2014-10-21 2014-10-21 The method and device of multiple features dimension quantization is carried out to network object

Country Status (1)

Country Link
CN (1) CN104331429B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106033457B (en) * 2015-03-18 2019-10-18 阿里巴巴集团控股有限公司 A kind of method and apparatus of the attribute information of determining fruit objective attribute target attribute
CN106897208A (en) * 2015-12-18 2017-06-27 阿里巴巴集团控股有限公司 A kind of quantization method and device of application software Consumer's Experience
CN107749006A (en) * 2017-11-01 2018-03-02 广州爱九游信息技术有限公司 Game appraisal procedure, device and equipment
CN111177515B (en) * 2019-12-25 2023-07-21 中国平安财产保险股份有限公司 Product label matching method, device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101211339A (en) * 2006-12-29 2008-07-02 上海芯盛电子科技有限公司 Intelligent web page classifier based on user behaviors
WO2010044780A1 (en) * 2008-10-14 2010-04-22 Hewlett-Packard Development Company, L.P. Dynamic content sorting using tags
CN102902790A (en) * 2012-09-29 2013-01-30 北京奇虎科技有限公司 Web page classification system and method
CN102955813A (en) * 2011-08-29 2013-03-06 中国移动通信集团四川有限公司 Information searching method and information searching system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101551806B (en) * 2008-04-03 2012-04-18 北京搜狗科技发展有限公司 Personalized website navigation method and system
CN103886090B (en) * 2014-03-31 2018-01-02 北京搜狗科技发展有限公司 Content recommendation method and device based on user preferences

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101211339A (en) * 2006-12-29 2008-07-02 上海芯盛电子科技有限公司 Intelligent web page classifier based on user behaviors
WO2010044780A1 (en) * 2008-10-14 2010-04-22 Hewlett-Packard Development Company, L.P. Dynamic content sorting using tags
CN102955813A (en) * 2011-08-29 2013-03-06 中国移动通信集团四川有限公司 Information searching method and information searching system
CN102902790A (en) * 2012-09-29 2013-01-30 北京奇虎科技有限公司 Web page classification system and method

Also Published As

Publication number Publication date
CN104331429A (en) 2015-02-04

Similar Documents

Publication Publication Date Title
CN110020094A (en) A kind of methods of exhibiting and relevant apparatus of search result
CN102306171B (en) A kind of for providing network to access suggestion and the method and apparatus of web search suggestion
CN107423355A (en) Information recommendation method and device, electronic equipment
CN102968413B (en) A kind of method and apparatus for being used to provide search result
CN107229666B (en) A kind of interest heuristic approach and device based on recommender system
CN104331429B (en) The method and device of multiple features dimension quantization is carried out to network object
CN106688215A (en) Automated click type selection for content performance optimization
CN102354315A (en) Generation method of site navigation page and device thereof
CN102831117B (en) Select font, the determination of font, recommendation, generation method and equipment thereof
CN105874449A (en) Systems and methods for extracting and generating images for display content
CN106484766A (en) Searching method based on artificial intelligence and device
CN110267097A (en) Video pushing method, device and electronic equipment based on characteristic of division
CN103514282A (en) Method and device for displaying search results of videos
CN105224657B (en) A kind of information recommendation method and electronic equipment based on search engine
CN110110201A (en) A kind of content recommendation method and system
CN103473282B (en) A kind of apparatus and method generating the Hot Contents page
CN106354855A (en) Recommendation method and system
CN104699844B (en) The method and device of video tab is determined for advertisement
CN104199865B (en) Searching method, client and the system of the customization result of content providers are provided
CN102073670A (en) Method, equipment and system for debugging online webpage template
CN110278447A (en) Video pushing method, device and electronic equipment based on continuous feature
CN105139233A (en) Advertisement putting method, device, and system
CN109075987A (en) Optimize digital assembly analysis system
CN110688578A (en) Screen locking wallpaper recommendation method and device and electronic equipment
CN109509051A (en) A kind of item recommendation method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220803

Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015

Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park)

Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Patentee before: Qizhi software (Beijing) Co.,Ltd.

TR01 Transfer of patent right