CN104462448B - A kind of packet name classification method and device - Google Patents

A kind of packet name classification method and device Download PDF

Info

Publication number
CN104462448B
CN104462448B CN201410779559.9A CN201410779559A CN104462448B CN 104462448 B CN104462448 B CN 104462448B CN 201410779559 A CN201410779559 A CN 201410779559A CN 104462448 B CN104462448 B CN 104462448B
Authority
CN
China
Prior art keywords
packet name
sorted
packet
name
grouping
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410779559.9A
Other languages
Chinese (zh)
Other versions
CN104462448A (en
Inventor
李金奎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Weimeng Chuangke Network Technology China Co Ltd
Original Assignee
Weimeng Chuangke Network Technology China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Weimeng Chuangke Network Technology China Co Ltd filed Critical Weimeng Chuangke Network Technology China Co Ltd
Priority to CN201410779559.9A priority Critical patent/CN104462448B/en
Publication of CN104462448A publication Critical patent/CN104462448A/en
Application granted granted Critical
Publication of CN104462448B publication Critical patent/CN104462448B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Abstract

This application discloses a kind of packet name classification methods, may cause information inaccuracy recommended to the user according to packet mode in the prior art for solving the problems, such as.It specifically includes that and obtains packet name to be sorted;According to the feature for the perpetual object that the grouping that packet name to be sorted indicates is included, the characteristic value of packet name to be sorted is determined;According to the characteristic value of packet name to be sorted, treats classified packets name and classify.Also disclose a kind of packet name sorter.

Description

A kind of packet name classification method and device
Technical field
This application involves field of computer technology more particularly to a kind of packet name classification method and devices.
Background technique
Packet name generally refers to: user in social networks according to oneself and perpetual object relationship, or it is right according to oneself Perpetual object generates the reason of interest, after perpetual object is divided into different grouping, to be grouped the title taken.Since packet name is past Toward the hobby and social networks that can embody individual subscriber, therefore packet name can be generally divided into two classifications, respectively " relationship classification " and " category of interest ", the packet name in the two classifications are respectively grouping related with the social networks of user Name, and packet name related with the hobby of user.
In the prior art, in order to user recommend related with the hobby of user information or with the social networks of user Related information first has to determine the packet name generic that user is arranged on social networks, Jin Ergen by packet name semanteme According to the category, recommend relevant information to user.Which has the disadvantage that for example for the grouping for being grouped entitled " colleague " For, if the perpetual object in the grouping is all video display star, if that only with semanteme parse, may by " colleague " this Packet name is directly divided into relationship classification, so as to cause according to category information inaccuracy recommended to the user.
Summary of the invention
The embodiment of the present application provides a kind of packet name classification method, to solve can according to packet mode in the prior art It can lead to the problem of information inaccuracy recommended to the user.
The embodiment of the present application also provides a kind of packet name sorter, to solve according to packet mode in the prior art It may cause the problem of information inaccuracy recommended to the user.
The embodiment of the present application adopts the following technical solutions:
A kind of packet name classification method, specifically includes that
Obtain packet name to be sorted;
According to the feature for the perpetual object that the grouping that the packet name to be sorted indicates is included, described to be sorted point is determined The characteristic value of group name;
According to the characteristic value of the packet name to be sorted, classify to the packet name to be sorted.
A kind of packet name sorter, comprising:
Acquiring unit, for obtaining packet name to be sorted;
Determination unit, the feature for the perpetual object that the grouping for being indicated according to the packet name to be sorted is included, really The characteristic value of the fixed packet name to be sorted;
Taxon classifies to the packet name to be sorted for the characteristic value according to the packet name to be sorted.
The embodiment of the present application use at least one above-mentioned technical solution can reach it is following the utility model has the advantages that
By the feature for the perpetual object that the grouping indicated thus according to packet name to be sorted is included, grouping to be sorted is determined The characteristic value of name, and this feature value treats classified packets name and classifies, so that the pass that classification results and grouping are included The feature of note object matches, and solves and determines packet name generic to be sorted with semanteme in the prior art, will lead to The problem for the information inaccuracy that family is recommended.
Detailed description of the invention
The drawings described herein are used to provide a further understanding of the present application, constitutes part of this application, this Shen Illustrative embodiments and their description please are not constituted an undue limitation on the present application for explaining the application.In the accompanying drawings:
Fig. 1 is a kind of flow chart of packet name classification method provided by the embodiments of the present application;
Fig. 2 treats the method flow that classified packets name is classified using decision-tree model to be provided by the embodiments of the present application Figure;
Fig. 3 is a kind of structural block diagram of packet name sorter provided by the embodiments of the present application.
Specific embodiment
To keep the purposes, technical schemes and advantages of the application clearer, below in conjunction with the application specific embodiment and Technical scheme is clearly and completely described in corresponding attached drawing.Obviously, described embodiment is only the application one Section Example, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art are not doing Every other embodiment obtained under the premise of creative work out, shall fall in the protection scope of this application.
Below in conjunction with attached drawing, the technical scheme provided by various embodiments of the present application will be described in detail.
Embodiment 1
In order to solve the problems, such as to may cause information inaccuracy recommended to the user according to packet mode in the prior art, The application proposes a kind of packet name classification method, and the implementation flow chart of this method is as shown in Figure 1, mainly include the following steps:
Step 11 obtains packet name to be sorted;
The feature of step 12, the perpetual object for being included according to the grouping that packet name to be sorted indicates, determines to be sorted point The characteristic value of group name;
Step 13, according to the characteristic value of packet name to be sorted, treat classified packets name and classify.
Using the above method provided by the embodiments of the present application, by thus according to the grouping for getting packet name expression to be sorted The feature for the perpetual object for being included determines the characteristic value of packet name to be sorted;And then according to the characteristic value pair of the packet name Packet name to be sorted is classified, so that the feature for the perpetual object that classification results are included with grouping matches, is solved Packet name generic to be sorted is determined with semanteme in the prior art, will lead to asking for information inaccuracy recommended to the user Topic.
Optional embodiments some to the embodiment of the present application are described in detail below.
In one embodiment, in order to avoid before step 12, can will acquire by the interference of character lack of standardization Packet name to be sorted is pre-processed.
Specifically, the frequency of available packet name to be sorted being used by a user;Then, extracting is made by user It is greater than the packet name to be sorted of the default frequency with the frequency;Finally, the packet name to be sorted of extraction is carried out format normalization.Its In, format normalization, can be by the complex form of Chinese characters be converted into simplified Chinese character, removal punctuation mark and/or upper case character be converted to small letter Character etc..
In one embodiment, step 12 can be realized by following step A1- step A2:
Step A1, determine that at least one for having packet name to be sorted is grouped corresponding correlation degree value as perpetual object Feature;
Step A2, according to correlation degree value, the characteristic value of packet name to be sorted is determined.
Wherein, each is grouped the perpetual object and this point is arranged that corresponding correlation degree value indicates that current group is included Correlation degree between the user of group.
In the embodiment of the present application, step A2 can be, but not limited to following two implementation:
The first embodiment, step A2 can be illustrated by mathematic(al) representation shown in formula [1]:
Wherein, AVGfIndicate the characteristic value of packet name to be sorted;N indicates the sum that the user of the packet name to be sorted is arranged Amount;I indicates Customs Assigned Number, and i ∈ [1, N];CountiIt indicates to number and has this set by the user for being i and the user wait divide The correlation degree value for the perpetual object that the grouping of class packet name includes.
It further illustrates for formula [1]: assuming that grouping entitled " junior middle school good friend " to be sorted, is arranged the user of the packet name There are U1 and U2, wherein 100 people are had by the U1 perpetual object being divided into " junior middle school good friend " grouping, there are 50 people also same in this 100 people When U1 is paid close attention to;U2, which is grouped included perpetual object " junior middle school good friend ", 30 people, has 20 people also same in this 30 people When U2 is paid close attention to.It can then be determined by above-mentioned formula [1]:
N=2, Count1=50, Count2=20;And then the characteristic value of packet name to be sorted can be obtained by formula [1] AVGf=35.
Second of embodiment: step A2 may include: by corresponding according to the correlation degree value for being greater than first threshold Have the first grouping number of packet name to be sorted, and corresponding less than the correlation degree value of second threshold has to be sorted point The second packet number of group name, determines the characteristic value of packet name to be sorted.Wherein, first threshold is greater than second threshold.
Specifically, the embodiment can shown in formula [2] mathematic(al) representation illustrate:
Wherein, Sub indicates the characteristic value of packet name to be sorted;High indicates that the correlation degree value for being greater than first threshold is corresponding The first grouping number for having packet name to be sorted;Low represent less than second threshold correlation degree value it is corresponding have to The second packet number of classified packets name.
For formula [2], further following citings are assumed:
Grouping entitled " six classes of the junior three " to be sorted, " machine learning " and " video display artist ";
Have the total number packets mesh of group name to be sorted as shown in following table one:
Table one:
Packet name to be sorted The total number packets mesh for having packet name to be sorted
Six classes of the junior three 110
Machine learning 837
Video display artist 204
Further, it is assumed that correlation degree value is mutual powder rate;In addition, having " six classes of the junior three " this packet name to be sorted In 110 groupings, mutual powder rate distribution is as shown in Table 2:
Table two:
Similarly, grouping entitled " machine learning " to be sorted as mutual powder rate distributional class shown in also available and table two The mutual powder rate of " video display artist " is distributed, and the application does not repeat one by one herein.
Further, if assuming, first threshold is 60%, second threshold 30%, then by data in the right column of table two Statistics, available: packet name " six classes of the junior three " to be sorted corresponding High=99, Low=0, and then can be according to formula [2], Sub=100% is calculated.Similarly, other available packet names " machine learning " to be sorted and " video display artist " Sub value.
Based on the Sub value determined, a kind of embodiment of step 13 may include:
According to Sub value, and the Sub threshold value of setting, packet name generic to be sorted is determined.
In one embodiment, which can be arranged according to sample packet name gathered in advance.For example assume: To be grouped entitled " six classes of the junior three ", " friend relative ", " siblings " and " machine learning ", " video display are related ", " video display artist " For the sample packet set constituted, if " six classes of the junior three " in the sample packet set, " friend relative ", " siblings " Relation belonging to classification, and obtain " six classes of the junior three ", " friend relative ", " siblings " total number packets in grouping set Accounting is 39.67%;" machine learning ", " video display are related " in the sample packet set, " video display artist " belong to category of interest, And it obtains the accounting of " machine learning ", " video display correlation ", the total number packets of " video display artist " in grouping set and is 60.33%;So, it by the training of the sample packet name set, can determine when the Sub threshold value=20%, Neng Gouda To best classifying quality.
Above-mentioned hypothesis content is specifically see following table three:
Table three:
Based on above-mentioned table three, after determining the Sub value of packet name to be sorted, if Sub > 20%, it is determined that be sorted It is grouped entitled relation group, if Sub≤20%, it is determined that the entitled interest grouping of grouping to be sorted.
In one embodiment, step 12 can be realized by following step B1- step B3:
Step B1, the quantity of the user using packet name to be sorted is determined;
Step B2, feature of the quantity as packet name to be sorted that the perpetual object of the grouping is divided by user is obtained;
Step B3, according to the quantity of the quantity of user and perpetual object, the characteristic value of packet name to be sorted is determined.
In the embodiment of the present application, step B3 can be, but not limited to following two implementation:
The first implementation: step B3 can be illustrated by mathematic(al) representation shown in formula [3]:
Wherein, AVGuIndicate the characteristic value of packet name to be sorted;N indicates the sum that the user of the packet name to be sorted is arranged Amount;J indicates Customs Assigned Number, and j ∈ [1, N];CountjIndicate that the user for being numbered as j is divided into represented by packet name to be sorted Grouping in perpetual object quantity.
It further illustrates for formula [3]: assuming that grouping entitled " famous person star " to be sorted, is arranged the user of the packet name There are U1, U2, U3,100 people are had by the U1 perpetual object being divided into " famous person star " grouping, are divided into " famous person star " by U2 point Perpetual object in group has 50 and has 30 by the U3 perpetual object being divided into " famous person star " grouping.Then by above-mentioned formula [3] It can determine:
N=3, Count1=100, Count2=50, Count3=30, and then grouping to be sorted can be obtained by formula [3] The characteristic value AVG of nameu=60.
Second of embodiment: step B3 may include: by obtain by user be divided into the grouping, have specific mark The quantity of the perpetual object of knowledge is as grouping feature to be sorted.
Specifically, can be obtained if user U1 and U2 are the users with specific identifier:
N=2, Count1=100, Count2=50, and then by formula [3] available AVGu=50.
Based on the AVG determineduValue, a kind of embodiment of step 13 may include:
According to AVGuValue, and the AVG of settinguThreshold value determines packet name generic to be sorted.
In one embodiment, which can be arranged according to sample packet name gathered in advanceuThreshold value.Such as it is false If: to be grouped entitled " information muffler ", " shop automobile 4s ", " law court colleague ", " which father goes ", " investment in gold " and " university For the sample packet set that classmate " is constituted, if " information muffler " in the sample packet set, " shop automobile 4s ", " law court is same Row " belong to category of interest, and obtain " information muffler ", " shop automobile 4s ", " law court colleague " total number packets in grouping set Middle accounting is 38.25%, " which father goes ", " investment in gold " and " university classmate " relation belonging to class in the sample packet set Not, it and obtains accounting of the total number packets of " which father goes ", " investment in gold " and " university classmate " in grouping set and is 61.75%, then, by the training of the sample packet name set, it can determine the AVGuThreshold value=1.
Above-mentioned hypothesis content is specifically see following table four:
Table four:
Based on above-mentioned table four, work as AVGuWhen > 1, determine that the entitled category of interest of grouping to be sorted i.e. accounting is 38.25%, when AVGuWhen≤1, determine that i.e. accounting is not 61.75% to the entitled relation object of grouping to be sorted, but since formula [3] is not examined Consider correlation degree between user and perpetual object, so formula [3] is not high to the fine granularity for obtaining packet name to be sorted, because This, can by formula [3] obtained characteristic value in such a way that formula [2] obtained characteristic value is used in combination come to be sorted The characteristic value of packet name is classified.
Described above is the several ways of the characteristic value of determining packet name to be sorted, introduced below a kind of according to determining The method that characteristic value is classified:
Firstly, setting respectively obtains the feature of packet name to be sorted " colleague ", " famous person star " in the way of formula [2] Value 10%, 60%, respectively as the First Eigenvalue of " colleague ", " famous person star ";It is arranged and is obtained respectively in the way of formula [3] To the characteristic value 10,4 of packet name to be sorted, respectively as the Second Eigenvalue of " colleague ", " famous person star ";Setting utilizes formula [1] mode respectively obtains the characteristic value 100,70 of packet name to be sorted, special respectively as the third of " colleague ", " famous person star " Value indicative.
Then, it performs the following operations:
Using decision-tree model as shown in Figure 2, treats classified packets name and classify.Detailed process are as follows:
Whether the First Eigenvalue of judgement " colleague " is greater than 20%;After obtaining the judging result of "No", the second spy is judged Whether value indicative is greater than 20;After the judging result for obtaining " Second Eigenvalue of colleague is not more than 20 ", judge that " colleague " belongs to Relationship classification.
Whether the First Eigenvalue of judgement " famous person star " is greater than 20%;After obtaining the judging result of "Yes", is judged [0,5) whether two characteristic values are in;After the judging result for obtaining " Second Eigenvalue be in [0,5) ", third feature value is judged Whether in [0,90), after the judging result for obtaining " the third feature value of famous person star be in [0,90) ", judge " famous person Star " belongs to category of interest.
It in one embodiment, can grouping determining packet name generic and then to classification is had determined that Name carries out planningization processing.Concrete norm mode can be with are as follows: in such a way that part of speech filters, will determine the packet name of classification It is divided into two parts, respectively planningization packet name and packet name to be modified.
Specifically, due to the packet name in category of interest, usually by relatively common noun, verb, adjective etc. Composition, therefore white list mechanism can be used;And the other packet name of relation object, usually part of speech are inherently very complicated changeable, because We use blacklist mechanism for this.Filtering rule can be as shown in following table three:
After completing the planningization processing of packet name, for each packet name to be modified, it can execute respectively: from planning Change in packet name, determines that the feature of corresponding perpetual object is identical as the feature of the corresponding perpetual object of packet name to be modified Planningization packet name, and then the packet name to be modified is revised as to the planningization packet name determined.
It should be noted that the executing subject of each step of 1 providing method of embodiment may each be same equipment, or Person, this method is also by distinct device as executing subject.For example, the executing subject of step 11 and step 12 can be equipment 1, step Rapid 13 executing subject can be equipment 2;For another example, the executing subject of step 11 can be equipment 1, step 12 and step 13 Executing subject can be equipment 2;Etc..
Embodiment 2
In order to solve the problems, such as to may cause information inaccuracy recommended to the user according to packet mode in the prior art, The application proposes a kind of packet name sorter, the implementation flow chart of this method as shown in figure 3, specifically include that acquiring unit 31, Determination unit 32 and taxon 33, specific as follows:
Acquiring unit 31, for obtaining packet name to be sorted;
Determination unit 32, the feature for the perpetual object that the grouping for being indicated according to packet name to be sorted is included determine The characteristic value of the packet name to be sorted;
Taxon 33 treats classified packets name and classifies for the characteristic value according to packet name to be sorted.
In one embodiment, determination unit 32 are determined at least one point for having packet name to be sorted The corresponding correlation degree value of group is as the feature;Wherein, each, which is grouped corresponding correlation degree value, indicates current group institute Correlation degree between the perpetual object for including and the user that the grouping is set;According to correlation degree value, to be sorted point is determined The characteristic value of group name.
In a kind of embodiment, determination unit 32 can be used for corresponding according to the correlation degree value for being greater than first threshold First grouping number, and second packet number corresponding less than the correlation degree value of second threshold, determine grouping to be sorted The characteristic value of name;Wherein, first threshold is greater than second threshold.
In one embodiment, determination unit 32 can be also used for the number for determining the user using packet name to be sorted Amount;Obtain by user be divided into grouping perpetual object quantity as this feature;According to the quantity of user and perpetual object Quantity determines the characteristic value of packet name to be sorted.
In one embodiment, determination unit 32, can be used for obtaining by user be divided into grouping, have specific mark Feature of the quantity of the perpetual object of knowledge as packet name to be sorted.
Using device provided by above-described embodiment 2, by thus according to the grouping institute for getting packet name expression to be sorted The feature for the perpetual object for including determines the characteristic value of packet name to be sorted;And then it is treated according to the characteristic value of the packet name Classified packets name is classified.So that the feature for the perpetual object that classification results are included with grouping matches, and then solve Packet mode in the prior art of having determined may cause the problem of information inaccuracy recommended to the user.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
In a typical configuration, calculating equipment includes one or more processors (CPU), input/output interface, net Network interface and memory.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/or The forms such as Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is computer-readable medium Example.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method Or technology come realize information store.Information can be computer readable instructions, data structure, the module of program or other data. The example of the storage medium of computer includes, but are not limited to phase change memory (PRAM), static random access memory (SRAM), moves State random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electric erasable Programmable read only memory (EEPROM), flash memory or other memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM), Digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or other magnetic storage devices Or any other non-transmission medium, can be used for storage can be accessed by a computing device information.As defined in this article, it calculates Machine readable medium does not include temporary computer readable media (transitory media), such as the data-signal and carrier wave of modulation.
It should also be noted that, the terms "include", "comprise" or its any other variant are intended to nonexcludability It include so that the process, method, commodity or the equipment that include a series of elements not only include those elements, but also to wrap Include other elements that are not explicitly listed, or further include for this process, method, commodity or equipment intrinsic want Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including described want There is also other identical elements in the process, method of element, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can provide as method, system or computer program product. Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the application Form.It is deposited moreover, the application can be used to can be used in the computer that one or more wherein includes computer usable program code The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) Formula.
The above description is only an example of the present application, is not intended to limit this application.For those skilled in the art For, various changes and changes are possible in this application.All any modifications made within the spirit and principles of the present application are equal Replacement, improvement etc., should be included within the scope of the claims of this application.

Claims (10)

1. a kind of packet name classification method characterized by comprising
Obtain packet name to be sorted;
Acquired packet name to be sorted is pre-processed;
According to the feature for the perpetual object that the grouping that the packet name to be sorted indicates is included, the packet name to be sorted is determined Characteristic value;
According to the characteristic value of the packet name to be sorted, classify to the packet name to be sorted;
To the packet name progress standardization processing for having determined that classification, the standardization processing includes: the side filtered by part of speech The packet name for determining classification is divided into two parts by formula, respectively standardization packet name and packet name to be modified;
The packet name to be modified is executed: from the standardization packet name, determine corresponding perpetual object feature and The identical standardization packet name of feature of the corresponding perpetual object of the packet name to be modified, and then by point to be modified Group name is revised as the standardization packet name determined.
2. the method as described in claim 1, which is characterized in that included according to the grouping that the packet name to be sorted indicates The feature of perpetual object determines the characteristic value of the packet name to be sorted, comprising:
Determine that at least one for having the packet name to be sorted is grouped corresponding correlation degree value as the feature;Wherein, Each is grouped corresponding correlation degree value and indicates between the current group perpetual object for being included and the user that the grouping is arranged Correlation degree;
According to the correlation degree value, the characteristic value of packet name to be sorted is determined.
3. method according to claim 2, which is characterized in that according to the correlation degree value, determine packet name to be sorted Characteristic value, specifically include:
According to corresponding first grouping number of correlation degree value for being greater than first threshold, and the correlation degree less than second threshold It is worth corresponding second packet number, determines the characteristic value of packet name to be sorted;
Wherein, first threshold is greater than second threshold.
4. the method as described in claim 1, which is characterized in that included according to the grouping that the packet name to be sorted indicates The feature of perpetual object determines the characteristic value of the packet name to be sorted, comprising:
Determine the quantity of the user using the packet name to be sorted;
Obtain by the user be divided into the grouping perpetual object quantity as the feature;
According to the quantity of the quantity of the user and the perpetual object, the characteristic value of packet name to be sorted is determined.
5. method as claimed in claim 4, which is characterized in that obtain the perpetual object for being divided into the grouping by the user Quantity as the feature, comprising:
The quantity of perpetual object that the grouping is divided by the user, having specific identifier is obtained as the feature.
6. a kind of packet name sorter characterized by comprising
Acquiring unit, for obtaining packet name to be sorted;
Determination unit, the feature for the perpetual object that the grouping for being indicated according to the packet name to be sorted is included, determines institute State the characteristic value of packet name to be sorted;
Taxon classifies to the packet name to be sorted for the characteristic value according to the packet name to be sorted;
Wherein, the determination unit is also used to the concern for being included in the grouping indicated according to the packet name to be sorted The feature of object before the characteristic value for determining the packet name to be sorted, pre-processes acquired packet name to be sorted;
The taxon is also used to the packet name progress standardization processing for having determined that classification, the standardization processing packet Include: part of speech filter by way of, the packet name for determining classification is divided into two parts, respectively standardization packet name and Packet name to be modified, and the packet name to be modified is executed: from the standardization packet name, determine corresponding pass The identical standardization packet name of feature of the feature perpetual object corresponding with the packet name to be modified of object is infused, and then will The packet name to be modified is revised as the standardization packet name determined.
7. device as claimed in claim 6, which is characterized in that the determination unit is specifically used for:
Determine that at least one for having the packet name to be sorted is grouped corresponding correlation degree value as the feature;Wherein, Each is grouped corresponding correlation degree value and indicates between the current group perpetual object for being included and the user that the grouping is arranged Correlation degree;
According to the correlation degree value, the characteristic value of packet name to be sorted is determined.
8. device as claimed in claim 7, which is characterized in that the determination unit is specifically used for:
According to corresponding first grouping number of correlation degree value for being greater than first threshold, and the correlation degree less than second threshold It is worth corresponding second packet number, determines the characteristic value of packet name to be sorted;
Wherein, first threshold is greater than second threshold.
9. device as claimed in claim 6, which is characterized in that the determination unit is specifically used for:
Determine the quantity of the user using the packet name to be sorted;
Obtain by the user be divided into the grouping perpetual object quantity as the feature;
According to the quantity of the quantity of the user and the perpetual object, the characteristic value of packet name to be sorted is determined.
10. device as claimed in claim 9, which is characterized in that the determination unit is specifically used for:
The quantity of perpetual object that the grouping is divided by the user, having specific identifier is obtained as the feature.
CN201410779559.9A 2014-12-15 2014-12-15 A kind of packet name classification method and device Active CN104462448B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410779559.9A CN104462448B (en) 2014-12-15 2014-12-15 A kind of packet name classification method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410779559.9A CN104462448B (en) 2014-12-15 2014-12-15 A kind of packet name classification method and device

Publications (2)

Publication Number Publication Date
CN104462448A CN104462448A (en) 2015-03-25
CN104462448B true CN104462448B (en) 2019-03-12

Family

ID=52908483

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410779559.9A Active CN104462448B (en) 2014-12-15 2014-12-15 A kind of packet name classification method and device

Country Status (1)

Country Link
CN (1) CN104462448B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106354719B (en) * 2015-07-13 2019-09-13 阿里巴巴集团控股有限公司 POI service providing method, POI data processing method and processing device
CN107256231B (en) * 2017-05-04 2022-04-22 腾讯科技(深圳)有限公司 Team member identification device, method and system
JP2019159553A (en) * 2018-03-09 2019-09-19 株式会社東芝 Information processing apparatus, information processing method, and program

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101090346A (en) * 2006-06-16 2007-12-19 腾讯科技(深圳)有限公司 Group management method based on immediate communication and immediate communication customer end
CN102968430A (en) * 2011-09-01 2013-03-13 三星电子株式会社 Method and apparatus for automatically generating and managing groups in address book
CN103365947A (en) * 2012-03-30 2013-10-23 卡西欧计算机株式会社 Social network service system and image display method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101090346A (en) * 2006-06-16 2007-12-19 腾讯科技(深圳)有限公司 Group management method based on immediate communication and immediate communication customer end
CN102968430A (en) * 2011-09-01 2013-03-13 三星电子株式会社 Method and apparatus for automatically generating and managing groups in address book
CN103365947A (en) * 2012-03-30 2013-10-23 卡西欧计算机株式会社 Social network service system and image display method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《可怕的圈子》;何菲;《IT经理世界》;20120420(第338期);30-32

Also Published As

Publication number Publication date
CN104462448A (en) 2015-03-25

Similar Documents

Publication Publication Date Title
CN105183731B (en) Recommendation information generation method, device and system
CN104035968B (en) The construction method and device of training corpus collection based on social networks
CN105095179B (en) The method and device that user's evaluation is handled
CN106951571A (en) A kind of method and apparatus for giving application mark label
CN108833139B (en) OSSEC alarm data aggregation method based on category attribute division
CN106803039B (en) A kind of homologous determination method and device of malicious file
CN110287313A (en) A kind of the determination method and server of risk subject
CN110134792A (en) Text recognition method, device, electronic equipment and storage medium
CN104462448B (en) A kind of packet name classification method and device
CN106598999A (en) Method and device for calculating text theme membership degree
CN106960040A (en) A kind of URL classification determines method and device
CN107679069A (en) Method is found based on a kind of special group of news data and related commentary information
CN104536663B (en) A kind of page turning method
CN108304509A (en) A kind of comment spam filter method for indicating mutually to learn based on the multidirectional amount of text
CN106897282B (en) User group classification method and device
CN104102662B (en) A kind of user interest preference similarity determines method and device
CN109918658A (en) A kind of method and system obtaining target vocabulary from text
CN103870541A (en) Social network user interest mining method and system
CN109902157A (en) A kind of training sample validation checking method and device
CN110263817B (en) Risk grade classification method and device based on user account
CN106897267A (en) Article breviary text meaning generation method, device and server
CN110472742A (en) A kind of model variable determines method, device and equipment
CN106844748A (en) Text Clustering Method, device and electronic equipment
CN108804676A (en) A kind of model sort method, device, equipment and computer readable storage medium
CN108076032B (en) Abnormal behavior user identification method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant