CN109144831A - A kind of acquisition methods and device of APP recognition rule - Google Patents

A kind of acquisition methods and device of APP recognition rule Download PDF

Info

Publication number
CN109144831A
CN109144831A CN201710453676.XA CN201710453676A CN109144831A CN 109144831 A CN109144831 A CN 109144831A CN 201710453676 A CN201710453676 A CN 201710453676A CN 109144831 A CN109144831 A CN 109144831A
Authority
CN
China
Prior art keywords
app
participle unit
unit
field
participle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710453676.XA
Other languages
Chinese (zh)
Other versions
CN109144831B (en
Inventor
储晶星
邓圆
傅平
傅一平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
China Mobile Group Zhejiang Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
China Mobile Group Zhejiang Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, China Mobile Group Zhejiang Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN201710453676.XA priority Critical patent/CN109144831B/en
Publication of CN109144831A publication Critical patent/CN109144831A/en
Application granted granted Critical
Publication of CN109144831B publication Critical patent/CN109144831B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3476Data logging
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3604Software analysis for verifying properties of programs
    • G06F11/3612Software analysis for verifying properties of programs by runtime analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/865Monitoring of software

Abstract

The embodiment of the present invention provides the acquisition methods and device of a kind of APP recognition rule.The described method includes: obtain multiple APP to be obtained runs the feature field for including in the access log of generation within a preset period of time, the feature field includes url field and UA field;The participle unit for including in the feature field is obtained, and calculates each described participle unit corresponding feature score value in each APP to be obtained;The corresponding recognition rule of each APP to be obtained is generated according to the feature score value.Described device is for executing the above method.Method and device provided by the invention improves the acquisition efficiency of APP recognition rule.

Description

A kind of acquisition methods and device of APP recognition rule
Technical field
The present embodiments relate to Internet technical field more particularly to a kind of acquisition methods and dress of APP recognition rule It sets.
Background technique
With the rapid development of mobile Internet, people increasingly like installing types of applications program in mobile phone (Application, APP), APP become the important entrance of behavior on line, carry out signature analysis and stream to all kinds of APP in network-side There is important value therefore the research of the acquisition methods of APP recognition rule is had been to be concerned by more and more people for amount identification.
Under the conditions of the prior art, the method for obtaining APP recognition rule mainly has decompiling method and artificial packet capturing method.Wherein, Decompiling method is to carry out decompiling to APP installation kit, obtains the static natures such as Apply Names, the version number in installation kit as knowledge Not rule;Artificial packet capturing method is manually installed and runs APP to be obtained, passes through APP to be obtained described in packet capturing software grabs The uniform resource locator (Uniform Resource Locator, URL) and user agent (User generated in use process Agent, UA) information, then the characteristic feature summed up by way of manual analysis in the APP use process to be obtained makees For recognition rule.But what decompiling method can obtain is often the static natures such as Apply Names, version number, cannot be obtained described Behavioral characteristics in APP use process to be obtained can only parse the apparent APP of feature of a small amount of installation kit;And artificial packet capturing Method needs artificial downloading and installation, artificial packet capturing analysis, identifies that a APP needs 10~20 minutes, height relies on solution The experience of analysis personnel is easy to cause the misjudgement of part recognition rule or fails to judge;In addition, decompiling method and artificial packet capturing method all can only Recognition rule is obtained for single APP, the above problem leverages the efficiency for obtaining APP recognition rule.
Therefore, the problem of how proposing a kind of acquisition efficiency of method to improve APP recognition rule be current industry urgently The important topic of solution.
Summary of the invention
For the defects in the prior art, the embodiment of the present invention provides the acquisition methods and device of a kind of APP recognition rule.
On the one hand, the embodiment of the present invention provides a kind of acquisition methods of APP recognition rule, comprising:
It obtains multiple APP to be obtained and runs the feature field for including in the access log of generation, institute within a preset period of time Stating feature field includes url field and UA field;
Obtain the participle unit for including in the feature field, and calculate each described participle unit it is each it is described to Obtain corresponding feature score value in APP;
The corresponding recognition rule of each APP to be obtained is generated according to the feature score value.
On the other hand, the embodiment of the present invention provides a kind of acquisition device of APP recognition rule, comprising:
Acquiring unit, runs in the access log of generation within a preset period of time for obtaining multiple APP to be obtained and includes Feature field, the feature field includes url field and UA field;
Computing unit, for obtaining the participle unit for including in the feature field, and it is single to calculate each described participle Member corresponding feature score value in each APP to be obtained;
Processing unit, for generating the corresponding recognition rule of each APP to be obtained according to the feature score value.
Another aspect, the embodiment of the present invention provide a kind of electronic equipment, including processor, memory and bus, in which:
The processor, the memory complete mutual communication by bus;
The processor can call the computer program in memory, the step of to execute the above method.
In another aspect, the embodiment of the present invention provides a kind of computer readable storage medium, it is stored thereon with computer program, The step of above method is realized when the program is executed by processor.
The acquisition methods and device of APP recognition rule provided in an embodiment of the present invention are existed by obtaining multiple APP to be obtained The feature field for including in the access log that operation generates in preset time period, and obtain the participle list for including in feature field Member calculates each participle unit corresponding feature score value in each APP to be obtained, to be generated according to feature score value each The corresponding recognition rule of APP to be obtained, improves the acquisition efficiency of APP recognition rule.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is this hair Bright some embodiments for those of ordinary skill in the art without creative efforts, can be with root Other attached drawings are obtained according to these attached drawings.
Fig. 1 is the flow diagram of the acquisition methods of APP recognition rule provided in an embodiment of the present invention;
Fig. 2 is the overall flow schematic diagram of the acquisition methods of APP recognition rule provided in an embodiment of the present invention;
Fig. 3 is the structural schematic diagram of the acquisition device for the APP recognition rule that one embodiment of the invention provides;
Fig. 4 be another embodiment of the present invention provides APP recognition rule acquisition device structural schematic diagram;
Fig. 5 is electronic equipment entity apparatus structural schematic diagram provided in an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical solution in the embodiment of the present invention is explicitly described, it is clear that described embodiment is the present invention A part of the embodiment, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not having Every other embodiment obtained under the premise of creative work is made, shall fall within the protection scope of the present invention.
Fig. 1 is the flow diagram of the acquisition methods of APP recognition rule provided in an embodiment of the present invention, as shown in Figure 1, this Embodiment provides a kind of acquisition methods of APP recognition rule, comprising:
S101, it multiple APP to be obtained is obtained runs the tagged word for including in the access log of generation within a preset period of time Section, the feature field includes url field and UA field;
Specifically, the acquisition device of APP recognition rule obtains multiple APP to be obtained and runs generation within a preset period of time Access log, and extract from the access log url field and UA field as feature field, the feature field can be with It including other fields, specifically can be adjusted according to the actual situation, be not especially limited herein.
S102, the participle unit for including in the feature field is obtained, and calculates each described participle unit each Corresponding feature score value in the APP to be obtained;
Specifically, described device segments the corresponding url field of each described APP to be obtained and UA field After going stop words etc. to pre-process, the participle unit for including in the feature field is obtained, then, described device counts each institute State the number and each participle unit that participle unit occurs in the corresponding feature field of each APP to be obtained The number of corresponding target APP to be obtained, described device calculate each described participle unit and exist according to the number and number Corresponding feature score value in each APP to be obtained.Wherein, target APP to be obtained is the corresponding feature field In include the participle unit the APP to be obtained.
S103, the corresponding recognition rule of each APP to be obtained is generated according to the feature score value.
Specifically, described device is by the corresponding participle unit of each APP to be obtained, according to the feature score value from height It is ranked up to low, using the participle unit for the forward preset quantity that sorts as the characteristic key words of the APP to be obtained, The recognition rule of each APP to be obtained is generated according to the characteristic key words.
The acquisition methods of APP recognition rule provided in an embodiment of the present invention, by obtaining multiple APP to be obtained when default Between include in operation generates in section access log feature field, and obtain the participle unit for including in feature field, calculate Each participle unit corresponding feature score value in each APP to be obtained, to be generated according to feature score value each to be obtained The corresponding recognition rule of APP improves the acquisition efficiency of APP recognition rule.
On the basis of the above embodiments, further, the participle unit for including in the acquisition feature field, and Calculate each described participle unit corresponding feature score value in each APP to be obtained, comprising:
The participle unit for including in the corresponding feature field of each APP to be obtained is obtained respectively;
The number that each participle unit occurs in the corresponding feature field of each APP to be obtained is counted, with And the number of the corresponding target APP to be obtained of each participle unit, the target APP to be obtained are the corresponding feature It include the APP to be obtained of the participle unit in field;
According to the number and number, it is corresponding in each APP to be obtained to calculate each described participle unit Feature score value.
Specifically, described device obtains the participle for including in the corresponding feature field of each APP to be obtained respectively Unit counts the number that each participle unit occurs in the corresponding feature field of each APP to be obtained, and each The number of the corresponding target APP to be obtained of a participle unit, described device is according to each participle unit in each institute State the number occurred in the corresponding feature field of APP to be obtained and the corresponding target APP to be obtained of each participle unit Number calculates each described participle unit corresponding feature score value in each APP to be obtained.Wherein, the target APP to be obtained is the APP to be obtained in the corresponding feature field including the participle unit.For example, table 1 is wait obtain The number for taking APP and corresponding participle unit and each participle unit to occur in each APP to be obtained, as shown in table 1, with For participle unit a, number that participle unit a occurs in APP1 is 2, the participle unit a respectively in APP1, APP2 and Occur in APP4, then the number of the corresponding target APP to be obtained of the participle unit a is 3, is 2 and described according to the number Number 3 calculates the participle unit a corresponding feature score value in APP1, calculates separately participle unit b in the same way and exists Corresponding feature score value and participle unit the c corresponding feature score value in APP1 in APP1.
Table 1
On the basis of the above embodiments, further, described according to the number and number, calculate each described point Word unit corresponding feature score value in each APP to be obtained, comprising:
According to the number that each participle unit occurs in the corresponding feature field of each APP to be obtained, press According to formula:
Calculate each described participle unit corresponding number characteristic value in each APP to be obtained;Wherein, Fi,j J-th of participle unit is in the corresponding number characteristic value of i-th of APP to be obtained, Ni,jIt is j-th of participle unit i-th The number occurred in the corresponding feature field of a APP to be obtained, NiIt is each participle unit described in i-th The total degree occurred in the corresponding feature field of APP to be obtained;
According to the number of the corresponding target APP to be obtained of each participle unit, according to formula:
Calculate the number characteristic value of the corresponding target APP to be obtained of each described participle unit;MjFor the jth The number characteristic value of the corresponding target APP to be obtained of a participle unit, P are the total number of the APP to be obtained, PjFor institute State the number of the corresponding target of j-th of participle unit APP to be obtained;
According to formula:
Δi,j=Fi,j×Mj
Calculate each described participle unit corresponding feature score value in each APP to be obtained;Wherein, Δi,jFor J-th of participle unit corresponding feature score value, F in i-th of APPi,jJ-th of participle unit described in i-th to Obtain the corresponding number characteristic value of APP, MjIt is special for the number of the corresponding target APP to be obtained of j-th of participle unit Value indicative.
Specifically, described device is directed to each described APP to be obtained, counts include in its corresponding feature field each The total degree that the number and each participle unit that a participle unit occurs respectively occur, according to formula:It calculates every One participle unit corresponding number characteristic value in each APP to be obtained;Wherein, Fi,jJ-th of participle unit exists The corresponding number characteristic value of i-th of APP to be obtained, Ni,jIt is j-th of participle unit in i-th of APP to be obtained The number occurred in the corresponding feature field, NiIt is corresponding in i-th of APP to be obtained for each participle unit The total degree occurred in the feature field.
Described device obtains the number of the corresponding target of each participle unit APP to be obtained and described to be obtained The total number of APP, and according to formula:The corresponding target of each described participle unit is calculated to wait obtaining Take the number characteristic value of APP;MjFor the number characteristic value of the corresponding target APP to be obtained of j-th of participle unit, P For the total number of the APP to be obtained, PjFor the number of the corresponding target APP to be obtained of j-th of participle unit.
Described device corresponding number characteristic value in each APP to be obtained according to participle unit described in each, And the number characteristic value of the corresponding target APP to be obtained of each described participle unit, according to formula: Δi,j=Fi,j ×MjCalculate each described participle unit corresponding feature score value in each APP to be obtained;Wherein, Δi,jIt is described J-th of participle unit corresponding feature score value, F in i-th of APPi,jJ-th of participle unit is described to be obtained at i-th The corresponding number characteristic value of APP, MjFor the number characteristic value of the corresponding target APP to be obtained of j-th of participle unit.
For example, to calculate participle unit a in APP1 for corresponding feature score value, APP1 is corresponding with continued reference to table 1 The total degree that participle unit a, participle unit b, the participle unit c for including in feature field occur is 2+1+3=6, participle unit a The number occurred in APP1 is 2, then participle unit a corresponding number characteristic value in APP1 is 2/ (2+1+3)=1/3.Institute The total number for stating APP to be obtained is 5, and wherein in corresponding feature field including the APP to be obtained of the participle unit a Number is 3, that is, the number of the corresponding target APP to be obtained of the participle unit a is 3, then a pairs of the participle unit The number characteristic value of the target APP to be obtained answered is log [5/ (3+1)]=log1.25, then the participle unit a exists Corresponding feature score value is (1/3) × log1.25=0.0291 in APP1.Described device can also calculate often in the same way One participle unit corresponding feature score value in each APP to be obtained, details are not described herein again.
On the basis of the above embodiments, further, described each described to be obtained according to feature score value generation The recognition rule of APP, comprising:
The participle unit that will include in the corresponding feature field of each APP to be obtained, according to the feature score value from It is high to Low to be ranked up, using the participle unit for the forward preset quantity that sorts as the feature critical of the APP to be obtained Word;
The recognition rule of each APP to be obtained is generated according to the characteristic key words.
Specifically, described device is directed to each described APP to be obtained, the participle that will include in its corresponding feature field Unit is ranked up from high to low according to the feature score value, using the participle unit for the forward preset quantity that sorts as The characteristic key words of the APP to be obtained generate the recognition rule of each APP to be obtained according to the characteristic key words. Wherein, the preset quantity can be adjusted according to the actual situation and be arranged, and be not specifically limited herein.
For example, by taking APP1 as an example, calculating separately out the corresponding feature of the APP1 by above-mentioned process with continued reference to table 1 The feature score value for the participle unit a for including in field is 0.0291, the feature score value of participle unit b is 0.0162, participle unit c Feature score value be 0.0485, the participle unit is ranked up according to the feature score value as participle unit c by described device > participle unit a > participle unit b, described device can take sequence in the participle unit c and participle unit a of the first two as institute The characteristic key words of APP1 are stated, and APP1 recognition rule can be generated according to the characteristic key words are as follows: will be in running log Feature field in include participle unit and the matched APP of the participle unit c and participle unit a be identified as it is described APP1.Described device can also generate the recognition rule of other APP to be obtained respectively according to the method described above, and detailed process is herein not It repeats again.
In the above embodiments, the method also includes:
The installation kit of the APP to be obtained is obtained, APP to be obtained described in simultaneously dry run is installed;
Obtain the APP to be obtained access log that dry run generates within a preset period of time;
The feature field is obtained according to the access log, and the feature field is stored.
Specifically, described device can be acquired from application market the APP to be obtained information (such as title, website classification, Download link), the installation kit of the APP to be obtained is downloaded, the APP to be obtained is installed to simulator, and by the simulation The networking IP of device is set as the IP of proxy server, starts the APP to be obtained, then log oracle listener snoop agents server The access log of output stops the receipts APP to be obtained, then collects the APP to be obtained after the preset time period All access logs that generation is run in the preset time period extract url field and UA field from the access log As the feature field, and it is saved to specified storage location, the identification of the APP to be obtained is advised in described device When then being obtained, the feature field can be obtained from the specified storage location.Described device obtains in the same way The feature field of multiple APP to be obtained is taken and stores, detailed process, details are not described herein again.
The acquisition methods of APP recognition rule provided in an embodiment of the present invention, by obtaining multiple APP to be obtained when default Between include in operation generates in section access log feature field, and obtain the participle unit for including in feature field, calculate Each participle unit corresponding feature score value in each APP to be obtained, to be generated according to feature score value each to be obtained The corresponding recognition rule of APP improves the acquisition efficiency of APP recognition rule.
Fig. 2 is the overall flow schematic diagram of the acquisition methods of APP recognition rule provided in an embodiment of the present invention, such as Fig. 2 institute Show, the acquisition methods of APP recognition rule provided in an embodiment of the present invention specifically includes the following steps:
The information of S201, acquisition APP to be obtained;Described device can acquire the letter of the APP to be obtained from application market Breath, the information include title, website classification, download link, can also include other information;Then step S202 is executed;
S202, downloading are simultaneously installed the APP to be obtained and are installed to simulator;Described device downloads the APP's to be obtained Installation kit installs the APP to be obtained to simulator, and sets proxy server for the networking IP of the simulator IP;Then step S202 is executed;
APP to be obtained described in S203, dry run;Start the APP to be obtained, then log oracle listener snoop agents take The access log of business device output;Then step is executed
S204, judge whether runing time reaches preset time period;If described device judgement knows that runing time reaches institute Preset time period is stated, then stops the APP to be obtained, then executes step S205, otherwise, return step S203;
S205, access log is obtained;Described device collects the APP to be obtained and runs generation in the preset time period All access logs, then execute step S206;
S206, extraction and the feature field for storing the APP to be obtained;Described device is extracted from the access log Url field and UA field are saved to specified storage location as the feature field, then execute step S207;
S207, dry run the quantity of APP to be obtained whether reach threshold value;It has been simulated if described device judgement is known The quantity of the APP to be obtained of operation reaches threshold value, thens follow the steps S208, otherwise returns to step S201;
S208, the feature field for obtaining multiple APP to be obtained;Described device obtains multiple from the specified storage location Then the feature field of APP to be obtained executes step S209;
S209, the participle unit for including in the corresponding feature field of each APP to be obtained is obtained;Described device pair After the corresponding url field of each described APP to be obtained and UA field are segmented and gone stop words etc. to pre-process, obtain Then the participle unit for including in the feature field executes step S210;
Time that S210, each participle unit of statistics occur in the corresponding feature field of each APP to be obtained Number;Then step S202 is executed;
S210, the number for counting the corresponding target APP to be obtained of each participle unit;Then step S202 is executed;
S211, each described participle unit corresponding feature score value in each APP to be obtained is calculated;The dress Set the number occurred in the corresponding feature field of each APP to be obtained according to each participle unit and each described The number of the corresponding target of participle unit APP to be obtained calculates each described participle unit in each APP to be obtained Then corresponding feature score value executes step S212;
S212, the recognition rule that each APP to be obtained is generated according to the feature score value;Described device is by each institute The corresponding participle unit of APP to be obtained is stated, is ranked up from high to low according to the feature score value, by the forward present count that sorts Characteristic key words of the participle unit of amount as the APP to be obtained generate each described according to the characteristic key words The recognition rule of APP to be obtained
The acquisition methods of APP recognition rule provided in an embodiment of the present invention, by obtaining multiple APP to be obtained when default Between include in operation generates in section access log feature field, and obtain the participle unit for including in feature field, calculate Each participle unit corresponding feature score value in each APP to be obtained, to be generated according to feature score value each to be obtained The corresponding recognition rule of APP improves the acquisition efficiency of APP recognition rule.
Fig. 3 is the structural schematic diagram of the acquisition device for the APP recognition rule that one embodiment of the invention provides, as shown in figure 3, The embodiment of the present invention provides a kind of acquisition device of APP recognition rule, including acquiring unit 301, computing unit 302 and processing list Member 303, in which:
Acquiring unit 301, which is used to obtain multiple APP to be obtained and is run in the access log of generation within a preset period of time, wraps The feature field included, the feature field include url field and UA field;Computing unit 302 is for obtaining the feature field In include participle unit, and calculate each described participle unit corresponding feature score value in each APP to be obtained; Processing unit 303 is used to generate the corresponding recognition rule of each APP to be obtained according to the feature score value.
Specifically, acquiring unit 301 obtains the access log that multiple APP to be obtained run generation within a preset period of time, And url field and UA field are extracted from the access log as feature field, the feature field can also include other Field specifically can be adjusted according to the actual situation, and be not especially limited herein.Computing unit 302 described in each to After the corresponding url field of acquisition APP and UA field are segmented and gone stop words etc. to pre-process, the feature field is obtained In include participle unit, then, it is corresponding in each APP to be obtained that computing unit 302 counts each participle unit Feature field in the number of number and the corresponding target APP to be obtained of each participle unit that occurs, computing unit 302 according to the number and number, calculates each described participle unit corresponding feature point in each APP to be obtained Value.Wherein, target APP to be obtained is described to be obtained including the participle unit in the corresponding feature field APP.Processing unit 303 carries out the corresponding participle unit of each APP to be obtained according to the feature score value from high to low Sequence, using the participle unit for the forward preset quantity that sorts as the characteristic key words of the APP to be obtained, according to described Characteristic key words generate the recognition rule of each APP to be obtained.
The acquisition device of APP recognition rule provided in an embodiment of the present invention, by obtaining multiple APP to be obtained when default Between include in operation generates in section access log feature field, and obtain the participle unit for including in feature field, calculate Each participle unit corresponding feature score value in each APP to be obtained, to be generated according to feature score value each to be obtained The corresponding recognition rule of APP improves the acquisition efficiency of APP recognition rule.
Fig. 4 be another embodiment of the present invention provides APP recognition rule acquisition device structural schematic diagram, such as Fig. 4 institute Show, the embodiment of the present invention provides a kind of acquisition device of APP recognition rule, including acquiring unit 401, computing unit 402 and place Manage the acquiring unit 401 and processing unit 403 1 in unit 403, acquiring unit 401 and processing unit 403 and above-described embodiment It causes, computing unit 402 includes obtaining subelement 404, statistics subelement 405 and computation subunit 406, in which:
It obtains subelement 404 and is used to obtain point in the corresponding feature field of each APP to be obtained included respectively Word unit;Statistics subelement 405 is for counting each participle unit in the corresponding feature field of each APP to be obtained The number of the number of middle appearance and the corresponding target APP to be obtained of each participle unit, the target APP to be obtained For the APP to be obtained in the corresponding feature field including the participle unit;Computation subunit 406 is used for according to institute Number and number are stated, each described participle unit corresponding feature score value in each APP to be obtained is calculated.
Specifically, it obtains subelement 404 and is obtained in the corresponding feature field of each APP to be obtained respectively and include Participle unit, statistics subelement 405 count each participle unit in the corresponding feature field of each APP to be obtained The number of the number of appearance and the corresponding target APP to be obtained of each participle unit, computation subunit 406 is according to each The number and each participle unit that a participle unit occurs in the corresponding feature field of each APP to be obtained The number of corresponding target APP to be obtained calculates each described participle unit corresponding spy in each APP to be obtained Levy score value.Wherein, target APP to be obtained is described wait obtain including the participle unit in the corresponding feature field Take APP.
The acquisition device of APP recognition rule provided in an embodiment of the present invention, by obtaining multiple APP to be obtained when default Between include in operation generates in section access log feature field, and obtain the participle unit for including in feature field, calculate Each participle unit corresponding feature score value in each APP to be obtained, to be generated according to feature score value each to be obtained The corresponding recognition rule of APP improves the acquisition efficiency of APP recognition rule.
On the basis of the above embodiments, further, computation subunit 406 is specifically used for:
According to the number that each participle unit occurs in the corresponding feature field of each APP to be obtained, press According to formula:
Calculate each described participle unit corresponding number characteristic value in each APP to be obtained;Wherein, Fi,j J-th of participle unit is in the corresponding number characteristic value of i-th of APP to be obtained, Ni,jIt is j-th of participle unit i-th The number occurred in the corresponding feature field of a APP to be obtained, NiIt is each participle unit described in i-th The total degree occurred in the corresponding feature field of APP to be obtained;
According to the number of the corresponding target APP to be obtained of each participle unit, according to formula:
Calculate the number characteristic value of the corresponding target APP to be obtained of each described participle unit;MjFor the jth The number characteristic value of the corresponding target APP to be obtained of a participle unit, P are the total number of the APP to be obtained, PjFor institute State the number of the corresponding target of j-th of participle unit APP to be obtained;
According to formula:
Δi,j=Fi,j×Mj
Calculate each described participle unit corresponding feature score value in each APP to be obtained;Wherein, Δi,jFor J-th of participle unit corresponding feature score value, F in i-th of APPi,jJ-th of participle unit described in i-th to Obtain the corresponding number characteristic value of APP, MjIt is special for the number of the corresponding target APP to be obtained of j-th of participle unit Value indicative.
Specifically, computation subunit 406 is directed to each described APP to be obtained, counts and wraps in its corresponding feature field The total degree that the number and each participle unit that each participle unit included occurs respectively occur, according to formula: Calculate each described participle unit corresponding number characteristic value in each APP to be obtained;Wherein, Fi,jJ-th of participle Unit is in the corresponding number characteristic value of i-th of APP to be obtained, Ni,jFor j-th of participle unit described in i-th to Obtain the number occurred in the corresponding feature field of APP, NiIt is described to be obtained at i-th for each participle unit The total degree occurred in the corresponding feature field of APP.
Computation subunit 406 obtain the corresponding target of each participle unit APP to be obtained number and it is described to The total number of APP is obtained, and according to formula:Calculate the corresponding target of each described participle unit The number characteristic value of APP to be obtained;MjFor the number feature of the corresponding target APP to be obtained of j-th of participle unit Value, P are the total number of the APP to be obtained, PjFor the number of the corresponding target APP to be obtained of j-th of participle unit.
The corresponding number in each APP to be obtained is special according to participle unit described in each for computation subunit 406 The number characteristic value of value indicative and the corresponding target APP to be obtained of each described participle unit, according to formula: Δi,j =Fi,j×MjCalculate each described participle unit corresponding feature score value in each APP to be obtained;Wherein, Δi,j For j-th of participle unit in i-th of APP corresponding feature score value, Fi,jJ-th of participle unit is described in i-th The corresponding number characteristic value of APP to be obtained, MjFor the number of the corresponding target APP to be obtained of j-th of participle unit Characteristic value.
The acquisition device of APP recognition rule provided in an embodiment of the present invention, by obtaining multiple APP to be obtained when default Between include in operation generates in section access log feature field, and obtain the participle unit for including in feature field, calculate Each participle unit corresponding feature score value in each APP to be obtained, to be generated according to feature score value each to be obtained The corresponding recognition rule of APP improves the acquisition efficiency of APP recognition rule.
The embodiment of device provided by the invention specifically can be used for executing the process flow of above-mentioned each method embodiment, Details are not described herein for function, is referred to the detailed description of above method embodiment.
Fig. 5 is electronic equipment entity apparatus structural schematic diagram provided in an embodiment of the present invention, as shown in figure 5, the electronics is set Standby may include: processor (processor) 501, memory (memory) 502 and bus 503, wherein processor 501 is deposited Reservoir 502 completes mutual communication by bus 503.Processor 501 can call the computer program in memory 502, To execute following method: obtaining multiple APP to be obtained and run the feature for including in the access log of generation within a preset period of time Field, the feature field include url field and UA field;The participle unit for including in the feature field is obtained, and is calculated Each described participle unit corresponding feature score value in each APP to be obtained;It is generated according to the feature score value each The corresponding recognition rule of a APP to be obtained.
The embodiment of the present invention discloses a kind of computer program product, and the computer program product is non-transient including being stored in Computer program on computer readable storage medium, the computer program include program instruction, when described program instructs quilt When computer executes, computer is able to carry out method provided by above-mentioned each method embodiment, for example, obtains multiple wait obtain APP is taken to run the feature field for including in the access log of generation within a preset period of time, the feature field includes url field With UA field;The participle unit for including in the feature field is obtained, and calculates each described participle unit each described Corresponding feature score value in APP to be obtained;The corresponding identification of each APP to be obtained is generated according to the feature score value to advise Then.
The embodiment of the present invention provides a kind of non-transient computer readable storage medium, the non-transient computer readable storage Medium storing computer program, the computer program make the computer execute side provided by above-mentioned each method embodiment Method, for example, obtain multiple APP to be obtained and run the tagged word for including in the access log of generation within a preset period of time Section, the feature field includes url field and UA field;The participle unit for including in the feature field is obtained, and is calculated every One participle unit corresponding feature score value in each APP to be obtained;It is generated according to the feature score value each The corresponding recognition rule of the APP to be obtained.
In addition, the logical order in above-mentioned memory 503 can be realized by way of SFU software functional unit and conduct Independent product when selling or using, can store in a computer readable storage medium.Based on this understanding, originally Substantially the part of the part that contributes to existing technology or the technical solution can be in other words for the technical solution of invention The form of software product embodies, which is stored in a storage medium, including some instructions to So that a computer equipment (can be personal computer, server or the network equipment etc.) executes each implementation of the present invention The all or part of the steps of example the method.And storage medium above-mentioned include: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. it is various It can store the medium of program code.
The apparatus embodiments described above are merely exemplary, wherein described, unit can as illustrated by the separation member It is physically separated with being or may not be, component shown as a unit may or may not be physics list Member, it can it is in one place, or may be distributed over multiple network units.It can be selected according to the actual needs In some or all of the modules achieve the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying creativeness Labour in the case where, it can understand and implement.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment can It realizes by means of software and necessary general hardware platform, naturally it is also possible to pass through hardware.Based on this understanding, on Stating technical solution, substantially the part that contributes to existing technology can be embodied in the form of software products in other words, should Computer software product may be stored in a computer readable storage medium, such as ROM/RAM, magnetic disk, CD, including several fingers It enables and using so that a computer equipment (can be personal computer, server or the network equipment etc.) executes each implementation Method described in certain parts of example or embodiment.
Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although Present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: it still may be used To modify the technical solutions described in the foregoing embodiments or equivalent replacement of some of the technical features; And these are modified or replaceed, technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution spirit and Range.

Claims (10)

1. a kind of acquisition methods of APP recognition rule characterized by comprising
It obtains multiple APP to be obtained and runs the feature field for including in the access log of generation, the spy within a preset period of time Levying field includes url field and UA field;
The participle unit for including in the feature field is obtained, and calculates each described participle unit each described to be obtained Corresponding feature score value in APP;
The corresponding recognition rule of each APP to be obtained is generated according to the feature score value.
2. the method according to claim 1, wherein described obtain the participle list for including in the feature field Member, and calculate each described participle unit corresponding feature score value in each APP to be obtained, comprising:
The participle unit for including in the corresponding feature field of each APP to be obtained is obtained respectively;
The number that each participle unit occurs in the corresponding feature field of each APP to be obtained is counted, and each The number of the corresponding target APP to be obtained of a participle unit, the target APP to be obtained are the corresponding feature field In include the participle unit the APP to be obtained;
According to the number and number, each described participle unit corresponding feature in each APP to be obtained is calculated Score value.
3. according to the method described in claim 2, calculating each institute it is characterized in that, described according to the number and number State participle unit corresponding feature score value in each APP to be obtained, comprising:
According to the number that each participle unit occurs in the corresponding feature field of each APP to be obtained, according to public affairs Formula:
Calculate each described participle unit corresponding number characteristic value in each APP to be obtained;Wherein, Fi,jJ-th Participle unit is in the corresponding number characteristic value of i-th of APP to be obtained, Ni,jIt is j-th of participle unit in i-th of institute State the number occurred in the corresponding feature field of APP to be obtained, NiFor each participle unit wait obtain described in i-th Take the total degree occurred in the corresponding feature field of APP;
According to the number of the corresponding target APP to be obtained of each participle unit, according to formula:
Calculate the number characteristic value of the corresponding target APP to be obtained of each described participle unit;MjIt is described j-th point The number characteristic value of the corresponding target APP to be obtained of word unit, P are the total number of the APP to be obtained, PjIt is described The number of the corresponding target of j participle unit APP to be obtained;
According to formula:
Δi,j=Fi,j×Mj
Calculate each described participle unit corresponding feature score value in each APP to be obtained;Wherein, Δi,jIt is described J-th of participle unit corresponding feature score value, F in i-th of APPi,jJ-th of participle unit is described to be obtained at i-th The corresponding number characteristic value of APP, MjFor the number characteristic value of the corresponding target APP to be obtained of j-th of participle unit.
4. the method according to claim 1, wherein described each described wait obtain according to feature score value generation Take the recognition rule of APP, comprising:
The participle unit that will include in the corresponding feature field of each APP to be obtained, according to the feature score value from height to It is low to be ranked up, using the participle unit for the forward preset quantity that sorts as the characteristic key words of the APP to be obtained;
The recognition rule of each APP to be obtained is generated according to the characteristic key words.
5. method according to any of claims 1-4, which is characterized in that the method also includes:
The installation kit of the APP to be obtained is obtained, APP to be obtained described in simultaneously dry run is installed;
Obtain the APP to be obtained access log that dry run generates within a preset period of time;
The feature field is obtained according to the access log, and the feature field is stored.
6. a kind of acquisition device of APP recognition rule characterized by comprising
Acquiring unit runs the spy for including in the access log of generation for obtaining multiple APP to be obtained within a preset period of time Field is levied, the feature field includes url field and UA field;
Computing unit for obtaining the participle unit for including in the feature field, and calculates each described participle unit and exists Corresponding feature score value in each APP to be obtained;
Processing unit, for generating the corresponding recognition rule of each APP to be obtained according to the feature score value.
7. device according to claim 6, which is characterized in that the computing unit includes:
Subelement is obtained, for obtaining the participle unit for including in the corresponding feature field of each APP to be obtained respectively;
Subelement is counted, is gone out in the corresponding feature field of each APP to be obtained for counting each participle unit The number of existing number and the corresponding target APP to be obtained of each participle unit, the target APP to be obtained are pair It include the APP to be obtained of the participle unit in the feature field answered;
Computation subunit, for calculating each described participle unit each described to be obtained according to the number and number Corresponding feature score value in APP.
8. device according to claim 7, which is characterized in that the computation subunit is specifically used for:
According to the number that each participle unit occurs in the corresponding feature field of each APP to be obtained, according to public affairs Formula:
Calculate each described participle unit corresponding number characteristic value in each APP to be obtained;Wherein, Fi,jJ-th Participle unit is in the corresponding number characteristic value of i-th of APP to be obtained, Ni,jIt is j-th of participle unit in i-th of institute State the number occurred in the corresponding feature field of APP to be obtained, NiFor each participle unit wait obtain described in i-th Take the total degree occurred in the corresponding feature field of APP;
According to the number of the corresponding target APP to be obtained of each participle unit, according to formula:
Calculate the number characteristic value of the corresponding target APP to be obtained of each described participle unit;MjIt is described j-th point The number characteristic value of the corresponding target APP to be obtained of word unit, P are the total number of the APP to be obtained, PjIt is described The number of the corresponding target of j participle unit APP to be obtained;
According to formula:
Δi,j=Fi,j×Mj
Calculate each described participle unit corresponding feature score value in each APP to be obtained;Wherein, Δi,jIt is described J-th of participle unit corresponding feature score value, F in i-th of APPi,jJ-th of participle unit is described to be obtained at i-th The corresponding number characteristic value of APP, MjFor the number characteristic value of the corresponding target APP to be obtained of j-th of participle unit.
9. a kind of electronic equipment, which is characterized in that including processor, memory and bus, in which:
The processor, the memory complete mutual communication by bus;
The processor can call the computer program in memory, to execute as described in claim 1-5 any one The step of method.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is by processor It realizes when execution such as the step of claim 1-5 any one the method.
CN201710453676.XA 2017-06-15 2017-06-15 Method and device for acquiring APP identification rule Active CN109144831B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710453676.XA CN109144831B (en) 2017-06-15 2017-06-15 Method and device for acquiring APP identification rule

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710453676.XA CN109144831B (en) 2017-06-15 2017-06-15 Method and device for acquiring APP identification rule

Publications (2)

Publication Number Publication Date
CN109144831A true CN109144831A (en) 2019-01-04
CN109144831B CN109144831B (en) 2021-10-29

Family

ID=64830160

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710453676.XA Active CN109144831B (en) 2017-06-15 2017-06-15 Method and device for acquiring APP identification rule

Country Status (1)

Country Link
CN (1) CN109144831B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110245273A (en) * 2019-06-21 2019-09-17 武汉绿色网络信息服务有限责任公司 A kind of method obtaining APP service feature library and corresponding device
CN111740923A (en) * 2020-06-22 2020-10-02 北京神州泰岳智能数据技术有限公司 Method and device for generating application identification rule, electronic equipment and storage medium
CN112839004A (en) * 2019-11-22 2021-05-25 中国电信股份有限公司 Application identification method and device
CN114500309A (en) * 2022-04-13 2022-05-13 南京华飞数据技术有限公司 Network application flow automatic configuration recognition system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130290319A1 (en) * 2012-04-27 2013-10-31 Eric Glover Performing application searches
CN104298735A (en) * 2014-09-30 2015-01-21 北京金山安全软件有限公司 Method and device for identifying application program type
CN104331662A (en) * 2013-07-22 2015-02-04 深圳市腾讯计算机系统有限公司 Method and device for detecting Android malicious application
CN104618132A (en) * 2014-12-16 2015-05-13 北京神州绿盟信息安全科技股份有限公司 Generation method and generation device for application program recognition rule

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130290319A1 (en) * 2012-04-27 2013-10-31 Eric Glover Performing application searches
CN104331662A (en) * 2013-07-22 2015-02-04 深圳市腾讯计算机系统有限公司 Method and device for detecting Android malicious application
CN104298735A (en) * 2014-09-30 2015-01-21 北京金山安全软件有限公司 Method and device for identifying application program type
CN104618132A (en) * 2014-12-16 2015-05-13 北京神州绿盟信息安全科技股份有限公司 Generation method and generation device for application program recognition rule

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
程骏: "面向移动互联网的文本分类技术应用研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110245273A (en) * 2019-06-21 2019-09-17 武汉绿色网络信息服务有限责任公司 A kind of method obtaining APP service feature library and corresponding device
CN110245273B (en) * 2019-06-21 2021-04-30 武汉绿色网络信息服务有限责任公司 Method for acquiring APP service feature library and corresponding device
CN112839004A (en) * 2019-11-22 2021-05-25 中国电信股份有限公司 Application identification method and device
CN112839004B (en) * 2019-11-22 2022-09-06 中国电信股份有限公司 Application identification method and device
CN111740923A (en) * 2020-06-22 2020-10-02 北京神州泰岳智能数据技术有限公司 Method and device for generating application identification rule, electronic equipment and storage medium
CN114500309A (en) * 2022-04-13 2022-05-13 南京华飞数据技术有限公司 Network application flow automatic configuration recognition system
CN114500309B (en) * 2022-04-13 2022-07-08 南京华飞数据技术有限公司 Network application flow automatic configuration recognition system

Also Published As

Publication number Publication date
CN109144831B (en) 2021-10-29

Similar Documents

Publication Publication Date Title
CN109144831A (en) A kind of acquisition methods and device of APP recognition rule
CN105389722B (en) Malicious order identification method and device
US20210035126A1 (en) Data processing method, system and computer device based on electronic payment behaviors
WO2018166113A1 (en) Random forest model training method, electronic apparatus and storage medium
US8751184B2 (en) Transaction based workload modeling for effective performance test strategies
CN109669795B (en) Crash information processing method and device
CN109918554A (en) Web data crawling method, device, system and computer readable storage medium
CN105577799B (en) A kind of fault detection method and device of data-base cluster
CN105577528B (en) A kind of wechat public platform collecting method and device based on virtual machine
CN110795697B (en) Method and device for acquiring logic expression, storage medium and electronic device
CN103248677A (en) Internet behavior analysis system and working method thereof
WO2020257993A1 (en) Content pushing method and apparatus, server, and storage medium
CN113412607A (en) Content pushing method and device, mobile terminal and storage medium
CN108600270A (en) A kind of abnormal user detection method and system based on network log
CN106998336B (en) Method and device for detecting user in channel
CN108985048A (en) Simulator recognition methods and relevant apparatus
Du et al. Hawkeye: Adaptive straggler identification on heterogeneous spark cluster with reinforcement learning
CN111126071A (en) Method and device for determining questioning text data and data processing method of customer service group
CN109144834A (en) Acquisition method and device, the Android system and terminal device of user behavior data
CN110245059A (en) A kind of data processing method, equipment and storage medium
CN110990352A (en) Method and device for determining data extraction rule, computer equipment and medium
CN108520438B (en) Behavior type determination method and device
CN110781410A (en) Community detection method and device
CN106294457A (en) Network information push method and device
CN109587248A (en) User identification method, device, server and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant