CN110515654A - A kind of Android application management system and method based on deep learning - Google Patents
A kind of Android application management system and method based on deep learning Download PDFInfo
- Publication number
- CN110515654A CN110515654A CN201910794990.3A CN201910794990A CN110515654A CN 110515654 A CN110515654 A CN 110515654A CN 201910794990 A CN201910794990 A CN 201910794990A CN 110515654 A CN110515654 A CN 110515654A
- Authority
- CN
- China
- Prior art keywords
- android
- feature
- deep learning
- module
- android application
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000013135 deep learning Methods 0.000 title claims abstract description 22
- 238000000034 method Methods 0.000 title abstract description 10
- 230000003068 static effect Effects 0.000 claims abstract description 26
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 15
- 238000007726 management method Methods 0.000 claims description 42
- 239000000284 extract Substances 0.000 claims description 3
- 239000000203 mixture Substances 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 2
- 230000035800 maturation Effects 0.000 claims description 2
- 238000004458 analytical method Methods 0.000 description 5
- 244000062793 Sorghum vulgare Species 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 235000019713 millet Nutrition 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 238000012550 audit Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 238000009533 lab test Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/70—Software maintenance or management
- G06F8/75—Structural analysis for program understanding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention relates to a kind of Android application management system and method based on deep learning, including characteristic extracting module, deep learning module and Classification Management module.Characteristic extracting module is applied with Android as input, is extracted feature using static extracting mode and is formed feature documents.Deep learning module uses convolutional neural networks as learning model.On the one hand the sample set that feature documents are formed with ident value is trained and is learnt, obtain the mature model of study;On the other hand using feature documents as input, using the model that study is mature, accurate Classification Management Android application improves the efficiency of management of application shop.
Description
Technical field
The present invention relates to mobile terminal safety technical field, in particular to a kind of Android application management based on deep learning
System and method.
Background technique
For Android as presently most popular intelligent movable operating system, equipment and number of users are huge, rich using type
Richness, safety have received widespread attention.Classification that Android application shop is specified generally according to application developer or by point
The description that analysis application developer provides classifies to application.But this process is easy to by the developer of malicious application
Manipulation applies addition to be easier to the information class application interface by audit to escape detection, such as by no qualification finance class.Furthermore with
The explosive growth of Android number of applications, how rapidly and accurately automatically to be classified to Android application, the raising efficiency of management
It is vital.But it is less for the research of Android application class management at present.In maximally related work, Shabtai
Et al. Classification Management is carried out to tool-class and the application of game class Android using static analysis and machine learning algorithm;Wang Wei etc.
People carries out Classification Management to the application of 24 class Androids also with Static Analysis Technology, using a variety of machine learning algorithms.But machine
Device learning algorithm is expressed complicated function and classification problem and is limited, and generalization ability is restricted, and accuracy rate is lower.
Summary of the invention
Technology of the invention solves the problems, such as: overcoming the deficiencies of the prior art and provide a kind of Android based on deep learning and answers
With management system and method, Android application feature is extracted using static extracting mode, to ensure Classification Management efficiency;Android is answered
Feature documents processing is two-dimensional eigenmatrix, and form turns to " picture ", using what is done well in picture recognition field
Convolutional neural networks algorithm carries out accurate Classification Management to the application of a large amount of Androids.
Technical solution of the invention: Android application management system and method based on deep learning are wrapped as shown in Figure 1
It includes:
Characteristic extracting module, the characteristic extracting module are configured as answering with the multiclass Android with different classes of ident value
With to input, uses static extracting mode to extract multiclass feature and form feature documents;
Deep learning module, the deep learning module are configured with convolutional neural networks to by the feature documents
It is trained with the sample set of classification logotype value composition, obtains the mature model of study;Simultaneously using the feature documents as defeated
Enter, uses the model of the study maturation as final Classification Management model;
Classification Management module, the Classification Management module use browser/server framework, and wherein server end is configured
For what is applied by the mature model of the study for calling the characteristic extracting module and deep learning module to obtain as Android
Classification Management model;Browser end is configured as carrying out uploading a large amount of new Android applications towards Android application shop, passes through clothes
The model at business device end carries out Classification Management to Android application.
The present invention is based on the Android application management systems of deep learning to realize that steps are as follows:
1. acquisition largely has the Android application of classification logotype, batch extracting static nature forms feature documents;
1.1. Android application is decompressed, file AndroidManifest.xml is obtained;
1.2. AndroidManifest.xml file is parsed using AXMLPrinter2 and TinyXml tool, obtains Android
Permission feature, software and hardware feature and the intent features of application are as static nature;For each Android application, it is wrapped
The static nature contained extracts in text document as feature documents;
2. being trained using convolutional neural networks to the sample set being made of feature documents and classification logotype value, learned
Mature model is practised, the convolutional neural networks are made of embeding layer, convolutional layer, pond layer and full articulamentum;
2.1. embeding layer: all feature documents are traversed and obtain non-duplicate static nature, map that low-dimensional vector table
Show, insertion dimension is 300, is indicated thus to obtain the corresponding vector of each static nature.The feature documents that each Android is applied
And its ident value is input to the embeding layer of convolutional neural networks, the static nature for being included by feature documents is converted to vector table
Show, obtains two-dimensional eigenmatrix.
2.2. 4 convolutional layers conv1, conv2, conv3 and conv4, convolution convolutional layer: are connected in parallel after embeding layer
The size of core is respectively 2x300,3x300,4x300 and 5x300, and the number of convolution kernel is 64.
2.3. pond layer: be separately connected after four convolutional layers maximum pond layer pooling1, pooling2, pooling3 and
Pooling4, each pond layer obtain 64 optimal characteristics.
2.4. full articulamentum: being sent into softmax classifier by full articulamentum after the layer of pond, carries out Classification Management.
3. Android is uploaded onto the server using batch by browser, the mature model of server calls study is divided
Class management.
The advantages of the present invention over the prior art are that:
(1) present invention is one kind effectively supplement to Android application management method.It is answered using static extracting mode from Android
It is permission feature, software and hardware feature and intent features respectively, compared to dynamic analysis, consumption is more with three category features of middle extraction
Few resource and time.Because application scenarios of the invention are mainly Android application shop, merely with Static Analysis Method
Feature is extracted, to ensure Classification Management efficiency.In addition, the system has certain expansion, it can be according to Android application shop
Free adjustment is carried out for the division of management category, while more features can also be added, more accurately to carry out Android
Good basis is laid in application class management.
(2) two-dimensional eigenmatrix is converted by the feature that each Android is applied, is visualized as " picture ", is incorporated in image
Identification aspect show outstanding convolutional neural networks model can more efficient learning characteristic, it is quasi- to reach higher Classification Management
True rate and faster Classification Management speed.
Detailed description of the invention
Fig. 1 is the flow chart of present system;
Fig. 2 is the structure of the convolutional neural networks in present system.
Specific embodiment
The present invention is described in detail with reference to the accompanying drawing
As shown in Figure 1, the present invention is based on the Android application management systems of deep learning by characteristic extracting module, deep learning
Module and Classification Management module composition.
Illustrate so that millet application shop application class manages as an example below, entire to realize that process is as follows:
(1) application of multiclass Android is downloaded from millet application shop crawler, including game class is applied, books reading class is applied,
Audio-visual audiovisual class application, the application of chat social category, the application of sports class, the application of Domestic News class, the application of fashionable shopping class, gold
Melt the application of financing class and photography and vedio recording class application, and manual correction is carried out to the application of wherein classification error, by above nine class
Application order number is 1~9, as classification logotype value;
(2) decompress Android application, obtain file AndroidManifest.xml, using AXMLPrinter2 and
TinyXml tool parse AndroidManifest.xml file, obtain Android application permission feature, software and hardware feature and
Intent features are as static nature, i.e. label<uses-permission>,<uses-feature>,<intent-filter>in
Content.For each Android application, the static nature for being included by it is extracted in text document as feature documents;
(3) sample set being made of feature documents and classification logotype value is trained using convolutional neural networks, is obtained
Learn mature model.Convolutional neural networks include embeding layer, convolutional layer, pond layer and full articulamentum.
(3.1) embeding layer: all feature documents are traversed and obtain non-duplicate static nature, map that low-dimensional vector table
Show, insertion dimension is 300, is indicated thus to obtain the corresponding vector of each static nature.The feature documents that each Android is applied
And its ident value is input to the embeding layer of convolutional neural networks, the static nature for being included by feature documents is converted to vector table
Show, obtains two-dimensional eigenmatrix.
(3.2) 4 convolutional layers conv1, conv2, conv3 and conv4, volume convolutional layer: are connected in parallel after embeding layer
The size of product core is respectively 2x300,3x300,4x300 and 5x300, and the number of convolution kernel is 64.
(3.3) maximum pond layer pooling1, pooling2, pooling3 pond layer: are separately connected after four convolutional layers
And pooling4, each pond layer obtain 64 optimal characteristics.
(3.4) full articulamentum: being sent into softmax classifier by full articulamentum after the layer of pond, carries out Classification Management.
(3.5) 80% is randomly selected in the application of all Androids as training sample, wherein 10% is used as verifying sample,
20% is remaininged as test sample.In each training process, the accuracy rate and penalty values of observation verifying sample, when penalty values no longer drop
Deconditioning when low obtains the mature model of study, as final Classification Management model.
(3.6) laboratory test results of the invention are as shown in table 1, the Classification Management model obtained using deep learning module
20% test sample is tested, six seed type Android application all classifications are correct as the result is shown, three types Android application
In respectively have an application class mistake.Demonstrate the classification validity of this system.Subsequent a large amount of Android applications can be carried out quick
Accurate Classification Management.
Table 1
(4) Android application shop can be uploaded onto the server Android using batch by browser, server calls
It practises mature model and carries out Classification Management.
Claims (4)
1. a kind of Android application management system based on deep learning characterized by comprising characteristic extracting module, depth
Practise module and Classification Management module, in which:
Characteristic extracting module, the characteristic extracting module, which is configured as applying with the multiclass Android with different classes of ident value, is
Input extracts multiclass feature using static extracting mode and forms feature documents;
Deep learning module, the deep learning module are configured with convolutional neural networks to by the feature documents and class
The sample set of other ident value composition is trained, and obtains the mature model of study;Simultaneously using the feature documents as input, make
Use the model of the study maturation as final Classification Management model;
Classification Management module, the Classification Management module use browser/server framework, and wherein server end is configured as leading to
Cross the classification that the mature model of the study for calling the characteristic extracting module and deep learning module to obtain is applied as Android
Administrative model;Browser end is configured as carrying out uploading a large amount of new Android applications towards Android application shop, passes through server
The model at end carries out Classification Management to Android application.
2. a kind of Android application management method based on deep learning, it is characterised in that: the following steps are included:
The first step, acquisition largely have the Android application of classification logotype, and batch extracting static nature forms feature documents;
Second step is trained the sample set being made of feature documents and classification logotype value using convolutional neural networks, obtains
Learn mature model;
Third step is uploaded onto the server Android using batch by browser, and the mature model of server calls study carries out
Classification Management.
3. the Android application management method according to claim 2 based on deep learning, it is characterised in that: in the first step,
The extraction static nature and form feature documents the following steps are included:
3.1. Android application is decompressed, file AndroidManifest.xml is obtained;
3.2. AndroidManifest.xml file is parsed using AXMLPrinter2 and TinyXml tool, obtains Android application
Permission feature, software and hardware feature and intent features as static nature;For each Android application, included by it
Static nature extracts in text document as feature documents.
4. the Android application management method according to claim 2 based on deep learning, it is characterised in that: in second step,
The convolutional neural networks are made of embeding layer, convolutional layer, pond layer and full articulamentum, are had the feature that
4.1. embeding layer: traversing all feature documents and obtain non-duplicate static nature, maps that the expression of low-dimensional vector, embedding
Entering dimension is 300, is indicated thus to obtain the corresponding vector of each static nature;The feature documents and its mark that each Android is applied
Knowledge value is input to the embeding layer of convolutional neural networks, and the static nature for being included by feature documents is converted to vector expression, obtains
Two-dimensional eigenmatrix;
4.2. convolutional layer: being connected in parallel 4 convolutional layers conv1, conv2, conv3 and conv4 after embeding layer, convolution kernel
Size is respectively 2x300,3x300,4x300 and 5x300, and the number of convolution kernel is 64;
4.3. pond layer: be separately connected after four convolutional layers maximum pond layer pooling1, pooling2, pooling3 and
Pooling4, each pond layer obtain 64 optimal characteristics;
4.4. full articulamentum: being sent into softmax classifier by full articulamentum after the layer of pond, carries out Classification Management.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910794990.3A CN110515654A (en) | 2019-08-27 | 2019-08-27 | A kind of Android application management system and method based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910794990.3A CN110515654A (en) | 2019-08-27 | 2019-08-27 | A kind of Android application management system and method based on deep learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110515654A true CN110515654A (en) | 2019-11-29 |
Family
ID=68627946
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910794990.3A Pending CN110515654A (en) | 2019-08-27 | 2019-08-27 | A kind of Android application management system and method based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110515654A (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105205396A (en) * | 2015-10-15 | 2015-12-30 | 上海交通大学 | Detecting system for Android malicious code based on deep learning and method thereof |
CN108304720A (en) * | 2018-02-06 | 2018-07-20 | 恒安嘉新(北京)科技股份公司 | A kind of Android malware detection methods based on machine learning |
-
2019
- 2019-08-27 CN CN201910794990.3A patent/CN110515654A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105205396A (en) * | 2015-10-15 | 2015-12-30 | 上海交通大学 | Detecting system for Android malicious code based on deep learning and method thereof |
CN108304720A (en) * | 2018-02-06 | 2018-07-20 | 恒安嘉新(北京)科技股份公司 | A kind of Android malware detection methods based on machine learning |
Non-Patent Citations (1)
Title |
---|
赵梦雪: "《基于深度学习的安卓恶意应用检测方法研究》", 《中国优秀硕士学位论文全文数据库信息科技辑(月刊)2018年第06期》 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Dubey et al. | Multichannel decoded local binary patterns for content-based image retrieval | |
CN109800761A (en) | Method and terminal based on deep learning model creation paper document structural data | |
CN107742107A (en) | Facial image sorting technique, device and server | |
CN103258157B (en) | A kind of online handwriting authentication method based on finger information and system | |
CN110879963B (en) | Sensitive expression package detection method and device and electronic equipment | |
CN106228129A (en) | A kind of human face in-vivo detection method based on MATV feature | |
CN112597495B (en) | Malicious code detection method, system, equipment and storage medium | |
CN106980817A (en) | A kind of terrified video frequency identifying method based on Caffe frameworks | |
CN110457677A (en) | Entity-relationship recognition method and device, storage medium, computer equipment | |
CN107943811A (en) | The dissemination method and device of content | |
CN110223202A (en) | A kind of method and system of teaching stage property identification and scoring | |
CN112990268A (en) | Machine learning-oriented method and system for evaluating contribution degree of visual crowd sensing data | |
CN102567738A (en) | Rapid detection method for pornographic videos based on Gaussian distribution | |
CN110399547A (en) | For updating the method, apparatus, equipment and storage medium of model parameter | |
Zhou et al. | Pre-training-free image manipulation localization through non-mutually exclusive contrastive learning | |
Hung et al. | Automatic vietnamese passport recognition on android phones | |
CN107070702B (en) | User account correlation method and device based on cooperative game support vector machine | |
CN110515654A (en) | A kind of Android application management system and method based on deep learning | |
CN107103327A (en) | Image detecting method is forged in a kind of dyeing based on Color Statistical difference | |
CN114579796B (en) | Machine reading understanding method and device | |
CN116094971A (en) | Industrial control protocol identification method and device, electronic equipment and storage medium | |
CN114743148A (en) | Multi-scale feature fusion tampering video detection method, system, medium, and device | |
CN104657349B (en) | A kind of forum postings characteristic recognition method and device | |
CN114329050A (en) | Visual media data deduplication processing method, device, equipment and storage medium | |
CN108304838A (en) | A kind of pictorial information recognition methods and terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191129 |