CN110515654A - A kind of Android application management system and method based on deep learning - Google Patents

A kind of Android application management system and method based on deep learning Download PDF

Info

Publication number
CN110515654A
CN110515654A CN201910794990.3A CN201910794990A CN110515654A CN 110515654 A CN110515654 A CN 110515654A CN 201910794990 A CN201910794990 A CN 201910794990A CN 110515654 A CN110515654 A CN 110515654A
Authority
CN
China
Prior art keywords
android
feature
deep learning
module
android application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910794990.3A
Other languages
Chinese (zh)
Inventor
王志强
李格菲
池亚平
张健毅
吕欣
邹潇湘
钱榕
张克君
卓子寒
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING ELECTRONIC SCIENCE AND TECHNOLOGY INSTITUTE
Original Assignee
BEIJING ELECTRONIC SCIENCE AND TECHNOLOGY INSTITUTE
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING ELECTRONIC SCIENCE AND TECHNOLOGY INSTITUTE filed Critical BEIJING ELECTRONIC SCIENCE AND TECHNOLOGY INSTITUTE
Priority to CN201910794990.3A priority Critical patent/CN110515654A/en
Publication of CN110515654A publication Critical patent/CN110515654A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/70Software maintenance or management
    • G06F8/75Structural analysis for program understanding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a kind of Android application management system and method based on deep learning, including characteristic extracting module, deep learning module and Classification Management module.Characteristic extracting module is applied with Android as input, is extracted feature using static extracting mode and is formed feature documents.Deep learning module uses convolutional neural networks as learning model.On the one hand the sample set that feature documents are formed with ident value is trained and is learnt, obtain the mature model of study;On the other hand using feature documents as input, using the model that study is mature, accurate Classification Management Android application improves the efficiency of management of application shop.

Description

A kind of Android application management system and method based on deep learning
Technical field
The present invention relates to mobile terminal safety technical field, in particular to a kind of Android application management based on deep learning System and method.
Background technique
For Android as presently most popular intelligent movable operating system, equipment and number of users are huge, rich using type Richness, safety have received widespread attention.Classification that Android application shop is specified generally according to application developer or by point The description that analysis application developer provides classifies to application.But this process is easy to by the developer of malicious application Manipulation applies addition to be easier to the information class application interface by audit to escape detection, such as by no qualification finance class.Furthermore with The explosive growth of Android number of applications, how rapidly and accurately automatically to be classified to Android application, the raising efficiency of management It is vital.But it is less for the research of Android application class management at present.In maximally related work, Shabtai Et al. Classification Management is carried out to tool-class and the application of game class Android using static analysis and machine learning algorithm;Wang Wei etc. People carries out Classification Management to the application of 24 class Androids also with Static Analysis Technology, using a variety of machine learning algorithms.But machine Device learning algorithm is expressed complicated function and classification problem and is limited, and generalization ability is restricted, and accuracy rate is lower.
Summary of the invention
Technology of the invention solves the problems, such as: overcoming the deficiencies of the prior art and provide a kind of Android based on deep learning and answers With management system and method, Android application feature is extracted using static extracting mode, to ensure Classification Management efficiency;Android is answered Feature documents processing is two-dimensional eigenmatrix, and form turns to " picture ", using what is done well in picture recognition field Convolutional neural networks algorithm carries out accurate Classification Management to the application of a large amount of Androids.
Technical solution of the invention: Android application management system and method based on deep learning are wrapped as shown in Figure 1 It includes:
Characteristic extracting module, the characteristic extracting module are configured as answering with the multiclass Android with different classes of ident value With to input, uses static extracting mode to extract multiclass feature and form feature documents;
Deep learning module, the deep learning module are configured with convolutional neural networks to by the feature documents It is trained with the sample set of classification logotype value composition, obtains the mature model of study;Simultaneously using the feature documents as defeated Enter, uses the model of the study maturation as final Classification Management model;
Classification Management module, the Classification Management module use browser/server framework, and wherein server end is configured For what is applied by the mature model of the study for calling the characteristic extracting module and deep learning module to obtain as Android Classification Management model;Browser end is configured as carrying out uploading a large amount of new Android applications towards Android application shop, passes through clothes The model at business device end carries out Classification Management to Android application.
The present invention is based on the Android application management systems of deep learning to realize that steps are as follows:
1. acquisition largely has the Android application of classification logotype, batch extracting static nature forms feature documents;
1.1. Android application is decompressed, file AndroidManifest.xml is obtained;
1.2. AndroidManifest.xml file is parsed using AXMLPrinter2 and TinyXml tool, obtains Android Permission feature, software and hardware feature and the intent features of application are as static nature;For each Android application, it is wrapped The static nature contained extracts in text document as feature documents;
2. being trained using convolutional neural networks to the sample set being made of feature documents and classification logotype value, learned Mature model is practised, the convolutional neural networks are made of embeding layer, convolutional layer, pond layer and full articulamentum;
2.1. embeding layer: all feature documents are traversed and obtain non-duplicate static nature, map that low-dimensional vector table Show, insertion dimension is 300, is indicated thus to obtain the corresponding vector of each static nature.The feature documents that each Android is applied And its ident value is input to the embeding layer of convolutional neural networks, the static nature for being included by feature documents is converted to vector table Show, obtains two-dimensional eigenmatrix.
2.2. 4 convolutional layers conv1, conv2, conv3 and conv4, convolution convolutional layer: are connected in parallel after embeding layer The size of core is respectively 2x300,3x300,4x300 and 5x300, and the number of convolution kernel is 64.
2.3. pond layer: be separately connected after four convolutional layers maximum pond layer pooling1, pooling2, pooling3 and Pooling4, each pond layer obtain 64 optimal characteristics.
2.4. full articulamentum: being sent into softmax classifier by full articulamentum after the layer of pond, carries out Classification Management.
3. Android is uploaded onto the server using batch by browser, the mature model of server calls study is divided Class management.
The advantages of the present invention over the prior art are that:
(1) present invention is one kind effectively supplement to Android application management method.It is answered using static extracting mode from Android It is permission feature, software and hardware feature and intent features respectively, compared to dynamic analysis, consumption is more with three category features of middle extraction Few resource and time.Because application scenarios of the invention are mainly Android application shop, merely with Static Analysis Method Feature is extracted, to ensure Classification Management efficiency.In addition, the system has certain expansion, it can be according to Android application shop Free adjustment is carried out for the division of management category, while more features can also be added, more accurately to carry out Android Good basis is laid in application class management.
(2) two-dimensional eigenmatrix is converted by the feature that each Android is applied, is visualized as " picture ", is incorporated in image Identification aspect show outstanding convolutional neural networks model can more efficient learning characteristic, it is quasi- to reach higher Classification Management True rate and faster Classification Management speed.
Detailed description of the invention
Fig. 1 is the flow chart of present system;
Fig. 2 is the structure of the convolutional neural networks in present system.
Specific embodiment
The present invention is described in detail with reference to the accompanying drawing
As shown in Figure 1, the present invention is based on the Android application management systems of deep learning by characteristic extracting module, deep learning Module and Classification Management module composition.
Illustrate so that millet application shop application class manages as an example below, entire to realize that process is as follows:
(1) application of multiclass Android is downloaded from millet application shop crawler, including game class is applied, books reading class is applied, Audio-visual audiovisual class application, the application of chat social category, the application of sports class, the application of Domestic News class, the application of fashionable shopping class, gold Melt the application of financing class and photography and vedio recording class application, and manual correction is carried out to the application of wherein classification error, by above nine class Application order number is 1~9, as classification logotype value;
(2) decompress Android application, obtain file AndroidManifest.xml, using AXMLPrinter2 and TinyXml tool parse AndroidManifest.xml file, obtain Android application permission feature, software and hardware feature and Intent features are as static nature, i.e. label<uses-permission>,<uses-feature>,<intent-filter>in Content.For each Android application, the static nature for being included by it is extracted in text document as feature documents;
(3) sample set being made of feature documents and classification logotype value is trained using convolutional neural networks, is obtained Learn mature model.Convolutional neural networks include embeding layer, convolutional layer, pond layer and full articulamentum.
(3.1) embeding layer: all feature documents are traversed and obtain non-duplicate static nature, map that low-dimensional vector table Show, insertion dimension is 300, is indicated thus to obtain the corresponding vector of each static nature.The feature documents that each Android is applied And its ident value is input to the embeding layer of convolutional neural networks, the static nature for being included by feature documents is converted to vector table Show, obtains two-dimensional eigenmatrix.
(3.2) 4 convolutional layers conv1, conv2, conv3 and conv4, volume convolutional layer: are connected in parallel after embeding layer The size of product core is respectively 2x300,3x300,4x300 and 5x300, and the number of convolution kernel is 64.
(3.3) maximum pond layer pooling1, pooling2, pooling3 pond layer: are separately connected after four convolutional layers And pooling4, each pond layer obtain 64 optimal characteristics.
(3.4) full articulamentum: being sent into softmax classifier by full articulamentum after the layer of pond, carries out Classification Management.
(3.5) 80% is randomly selected in the application of all Androids as training sample, wherein 10% is used as verifying sample, 20% is remaininged as test sample.In each training process, the accuracy rate and penalty values of observation verifying sample, when penalty values no longer drop Deconditioning when low obtains the mature model of study, as final Classification Management model.
(3.6) laboratory test results of the invention are as shown in table 1, the Classification Management model obtained using deep learning module 20% test sample is tested, six seed type Android application all classifications are correct as the result is shown, three types Android application In respectively have an application class mistake.Demonstrate the classification validity of this system.Subsequent a large amount of Android applications can be carried out quick Accurate Classification Management.
Table 1
(4) Android application shop can be uploaded onto the server Android using batch by browser, server calls It practises mature model and carries out Classification Management.

Claims (4)

1. a kind of Android application management system based on deep learning characterized by comprising characteristic extracting module, depth Practise module and Classification Management module, in which:
Characteristic extracting module, the characteristic extracting module, which is configured as applying with the multiclass Android with different classes of ident value, is Input extracts multiclass feature using static extracting mode and forms feature documents;
Deep learning module, the deep learning module are configured with convolutional neural networks to by the feature documents and class The sample set of other ident value composition is trained, and obtains the mature model of study;Simultaneously using the feature documents as input, make Use the model of the study maturation as final Classification Management model;
Classification Management module, the Classification Management module use browser/server framework, and wherein server end is configured as leading to Cross the classification that the mature model of the study for calling the characteristic extracting module and deep learning module to obtain is applied as Android Administrative model;Browser end is configured as carrying out uploading a large amount of new Android applications towards Android application shop, passes through server The model at end carries out Classification Management to Android application.
2. a kind of Android application management method based on deep learning, it is characterised in that: the following steps are included:
The first step, acquisition largely have the Android application of classification logotype, and batch extracting static nature forms feature documents;
Second step is trained the sample set being made of feature documents and classification logotype value using convolutional neural networks, obtains Learn mature model;
Third step is uploaded onto the server Android using batch by browser, and the mature model of server calls study carries out Classification Management.
3. the Android application management method according to claim 2 based on deep learning, it is characterised in that: in the first step, The extraction static nature and form feature documents the following steps are included:
3.1. Android application is decompressed, file AndroidManifest.xml is obtained;
3.2. AndroidManifest.xml file is parsed using AXMLPrinter2 and TinyXml tool, obtains Android application Permission feature, software and hardware feature and intent features as static nature;For each Android application, included by it Static nature extracts in text document as feature documents.
4. the Android application management method according to claim 2 based on deep learning, it is characterised in that: in second step, The convolutional neural networks are made of embeding layer, convolutional layer, pond layer and full articulamentum, are had the feature that
4.1. embeding layer: traversing all feature documents and obtain non-duplicate static nature, maps that the expression of low-dimensional vector, embedding Entering dimension is 300, is indicated thus to obtain the corresponding vector of each static nature;The feature documents and its mark that each Android is applied Knowledge value is input to the embeding layer of convolutional neural networks, and the static nature for being included by feature documents is converted to vector expression, obtains Two-dimensional eigenmatrix;
4.2. convolutional layer: being connected in parallel 4 convolutional layers conv1, conv2, conv3 and conv4 after embeding layer, convolution kernel Size is respectively 2x300,3x300,4x300 and 5x300, and the number of convolution kernel is 64;
4.3. pond layer: be separately connected after four convolutional layers maximum pond layer pooling1, pooling2, pooling3 and Pooling4, each pond layer obtain 64 optimal characteristics;
4.4. full articulamentum: being sent into softmax classifier by full articulamentum after the layer of pond, carries out Classification Management.
CN201910794990.3A 2019-08-27 2019-08-27 A kind of Android application management system and method based on deep learning Pending CN110515654A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910794990.3A CN110515654A (en) 2019-08-27 2019-08-27 A kind of Android application management system and method based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910794990.3A CN110515654A (en) 2019-08-27 2019-08-27 A kind of Android application management system and method based on deep learning

Publications (1)

Publication Number Publication Date
CN110515654A true CN110515654A (en) 2019-11-29

Family

ID=68627946

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910794990.3A Pending CN110515654A (en) 2019-08-27 2019-08-27 A kind of Android application management system and method based on deep learning

Country Status (1)

Country Link
CN (1) CN110515654A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105205396A (en) * 2015-10-15 2015-12-30 上海交通大学 Detecting system for Android malicious code based on deep learning and method thereof
CN108304720A (en) * 2018-02-06 2018-07-20 恒安嘉新(北京)科技股份公司 A kind of Android malware detection methods based on machine learning

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105205396A (en) * 2015-10-15 2015-12-30 上海交通大学 Detecting system for Android malicious code based on deep learning and method thereof
CN108304720A (en) * 2018-02-06 2018-07-20 恒安嘉新(北京)科技股份公司 A kind of Android malware detection methods based on machine learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵梦雪: "《基于深度学习的安卓恶意应用检测方法研究》", 《中国优秀硕士学位论文全文数据库信息科技辑(月刊)2018年第06期》 *

Similar Documents

Publication Publication Date Title
Dubey et al. Multichannel decoded local binary patterns for content-based image retrieval
CN109800761A (en) Method and terminal based on deep learning model creation paper document structural data
CN107742107A (en) Facial image sorting technique, device and server
CN103258157B (en) A kind of online handwriting authentication method based on finger information and system
CN110879963B (en) Sensitive expression package detection method and device and electronic equipment
CN106228129A (en) A kind of human face in-vivo detection method based on MATV feature
CN112597495B (en) Malicious code detection method, system, equipment and storage medium
CN106980817A (en) A kind of terrified video frequency identifying method based on Caffe frameworks
CN110457677A (en) Entity-relationship recognition method and device, storage medium, computer equipment
CN107943811A (en) The dissemination method and device of content
CN110223202A (en) A kind of method and system of teaching stage property identification and scoring
CN112990268A (en) Machine learning-oriented method and system for evaluating contribution degree of visual crowd sensing data
CN102567738A (en) Rapid detection method for pornographic videos based on Gaussian distribution
CN110399547A (en) For updating the method, apparatus, equipment and storage medium of model parameter
Zhou et al. Pre-training-free image manipulation localization through non-mutually exclusive contrastive learning
Hung et al. Automatic vietnamese passport recognition on android phones
CN107070702B (en) User account correlation method and device based on cooperative game support vector machine
CN110515654A (en) A kind of Android application management system and method based on deep learning
CN107103327A (en) Image detecting method is forged in a kind of dyeing based on Color Statistical difference
CN114579796B (en) Machine reading understanding method and device
CN116094971A (en) Industrial control protocol identification method and device, electronic equipment and storage medium
CN114743148A (en) Multi-scale feature fusion tampering video detection method, system, medium, and device
CN104657349B (en) A kind of forum postings characteristic recognition method and device
CN114329050A (en) Visual media data deduplication processing method, device, equipment and storage medium
CN108304838A (en) A kind of pictorial information recognition methods and terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20191129