CN113034331A - Android gambling application identification method and system based on multi-mode fusion - Google Patents
Android gambling application identification method and system based on multi-mode fusion Download PDFInfo
- Publication number
- CN113034331A CN113034331A CN202110490157.7A CN202110490157A CN113034331A CN 113034331 A CN113034331 A CN 113034331A CN 202110490157 A CN202110490157 A CN 202110490157A CN 113034331 A CN113034331 A CN 113034331A
- Authority
- CN
- China
- Prior art keywords
- android
- gambling
- application
- model
- formula
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
- G06Q50/34—Betting or bookmaking, e.g. Internet betting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
Abstract
The invention discloses an android gambling application identification method and system based on multi-mode fusion, belonging to the technical field of android application security, wherein the identification method comprises the following specific processes; (1) obtaining website comments and downloading clue information of android gambling applications in gambling websites in batches; (2) finding an application download link ending with APK in a target website, and extracting suspected android application; (3) extracting a package name, an icon, a certificate, an IP address, a URL domain name and an email address of the android application installation package; (4) judging that the gambling application comprises an image model, a text model and a Multihead Attention fusion model through a multi-mode fused android gambling application recognition model; (5) storing the found basic information of the android application and the application installation package; the multi-mode recognition model can accurately recognize gambling applications in the android platform, and is beneficial to reducing network gambling illegal criminal activities.
Description
Technical Field
The invention relates to the technical field of android application security, in particular to an android gambling application identification method and system based on multi-mode fusion.
Background
Through retrieval, Chinese patent No. CN108052523A discloses a gambling website identification method and system based on a convolutional neural network, and the method can identify the webpage screenshot of a website to be predicted through a convolutional neural network model, judge whether the website is the gambling website, but only train and identify the image characteristics of the website; in recent years, with the rapid development of the internet and mobile communication industry, network gambling is used as an illegal criminal activity with a large number of participants, a wide spread channel and a large involved case scale, and is spread continuously in the world, so that a large amount of funds are leaked, and derived crimes seriously threaten the social security of China; the Android (Android) mobile application is used as one of important propagation carriers of network gambling information, has the characteristics of platform opening, no need of official or third-party application store installation and the like, and a large number of Android platform gambling applications are propagated in a mode of directly providing an installation package or providing download links in official websites or other information propagation channels; currently, gambling applications are one of harmful applications, android platform harmful application analysis mainly focuses on traditional network security fields such as malicious codes and behavior security, and has few researches on harmful discovery and content security of application contents; therefore, it becomes important to invent an android gambling application identification method and system based on multi-mode fusion;
the existing android application identification method usually focuses on network security problems such as malicious codes, behavior security and the like in the application, discovery and discrimination of application harmful contents and research on content information security are less, and content security related public training and test data sets are less; in addition, the android gambling application installation package contains a large amount of multi-mode information such as texts, pictures, certificates and the like, and the existing multi-mode fusion method has no pertinence; therefore, the android gambling application identification method and system based on multi-mode fusion are provided.
Disclosure of Invention
The invention aims to solve the defects in the prior art, and provides an android gambling application identification method and system based on multi-mode fusion.
In order to achieve the purpose, the invention adopts the following technical scheme:
an android gambling application identification method based on multi-mode fusion comprises the following specific processes:
(1) obtaining website comments and downloading clue information of android gambling applications in gambling websites in batches;
(2) finding an application download link ending with APK in a target website, and extracting suspected android application;
(3) extracting a package name, an icon, a certificate, an IP address, a URL domain name and an email address of the android application installation package;
(4) judging that the gambling application comprises an image model, a text model and a Multihead Attention fusion model through a multi-mode fused android gambling application recognition model;
(5) storing the found basic information of the android application and the application installation package;
(6) the front end shows the discovery of android gambling and the new discovery of android gambling applications.
Preferably, the image model determination process specifically includes:
s1: mapping icon pictures to feature f using VGGNetIThe formula is as follows:
fI=CNNvgg(I) (1)
s2: scaling the icon picture to 448 x 448 pixels and then obtaining the feature f of the last pooling layerIThe output dimension is 512 × 14, wherein 14 × 14 is the number of picture segmentation regions, and 512 is the dimension of each region feature vector;
s3: each feature vector is converted into a new vector with the same dimension as the text vector by using a single-layer perceptron, and the formula is as follows:
vI=tanh(WIfI+bI) (2)
in the formula: v. ofIIs a matrix, the ith column of which is the region i of the picture feature vector;
s4: the gradient vanishing problem is solved by using 18-layer or 34-layer residual neural network.
Preferably, the residual neural network consists of two stacked layer building blocks, and the structure of the residual neural network is as follows:
H(x)=F(x,{Wi})+x (3)
in the formula: x and H (x) are the input and output vectors of the building block; f (x, { W)i}) represents a learned residual map;
wherein:
F(x)=W2δ(W1x) (4)
in the formula: δ represents the activation function, W1Represents the first connection weight, W2Representing a second connection weight;
if the dimensions x and F do not match, the dimensions can be matched using a linear mapping W _ s:
H(x)=F(x,{Wi})+Wδx (5)
the formula f (x) + x is implemented by a forward neural network with a shortcut connection, which is implemented by an identity map, the output will be added to the final output of the stack.
Preferably, the text model is specifically LSTM, the basic structure of LSTM is a memory unit that retains sequence states, and in each step, the LSTM unit obtains an input word vector xtUpdating the memory cell ctThen outputs a hidden state ht(ii) a A door mechanism is used in the updating process; forgetting door ftControlling c from the last statet-1How much information is kept in; an input gate itControlling the current input xtUpdating how much information to the memory unit; the output gate controls how much information enters the output, namely the hidden state, and the detailed updating process is as follows:
it=σ(Wxixt+Whiht-1+bi) (6)
ft=σ(Wxfxt+Whfht-1+bf) (7)
ot=σ(Wxoxt+Whoht-1+bo) (8)
ct=ftct-1+ittanh(Wxcxt+Whcht-1+bc) (9)
ht=ottanh(ct) (10)
in the formula: i, f, o and c are respectively an input gate, a forgetting gate, an output gate and a memory unit;
the word vector xtAs input to the LSTM, the formula is as follows:
xt=Weqt,t∈{1,2,…T} (11)
ht=LSTM(xt),t∈{1,2,…T} (12)
in the formula: q ═ q1,…,qT]Representing text, qtIs a one-hot vector representation of the word at location t.
Preferably, the Multihead Attention fusion model is a multimodality fusion model based on an Attention mechanism, and the result output by the picture model and the text model is fused by an Attention mechanism, and the Attention mechanism is calculated as follows:
(1) calculating the weight of the query and each key through the similarity, wherein similar functions use dot products;
(2) dot product operation of which the factorThe adjustment function is realized, so that the dot product is not too large;
(3) normalizing the obtained weights by a softmax function;
(4) obtaining their weighted sum by similarity and value of corresponding key;
based on the above steps, the following formula is obtained:
in the formula: q is the feature vector of the picture, and V and K are the output of the text model.
Preferably, the multi-modal fusion needs to pass through a global average pooling layer, and the formula is as follows:
vgap=Global(v1,v2,…,vn) (14)
finally, v obtained is processedgapThe vector is directly input into the softmax layer for classification prediction, and the prediction result is as follows,
the purpose of introducing the cross entropy function is to evaluate the model, reflecting the true class y and the predicted classThe difference between them:
in the formula: i is an index number.
An android gambling application recognition system based on multi-mode fusion comprises a web crawler layer, an extraction and discrimination layer, a data storage layer and a result display layer;
the network crawler layer is used for crawling android gambling application clues in website contents and comment information and grabbing and finding android application download addresses;
the extraction discrimination layer is used for extracting basic information of the android application and carrying out android platform gambling application identification based on multi-mode fusion;
the extraction discrimination layer is used for storing the identified android application installation package and storing android application information;
the achievement display layer is used for displaying the android gambling application in a whole manner and displaying the new findings of the android gambling application.
Compared with the prior art, the invention has the beneficial effects that:
1. according to the android gambling application identification method based on multi-mode fusion, resource files containing a large number of different modes in an android platform application installation package, such as package names, icons, certificates and character strings (IP addresses, URL domain names and e-mail addresses), are extracted through a data crawler method, and then picture and text feature vectors in application are distinguished through a picture model and a text model, so that a foundation is laid for the establishment of a subsequent multi-mode identification model;
2. according to the android gambling application identification method based on multi-mode fusion, the characteristic vectors obtained by the picture model and the text model are subjected to fusion training to form a multi-mode identification model, compared with the traditional single-characteristic android gambling application identification model, the multi-mode identification model is wider in identification range and higher in identification precision, so that gambling applications in an android platform can be accurately and automatically identified, and network gambling illegal criminal activities can be reduced.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention.
Fig. 1 is an overall flowchart of an artificial intelligence eye disease screening service method according to the present invention;
FIG. 2 is a schematic overall structure diagram of an android gambling application recognition system based on multi-mode fusion, provided by the invention;
FIG. 3 is a schematic diagram illustrating a process of determining a picture model according to the present invention;
FIG. 4 is a schematic diagram of a residual neural network structure according to the present invention;
FIG. 5 is a schematic diagram of a fusion process of the Multihead Attention fusion model of the present invention;
FIG. 6 is a schematic illustration of a calculation process for the attention mechanism of the present invention;
FIG. 7 is a diagram of the ResNet-18 model prediction results of the present invention;
FIG. 8 is a diagram illustrating the model prediction results after ResNet-18 pre-training in accordance with the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.
In the description of the present invention, it is to be understood that the terms "upper", "lower", "front", "rear", "left", "right", "top", "bottom", "inner", "outer", and the like, indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, are merely for convenience in describing the present invention and simplifying the description, and do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention.
Referring to FIGS. 1-8, 495 gambling-class applications are collected, of which 395 training sets, 50 validation sets, and 50 test sets, crawl from the application market to 446 non-gambling normal applications, of which 346 training sets, 50 validation sets, and 50 test sets, as shown in the following table:
training set | Verification set | Test set | Total number of | |
Gambling class | 395 | 50 | 50 | 495 |
Non-gambling games | 346 | 50 | 50 | 446 |
The text information that can be obtained by extracting the text information in the application is shown in the following table:
serial number | Domain name |
1 | www.qhc25.com |
2 | agmbet.com |
3 | api.383game7a1.com |
4 | api.yjgame1.com |
5 | api.kgky8372.com |
Serial number | Partial in-application text examples |
1 | Full international, red-envelope fishing and AG video … |
2 | All-season color, dragon and tiger war, Shenlongbao Tibetan and fried golden flower … |
3 | Lebo cash network, lottery 25, Wuwan … |
4 | Venice entertainment, Baijiale … |
5 | Yongli international entertainment city … |
10994 collected short texts were used as pre-training data for the text model, wherein the gambling text was 3425 and the non-gambling normal text was 7569, as shown in the following table:
during training, firstly, preprocessing a picture, and then transforming pixels from 0 to 255 to between-1 and 1 for regularization through scaling and clipping;
the embodiment provides an android gambling application identification method based on multi-mode fusion, which specifically comprises the following processes:
(1) obtaining website comments and downloading clue information of android gambling applications in gambling websites in batches;
(2) finding an application download link ending with APK in a target website, and extracting suspected android application;
(3) information such as package names, icons, certificates and character strings (IP addresses, URL domain names and e-mail addresses) of the android application installation package is extracted, and part of android application information extraction examples are as follows:
(4) judging that the gambling application comprises an image model, a text model and a Multihead Attention fusion model through a multi-mode fused android gambling application recognition model;
(5) storing the found basic information of the android application and the application installation package;
(6) the front end shows the discovery of android gambling and the new discovery of android gambling applications.
The image model determination process is specifically as follows:
s1: mapping icon pictures to feature f using VGGNetIThe formula is as follows:
fI=CNNvgg(I) (1)
s2: scaling the icon picture to 448 x 448 pixels and then obtaining the feature f of the last pooling layerIThe output dimension is 512 × 14, wherein 14 × 14 is the number of picture segmentation regions, and 512 is the dimension of each region feature vector;
s3: each feature vector is converted into a new vector with the same dimension as the text vector by using a single-layer perceptron, and the formula is as follows:
vI=tanh(WIfI+bI) (2)
in the formula: v. ofIIs a matrix, the ith column of which is the region i of the picture feature vector;
s4: the gradient vanishing problem is solved by using 18-layer or 34-layer residual neural network.
The residual error neural network consists of two stacked layer building blocks, and the structure is as follows:
H(x)=F(x,{Wi})+x (3)
in the formula: x and H (x) are the input and output vectors of the building block; f (x, { W)i}) represents a learned residual map;
wherein:
F(x)=W2δ(W1x) (4)
in the formula: δ represents the activation function, W1Represents the first connection weight, W2Representing a second connection weight;
if the dimensions x and F do not match, the dimensions can be matched using a linear mapping W _ s:
H(x)=F(x,{Wi})+Wδx (5)
the formula f (x) + x is implemented by a forward neural network with a shortcut connection, which is implemented by an identity map, the output will be added to the final output of the stack.
It should be noted that the text model is specifically LSTM, and the basic structure of LSTM is a memory unit that retains sequence states, and in each step, the LSTM unit obtains an input word vector xtUpdating the memory cell ctThen outputs a hidden state ht(ii) a A door mechanism is used in the updating process; forgetting door ftControlling c from the last statet-1How much information is kept in; an input gate itControlling the current input xtUpdating how much information to the memory unit; the output gate controls how much information enters the output, namely the hidden state, and the detailed updating process is as follows:
it=σ(Wxixt+Whiht-1+bi) (6)
ft=σ(Wxfxt+Whfht-1+bf) (7)
ot=σ(Wxoxt+Whoht-1+bo) (8)
ct=ftct-1+ittanh(Wxcxt+Whcht-1+bc) (9)
ht=ottanh(ct) (10)
in the formula: i, f, o and c are respectively an input gate, a forgetting gate, an output gate and a memory unit;
word vector xtAs input to the LSTM, the formula is as follows:
xt=Weqt,t∈{1,2,…T} (11)
ht=LSTM(xt),t∈{1,2,…T} (12)
in the formula: q ═ q1,…,qT]Representing text, qtIs a one-hot vector representation of the word at location t.
It should be noted that the Multihead Attention fusion model is specifically a multimodality fusion model based on an Attention mechanism, and the result output by the picture model and the text model is fused by an Attention mechanism, and the calculation process of the Attention mechanism is as follows:
(1) calculating the weight of the query and each key through the similarity, wherein similar functions use dot products;
(2) dot product operation of which the factorThe adjustment function is realized, so that the dot product is not too large;
(3) normalizing the obtained weights by a softmax function;
(4) obtaining their weighted sum by similarity and value of corresponding key;
based on the above steps, the following formula is obtained:
in the formula: q is the feature vector of the picture, and V and K are the output of the text model.
The multi-modal fusion needs to pass through a global average pooling layer, and the formula is as follows:
vgap=Global(v1,v2,…,vn) (14)
finally, v obtained is processedgapThe vector is directly input into the softmax layer for classification prediction, and the prediction result is as follows,
the purpose of introducing the cross entropy function is to evaluate the model, reflecting the true class y and the predicted classThe difference between them:
in the formula: i is an index number.
The embodiment provides an android gambling application recognition system based on multi-mode fusion, which comprises a web crawler layer, an extraction and discrimination layer, a data storage layer and an achievement display layer;
the network crawler layer is used for crawling web content and android gambling application clues in the comment information, and grabbing and finding android application download addresses;
the extraction discrimination layer is used for extracting basic information of the android application and carrying out android platform gambling application identification based on multi-mode fusion;
the extraction discrimination layer is used for storing the identified android application installation package and storing android application information;
the achievement display layer is used for displaying the android gambling application in a whole manner and displaying the new findings of the android gambling application.
The results of the picture model on the test set are shown in the following table:
Model | Precision | Recall | F1 |
CNN | 0.61 | 0.86 | 0.71 |
ResNet-18 | 0.82 | 0.79 | 0.80 |
ResNet-34 | 0.78 | 0.77 | 0.77 |
CNN(Pre-training) | 0.69 | 0.78 | 0.73 |
ResNet-18(Pre-training) | 0.84 | 0.80 | 0.82 |
ResNet-34(Pre-training) | 0.82 | 0.78 | 0.80 |
specifically, the ResNet network is obviously superior to the basic CNN network from the icon recognition result, and even though the network is pre-trained, the recognition effect of the basic CNN network is not greatly improved; it is worth noting that the model effect of ResNet-34 is worse than that of ResNet-18, but after the pre-trained model is used, the model effect is greatly improved, but still worse than that of ResNet-18;
as shown in fig. 7 and 8, from the perspective of the predicted results of the pictures, the basic CNN model can predict most of the results, but the predicted probability is only raised by a few percent from about fifty percent before training, while the predicted probability of the ResNet model can be as high as ninety percent.
The results of the text model on the test set are shown in the following table:
specifically, it can be seen that the LSTM has a significant improvement on the effect of the basic RNN model, and through pre-training of other gambling-type short texts, the basic RNN model is also greatly improved, but the effect is still inferior to that of the LSTM model, so that the LSTM text model subjected to pre-training is selected in the final model, but the overall effect of the text model is significantly inferior to that of the picture model.
The results of the fusion model on the test set are shown in the following table:
Model | Precision | Recall | F1 |
ResNet | 0.84 | 0.80 | 0.82 |
LSTM | 0.80 | 0.81 | 0.80 |
LSTM-ResNet-Concat | 0.88 | 0.92 | 0.90 |
LSTM-ResNet-MHAT | 0.90 | 0.93 | 0.91 |
in the final multi-modal feature fusion model, the image feature vectors and the text feature vectors are also subjected to connection operation, and then classified prediction is carried out on the image feature vectors and the text feature vectors through a full connection layer, a global average pooling layer and a softmax layer, and the classified prediction is used as a baseline model of an experiment for comparison; the experimental result shows that when the characteristics of a single mode are used, the model can learn that the gambling harmful application is different from the normal application, the effective identification is carried out, when the characteristics of multiple modes are used, the effect of the model is obviously improved, the Multihead Attention mechanism can obtain better results by paying Attention to partial areas in pictures instead of interference introduced by global pictures, and the identification of more harmful gambling applications is of great significance.
The working principle and the using process of the invention are as follows: when the multi-mode fusion-based android gambling application identification system is used, firstly, website comments and downloading thread information of an android gambling application in a gambling website are obtained in batches; then, discovering an application download link ending with the APK on a target website, and extracting suspected android applications; then extracting the package name, the icon, the certificate, the IP address, the URL domain name and the email address of the android application installation package; then, through a multi-mode fused android gambling application recognition model, judging that the gambling application comprises an image model, a text model and a Multihead Attention fusion model; then storing the found basic information of the android application and the application installation package; finally, the front end displays the found situation of the android gambling and the new found situation of the application of the android gambling; according to the invention, static resources in the application, such as package names, icons, certificates and character strings (IP addresses, URL domain names and e-mail addresses), are extracted through a crawler technology, and then characters, pictures and other characteristics of different modes are combined by applying a Multihead Attention multimodal fusion technology to construct a multimodality fused android gambling application identification model, so that the gambling application in an android platform can be accurately and automatically identified.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered to be within the technical scope of the present invention, and the technical solutions and the inventive concepts thereof according to the present invention should be equivalent or changed within the scope of the present invention.
Claims (7)
1. An android gambling application identification method based on multi-mode fusion is characterized by comprising the following specific steps:
(1) obtaining website comments and downloading clue information of android gambling applications in gambling websites in batches;
(2) finding an application download link ending with APK in a target website, and extracting suspected android application;
(3) extracting a package name, an icon, a certificate, an IP address, a URL domain name and an email address of the android application installation package;
(4) judging that the gambling application comprises an image model, a text model and a Multihead Attention fusion model through a multi-mode fused android gambling application recognition model;
(5) storing the found basic information of the android application and the application installation package;
(6) the front end shows the discovery of android gambling and the new discovery of android gambling applications.
2. The method for identifying the android gambling application based on the multi-modal fusion as claimed in claim 1, wherein the image model decision process is specifically as follows:
s1: mapping icon pictures to feature f using VGGNetIThe formula is as follows:
fI=CNNvgg(I) (1)
s2: scaling the icon picture to 448 x 448 pixels and then obtaining the feature f of the last pooling layerIThe output dimension is 512 × 14, wherein 14 × 14 is the number of picture segmentation regions, and 512 is the dimension of each region feature vector;
s3: each feature vector is converted into a new vector with the same dimension as the text vector by using a single-layer perceptron, and the formula is as follows:
vI=tanh(WIfI+bI) (2)
in the formula: v. ofIIs a matrix, the ith column of which is the region i of the picture feature vector;
s4: the gradient vanishing problem is solved by using 18-layer or 34-layer residual neural network.
3. The identification method for the android gambling application based on multi-modal fusion is characterized in that the residual neural network consists of two stacked layer building blocks, and the structure of the residual neural network is as follows:
H(x)=F(x,{Wi})+x (3)
in the formula: x and H (x) are the input and output vectors of the building block; f (x, { W)i}) represents a learned residual map;
wherein:
F(x)=W2δ(W1x) (4)
in the formula: δ represents the activation function, W1Represents the first connection weight, W2Representing a second connection weight;
if the dimensions x and F do not match, the dimensions can be matched using a linear mapping W _ s:
H(x)=F(x,{Wi})+Wδx (5)
the formula f (x) + x is implemented by a forward neural network with a shortcut connection, which is implemented by an identity map, the output will be added to the final output of the stack.
4. The method as claimed in claim 1, wherein the text model is LSTM, the basic structure of LSTM is a memory unit with sequence state preserved, and in each step, LSTM unit obtains an input word vector xtUpdating the memory cell ctThen outputs a hidden state ht(ii) a A door mechanism is used in the updating process; forgetting door ftControlling c from the last statet-1How much information is kept in; an input gate itControlling the current input xtUpdating how much information to the memory unit; the output gate controls how much information enters the output, namely the hidden state, and the detailed updating process is as follows:
it=σ(Wxixt+Whiht-1+bi) (6)
ft=σ(Wxfxt+Whfht-1+bf) (7)
ot=σ(Wxoxt+Whoht-1+bo) (8)
ct=ftct-1+ittanh(Wxcxt+Whcht-1+bc) (9)
ht=ottanh(ct) (10)
in the formula: i, f, o and c are respectively an input gate, a forgetting gate, an output gate and a memory unit;
the word vector xtAs input to the LSTM, the formula is as follows:
xt=Weqt,t∈{1,2,...T} (11)
ht=LSTM(xt),t∈{1,2,...T} (12)
in the formula: q ═ q1,...,qT]Representing text, qtIs a one-hot vector representation of the word at location t.
5. The method for identifying an android gambling application based on multi-modal fusion as claimed in claim 1, wherein the Multihead Attention fusion model is a multi-modal fusion model based on an Attention mechanism, and the result output by the picture model and the text model is fused by an Attention mechanism, and the Attention mechanism is calculated as follows:
(1) calculating the weight of the query and each key through the similarity, wherein similar functions use dot products;
(2) dot product operation due toThe adjustment function is realized, so that the dot product is not too large;
(3) normalizing the obtained weights by a softmax function;
(4) obtaining their weighted sum by similarity and value of corresponding key;
based on the above steps, the following formula is obtained:
in the formula: q is the feature vector of the picture, and V and K are the output of the text model.
6. The method for identifying an android gambling application based on multi-modal fusion as claimed in claim 5, wherein the multi-modal fusion needs to pass through a global average pooling layer, and the formula is as follows:
vgap=Global(v1,v2,...,vn) (14)
v obtained isgapThe vector is directly input into the softmax layer for classification prediction, and the prediction result is as follows,
the purpose of introducing the cross entropy function is to evaluate the model, reflecting the true class y and the predicted classThe difference between them:
in the formula: i is an index number.
7. An android gambling application identification system based on multi-mode fusion is characterized by comprising a web crawler layer, an extraction discrimination layer, a data storage layer and a result display layer;
the network crawler layer is used for crawling android gambling application clues in website contents and comment information and grabbing and finding android application download addresses;
the extraction discrimination layer is used for extracting basic information of the android application and carrying out android platform gambling application identification based on multi-mode fusion;
the extraction discrimination layer is used for storing the identified android application installation package and storing android application information;
the achievement display layer is used for displaying the android gambling application in a whole manner and displaying the new findings of the android gambling application.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110490157.7A CN113034331A (en) | 2021-05-06 | 2021-05-06 | Android gambling application identification method and system based on multi-mode fusion |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110490157.7A CN113034331A (en) | 2021-05-06 | 2021-05-06 | Android gambling application identification method and system based on multi-mode fusion |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113034331A true CN113034331A (en) | 2021-06-25 |
Family
ID=76455431
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110490157.7A Pending CN113034331A (en) | 2021-05-06 | 2021-05-06 | Android gambling application identification method and system based on multi-mode fusion |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113034331A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113918949A (en) * | 2021-12-13 | 2022-01-11 | 北京赋乐科技有限公司 | Recognition method of fraud APP based on multi-mode fusion |
CN114021651A (en) * | 2021-11-04 | 2022-02-08 | 桂林电子科技大学 | Block chain violation information perception method based on deep learning |
CN114140673A (en) * | 2022-02-07 | 2022-03-04 | 人民中科(济南)智能技术有限公司 | Illegal image identification method, system and equipment |
CN114491529A (en) * | 2021-12-20 | 2022-05-13 | 西安电子科技大学 | Android malicious application program identification method based on multi-modal neural network |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170102929A1 (en) * | 2015-10-12 | 2017-04-13 | Samsung Electronics Co., Ltd. | System and Method to Reduce Storage Area Usage of Android Application |
CN108052523A (en) * | 2017-11-03 | 2018-05-18 | 中国互联网络信息中心 | Gambling site recognition methods and system based on convolutional neural networks |
CN110619075A (en) * | 2018-06-04 | 2019-12-27 | 阿里巴巴集团控股有限公司 | Webpage identification method and equipment |
CN111310026A (en) * | 2020-01-17 | 2020-06-19 | 南京邮电大学 | Artificial intelligence-based yellow-related terrorism monitoring method |
CN111563551A (en) * | 2020-04-30 | 2020-08-21 | 支付宝(杭州)信息技术有限公司 | Multi-mode information fusion method and device and electronic equipment |
CN112347244A (en) * | 2019-08-08 | 2021-02-09 | 四川大学 | Method for detecting website involved in yellow and gambling based on mixed feature analysis |
CN112600834A (en) * | 2020-12-10 | 2021-04-02 | 同盾控股有限公司 | Content security identification method and device, storage medium and electronic equipment |
-
2021
- 2021-05-06 CN CN202110490157.7A patent/CN113034331A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170102929A1 (en) * | 2015-10-12 | 2017-04-13 | Samsung Electronics Co., Ltd. | System and Method to Reduce Storage Area Usage of Android Application |
CN108052523A (en) * | 2017-11-03 | 2018-05-18 | 中国互联网络信息中心 | Gambling site recognition methods and system based on convolutional neural networks |
CN110619075A (en) * | 2018-06-04 | 2019-12-27 | 阿里巴巴集团控股有限公司 | Webpage identification method and equipment |
CN112347244A (en) * | 2019-08-08 | 2021-02-09 | 四川大学 | Method for detecting website involved in yellow and gambling based on mixed feature analysis |
CN111310026A (en) * | 2020-01-17 | 2020-06-19 | 南京邮电大学 | Artificial intelligence-based yellow-related terrorism monitoring method |
CN111563551A (en) * | 2020-04-30 | 2020-08-21 | 支付宝(杭州)信息技术有限公司 | Multi-mode information fusion method and device and electronic equipment |
CN112600834A (en) * | 2020-12-10 | 2021-04-02 | 同盾控股有限公司 | Content security identification method and device, storage medium and electronic equipment |
Non-Patent Citations (1)
Title |
---|
小崔: "Attention 之 Multi-Head Attention", 《HTTPS://ZHUANLAN.ZHIHU.COM/P/172400219》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114021651A (en) * | 2021-11-04 | 2022-02-08 | 桂林电子科技大学 | Block chain violation information perception method based on deep learning |
CN114021651B (en) * | 2021-11-04 | 2024-03-29 | 桂林电子科技大学 | Block chain illegal information sensing method based on deep learning |
CN113918949A (en) * | 2021-12-13 | 2022-01-11 | 北京赋乐科技有限公司 | Recognition method of fraud APP based on multi-mode fusion |
CN114491529A (en) * | 2021-12-20 | 2022-05-13 | 西安电子科技大学 | Android malicious application program identification method based on multi-modal neural network |
CN114140673A (en) * | 2022-02-07 | 2022-03-04 | 人民中科(济南)智能技术有限公司 | Illegal image identification method, system and equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113034331A (en) | Android gambling application identification method and system based on multi-mode fusion | |
CN111680176B (en) | Remote sensing image retrieval method and system based on attention and bidirectional feature fusion | |
CN113486981B (en) | RGB image classification method based on multi-scale feature attention fusion network | |
CN111582397B (en) | CNN-RNN image emotion analysis method based on attention mechanism | |
CN107273416B (en) | Webpage hidden link detection method and device and computer readable storage medium | |
Mohan et al. | Spoof net: syntactic patterns for identification of ominous online factors | |
CN113158554B (en) | Model optimization method and device, computer equipment and storage medium | |
CN113011889B (en) | Account anomaly identification method, system, device, equipment and medium | |
CN113422761B (en) | Malicious social user detection method based on counterstudy | |
CN109918648B (en) | Rumor depth detection method based on dynamic sliding window feature score | |
CN112819024B (en) | Model processing method, user data processing method and device and computer equipment | |
CN111783903A (en) | Text processing method, text model processing method and device and computer equipment | |
CN115757991A (en) | Webpage identification method and device, electronic equipment and storage medium | |
CN114978624A (en) | Phishing webpage detection method, device and equipment and storage medium | |
CN114282258A (en) | Screen capture data desensitization method and device, computer equipment and storage medium | |
CN113627550A (en) | Image-text emotion analysis method based on multi-mode fusion | |
CN114638984B (en) | Malicious website URL detection method based on capsule network | |
CN117112814A (en) | False media content mining and identification system and identification method thereof | |
CN111144453A (en) | Method and equipment for constructing multi-model fusion calculation model and method and equipment for identifying website data | |
CN115129902B (en) | Media data processing method, device, equipment and storage medium | |
CN111523034B (en) | Application processing method, device, equipment and medium | |
CN114707068A (en) | Method, device, equipment and medium for recommending intelligence base knowledge | |
CN115129863A (en) | Intention recognition method, device, equipment, storage medium and computer program product | |
CN113312622A (en) | Method and device for detecting URL (Uniform resource locator) | |
CN113704400A (en) | False news identification method, device, equipment and chip |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210625 |