CN110264311B

CN110264311B - Business promotion information accurate recommendation method and system based on deep learning

Info

Publication number: CN110264311B
Application number: CN201910461767.7A
Authority: CN
Inventors: 苏俊健; 王东; 麦志领; 何佳奋; 纪淇纯; 叶新华
Original assignee: Foshan University
Current assignee: Foshan University
Priority date: 2019-05-30
Filing date: 2019-05-30
Publication date: 2023-04-18
Anticipated expiration: 2039-05-30
Also published as: CN110264311A

Abstract

The invention discloses a method and a system for accurately recommending business promotion information based on deep learning. The method and the system are a set of formed, efficient and regional industry-oriented recommendation system; in each area, many regional enterprises are concentrated in the same area, and the development clients are close to saturation under the line. And certain manpower and material resources cost is needed for digging potential customers offline, and the success rate is not high. Such enterprises urgently need a set of formed systems for guidance, and the cost required by developing customers is saved.

Description

Business promotion information accurate recommendation method and system based on deep learning

Technical Field

The disclosure relates to the technical field of machine learning recommendation algorithms and deep learning, in particular to a method and a system for accurately recommending business promotion information based on deep learning.

Background

The traditional recommendation system is generally used for recommending between commodities and customers, and recommending different commodities for different customers. While the traditional recommendation algorithm generally has: content Based recommendations (CB), collaborative Filtering (CF), hybrid recommendation methods, etc. The traditional recommendation mode is abandoned, recommendation between upstream enterprises and downstream enterprises is achieved, and a deep learning-based recommendation algorithm is used for solving complex relationships between the upstream enterprises and the downstream enterprises.

The conventional recommendation system at present has the following two problems:

1) The research field of the domestic current-stage recommendation algorithm mainly focuses on accurate recommendation of commodities, and the research on customer recommendation is less. The algorithm of the domestic research recommendation system generally adopts a machine learning algorithm, but for the increasingly complex enterprise relationships and diversified data relationships, the learning efficiency of machine learning gradually cannot meet the requirements.

2) The method is separated from the traditional data mining frame, data is automatically captured and screened by the webpage, the cost of collecting data is reduced or the dilemma of lacking data is solved

Disclosure of Invention

In order to solve the problems, the invention provides a technical scheme of a deep learning-based business promotion information accurate recommendation method and system, an LSTM neural network is trained through a training sample data set, the accuracy of the LSTM neural network is tested through a testing sample data set, the trained LSTM neural network is obtained and used as a recommendation system, a classifier of the recommendation system with complete functions is obtained, based on the deep learning and network technology, the cost required by an enterprise in developing a client is reduced, the time required by developing the client is shortened, and the accuracy of finding the client is improved.

In order to achieve the above object, according to an aspect of the present disclosure, there is provided a deep learning-based business promotion information accurate recommendation method, including the steps of:

step 1, collecting commercial promotion information data;

step 2, preprocessing and cleaning the collected commercial promotion information data to obtain a commercial promotion information data set;

step 3, performing dimensionality reduction and feature selection on the commercial promotion information data set;

step 4, dividing the commercial promotion information data set obtained by feature selection into a training sample data set and a test sample data set;

step 5, obtaining a word vector for training by a training sample data set and a test sample data set through a word2vec model;

and 6, training an LSTM (Long Short-Term Memory) neural network through the training sample data set and testing the accuracy of the LSTM neural network through the test sample data set to obtain the trained LSTM neural network as a recommendation system.

Further, in step 1, the method for collecting the commercial promotion information data includes, but is not limited to: collecting an open-source data set website such as a kaggle data set as commercial promotion information data; capturing a data set from a Taobao-like shop webpage or a same-city transaction information network through a crawler by using a web crawler technology after secondary development to obtain commercial promotion information data; and acquiring required information from the webpage backup in the txt format reserved in the hectogram as business promotion information data.

Further, in step 2, the method of preprocessing and cleaning the collected commercial promotion information data to obtain the commercial promotion information data set is that, because the obtained commercial promotion information data is extremely large, mixed or even useless, preprocessing is required, and the preprocessing is necessary processing such as auditing, screening, sorting and the like before classifying or grouping the collected commercial promotion information data, namely data auditing completeness and accuracy, data screening, data sorting, namely data cleaning, data integration, data transformation and data reduction; the step of cleaning the collected commercial promotion information data refers to data cleaning, and a last program for finding and correcting recognizable errors in the data file comprises the steps of checking data consistency, processing invalid values and missing values and the like; the business promotion information data is preprocessed and cleaned, and the dirty data is converted into data meeting the data quality requirement by using mathematical statistics, data mining or predefined cleaning rules, namely a cleaned business promotion information data set.

Further, in step 3, the method for performing dimensionality reduction and feature selection on the commercial promotion information data set includes, but is not limited to, principal Component Analysis (PCA), independent Component Analysis (ICA), linear Discriminant Analysis (LDA), local Linear embedding (lop), and Linear Discriminant Analysis (lpa)Any one dimension reduction method in (Locally Linear Embedding, LLE), laplacian Eigenmaps (Laplacian Eigenmaps), multiDimensional scaling (MDS) and Equal Metric Mapping (Equal Metric Mapping); the feature selection uses an improved algorithm based on an individual optimal feature selection method, and the features are vectors obtained by text vectorization of information such as the locations, the operating ranges and the registration information of companies in the commercial promotion information data set; calculating the separability criterion value of each feature when the features are used independently by using an independent optimal feature selection algorithm, then sorting the features from large to small according to the separability criterion value, and taking the first 30 features with larger separability criterion values as feature combinations; however, in combination with the actual situation, there is a phenomenon of information missing in the collected commercial promotion information data set, so that when selecting features, it is necessary to consider a single feature information missing degree, and the improved algorithm based on the single optimal feature selection method is the following formula:

wherein, X (i) = (X (1), X (2), X (3), …, X (N)), X (i) represents the i-th feature, N is the number of features, J (X) represents the separability criterion of the feature set, N (X (i)) represents the number of data volumes of the i-th feature which are not missing, M represents the total data volume, and N (X (i))/M represents the missing degree of the i-th feature in the data, thereby improving the phenomenon of information missing.

Further, in step 4, the method for dividing the commercial promotion information data set obtained by feature selection into a training sample data set and a test sample data set includes: any one of a leave-out method, a cross-validation method and a self-service method.

The leaving method is to directly divide the commercial promotion information data set into two mutually exclusive sets, wherein one set is used as a training sample data set, and the left set is used as a test sample data set.

The cross-validation method is to divide the commercial promotion information data set into mutually exclusive subsets with equal size, namely, each subset keeps the consistency of data distribution as much as possible, namely, each subset is obtained by layered sampling, then, a union set of the subsets is used as a training sample data set each time, and the rest subset is used as a test sample data set.

The self-service method is to sample and generate a commercial promotion information data set: and selecting a sample from the commercial promotion information data set at random each time, copying the sample into a training sample data set, taking the copied sample as a test sample data set, and repeating the process for times. Wherein, some samples can appear in the data set of the commercial promotion information data set for many times as training sample data set, and the other part of samples can not appear in the data set of the commercial promotion information data set as test sample data set.

Since a class of customers with similar attributes who have purchased the particular enterprise service or product needs to be exported. The model is trained using LSTM as the core network, taking into account the different number of attributes that may be attributed to the data set. LSTM is a special type of RNN (Current Neural Network) that can learn long-term dependency information. The LSTM controls the discarding of useless information or the improvement of the proportion of beneficial information through a gate, meanwhile, a memory cell (cell) is added in the model, relevant information before memory is collected, and the function of forgetting or memorizing is realized.

Further, in step 5, the method for obtaining the word vector for training by the training sample data set and the test sample data set through the word2vec model includes the following steps:

step 5.1, word segmentation: due to the particularity of Chinese, words in the sentences in the commercial promotion information are divided through a word division library to obtain a word library, wherein the word division library comprises but is not limited to a Jieba word library, an IK word library, an mmseg word library and a word library;

step 5.2, counting word frequency: traversing the word bank formed after word segmentation in the step 5.1, counting the frequency of the appeared words and numbering the words;

and 5.3, constructing a tree result: constructing a Huffman tree according to the occurrence probability of each word in the occurrence step 5.2;

step 5.4, generating a binary code where the node is: converting the occurrence probability of each word into binary codes to represent each node in the Huffman tree in the step 5.3;

step 5.5, initializing the intermediate vectors of all non-leaf nodes and the word vectors in the leaf nodes: each node in the Huffman tree stores a vector with the length of m, but the meanings of the vectors in leaf nodes and non-leaf nodes are different, and the word vectors of each word are stored in the leaf nodes and are used as the input of a neural network; instead, the intermediate vectors are stored in the leaf nodes, correspond to the parameters of the hidden layer in the neural network, and determine the classification result together with the input;

step 5.6, training the intermediate vector and the word vector: adding word vectors Of n-1 Words near the word A by using a CBOW (Continuous Bag-Of-Words Model) Model or Skip-Gram Model to serve as system input, sequentially classifying the word vectors according to binary codes generated by the word A in step 5.4, training intermediate vectors and the word vectors according to classification results, and finally obtaining corresponding word vectors Of the commercial promotion information; the word A is a word, and the training process mainly comprises three stages of an input layer (input), a mapping layer (project) and an output layer (output); the input layer is a word vector of n-1 words around a certain word A (word A). If n takes 5, the word A (which can be denoted as w (t)), the first two and the last two words are w (t-2), w (t-1), w (t + 1), w (t + 2). Correspondingly, the word vectors for those 4 words are denoted as v (w (t-2)), v (w (t-1)), v (w (t + 1)), and v (w (t + 2)). It is relatively simple to add those n-1 word vectors from the input layer to the mapping layer.

Further, in step 6, the method for training the LSTM neural network by training the sample data set and testing the accuracy of the LSTM neural network by testing the sample data set to obtain the trained LSTM neural network as the recommendation system includes the following steps:

step 6.1, training the LSTM neural network by using the word vectors: the training sample data set passes through a forgetting gate of an LSTM neural network, information discarding action of the LSTM neural network is started, the information discarding action is realized by a sigmoid layer in the forgetting gate, previous output of the sigmoid layer and input of a current word vector are checked, whether information learned in a previous state is reserved is determined, and the LSTM neural network comprises an input gate, a forgetting gate and an output gate;

step 6.2, the training sample data set after information discarding passes through an input gate of the LSTM neural network, information updating action of the LSTM neural network is started, the information updating action is realized by a sigmoid layer in the input gate, and then each cell state of the LSTM neural network is changed by a tanh layer, and new knowledge is learned;

step 6.3, the training sample data set after the information updating passes through an output gate of the LSTM neural network, and a vector is output, wherein the vector depends on the cell state in the step 6.2; firstly, operating a sigmoid layer to obtain a vector to determine an output part of a cell state, processing the cell state through a tanh layer, and multiplying the cell state by the output (vector) of a sigmoid gate to obtain output information of an LSTM network;

and 6.4, inputting the LSTM network obtained in the step 6.3 by using the test data to obtain an output result, comparing the output result with the label in the test data to verify the accuracy of the network, finishing training to obtain a trained LSTM neural network if the accuracy meets the requirement, and taking the trained LSTM neural network as a recommendation system.

The invention also provides a deep learning-based accurate commercial promotion information recommendation system, which comprises: a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor executing the computer program to run in the units of the following system:

the data collection and collection unit is used for collecting commercial promotion information data;

the data preprocessing unit is used for preprocessing and cleaning the collected commercial promotion information data to obtain a commercial promotion information data set;

the characteristic selection unit is used for performing dimensionality reduction and characteristic selection on the commercial promotion information data set;

the training sample dividing unit is used for dividing the commercial promotion information data set obtained by feature selection into a training sample data set and a test sample data set;

the vectorization unit is used for obtaining a word vector used for training through a word2vec model by a training sample data set and a test sample data set;

and the recommendation system obtaining unit is used for training the LSTM neural network through the training sample data set and testing the accuracy of the LSTM neural network through the test sample data set to obtain the trained LSTM neural network as a recommendation system.

Advantageous effects of the disclosure

The invention provides a deep learning-based accurate commercial promotion information recommendation method and system, which comprises the following steps:

1) An online service system which is not formed in the mining aspect of potential customers is aimed at, and the offline customer recommendation based on the social network basically reaches the utmost;

2) Although some enterprises research and develop systems aiming at the situations, the simple system can not adapt to various complex situations in reality and has little work effect. According to survey, the domestic system basically has no company which specially carries out information recommendation aiming at companies or enterprises, even has no set of formed, efficient and regional industry-oriented recommendation system;

3) In each region, a plurality of regional enterprises are concentrated in the same region, and the development clients are close to saturation in a line. And certain manpower and material resources cost is needed for digging potential customers off line, and the success rate is not high. The enterprises urgently need a set of formed systems for guidance, so that the cost required by developing customers is saved;

4) The LSTM has good effect on the aspect of natural language processing, has high accuracy and self-adaptive capacity when being applied to a language translator, can perfectly adapt to various complex conditions in reality, and is suitable for serving as a core network of the product.

Drawings

The foregoing and other features of the present disclosure will be more readily apparent from the detailed description of the embodiments shown in the accompanying drawings in which like reference numerals refer to the same or similar elements, and it will be apparent that the drawings in the following description are merely some examples of the disclosure, and that other drawings may be derived by those skilled in the art without inventive faculty, and wherein:

FIG. 1 is a flow chart of a deep learning-based method for accurately recommending business promotion information;

fig. 2 is a diagram illustrating a deep learning-based accurate commercial promotion information recommendation system.

Detailed Description

The conception, specific structure and technical effects of the present disclosure will be clearly and completely described below in conjunction with the embodiments and the accompanying drawings to fully understand the objects, aspects and effects of the present disclosure. It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.

Fig. 1 is a flowchart illustrating a method for accurately recommending deep learning-based business promotion information according to the present disclosure, and fig. 1 is combined to describe a method for accurately recommending deep learning-based business promotion information according to an embodiment of the present disclosure.

The utility model provides a commercial promotion information accurate recommendation method based on deep learning, specifically includes the following steps:

step 1, collecting commercial promotion information data;

step 4, dividing a commercial promotion information data set obtained by feature selection into a training sample data set and a test sample data set;

Further, in step 1, the method for collecting the business promotion information data includes, but is not limited to: collecting an open-source data set website such as a kaggle data set as commercial promotion information data; capturing a data set from a Taobao-like shop webpage or a same-city transaction information network through a crawler by using a web crawler technology after secondary development to obtain commercial promotion information data; and acquiring required information from the webpage backup in the txt format reserved in the hectogram as business promotion information data.

Further, in step 3, the method for performing dimensionality reduction and feature selection on the commercial promotion information data set includes, but is not limited to, any one of a Principal Component Analysis (PCA), independent Component Analysis (ICA), linear Discriminant Analysis (LDA), local Linear Embedding (LLE), laplacian feature mapping, multidimensional scaling (MDS), and isometry mapping; and the feature selection uses an improved algorithm based on a single optimal feature selection method, wherein the features are information such as the locations, the operating ranges, the registration information and the like of the companies in the commercial promotion information data setVectorizing the text; calculating the separability criterion value of each feature when the feature is used independently by using an independent optimal feature selection algorithm, then sorting the separability criterion values from large to small, and taking the first 30 features with larger separability criterion values as feature combinations; however, in combination with practical situations, there is a phenomenon of information missing in the collected commercial promotion information data set, so that when selecting features, a single feature information missing degree needs to be considered, and the improved algorithm based on the single optimal feature selection method is the following formula:

wherein, X (i) = (X (1), X (2), X (3), …, X (N)), X (i) represents the ith feature, N is the number of features, J (X) represents the separability criterion of the feature set, N (X (i)) represents the number of data volumes of the ith feature which are not missing, M represents the total data volume, and N (X (i))/M represents the missing degree of the ith feature in the data.

The method for setting out the business promotion information data sets comprises the steps of directly dividing the business promotion information data sets into two mutually exclusive sets, wherein one set serves as a training sample data set, and the set which is set out serves as a test sample data set.

The cross-validation method is that the commercial promotion information data set is divided into mutually exclusive subsets with equal size, namely, each subset keeps the consistency of data distribution as much as possible, namely, each subset is obtained by layered sampling, then, the union set of the subsets is used as a training sample data set every time, and the rest subset is used as a test sample data set.

Since a class of customers with similar attributes who have purchased the particular business service or product needs to be exported. The model is trained using the LSTM network as the core network, taking into account the different number of attributes that may be attributed to the data set. LSTM is a special type of RNN that can learn long-term dependency information. The LSTM controls the discarding of useless information or the improvement of the proportion of beneficial information through a gate, meanwhile, memory cells are added in the model, relevant information before memory is collected, and the function of forgetting or memorizing is realized.

step 5.3, constructing a tree result: constructing a Huffman tree according to the occurrence probability of each word in the occurrence step 5.2;

step 5.6, training the intermediate vector and the word vector: adding word vectors Of n-1 Words near the word A by using a CBOW (Continuous Bag-Of-Words Model) Model or Skip-Gram Model to serve as system input, sequentially classifying the word vectors according to binary codes generated by the word A in step 5.4, training intermediate vectors and the word vectors according to classification results, and finally obtaining corresponding word vectors Of the commercial promotion information; the word A is a word, and the training process mainly comprises three stages of an input layer (input), a mapping layer (project) and an output layer (output); the input layer is a word vector of n-1 words around a certain word A (word A). If n takes 5, the word A (which can be denoted as w (t)), the first two and the last two words are w (t-2), w (t-1), w (t + 1), w (t + 2). Correspondingly, the word vectors for those 4 words are denoted as v (w (t-2)), v (w (t-1)), v (w (t + 1)), and v (w (t + 2)). It is relatively simple to go from the input layer to the mapping layer by adding those n-1 word vectors.

By means of the recommendation system, how the method is used for accurately recommending the clients to the enterprise can be demonstrated, and the enterprise is helped to develop a large number of potential and accurate clients more efficiently. By using the thinking of 'internet + finance' and based on deep learning and network technology, the cost required by enterprises in developing clients is reduced, the time required by developing clients is shortened, and the accuracy of finding clients is improved.

The accuracy rate (0.12) of the mined data of the accurate commercial promotion information recommendation system based on deep learning is far higher than the accuracy rate (0.067) of the traditional content-based method, and the retention rate is also obviously improved; and hot commercial promotion information can be found more easily. The LSTM deep neural network is applied to commercial popularization information mining, irrelevant noise information is forgotten, the memory of strong relevant information is deepened, and the LSTM deep neural network has the advantages and disadvantages in algorithm and has the preferred selection characteristic. The Huffman tree is adopted to perform related business promotion information word segmentation, the word segmentation speed is greatly improved, and the word segmentation calculation time is about 1/20 of the general exhaustive expression.

An embodiment of the present disclosure provides an accurate recommendation system of business promotion information based on deep learning, as shown in fig. 2, is an accurate recommendation system diagram of business promotion information based on deep learning of the present disclosure, and an accurate recommendation system of business promotion information based on deep learning of the embodiment includes: the system comprises a processor, a memory and a computer program stored in the memory and capable of running on the processor, wherein the processor executes the computer program to realize the steps in the embodiment of the deep learning based commercial promotion information accurate recommendation system.

The system comprises: a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor executing the computer program to run in the units of the system:

The deep learning-based accurate commercial promotion information recommendation system can be operated in computing equipment such as desktop computers, notebooks, palm computers and cloud servers. The business promotion information accurate recommendation system based on deep learning can be operated by a system comprising but not limited to a processor and a memory. It will be understood by those skilled in the art that the example is only an example of a deep learning-based accurate commercial promotion information recommendation system, and does not constitute a limitation of a deep learning-based accurate commercial promotion information recommendation system, and may include more or less components than a certain proportion, or some components in combination, or different components, for example, the deep learning-based accurate commercial promotion information recommendation system may further include an input-output device, a network access device, a bus, etc.

The Processor may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. The general-purpose processor can be a microprocessor or the processor can be any conventional processor, and the processor is a control center of the deep learning-based accurate recommendation system for business promotion information, and various interfaces and lines are used for connecting all parts of the whole system capable of operating the deep learning-based accurate recommendation system for business promotion information.

The memory can be used for storing the computer program and/or the module, and the processor can realize various functions of the deep learning-based accurate commercial promotion information recommendation system by running or executing the computer program and/or the module stored in the memory and calling the data stored in the memory. The memory may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. In addition, the memory may include high speed random access memory, and may also include non-volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other volatile solid state storage device.

While the present disclosure has been described in considerable detail and with particular reference to a few illustrative embodiments thereof, it is not intended to be limited to any such details or embodiments or any particular embodiments, but it is to be construed as effectively covering the intended scope of the disclosure by providing a broad, potential interpretation of such claims in view of the prior art with reference to the appended claims. Furthermore, the foregoing describes the disclosure in terms of embodiments foreseen by the inventor for which an enabling description was available, notwithstanding that insubstantial modifications of the disclosure, not presently foreseen, may nonetheless represent equivalent modifications thereto.

Claims

1. A deep learning-based accurate business promotion information recommendation method is characterized by comprising the following steps:

step 1, collecting commercial promotion information data;

in step 5, the method for obtaining the word vector for training by the training sample data set and the test sample data set through the word2vec model comprises the following steps:

step 5.1, word segmentation: due to the particularity of Chinese, words in the sentences in the commercial promotion information are divided through a word division library to obtain a word library, wherein the word division library comprises a Jieba word library, an IK word library, an mmseg word library and a word library;

step 5.5, initializing the intermediate vectors of all non-leaf nodes and the word vectors in the leaf nodes: each node in the Huffman tree stores a length ofmThe meaning of the vectors in the leaf nodes and the non-leaf nodes is different, and the word vectors of all words are stored in the leaf nodes and are used as the input of the neural network; instead, the intermediate vectors are stored in the leaf nodes, correspond to the parameters of the hidden layer in the neural network, and determine the classification result together with the input;

step 5.6, training the intermediate vector and the word vector: training intermediate vectors and word vectors by using a CBOW model or a Skip-Gram model, and finally obtaining corresponding word vectors of the commercial promotion information;

step 6, training the LSTM neural network through the training sample data set and testing the accuracy of the LSTM neural network through the test sample data set to obtain the trained LSTM neural network as a recommendation system;

in step 3, the method for performing dimensionality reduction and feature selection on the commercial promotion information data set comprises any one dimensionality reduction method of a principal component analysis method, independent component analysis, linear discriminant analysis, local linear embedding, laplace feature mapping, multidimensional scaling and equal metric mapping; the feature selection uses an improved algorithm based on an individual optimal feature selection method, and the features are vectors of the places, the operating ranges and the registration information of the companies in the commercial promotion information data set after text vectorization; calculating the separability criterion value of each feature when the feature is used independently by using an independent optimal feature selection algorithm, then sorting the separability criterion values from large to small, and taking the first 30 features with larger separability criterion values as feature combinations; the improved algorithm based on the single optimal feature selection method is as follows:

wherein,x(i)=（x(1),x(2),x(3),…,x(n)），x(i) Represents the firstiThe characteristics of the device are as follows,nthe number of the characteristics is the same as the number of the characteristics,J(X) A separability criterion representing the set of features,N(x(i) Is expressed asiA characteristic is not the number of missing data volumes,Mwhich represents the total amount of the data amount,N(x(i))/Mshow thatiThe degree of absence of individual features in the data;

in step 4, the method for dividing the commercial promotion information data set obtained by feature selection into a training sample data set and a test sample data set includes: any one of a leave-out method, a cross-validation method and a self-service method;

in step 6, the method for training the LSTM neural network through the training sample data set and testing the accuracy of the LSTM neural network through the testing sample data set to obtain the trained LSTM neural network as the recommendation system comprises the following steps:

step 6.3, the training sample data set after the information updating passes through an output gate of the LSTM neural network, and a vector is output, wherein the vector depends on the cell state in the step 6.2; firstly, operating a sigmoid layer to obtain a vector to determine an output part of a cell state, processing the cell state through a tanh layer, and multiplying the cell state by the output of a sigmoid gate to obtain output information of an LSTM network;

2. The method for accurately recommending business promotion information based on deep learning according to claim 1, wherein in step 2, the method for preprocessing and cleaning the collected business promotion information data to obtain the business promotion information data set comprises preprocessing, wherein the preprocessing is the processing of auditing, screening and sequencing before classifying or grouping the collected business promotion information data, namely data auditing and accuracy, data screening and data sequencing, namely data cleaning, data integration, data transformation and data reduction; the step of cleaning the collected commercial promotion information data refers to data cleaning, and a last program for finding and correcting recognizable errors in the data file comprises the steps of checking data consistency and processing invalid values and missing values; the business promotion information data is preprocessed and cleaned, and the dirty data is converted into data meeting the data quality requirement by using mathematical statistics, data mining or predefined cleaning rules, namely a cleaned business promotion information data set.

3. A system for accurately recommending business promotion information based on deep learning, the system comprising: a memory, a processor, and a computer program stored in the memory and executable on the processor, the processor executing the computer program to run in the units of the following system:

the method for obtaining the word vector for training by the training sample data set and the test sample data set through the word2vec model comprises the following steps:

step 5.2, word frequency statistics: traversing the word bank formed after word segmentation in the step 5.1, counting the frequency of the appeared words and numbering the words;

step 5.5, initializing the intermediate vectors of all non-leaf nodes and the word vectors in the leaf nodes: each node in the Huffman tree stores a length ofmThe vectors in the leaf nodes and the non-leaf nodes have different meanings, and the word vectors of all words are stored in the leaf nodes and are used as the input of the neural network; instead, the intermediate vectors stored in the leaf nodes correspond to the parameters of the hidden layer in the neural network and determine the classification result together with the input;

the recommendation system obtaining unit is used for training the LSTM neural network through the training sample data set and testing the accuracy of the LSTM neural network through the test sample data set to obtain the trained LSTM neural network as a recommendation system;

the method for performing dimensionality reduction and feature selection on the commercial promotion information data set comprises any dimensionality reduction method of a principal component analysis method, an independent component analysis method, a linear discriminant analysis method, local linear embedding, laplace feature mapping, multidimensional scaling and equal-metric mapping; the feature selection uses an improved algorithm based on an individual optimal feature selection method, and the features are vectors obtained by text vectorization of information such as the locations, the operating ranges and the registration information of companies in the commercial promotion information data set; calculating the separability criterion value of each feature when the feature is used independently by using an independent optimal feature selection algorithm, then sorting the separability criterion values from large to small, and taking the first 30 features with larger separability criterion values as feature combinations; however, in combination with practical situations, there is a phenomenon of information missing in the collected commercial promotion information data set, so that when selecting features, a single feature information missing degree needs to be considered, and the improved algorithm based on the single optimal feature selection method is the following formula:

wherein,x(i)=（x(1),x(2),x(3),…,x(n)），x(i) Represents the firstiThe characteristics of the device are as follows,nthe number of the characteristics is the same as the number of the characteristics,J(X) A separability criterion representing the set of features,N(x(i) Is expressed asiA characteristic is not the number of missing data volumes,Mwhich represents the total amount of data volume,N(x(i))/Mshow thatiThe degree of absence of individual features in the data;

the method for dividing the commercial promotion information data set obtained by feature selection into a training sample data set and a test sample data set comprises the following steps: any one of a leave-out method, a cross-validation method and a self-service method;

the method for training the LSTM neural network through the training sample data set and testing the accuracy of the LSTM neural network through the testing sample data set to obtain the trained LSTM neural network as the recommendation system comprises the following steps: