CN110599234A - Product sales prediction method - Google Patents
Product sales prediction method Download PDFInfo
- Publication number
- CN110599234A CN110599234A CN201910745299.6A CN201910745299A CN110599234A CN 110599234 A CN110599234 A CN 110599234A CN 201910745299 A CN201910745299 A CN 201910745299A CN 110599234 A CN110599234 A CN 110599234A
- Authority
- CN
- China
- Prior art keywords
- model
- data
- wide
- features
- sales
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 48
- 230000006870 function Effects 0.000 claims description 24
- 238000012549 training Methods 0.000 claims description 16
- 230000002159 abnormal effect Effects 0.000 claims description 10
- 238000012795 verification Methods 0.000 claims description 10
- 230000008569 process Effects 0.000 claims description 9
- 238000012417 linear regression Methods 0.000 claims description 8
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 238000004364 calculation method Methods 0.000 claims description 5
- 238000012545 processing Methods 0.000 claims description 5
- 238000013277 forecasting method Methods 0.000 claims description 4
- 238000013527 convolutional neural network Methods 0.000 claims description 3
- 238000009499 grossing Methods 0.000 claims description 3
- 238000013473 artificial intelligence Methods 0.000 abstract description 2
- 230000008859 change Effects 0.000 description 5
- 238000011161 development Methods 0.000 description 4
- 230000004913 activation Effects 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000011478 gradient descent method Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000013178 mathematical model Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- YHXISWVBGDMDLQ-UHFFFAOYSA-N moclobemide Chemical compound C1=CC(Cl)=CC=C1C(=O)NCCN1CCOCC1 YHXISWVBGDMDLQ-UHFFFAOYSA-N 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 238000007500 overflow downdraw method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0201—Market modelling; Market analysis; Collecting market data
- G06Q30/0202—Market predictions or forecasting for commercial activities
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Data Mining & Analysis (AREA)
- Finance (AREA)
- Development Economics (AREA)
- Accounting & Taxation (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Biophysics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Entrepreneurship & Innovation (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Marketing (AREA)
- Game Theory and Decision Science (AREA)
- Economics (AREA)
- General Business, Economics & Management (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention relates to the field of artificial intelligence, in particular to a product sales prediction method. Compared with the traditional sales prediction method, the method is carried out on the basis of the wide and deep model, and can fully excavate the correlation among products by utilizing the strong space-time sequence prediction capability of a long-time convolution memory network and obtain the structure information of the products. The method well combines the influence characteristics of external factors on the consumption rule of consumers by fusing external characteristics such as regional characteristics, weather characteristics and the like, and realizes sales prediction aiming at different regions.
Description
Technical Field
The invention relates to the field of artificial intelligence, in particular to a product sales prediction method.
Background
In recent years, with the rise of new retail, the progress and change of enterprises are accelerated and concentrated, and the enterprises become faster and more explosive. In the process of realizing social informatization and digitization by the internet, the retail industry is developed rapidly by depending on the development and change of science and technology, and brings more challenges. In the high-speed development and change, the industry cost is reduced, the benefit is increased, meanwhile, new challenges are brought to the new retail industry, and the multiple varieties of the multi-industry and multi-platform dimension reduction impact and the variety of the demand change of consumers bring great influences on the decision of enterprises. Therefore, whether the demands of the consumers can be predicted more accurately or not plays an important role in the new retail industry.
The traditional sales prediction method mainly comprises qualitative prediction and quantitative prediction, wherein the qualitative prediction mainly depends on personal experience of managers to make decisions, and the judgment of the future development of things on the nature and degree is taken as the main basis for predicting the future, so that the method has greater flexibility, but the qualitative prediction method has poor transportability and strong subjective limitation; the quantitative prediction is to quantitatively reveal objective rules of a certain phenomenon, divide the objective rules into an periscopic level and a macroscopic level, reveal the relation between change and time from a certain angle, discover the rules in the objective rules, mine the intrinsic information of the rules, mainly adopt a time sequence analysis method to model historical sales data, including a moving average method, ARIMA, Kalman filtering and a grey theory, and send algorithms including Support Vector Regression (SVR), tree model (XGboot) and the like through machine learning. In addition, with the increasing data volume and the development of neural network models, some time series data also start to be processed by adopting deep learning technology. The models can obtain better prediction accuracy by means of strong learning capability of machine learning. However, the existing prediction method is usually suitable for a single commodity, only one commodity can be considered in one sample, the mutual influence among the commodities is ignored, and the prediction performance cannot be improved even if complex feature engineering and model fusion are carried out on historical data.
The defects of the prior method and the invention are as follows: 1) the existing method is usually suitable for a single commodity, only one commodity can be considered in one sample, and the mutual influence among the commodities is ignored; 2) the existing method only focuses on the memory capacity of the model, and needs more artificial characteristic engineering to enhance the generalization capacity of the model and mine the potential association in the data; 3) the existing method does not well deal with the influence of external factors on the consumption rule of people, like the completely opposite influence of rainstorm weather on consumers in different areas, and the model has poor portability.
Disclosure of Invention
In order to solve the prior art:
1) the existing method is usually suitable for a single commodity, only one commodity can be considered in one sample, and the mutual influence among the commodities is ignored;
2) the existing method only focuses on the memory capacity of the model, and needs more artificial characteristic engineering to enhance the generalization capacity of the model and mine the potential association in the data;
3) the existing method does not well deal with the influence of external factors on the consumption rule of people, like the completely opposite influence of rainstorm weather on consumers in different areas, and the model has poor portability.
In order to solve the technical problems, the technical scheme of the invention is as follows:
a product sales forecasting method comprising the steps of:
step S1: acquiring external data and internal data which influence the product sales volume;
step S2: processing the acquired data, removing abnormal values and filling missing values, and dividing the processed data into a training set and a verification set;
step S3: constructing a wide and deep model;
step S4: inputting the training set into a wide and deep model to train the model to obtain an optimized wide and deep model;
step S5: and inputting the verification set into the optimized wide and deep model to verify the accuracy of the model.
Preferably, the external data in step S1 includes weather attributes, time characteristics and area characteristics corresponding to the time of sale, and the weather attributes include temperature, humidity, wind speed, rain and snow; the time characteristics comprise holidays, single and double break, weeks and months; regional characteristics include surrounding households, number of cells, number of competitors, and surrounding rates.
Preferably, the internal data in step S1 is historical sales data of the product, including sales information, discount information, and price information;
preferably, the specific process of rejecting the abnormal value in step S2 is as follows:
the method for eliminating the abnormal value is to smooth the sales data by using a lowss method, and the specific process of lowss smoothing is as follows:
taking a point x as a center, intercepting a section of data with the length of frac forward and backward, performing weighted linear regression on the section of data by using a weight function w, and recordingIs the central value of the regression line, whereinFor the fitted curve corresponding values, n weighted regression lines can be made for all n data points, and the connecting line of the central value of each regression line is the Lowess curve of the data.
Preferably, the specific process of filling the missing value in step S2 is as follows: taking the average value (x) of historical n-day sales data x1+x2+...+xn) Filling is carried out on the data/n, corresponding conversion is carried out on the data format of external data according to model requirements, the conversion method comprises one hot encoding of a oneHotEncoder, digital encoding of a LabelEncoder and the like, and meanwhile, equal-frequency box-dividing encoding is carried out on numerical characteristics such as temperature, room price and the like.
Preferably, the specific steps of constructing the wide and deep model in step S3 are as follows:
the wide and deep model comprises a wide model and a deep model, wherein the wide model meets the memorialization characteristic, the deep model meets the Generalization characteristic, and the memorialization characteristic is a rule learned by the model into historical data; the Generalization characteristic is based on historical data, a new feature combination which never appears in the past is explored, the historical data is memorized, and then the historical data is generalized to the features which do not appear before, so that the Generalization capability of the model is improved; fusing the memorisation characteristic of the wide model and the generalisation characteristic of the deep model, and simultaneously playing the roles of the memorisation and the generalisation;
the wide model adopts a linear regression linear model, input features comprise continuous features such as sales information and discount information and sparse discrete features such as weather features and regional features, the discrete features can form higher-dimensional discrete features after being crossed, and the discrete features can be converged into an effective feature combination by introducing L1 regularization in linear model training;
the Deep network model is constructed by the Deep model in the convLSTM mode, and the Deep model can realize the function of time information of the LSTM considering the sales volume and the function of learning the space information of the data by the convolutional neural network by introducing the convLSTM mode, namely the mutual influence among different commodities; the ConvLSTM model has the same core essence as the LSTM, and by adding convolution operation on the basis of the LSTM, the LSTM can not only obtain a time sequence relation, but also extract spatial features like a convolution layer, obtain space-time features by fully fusing the features among commodities, and convert the switching between states into convolution calculation.
Compared with the prior art, the technical scheme of the invention has the beneficial effects that:
compared with the traditional sales prediction method, the method has the advantages that historical sales data and external data such as crawled weather are utilized, the strong space-time sequence prediction capability of a memory network in long and short convolution times is utilized, the correlation among products is fully mined, and the structure information of the products is obtained. The method well combines the influence characteristics of external factors on the consumption rule of consumers by fusing external characteristics such as regional characteristics, weather characteristics and the like, and realizes sales prediction aiming at different regions.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
FIG. 2 is the Wide and Deep model framework.
FIG. 3 is a single convLSTM unit.
FIG. 4 is a schematic diagram of the deep output result full connected.
Detailed Description
The drawings are for illustrative purposes only and are not to be construed as limiting the patent;
for the purpose of better illustrating the embodiments, certain features of the drawings may be omitted, enlarged or reduced, and do not represent the size of an actual product;
it will be understood by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.
The technical solution of the present invention is further described below with reference to the accompanying drawings and examples.
Example 1
As shown in fig. 1, a product sales prediction method includes the steps of:
step S1: acquiring external data and internal data which influence the product sales volume;
step S2: processing the acquired data, removing abnormal values and filling missing values, and dividing the processed data into a training set and a verification set;
step S3: constructing a wide and deep model;
step S4: inputting the training set into a wide and deep model to train the model to obtain an optimized wide and deep model;
step S5: and inputting the verification set into the optimized wide and deep model to verify the accuracy of the model.
As a preferred embodiment, the external data described in step S1 includes weather attributes, time characteristics, and regional characteristics corresponding to the time of sale, the weather attributes including temperature, humidity, wind speed, rain and snow; the time characteristics comprise holidays, single and double break, weeks and months; regional characteristics include surrounding households, number of cells, number of competitors, and surrounding rates.
As a preferred embodiment, the internal data described in step S1 is historical sales data of the product, including sales amount information, discount information, and price information;
as a preferred embodiment, the specific process of rejecting the abnormal value in step S2 is as follows:
the method for eliminating the abnormal value is to smooth the sales data by using a lowss method, and the specific process of lowss smoothing is as follows:
taking a point x as a center, and intercepting a section of data with the length of frac forwards and backwards, forThe data is weighted by a weight function w to make a weighted linear regression, and the weighted linear regression is recordedIs the central value of the regression line, whereinFor the fitted curve corresponding values, n weighted regression lines can be made for all n data points, and the connecting line of the central value of each regression line is the Lowess curve of the data.
As a preferred embodiment, the specific process of filling the missing value in step S2 is as follows: taking the average value (x) of historical n-day sales data x1+x2+...+xn) Filling is carried out on the data/n, corresponding conversion is carried out on the data format of external data according to model requirements, the conversion method comprises one hot encoding of a oneHotEncoder, digital encoding of a LabelEncoder and the like, and meanwhile, equal-frequency box-dividing encoding is carried out on numerical characteristics such as temperature, room price and the like.
As a preferred embodiment, the specific steps of constructing the wide and deep model in step S3 are as follows:
the wide and deep model comprises a wide model and a deep model, wherein the wide model meets the memorialization characteristic, the deep model meets the Generalization characteristic, and the memorialization characteristic is a rule learned by the model into historical data; the Generalization characteristic is based on historical data, a new feature combination which never appears in the past is explored, the historical data is memorized, and then the historical data is generalized to the features which do not appear before, so that the Generalization capability of the model is improved; fusing the memorisation characteristic of the wide model and the generalisation characteristic of the deep model, and simultaneously playing the roles of the memorisation and the generalisation;
the wide model adopts a linear regression linear model, input features comprise continuous features such as sales information and discount information and sparse discrete features such as weather features and regional features, the discrete features can form higher-dimensional discrete features after being crossed, and the discrete features can be converged into an effective feature combination by introducing L1 regularization in linear model training;
the Deep network model is constructed by the Deep model in the convLSTM mode, and the Deep model can realize the function of time information of the LSTM considering the sales volume and the function of learning the space information of the data by the convolutional neural network by introducing the convLSTM mode, namely the mutual influence among different commodities; the ConvLSTM model has the same core essence as the LSTM, and by adding convolution operation on the basis of the LSTM, the LSTM can not only obtain a time sequence relation, but also extract spatial features like a convolution layer, obtain space-time features by fully fusing the features among commodities, and convert the switching between states into convolution calculation.
Example 2
As shown in fig. 1 to 4, the specific process of this embodiment is as follows:
data processing:
obtaining historical sales data of a product to be predicted and external data of a time period corresponding to the historical sales data, wherein the external data comprises weather attributes, time characteristics and regional characteristics.
Characteristic engineering:
and processing the acquired data into a format capable of being applied to model input through characteristic engineering, and segmenting the data into a training set and a verification set for model training.
Specifically, the method comprises the following steps:
1: abnormal value elimination and missing value filling are carried out on numerical characteristics such as sales data, price characteristics and the like, normalization is needed at the same time, the speed of solving the optimal solution by model gradient reduction is accelerated, model convergence is facilitated,
the normalization method comprises the following steps:
2: performing OneHotEncoder one-hot coding, LabelEncoder digital coding and the like on the category type characteristics of weather attributes and time characteristics, and performing equal-frequency box coding on the numerical characteristics of temperature, room price and the like;
3: in order to ensure the prediction of the model on the time sequence, the processed data needs to be sorted according to time, and a determined time point is taken as a dividing point, the data is divided into a training set and a verification set, and it is ensured that no training set exists in the verification set, such as: the time span of the data set is 2018.01.01-2018.12.30, 2018.01.01-2018.10.01 can be used as a training set, and 2018.10.01-2018.12.30 can be used as a verification set for construction.
wide end linear regression model
1. By x1,x2,xnDescribing components within features, e.g. x1Product historical sales, x2Temperature, xnWhether weekend or not, an estimation function is obtained:
h(x)=hθ(x)=θ0+θ1x1+θ2x2
θ is a parameter indicating the degree of importance of the influence of the sales volume of each of the adjustment features, such as whether weather or holidays have a greater influence on the sales volume.
2. To facilitate the calculation, let x01, the estimation function is expressed in a vector manner:
hθ(x)=θT X
3. a mechanism is involved to evaluate how good the value of θ is, and whether the model prediction result is close to the target, so that the h function is needed to be evaluated, generally, this function is called loss function (loss function) or error function (error function) to describe the degree of excellence of the h function, and the defined loss function is as follows:
4. and solving the loss function of the model by using a gradient descent method to obtain the importance of each sales characteristic when the model best meets the prediction target.
And 5, corresponding importance degrees to all the features of the wide-end model training, wherein the output value of the estimation function is the predicted value in prediction.
Constructing a deep end convLSTM depth model:
the deep end adopts a convLSTM depth model, and the input and output elements of the convLSTM depth model are 3D tensors for retaining all spatial information. Since the network has multiple stacked ConvLSTM layers, it has powerful representation capabilities, making it suitable for prediction in complex dynamic systems.
1. The input feature data is constructed into a 3D tensor in time series,
the specific method comprises the following steps: the sales characteristics of each product are regarded as a 1D tensor, the products are arranged according to a certain relation sequence, the correlation among the products is found out through clustering, meanwhile, the 1D tensors corresponding to the sequenced products are spliced according to the positions of the products, the sales characteristics of all the products are spliced every day to correspond to one 2D tensor, and the obtained 2D tensors are spliced according to the time sequence to obtain the 3D tensor needed by the input end of the convLSTM depth model
2. Constructing a convLSTM depth model network,
with a single convLSTM unit as shown in figure 3. ConvLSTM can learn long-term dependence information of product sales in time through the gate control unit, potential relation between product sales data can be learned through convolution operation of the input sales subdivision matrix Xt, namely, association relation between products can be learned through convolution, and a mathematical model of ConvLSTM is as follows:
in the graph, a convolution calculation is represented, o represents a Hadamard product, Xt represents an input variable at the time t, Ot represents a final output variable, it, ft and Ot respectively represent an input gate, a forgetting gate and an output gate, Ct represents a state after the t time of the hidden layer memory unit is updated, Ht represents a final state at the time t of the hidden layer memory unit, Wi, Wf, Wo, Wc, Ui, Uf, UO and Uc represent connection weights, bi, bf, bo and bc represent activation biases, and the activation functions select sigmoid and tanh activation functions.
It is noted here that X, C, H, i, f, o are all three-dimensional sensors whose last two dimensions represent the spatial information of the rows and columns, i.e. ConvLSTM is conceived as a model handling feature vectors in a two-dimensional grid that can predict features of the central grid from features of surrounding points in the grid.
Output of 3D tensor result full connected conversion at deep end
The output 3D tensor of ConvLSTM is as shown in fig. 4, and is subjected to full connected transform to output the predicted sales of the corresponding product, calculate the loss function of the predicted sales, and perform back propagation training on the connection weights Wi, Wf, Wo, Wc, Ui, Uf, Uo and Uc by using a gradient descent method.
And fusing the forecast results of the wide end model and the deep end model:
fusing the predicted values of the two basic models, and obtaining a final Model by adopting a weight fusion method, wherein the formula is as follows:
prediction=w1f1+w2f2
wherein w1Is the weight occupied by the output value of the wide terminal, w2For the weight occupied by the output value of the deep end, the weight occupied by the two basic models can be calculated according to the Model evaluation index map, and the formula is as follows:
where map1Map evaluation value, map for wide end2The estimated value of map at the deep end is shown.
The same or similar reference numerals correspond to the same or similar parts;
the terms describing positional relationships in the drawings are for illustrative purposes only and are not to be construed as limiting the patent;
it should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.
Claims (6)
1. A product sales forecasting method, comprising the steps of:
step S1: acquiring external data and internal data which influence the product sales volume;
step S2: processing the acquired data, removing abnormal values and filling missing values, and dividing the processed data into a training set and a verification set;
step S3: constructing a wide and deep model;
step S4: inputting the training set into a wide and deep model to train the model to obtain an optimized wide and deep model;
step S5: and inputting the verification set into the optimized wide and deep model to verify the accuracy of the model.
2. The product sales forecasting method according to claim 1, wherein the external data in step S1 includes weather attributes, time characteristics, and regional characteristics corresponding to the sales time period, the weather attributes including temperature, humidity, wind speed, rain and snow; the time characteristics comprise holidays, single and double break, weeks and months; regional characteristics include surrounding households, number of cells, number of competitors, and surrounding rates.
3. The method of claim 1, wherein the internal data in step S1 is historical sales data of the product, including sales information, discount information and price information.
4. The product sales prediction method of claim 3, wherein the step S2 of eliminating the abnormal value comprises the following steps:
the method for eliminating the abnormal value is to smooth the sales data by using a lowss method, and the specific process of lowss smoothing is as follows:
taking a point x as a center, intercepting a section of data with the length of frac forward and backward, performing weighted linear regression on the section of data by using a weight function w, and recordingIs the central value of the regression line, whereinFor the fitted curve corresponding values, n weighted regression lines can be made for all n data points, and the connecting line of the central value of each regression line is the Lowess curve of the data.
5. The product sales prediction method of claim 3, wherein the missing values in step S2 are filled as follows: taking the average value (x) of historical n-day sales data x1+x2+...+xn) Filling is carried out according to/n, and external data needs to be according to the modeThe data format is correspondingly converted according to the type requirement, the conversion method comprises one HotEncoder one-hot coding, LabelEncoder digital coding and the like, and meanwhile, the equal-frequency box-dividing coding is carried out on the numerical characteristics of temperature, room price and the like.
6. The product sales forecasting method according to claim 3, wherein the step S3 of constructing the wide and deep model comprises the following steps:
the wide and deep model comprises a wide model and a deep model, wherein the wide model meets the memorialization characteristic, the deep model meets the Generalization characteristic, and the memorialization characteristic is a rule learned by the model into historical data; the Generalization characteristic is based on historical data, a new feature combination which never appears in the past is explored, the historical data is memorized, and then the historical data is generalized to the features which do not appear before, so that the Generalization capability of the model is improved; fusing the memorisation characteristic of the wide model and the generalisation characteristic of the deep model, and simultaneously playing the roles of the memorisation and the generalisation;
the wide model adopts a linear regression linear model, input features comprise continuous features such as sales information and discount information and sparse discrete features such as weather features and regional features, the discrete features can form higher-dimensional discrete features after being crossed, and the discrete features can be converged into an effective feature combination by introducing L1 regularization in linear model training;
the Deep network model is constructed by the Deep model in the convLSTM mode, and the Deep model can realize the function of time information of the LSTM considering the sales volume and the function of learning the space information of the data by the convolutional neural network by introducing the convLSTM mode, namely the mutual influence among different commodities; the ConvLSTM model has the same core essence as the LSTM, and by adding convolution operation on the basis of the LSTM, the LSTM can not only obtain a time sequence relation, but also extract spatial features like a convolution layer, obtain space-time features by fully fusing the features among commodities, and convert the switching between states into convolution calculation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910745299.6A CN110599234A (en) | 2019-08-13 | 2019-08-13 | Product sales prediction method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910745299.6A CN110599234A (en) | 2019-08-13 | 2019-08-13 | Product sales prediction method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110599234A true CN110599234A (en) | 2019-12-20 |
Family
ID=68854279
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910745299.6A Pending CN110599234A (en) | 2019-08-13 | 2019-08-13 | Product sales prediction method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110599234A (en) |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111401941A (en) * | 2020-03-06 | 2020-07-10 | 武汉大学 | Vehicle sales prediction method based on XGboost recommendation algorithm |
CN111401409A (en) * | 2020-02-28 | 2020-07-10 | 创新奇智(青岛)科技有限公司 | Commodity brand feature acquisition method, sales volume prediction method, device and electronic equipment |
CN112053195A (en) * | 2020-09-09 | 2020-12-08 | 创优数字科技(广东)有限公司 | Retail industry demand prediction method and device |
CN112150209A (en) * | 2020-06-19 | 2020-12-29 | 南京理工大学 | Construction method of CNN-LSTM time sequence prediction model based on clustering center |
CN112598457A (en) * | 2020-12-02 | 2021-04-02 | 新石器慧义知行智驰(北京)科技有限公司 | Control method and device for unmanned vending vehicle and unmanned vending vehicle |
CN113393041A (en) * | 2021-06-21 | 2021-09-14 | 湖南大学 | Retail field supply chain inventory optimization method based on sales prediction |
CN113553540A (en) * | 2020-04-24 | 2021-10-26 | 株式会社日立制作所 | Commodity sales prediction method |
CN113723985A (en) * | 2021-03-04 | 2021-11-30 | 京东城市(北京)数字科技有限公司 | Training method and device for sales prediction model, electronic equipment and storage medium |
CN113743969A (en) * | 2020-05-29 | 2021-12-03 | 北京京东智能城市大数据研究院 | Method, device, equipment and storage medium for sales data prediction |
CN113837794A (en) * | 2021-08-26 | 2021-12-24 | 润联软件系统(深圳)有限公司 | Chain retail store sales prediction method based on space-time graph convolutional network |
CN113869938A (en) * | 2021-09-09 | 2021-12-31 | 杭州铭信信息科技有限公司 | Intelligent ordering method for Nisshink fresh stores |
CN113947015A (en) * | 2021-09-27 | 2022-01-18 | 上海维智卓新信息科技有限公司 | Method and device for determining goods on shelves |
CN114239946A (en) * | 2021-12-09 | 2022-03-25 | 上海维智卓新信息科技有限公司 | Shop sales effect prediction method and device based on regional division |
CN114792257A (en) * | 2022-06-24 | 2022-07-26 | 北京京东振世信息技术有限公司 | Article circulation information generation method, circulation prediction information generation method and device |
CN115994788A (en) * | 2023-03-20 | 2023-04-21 | 北京永辉科技有限公司 | Data processing analysis method and device |
CN111178624B (en) * | 2019-12-26 | 2023-10-20 | 浙江大学 | New product demand prediction method |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107292672A (en) * | 2017-07-05 | 2017-10-24 | 上海数道信息科技有限公司 | System and method for is realized in a kind of catering industry sales forecast |
KR20180071906A (en) * | 2016-12-20 | 2018-06-28 | 한동대학교 산학협력단 | Method of providing t-commerce electronic program guide based on bigdata deep learning |
CN108416625A (en) * | 2018-02-28 | 2018-08-17 | 阿里巴巴集团控股有限公司 | The recommendation method and apparatus of marketing product |
CN108596645A (en) * | 2018-03-13 | 2018-09-28 | 阿里巴巴集团控股有限公司 | A kind of method, apparatus and equipment of information recommendation |
CN108965938A (en) * | 2018-08-03 | 2018-12-07 | 山东大学 | Potential paying customer prediction technique and system in smart television |
CN109509030A (en) * | 2018-11-15 | 2019-03-22 | 北京旷视科技有限公司 | Method for Sales Forecast method and its training method of model, device and electronic system |
CN109785003A (en) * | 2019-01-17 | 2019-05-21 | 四川骏逸富顿科技有限公司 | A kind of Pharmaceutical retail industry medicine sales forecasting system and method |
-
2019
- 2019-08-13 CN CN201910745299.6A patent/CN110599234A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20180071906A (en) * | 2016-12-20 | 2018-06-28 | 한동대학교 산학협력단 | Method of providing t-commerce electronic program guide based on bigdata deep learning |
CN107292672A (en) * | 2017-07-05 | 2017-10-24 | 上海数道信息科技有限公司 | System and method for is realized in a kind of catering industry sales forecast |
CN108416625A (en) * | 2018-02-28 | 2018-08-17 | 阿里巴巴集团控股有限公司 | The recommendation method and apparatus of marketing product |
CN108596645A (en) * | 2018-03-13 | 2018-09-28 | 阿里巴巴集团控股有限公司 | A kind of method, apparatus and equipment of information recommendation |
CN108965938A (en) * | 2018-08-03 | 2018-12-07 | 山东大学 | Potential paying customer prediction technique and system in smart television |
CN109509030A (en) * | 2018-11-15 | 2019-03-22 | 北京旷视科技有限公司 | Method for Sales Forecast method and its training method of model, device and electronic system |
CN109785003A (en) * | 2019-01-17 | 2019-05-21 | 四川骏逸富顿科技有限公司 | A kind of Pharmaceutical retail industry medicine sales forecasting system and method |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111178624B (en) * | 2019-12-26 | 2023-10-20 | 浙江大学 | New product demand prediction method |
CN111401409A (en) * | 2020-02-28 | 2020-07-10 | 创新奇智(青岛)科技有限公司 | Commodity brand feature acquisition method, sales volume prediction method, device and electronic equipment |
CN111401409B (en) * | 2020-02-28 | 2023-04-18 | 创新奇智(青岛)科技有限公司 | Commodity brand feature acquisition method, sales volume prediction method, device and electronic equipment |
CN111401941A (en) * | 2020-03-06 | 2020-07-10 | 武汉大学 | Vehicle sales prediction method based on XGboost recommendation algorithm |
CN113553540A (en) * | 2020-04-24 | 2021-10-26 | 株式会社日立制作所 | Commodity sales prediction method |
CN113743969A (en) * | 2020-05-29 | 2021-12-03 | 北京京东智能城市大数据研究院 | Method, device, equipment and storage medium for sales data prediction |
CN113743969B (en) * | 2020-05-29 | 2024-08-20 | 北京京东智能城市大数据研究院 | Method, apparatus, device and storage medium for sales data prediction |
CN112150209A (en) * | 2020-06-19 | 2020-12-29 | 南京理工大学 | Construction method of CNN-LSTM time sequence prediction model based on clustering center |
CN112150209B (en) * | 2020-06-19 | 2022-10-18 | 南京理工大学 | Construction method of CNN-LSTM time sequence prediction model based on clustering center |
CN112053195A (en) * | 2020-09-09 | 2020-12-08 | 创优数字科技(广东)有限公司 | Retail industry demand prediction method and device |
CN112598457A (en) * | 2020-12-02 | 2021-04-02 | 新石器慧义知行智驰(北京)科技有限公司 | Control method and device for unmanned vending vehicle and unmanned vending vehicle |
CN113723985A (en) * | 2021-03-04 | 2021-11-30 | 京东城市(北京)数字科技有限公司 | Training method and device for sales prediction model, electronic equipment and storage medium |
CN113393041A (en) * | 2021-06-21 | 2021-09-14 | 湖南大学 | Retail field supply chain inventory optimization method based on sales prediction |
CN113837794A (en) * | 2021-08-26 | 2021-12-24 | 润联软件系统(深圳)有限公司 | Chain retail store sales prediction method based on space-time graph convolutional network |
CN113869938A (en) * | 2021-09-09 | 2021-12-31 | 杭州铭信信息科技有限公司 | Intelligent ordering method for Nisshink fresh stores |
CN113947015A (en) * | 2021-09-27 | 2022-01-18 | 上海维智卓新信息科技有限公司 | Method and device for determining goods on shelves |
CN114239946A (en) * | 2021-12-09 | 2022-03-25 | 上海维智卓新信息科技有限公司 | Shop sales effect prediction method and device based on regional division |
CN114792257B (en) * | 2022-06-24 | 2022-11-08 | 北京京东振世信息技术有限公司 | Article circulation information generation method, circulation prediction information generation method and device |
CN114792257A (en) * | 2022-06-24 | 2022-07-26 | 北京京东振世信息技术有限公司 | Article circulation information generation method, circulation prediction information generation method and device |
CN115994788A (en) * | 2023-03-20 | 2023-04-21 | 北京永辉科技有限公司 | Data processing analysis method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110599234A (en) | Product sales prediction method | |
Bi et al. | Daily tourism volume forecasting for tourist attractions | |
CN113537600B (en) | Medium-long-term precipitation prediction modeling method for whole-process coupling machine learning | |
CN109284866B (en) | Commodity order prediction method and device, storage medium and terminal | |
CN110348624B (en) | Sand storm grade prediction method based on Stacking integration strategy | |
CN109389151B (en) | Knowledge graph processing method and device based on semi-supervised embedded representation model | |
CN108009674A (en) | Air PM2.5 concentration prediction methods based on CNN and LSTM fused neural networks | |
CN109657156A (en) | A kind of personalized recommendation method generating confrontation network based on circulation | |
CN110473592B (en) | Multi-view human synthetic lethal gene prediction method | |
US20150254554A1 (en) | Information processing device and learning method | |
CN109214863B (en) | Method for predicting urban house demand based on express delivery data | |
CN110110372B (en) | Automatic segmentation prediction method for user time sequence behavior | |
CN114584406B (en) | Industrial big data privacy protection system and method for federated learning | |
CN116434538B (en) | Urban traffic flow prediction model construction method based on heterogeneous data fusion | |
CN113780584A (en) | Label prediction method, apparatus, storage medium and program product | |
CN103840985A (en) | Web service quality prediction method and device based on user neighborhoods | |
CN115907122A (en) | Regional electric vehicle charging load prediction method | |
US11468352B2 (en) | Method and system for predictive modeling of geographic income distribution | |
Chen | Correlation analysis of financial indicators and stock price fluctuations based on artificial intelligence system | |
CN117217788A (en) | Cigarette demand prediction method and system based on deep learning | |
CN116960962A (en) | Mid-long term area load prediction method for cross-area data fusion | |
KR102474974B1 (en) | Method, device and system for analyzing brand based on artificial intelligence | |
CN116204723A (en) | Social recommendation method based on dynamic hypergraph representation learning | |
CN113240359B (en) | Demand prediction method for coping with external major changes | |
CN113869990A (en) | Shop lease pricing method based on space-time representation learning model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191220 |
|
RJ01 | Rejection of invention patent application after publication |