CN116977035A - Agricultural product recommendation method based on LightGBM and deep learning - Google Patents

Agricultural product recommendation method based on LightGBM and deep learning Download PDF

Info

Publication number
CN116977035A
CN116977035A CN202311235697.6A CN202311235697A CN116977035A CN 116977035 A CN116977035 A CN 116977035A CN 202311235697 A CN202311235697 A CN 202311235697A CN 116977035 A CN116977035 A CN 116977035A
Authority
CN
China
Prior art keywords
agricultural product
data set
product data
lightgbm
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311235697.6A
Other languages
Chinese (zh)
Inventor
赵斌
李斌
高一龙
张继群
曹伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Linyi University
Original Assignee
Linyi University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Linyi University filed Critical Linyi University
Priority to CN202311235697.6A priority Critical patent/CN116977035A/en
Publication of CN116977035A publication Critical patent/CN116977035A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • G06F18/2113Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Accounting & Taxation (AREA)
  • Computing Systems (AREA)
  • Finance (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses an agricultural product recommendation method based on LightGBM and deep learning, wherein the method comprises the following steps: preprocessing agricultural product data information through a linear normalization method, tag coding and the like; the characteristics in the agricultural product data information are fused and extracted by utilizing the LightGBM, and the characteristics with highest classification accuracy are selected as the input of the neural network layer; then, respectively acquiring a linear cross combination characteristic relation and a nonlinear association relation between the agricultural product data characteristics by using a cross network and a deep neural network; and finally, carrying out linear weighted combination on the results obtained by the preamble network to obtain the final agricultural product recommendation result. The invention adopts the lightGBM, the cross network and the deep neural network to mine the characteristic relation among the agricultural product data, so as to solve the sparsity problem of the agricultural product data; and valuable information is mined from the agricultural product information according to user preference to conduct intelligent recommendation, so that consumers can select agricultural products meeting own requirements more conveniently.

Description

Agricultural product recommendation method based on LightGBM and deep learning
Technical Field
The invention belongs to the field of data analysis and intelligent recommendation, and particularly relates to an agricultural product recommendation method based on LightGBM and deep learning.
Background
With the improvement of the living standard of people and the change of consumption concepts, more and more people begin to pay attention to the health, quality and safety of foods, and the requirements on agricultural products are also higher and higher. However, the conventional shopping manner often requires a great deal of time and effort for consumers to find suitable agricultural products, and often cannot guarantee the quality and safety of the agricultural products. Therefore, the agricultural product recommendation method can help consumers to select the agricultural products meeting the demands more conveniently, and can help agricultural product producers and sellers to promote the products more accurately, so that sales and customer satisfaction are improved.
The LightGBM is a high-efficiency and distributed gradient lifting frame and has the characteristics of high efficiency, high accuracy, low memory occupation and the like; in the agricultural product recommendation method, the LightGBM can be used for extracting the characteristics of users and commodities, constructing characteristic engineering and carrying out quick and efficient recommendation calculation. The deep learning technology has strong learning ability and is widely applied to the fields of computer vision, voice recognition, natural language processing and the like, the deep learning technology is applied to a recommendation method, so that the deep learning technology has more efficient learning ability, hidden relations among features are mined by using a neural network, and the recommendation accuracy can be improved.
Disclosure of Invention
The invention aims to provide an agricultural product recommendation method based on LightGBM and deep learning, which better recommends interesting agricultural products for users, thereby improving the purchase willingness of the users and further improving the sales of the agricultural products.
The invention provides an agricultural product recommendation method based on LightGBM and deep learning, which comprises the following steps:
step S1, preprocessing data in an agricultural product data set to enable a recommendation model to obtain basic data for feature processing to obtain an agricultural product data set B;
s2, taking the agricultural product data set B as input data of a LightGBM module, performing fusion optimization on features of the agricultural product data B through the LightGBM module, selecting features with highest classification accuracy as feature results, and obtaining effective integer result leaf vectors;
step S3, splicing all leaf node index values generated in the LightGBM module to form a new discrete agricultural product data set T, and splicing the new discrete agricultural product data set T with the agricultural product data set B to obtain a latest agricultural product data set V;
s4, taking the agricultural product data set V as input data of a neural network layer, and mining hidden relations of higher-order features in the data set through a cross network and a deep neural network;
and S5, carrying out linear weighted combination on the results obtained by the neural network layer, calculating by the full-connection layer, and obtaining the agricultural product recommendation result after the function is activated.
Further, in the step S1, in the input layer of the agricultural product recommendation model, a reasonable numerical filling process is performed on the blank value part in the agricultural product data set as follows:
considering that the data set is subjected to desensitization treatment, filling sparse data and blank values of the discrete data in the data set to be numerical values or characters which do not exist in the current data column;
carrying out normalization processing on sparse data in the agricultural product data set without the null value at a model input layer, and carrying out tag coding on the discrete data;
the sparse data normalization processing in the data set adopts a linear normalization method to perform linear transformation on the sparse data, and the result is mapped into the [0,1] interval after linear normalization.
Wherein the linear normalization method is expressed as follows:
(1)
wherein, the liquid crystal display device comprises a liquid crystal display device,as a result of the data maximum value,as a result of the minimum value of the data,in the event of a data value being a data value,data values obtained for linear normalization;
and performing discretization processing on the discrete data, and performing label conversion through a LabelEncoder to finally form a full-value agricultural product data set B without a blank value.
Further, the specific implementation manner of step S2 is as follows:
inputting the full-digital agricultural product data set B obtained after preprocessing of an input layer into a LightGBM module of a model, and performing fusion optimization on the characteristic relevance of the agricultural product data set B by utilizing a histogram algorithm in the LightGBM module to convert each characteristic column in the agricultural product data set B into a histogram; the characteristic data in the agricultural product data set B are equally divided into K bins, an integer is allocated to each bin, then the data value in the bin is replaced by the integer allocated to the bin, meanwhile, a histogram with the width of K is formed according to the integer of the bin, statistics and accumulation are carried out on the histogram when the data are traversed, after the data are traversed once, corresponding statistics are accumulated on the histogram, and then the optimal segmentation points are traversed and found according to the discrete value of the histogram.
The method comprises the steps of reducing the number of samples in a data set B by using a single-side gradient sampling algorithm GOSS in a LightGBM module, removing most small gradient samples in a farm data set B, selecting a feature with highest classification accuracy as a feature result, and obtaining an effective integer result leaf vector; all values of features to be split in the agricultural product data set B are ordered in a descending order according to the absolute value, m ∗ data with the largest absolute value are selected, and n ∗ data are randomly selected from the rest smaller gradient data; then randomly selecting n ∗% of data multiplied by a constant (1-m)/n, so that the algorithm can pay more attention to the undertraining samples and cannot generate excessive change on the distribution of the data set B; finally, using (m+n) ∗% data to calculate information gain;
wherein the LightGBM model objective function formula is as follows:
(2)
(3)
the formula of the second-order Taylor expansion result of the function is as follows:
(4)
wherein, the liquid crystal display device comprises a liquid crystal display device,as the true value of the tag,as a result of the k-1 th learning,for the regularized term sums of the first k-1 trees,is a regular term of the kth tree;as a function of the object to be processed,training errors for samples, meaning finding a suitable treeThe value of the function is minimized.
Further, the specific representation of step S3 is as follows:
after the processing of the LightGBM module is completed, the agricultural product data set B generates leaf node index values, and then all the generated leaf node index values are spliced to form a new discrete data set T, so that the sparsity problem of the agricultural product data set B is relieved by the data set T;
combining and splicing the agricultural product data set B and the agricultural product data set T to obtain an agricultural product data set V;
the agricultural product data set V enhances the effectiveness and the interpretability of the original data set, improves the utilization rate of the data characteristics, and enhances the accuracy of the recommendation model.
Further, the neural network layer processing procedure in the agricultural product recommendation model in step S4 is as follows:
the neural network layer in the agricultural product recommendation model is formed by combining a cross network and a deep neural network in parallel, an input part adopts an input sharing principle, and each neural network part processes the same data set so that an output part has relevance and connectivity;
the cross network of the agricultural product recommendation model is responsible for mining the linear cross combination relation between the higher-order features in the agricultural product data set V; the cross network consists of cross layers, wherein each layer of input data can cross with the current layer of data, and linear cross combination characteristics in the data are extracted; to be used forLayer for example, the agricultural product dataset V is characterized by crossing at the crossing layer, after one is completedThe layer will add its input back again and solve the problem of network performance degradation by using the residual idea; the crossover network can automatically learn the crossover characteristics of limited high orders and the weight parameters corresponding to the crossover characteristics, the degree of the crossover characteristics increases with the deepening of the layer depth, the time complexity and the space complexity of the crossover network linearly increase with the increase of the input dimension, and the crossover network has better generalization capability.
First, theLayer outputThe result formula is shown below:
(5)
the output result formula of the Cross network is as follows:
(6)
where x represents input data, b represents Bias,andrepresent the firstLayer and the firstThe output of the layer Cross layer,andrepresenting the connection parameters between the two layers, d representing the dimension of the feature,representing the vector formed by the superposition of the embedded vector and the continuous feature vector.
The deep neural network of the agricultural product recommendation model is responsible for mining nonlinear association relations between higher-order features in the agricultural product data set V; the deep neural network consists of an input layer, an output layer and a hidden layer, is a fully connected neural network, each data characteristic can implicitly interact with other characteristics, and can extract nonlinear high-order characteristics in a data set.
In the deep neural network, the input layers take discrete feature vectors in the agricultural product data set V as input, the neuron of each input layer receives the features in one discrete feature vector, and in the fully-connected hidden layer, the feature information is continuously transmitted; and each characteristic is interactively combined, and finally, nonlinear high-order characteristics are extracted by an output layer.
The output formula is as follows:
(7)
wherein, the liquid crystal display device comprises a liquid crystal display device,representing a non-linear transfer function and,and b is a parameter, and x is an output value of the previous layer.
Further, the specific representation of step S5 is as follows:
the agricultural product recommendation model adopts equal weight weighting, the output results of the cross network and the deep neural network are subjected to linear weighting combination, the spliced result is calculated through a full-connection layer, and the recommendation result of the agricultural product is obtained after the Sigmoid activation function acts.
The output formula is shown in the following 8.
(8)
Compared with the prior art, the invention has the advantages that: the agricultural product recommendation method based on the LightGBM and the deep learning can further excavate the characteristic relation between the agricultural product data information and solve the data sparsity problem, transform, fuse and extract the characteristics of the agricultural products by utilizing the gradient lifting decision tree, and select the characteristics to obtain effective integer result leaf vectors; and then, acquiring a linear cross combination relation of the high-order features and a nonlinear high-order feature association relation by using a cross network and a deep neural network, and fully mining a hidden relation among the high-order features, so that the recommendation accuracy is improved. Meanwhile, consumers are helped to select the agricultural products meeting the demands more conveniently, and meanwhile, producers and sellers of the agricultural products can be helped to promote the products more accurately, so that sales and customer satisfaction are improved.
Drawings
The following describes the embodiments of the present invention in further detail with reference to the accompanying drawings.
FIG. 1 is a flow chart of a method of recommending agricultural products according to the present invention;
fig. 2 is a model diagram of an agricultural product recommendation method of the present invention.
Detailed Description
It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.
The example provides a farm product recommendation method based on LightGBM and deep learning, comprising the following steps:
step S1, preprocessing data in an agricultural product data set to enable a recommendation model to obtain basic data for feature processing to obtain an agricultural product data set B;
s2, taking the agricultural product data set B as input data of a LightGBM module, performing fusion optimization on features of the agricultural product data B through the LightGBM module, selecting features with highest classification accuracy as feature results, and obtaining effective integer result leaf vectors;
step S3, splicing all leaf node index values generated in the LightGBM module to form a new discrete agricultural product data set T, and splicing the new discrete agricultural product data set T with the agricultural product data set B to obtain a latest agricultural product data set V;
s4, taking the agricultural product data set V as input data of a neural network layer, and mining hidden relations of higher-order features in the data set through a cross network and a deep neural network;
and S5, carrying out linear weighted combination on the results obtained by the neural network layer, calculating by the full-connection layer, and obtaining the agricultural product recommendation result after the function is activated.
In this example, the data information such as the agricultural product information, the user purchase information, the agricultural product sales information and the like obtained in the step S1 are integrated into an agricultural product data set, and the unoccupied value part existing in the agricultural product data set is subjected to reasonable numerical filling in the input layer of the agricultural product recommendation model.
The filling data is sparse data and the null value of discrete data in the filling agricultural product data set is a numerical value or character which does not exist in the current data column, so that the sparse data and the null value of the discrete data in the filling agricultural product data set become an agricultural product data set without the null value;
and then carrying out normalization processing on sparse data in the agricultural product data set without the vacancy value, carrying out linear transformation on the sparse data by adopting a linear normalization method, and mapping a result into a [0,1] interval after linear normalization.
Wherein the linear normalization method is expressed as follows:
(1)
wherein, the liquid crystal display device comprises a liquid crystal display device,as a result of the data maximum value,as a result of the minimum value of the data,in the event of a data value being a data value,data values obtained for linear normalization;
and performing Label encoding processing on the discrete data in the agricultural product data set without the vacancy value, and performing discretization processing on the data to finally form a full-value data set B without the vacancy value.
In this example, the specific implementation manner of step S2 is as follows:
and inputting the full-value data set B without the vacancy value into a LightGBM module of the model, performing fusion optimization on the feature relevance of the agricultural product data set B by utilizing a histogram algorithm in the LightGBM module, taking the feature set with the highest classification accuracy as a feature selection result, and obtaining an effective integer result leaf vector.
Firstly, converting each characteristic column in an agricultural product data set B into a histogram, dividing the characteristic data in the agricultural product data set B into K bins uniformly, allocating an integer to each bin, replacing the data value in the bin with the integer allocated to the bin, and forming a histogram with the width of K according to the integer of the bin. And when traversing the data, counting and accumulating the histograms, after the data is traversed once, accumulating corresponding statistics by the histograms, and then traversing according to the discrete values of the histograms to find the optimal segmentation points.
Then, a single-side gradient sampling algorithm GOSS is adopted, the number of samples in an agricultural product data set B is reduced, most small gradient samples in the agricultural product data set B are eliminated, all values of features to be split in the agricultural product data set B are ordered in descending order according to the absolute value, m ∗% of data with the largest absolute value are selected, and n ∗% of data are randomly selected from the rest small gradient data; then randomly selecting n ∗% of data multiplied by a constant (1-m)/n, so that the algorithm can pay more attention to the undertraining samples and cannot generate excessive change on the distribution of the data set B; finally the information gain was calculated using (m+n) ∗ data by 100%.
Wherein the LightGBM model objective function formula is as follows:
(2)
(3)
the formula of the second-order Taylor expansion result of the function is as follows:
(4)
wherein, the liquid crystal display device comprises a liquid crystal display device,as the true value of the tag,as a result of the k-1 th learning,for the regularized term sums of the first k-1 trees,is a regular term of the kth tree;as a function of the object to be processed,training errors for samples, meaning finding a suitable treeThe value of the function is minimized.
In this example, the step S3 is specifically shown as follows:
after the processing of the LightGBM module is completed, the agricultural product data set B generates leaf node index values, and then all the generated leaf node index values are spliced to form a new discrete data set T, so that the sparsity problem of the agricultural product data set B is relieved by the data set T;
and combining and splicing the agricultural product data set B and the data set T to obtain an agricultural product data set V, wherein the agricultural product data set V enhances the effectiveness and the interpretability of the original data set, improves the utilization rate of the data characteristics and enhances the accuracy of the recommendation model.
In this example, the specific implementation manner of step S4 is as follows:
the agricultural product data set V is input into a neural network layer of a model, and training is carried out through a cross network and a deep neural network in the neural network layer, wherein the neural network layer in the model is formed by combining the cross network and the deep neural network in parallel, the input part adopts an input sharing principle, and each neural network part processes the same data set, so that the output part has relevance and connectivity.
The cross network of the agricultural product recommendation model is responsible for mining the linear cross combination relation between the higher-order features in the agricultural product data set V; the cross network consists of cross layers, wherein each layer of input data can cross with the current layer of data, and linear cross combination characteristics in the data are extracted; to be used forLayer for example, the agricultural product dataset V is characterized by crossing at the crossing layer, after one is completedThe layer will add its input back again and solve the problem of network performance degradation by using the residual idea; the crossover network can automatically learn limited high-order crossover features and corresponding weight parameters in the agricultural product data set, the crossover feature degree increases with the deepening of the layer depth, the time complexity and the space complexity of the crossover network linearly increase with the increase of the input dimension, and the crossover network has better generalization capability.
First, theThe layer output result formula is as follows:
(5)
the output result formula of the Cross network is as follows:
(6)
where x represents input data, b represents Bias,andrepresent the firstLayer and the firstThe output of the layer Cross layer,andrepresenting the connection parameters between the two layers, d representing the dimension of the feature,representing the vector formed by the superposition of the embedded vector and the continuous feature vector.
The deep neural network of the agricultural product recommendation model is responsible for mining nonlinear association relations between higher-order features in the agricultural product data set V; the deep neural network is a fully-connected neural network, each data feature can implicitly interact with other features, and nonlinear high-order features in a data set can be extracted. The agricultural product data set V is arranged in an input layer of the deep neural network, wherein discrete feature vectors in the data set V are used as input, the neuron of each input layer receives features in one discrete feature vector, and feature information is continuously transmitted in the fully-connected hidden layer; the characteristics are interactively combined, and finally, nonlinear high-order characteristics are extracted by an output layer; implicit interaction can be carried out between the data features of the agricultural product data set V, and nonlinear association relations between the features are mined.
The output formula is as follows:
(7)
wherein, the liquid crystal display device comprises a liquid crystal display device,representing a non-linear transfer function and,and b is ginsengThe number x is the output value of the previous layer.
In this example, the specific representation of step S5 is as follows:
the agricultural product recommendation model adopts equal weight weighting, the output results of the cross network and the deep neural network are subjected to linear weighting combination, the spliced result is calculated through a full-connection layer, and the recommendation result of the agricultural product is obtained after the Sigmoid activation function acts.
The output formula is shown as the following 8:
(8)
the foregoing is merely illustrative of specific embodiments of the present invention, and the scope of the present invention is not limited thereto, but any changes or substitutions within the technical scope of the present invention should be covered by the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. The agricultural product recommending method based on the LightGBM and the deep learning is characterized by comprising the following steps of:
step S1, preprocessing data in an agricultural product data set, reasonably filling numerical values in a blank value part in the data set at an input layer, normalizing sparse data, and performing tag coding on discrete data to enable a recommendation model to obtain basic data for feature processing to obtain an agricultural product data set B;
step S2, taking the agricultural product data set B as input data of a LightGBM module, performing feature fusion optimization on the agricultural product data set B through a histogram algorithm, a unilateral gradient sampling algorithm GOSS and a reciprocal feature binding algorithm in the LightGBM module, selecting features with highest classification accuracy as feature results, and obtaining effective integer result leaf vectors;
step S3, splicing all leaf node index values generated in the LightGBM module to form a new discrete agricultural product data set T, and splicing the new discrete agricultural product data set T with the agricultural product data set B to obtain a latest agricultural product data set V;
s4, taking the agricultural product data set V as input data of a neural network layer, and mining hidden relations of higher-order features in the data set through a cross network and a deep neural network; the method comprises the steps of utilizing a cross network to excavate a linear cross combination relation between high-order features in an agricultural product data set V, and utilizing a deep neural network to excavate a nonlinear association relation between the high-order features in the agricultural product data set V;
and S5, carrying out linear weighted combination on the results obtained by the neural network layer, calculating by the full-connection layer, and obtaining the agricultural product recommendation result after the function is activated.
2. The agricultural product recommendation method based on LightGBM and deep learning as claimed in claim 1, wherein the step S1 of preprocessing the agricultural product data set at the agricultural product recommendation model input layer comprises the following steps:
filling the blank value of the agricultural product data set at the input layer, wherein the filling value is a numerical value or character which does not exist in the current data column when the blank value is filled because the data set is subjected to desensitization treatment;
normalizing sparse data in the agricultural product data set without the vacancy value, and performing tag coding on the discrete data;
the sparse data normalization processing adopts a linear normalization method to perform linear transformation on sparse data, and the result is mapped into a [0,1] interval after linear normalization;
and performing discretization processing on the discrete data, and performing label conversion by using a LabelEncoder to finally form a full-value agricultural product data set B without a blank value.
3. The agricultural product recommendation method based on LightGBM and deep learning as claimed in claim 1, wherein step S2 comprises the following steps:
carrying out fusion optimization on the feature relevance of the agricultural product data set B by utilizing a histogram algorithm in the LightGBM module, converting each feature column in the agricultural product data set B into a histogram, carrying out statistics and accumulation on the histogram when traversing data, accumulating corresponding statistics on the histogram after traversing the data once, and then traversing according to discrete values of the histogram to find optimal segmentation points;
and (3) reducing the number of samples in the data set B by utilizing a single-side gradient sampling algorithm GOSS in the LightGBM module, removing most small gradient samples in the agricultural product data set B, selecting the feature with the highest classification accuracy as a feature result, and obtaining an effective integer result leaf vector.
4. The agricultural product recommendation method based on LightGBM and deep learning as claimed in claim 1, wherein step S3 comprises the following steps:
after the processing of the LightGBM module is completed, the agricultural product data set B generates leaf node index values, and then all the generated leaf node index values are spliced to form a new discrete data set T, so that the sparsity problem of the agricultural product data set B is relieved by the data set T; and simultaneously combining and splicing the agricultural product data set B and the agricultural product data set T to obtain an agricultural product data set V.
5. The agricultural product recommendation method based on LightGBM and deep learning as claimed in claim 1, wherein the neural network layer in step S4 comprises:
the neural network layer is formed by parallel combination of a cross network and a deep neural network, an input part of the neural network layer adopts an input sharing principle, and each neural network part processes the same data set so that an output part has relevance and connectivity;
the cross network digs a linear cross combination relation between higher-order features in the agricultural product data set V;
the deep neural network digs nonlinear association relations between higher-order features in the agricultural product data set V.
6. The agricultural product recommendation method based on LightGBM and deep learning as claimed in claim 1, wherein step S5 specifically comprises the steps of:
the agricultural product recommendation model adopts equal weight weighting, the output results of the cross network and the deep neural network are subjected to linear weighting combination, the spliced result is calculated through a full-connection layer, and the recommendation result of the agricultural product is obtained after the Sigmoid activation function acts.
CN202311235697.6A 2023-09-25 2023-09-25 Agricultural product recommendation method based on LightGBM and deep learning Pending CN116977035A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311235697.6A CN116977035A (en) 2023-09-25 2023-09-25 Agricultural product recommendation method based on LightGBM and deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311235697.6A CN116977035A (en) 2023-09-25 2023-09-25 Agricultural product recommendation method based on LightGBM and deep learning

Publications (1)

Publication Number Publication Date
CN116977035A true CN116977035A (en) 2023-10-31

Family

ID=88479974

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311235697.6A Pending CN116977035A (en) 2023-09-25 2023-09-25 Agricultural product recommendation method based on LightGBM and deep learning

Country Status (1)

Country Link
CN (1) CN116977035A (en)

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112487291A (en) * 2020-11-28 2021-03-12 重庆邮电大学 Big data-based personalized news recommendation method and device
WO2021147568A1 (en) * 2020-08-27 2021-07-29 平安科技(深圳)有限公司 Gbdt high-order feature combination-based recommendation method, apparatus, and storage medium
CN113344615A (en) * 2021-05-27 2021-09-03 上海数鸣人工智能科技有限公司 Marketing activity prediction method based on GBDT and DL fusion model
JP6980958B1 (en) * 2021-06-23 2021-12-15 中国科学院西北生態環境資源研究院 Rural area classification garbage identification method based on deep learning
CN114677200A (en) * 2022-04-01 2022-06-28 重庆邮电大学 Business information recommendation method and device based on multi-party high-dimensional data longitudinal federal learning
CN114782114A (en) * 2022-05-26 2022-07-22 郑州航空港经济综合实验区空港人工智能研究院 Air ticket price prediction method based on CNN-GRU and SSA-GWO-LightGBM
WO2022175970A1 (en) * 2021-02-17 2022-08-25 Waycool Foods And Products Private Limited System and method for determining one or more agri-measures
CN115203264A (en) * 2022-05-13 2022-10-18 青岛文达通科技股份有限公司 Urban tourism route recommendation method and system based on LightGBM algorithm
CN115578137A (en) * 2022-10-31 2023-01-06 华南农业大学 Agricultural product future price prediction method and system based on text mining and deep learning model
CN116186378A (en) * 2022-07-29 2023-05-30 号百信息服务有限公司 Package recommendation method, package recommendation device, electronic equipment and storage medium
CN116304299A (en) * 2023-01-03 2023-06-23 电子科技大学 Personalized recommendation method integrating user interest evolution and gradient promotion algorithm
CN116362836A (en) * 2023-03-30 2023-06-30 东北林业大学 Agricultural product recommendation algorithm based on user behavior sequence
CN116452293A (en) * 2023-04-18 2023-07-18 重庆邮电大学 Deep learning recommendation method and system integrating audience characteristics of articles

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021147568A1 (en) * 2020-08-27 2021-07-29 平安科技(深圳)有限公司 Gbdt high-order feature combination-based recommendation method, apparatus, and storage medium
CN112487291A (en) * 2020-11-28 2021-03-12 重庆邮电大学 Big data-based personalized news recommendation method and device
WO2022175970A1 (en) * 2021-02-17 2022-08-25 Waycool Foods And Products Private Limited System and method for determining one or more agri-measures
CN113344615A (en) * 2021-05-27 2021-09-03 上海数鸣人工智能科技有限公司 Marketing activity prediction method based on GBDT and DL fusion model
JP6980958B1 (en) * 2021-06-23 2021-12-15 中国科学院西北生態環境資源研究院 Rural area classification garbage identification method based on deep learning
CN114677200A (en) * 2022-04-01 2022-06-28 重庆邮电大学 Business information recommendation method and device based on multi-party high-dimensional data longitudinal federal learning
CN115203264A (en) * 2022-05-13 2022-10-18 青岛文达通科技股份有限公司 Urban tourism route recommendation method and system based on LightGBM algorithm
CN114782114A (en) * 2022-05-26 2022-07-22 郑州航空港经济综合实验区空港人工智能研究院 Air ticket price prediction method based on CNN-GRU and SSA-GWO-LightGBM
CN116186378A (en) * 2022-07-29 2023-05-30 号百信息服务有限公司 Package recommendation method, package recommendation device, electronic equipment and storage medium
CN115578137A (en) * 2022-10-31 2023-01-06 华南农业大学 Agricultural product future price prediction method and system based on text mining and deep learning model
CN116304299A (en) * 2023-01-03 2023-06-23 电子科技大学 Personalized recommendation method integrating user interest evolution and gradient promotion algorithm
CN116362836A (en) * 2023-03-30 2023-06-30 东北林业大学 Agricultural product recommendation algorithm based on user behavior sequence
CN116452293A (en) * 2023-04-18 2023-07-18 重庆邮电大学 Deep learning recommendation method and system integrating audience characteristics of articles

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HOUCHOU HAN: "LFDNN: A Novel Hybrid Recommendation Model Based on DeepFM and LightGBM", 《ENTROPY》, vol. 25, no. 4, pages 1 - 15 *
方孝荣;: "基于深度学习BP人工神经网络的农产品质量检测", 金华职业技术学院学报, no. 03, pages 73 - 75 *
杨正成: "基于 LightGBM 的广告商品平台推荐系统设计与应用", 《科技创新与应用》, vol. 12, no. 30, pages 1 - 6 *

Similar Documents

Publication Publication Date Title
Jiajun et al. Ultra-short term wind prediction with wavelet transform, deep belief network and ensemble learning
Windarto Implementation of data mining on rice imports by major country of origin using algorithm using k-means clustering method
CN109903117B (en) Knowledge graph processing method and device for commodity recommendation
CN110648163B (en) Recommendation algorithm based on user comments
Zhang et al. An improved deep computation model based on canonical polyadic decomposition
CN111667022A (en) User data processing method and device, computer equipment and storage medium
Li et al. Hierarchical graph attention networks for semi-supervised node classification
CN112819171B (en) Data searching method, system and computer storage medium based on table function
CN107247753A (en) A kind of similar users choosing method and device
CN111539444A (en) Gaussian mixture model method for modified mode recognition and statistical modeling
CN114693397A (en) Multi-view multi-modal commodity recommendation method based on attention neural network
CN110781401A (en) Top-n project recommendation method based on collaborative autoregressive flow
CN115131698A (en) Video attribute determination method, device, equipment and storage medium
Du et al. Multi-feature based network for multivariate time series classification
CN113159892B (en) Commodity recommendation method based on multi-mode commodity feature fusion
Bhadoria et al. Bunch graph based dimensionality reduction using auto-encoder for character recognition
Raikar et al. Efficiency comparison of supervised and unsupervised classifier on content based classification using shape, color, texture
CN112905906A (en) Recommendation method and system fusing local collaboration and feature intersection
CN116977035A (en) Agricultural product recommendation method based on LightGBM and deep learning
CN113033090B (en) Push model training method, data push device and storage medium
CN114187546A (en) Combined action recognition method and system
CN108805628B (en) Electronic commerce recommendation method
CN112561599A (en) Click rate prediction method based on attention network learning and fusing domain feature interaction
Wang et al. Stock Price Forecasting Based on Wavelet Filtering and Ensembled Machine Learning Model
Patel An Innovative Approach for Association Rule Mining In Grocery Dataset Based On Non-Negative Matrix Factorization And Autoencoder

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination