CN116343944A - Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method - Google Patents

Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method Download PDF

Info

Publication number
CN116343944A
CN116343944A CN202310281731.7A CN202310281731A CN116343944A CN 116343944 A CN116343944 A CN 116343944A CN 202310281731 A CN202310281731 A CN 202310281731A CN 116343944 A CN116343944 A CN 116343944A
Authority
CN
China
Prior art keywords
cigarette
main stream
stream smoke
parameters
cigarettes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310281731.7A
Other languages
Chinese (zh)
Inventor
李海锋
王辉
李炳华
林建南
杜芳琪
宣润泉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Tobacco Zhejiang Industrial Co Ltd
Original Assignee
China Tobacco Zhejiang Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Tobacco Zhejiang Industrial Co Ltd filed Critical China Tobacco Zhejiang Industrial Co Ltd
Priority to CN202310281731.7A priority Critical patent/CN116343944A/en
Publication of CN116343944A publication Critical patent/CN116343944A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/30Prediction of properties of chemical compounds, compositions or mixtures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/70Machine learning, data mining or chemometrics
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Optimization (AREA)
  • Chemical & Material Sciences (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Evolutionary Biology (AREA)
  • Artificial Intelligence (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Operations Research (AREA)
  • Probability & Statistics with Applications (AREA)
  • Health & Medical Sciences (AREA)
  • Algebra (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Manufacturing Of Cigar And Cigarette Tobacco (AREA)

Abstract

The invention discloses a method for predicting the influence of auxiliary parameters, physical indexes and main stream smoke components of cigarettes, which comprises the steps of obtaining the auxiliary parameters, the physical indexes and the main stream smoke data of known cigarette samples in advance, obtaining the influence degree of the auxiliary parameters, the physical indexes and the main stream smoke data of the cigarette samples in specific component content regression of the main stream smoke of the cigarettes through random forest analysis, obtaining a corresponding random forest model regression result, and predicting the specific component data of the main stream smoke of unknown cigarette samples by using the established random forest model. The method utilizes the characteristics that the random forest method is not easy to fall into overfitting, has remarkable noise resistance and can process higher dimensional data, effectively reveals the correlation between the parameters and physical indexes of the auxiliary materials of the cigarettes and preset components in main stream smoke of the cigarettes, accurately predicts the component data in the main stream smoke of unknown cigarette samples, and is sufficient to provide reliable reference for design, research and development of cigarette products and quality control.

Description

Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method
Technical Field
The invention relates to the field of tobacco detection, in particular to a method for predicting influences of auxiliary parameters, physical indexes and main stream smoke components of cigarettes.
Background
The release amount of conventional components (tar, nicotine and CO) of the smoke is an important index for the design of the cigarette, and the release amount of the tar, the nicotine and the CO in the main stream smoke of the cigarette can be accurately predicted, so that the method has important significance for guiding the digital design of cigarette products. Related work has been carried out on the prediction of tar, nicotine and CO emissions in the mainstream smoke of cigarettes: if the quantitative relation between the gram weight of the cigarette paper, the air permeability of the forming paper, the air permeability of the tipping paper and the suction resistance of the filter stick and the release amount of tar, nicotine and CO in the main stream smoke is inspected by using a chemometric method, a multi-element linear prediction model is established; for another example, a mathematical model for predicting the release amount of tar, nicotine and CO in main stream smoke of cigarettes is established by multiple linear regression analysis of the quantitative of the cigarette paper, the air permeability of the forming paper, the air permeability of the tipping paper and the suction resistance of the filter stick; for another example, a multi-element linear regression equation prediction model of filter ventilation, filter rod pressure drop, cigarette paper quantification, cigarette paper ventilation, cigarette paper combustion improver mass fraction and potassium-sodium ratio in the cigarette paper combustion improver and release amounts of tar, nicotine and CO in main stream smoke of cigarettes is established; in addition, for example, a linear regression method and a stepwise regression method are adopted to construct a multi-element linear prediction model of the release amount of tar, nicotine and CO of the medium cigarette based on parameters such as the air permeability, the quantification, the combustion improver consumption, the combustion improver potassium-sodium ratio, the filter rod pressure drop, the tow specification, the air permeability of the forming paper, the air permeability of the tipping paper and the like of the cigarette paper.
However, most of existing algorithms for predicting the release amounts of tar, nicotine and CO in main stream smoke of cigarettes are based on linear fitting models, and the linear fitting models have the defects of low accuracy, easiness in overfitting, large influence of noise and the like.
Disclosure of Invention
In view of the above, the present invention aims to provide a method for predicting the influence of auxiliary parameters and physical indexes of cigarettes and main stream smoke components, so as to solve the defects in the current prediction process of the release amount of specific components of main stream smoke of cigarettes based on the auxiliary parameters and physical indexes of cigarettes.
The technical scheme adopted by the invention is as follows:
the invention provides a method for predicting the influence of auxiliary parameters, physical indexes and main stream smoke components of cigarettes, which comprises the following steps:
inputting cigarette auxiliary material parameters, physical indexes and preset component data in main stream smoke of a known cigarette sample in advance;
determining a random forest method from a system, and setting corresponding algorithm parameters;
obtaining importance ordering results of cigarette auxiliary material parameters and physical indexes of known cigarette samples in preset component regression of cigarette main stream smoke and corresponding random forest model fitting results by utilizing the random forest method;
based on a random forest model fitting result, the cigarette auxiliary material parameters and physical indexes of the unknown cigarette sample are combined, and the preset components in the main stream smoke of the unknown cigarette sample are predicted.
In at least one possible implementation manner, the obtaining the importance ranking result of the cigarette auxiliary parameters and the physical indexes of the known cigarette sample in the preset component regression of the main stream smoke of the cigarette includes:
sampling according to the set training sample size from the data of the known cigarette samples in a random and repeatable mode to form a training set;
combining the optimal segmentation modes of the given auxiliary parameters and physical indexes of the cigarettes in the training set to obtain a known cigarette sample which is not extracted;
when the preset components of the main stream smoke of the cigarettes are regressed, the known cigarette samples which are not extracted are taken as test samples, and the importance of the cigarette auxiliary material parameters and the physical indexes of each cigarette sample in the preset components is evaluated by adopting a random sampling mode.
In at least one possible implementation manner, the importance of the cigarette auxiliary material parameters and physical indexes is represented by adopting standardized data with poor inter-tree prediction precision.
In at least one possible implementation manner, the algorithm parameters include the number of models to be constructed, a sample size, a maximum node number, a maximum tree depth and a minimum child node number.
In at least one possible implementation manner, the cigarette auxiliary material parameters and physical indexes comprise a plurality of combinations of the following various parameter indexes: the cigarette paper ventilation degree, the cigarette paper ration, the cigarette paper citrate content, the cigarette paper phosphate content, the cigarette paper oxalate content, the cigarette paper potassium ion content, the cigarette paper sodium ion content, the tipping paper ventilation degree, the cigarette weight, the standard smoking resistance, the closed smoking resistance, the filter rod ventilation rate, the cigarette paper ventilation rate and the total ventilation rate.
In at least one possible implementation manner, the preset component data in the main stream smoke of the cigarette are obtained by actual measurement according to the established standard in the industry.
In at least one possible implementation thereof, the random forest method includes a random forest classification method or a random forest regression method.
Compared with the prior art, the main design concept of the invention is that the cigarette auxiliary material parameters and the physical indexes of the known cigarette samples and the data of the main stream smoke and the preset components are detected in advance, and then the influence degree of the cigarette auxiliary material parameters and the physical indexes of the cigarette samples in the regression of the content of the specific components in the main stream smoke of the cigarette and the regression result of the corresponding random forest model are obtained after random forest analysis, and the prediction result is carried out on the data of the specific components in the main stream smoke of the unknown cigarette samples by using the established random forest model. The method utilizes the characteristics that the random forest method is not easy to fall into overfitting, has remarkable noise resistance and can process higher dimensional data, effectively reveals the correlation strength between the parameters and physical indexes of auxiliary materials of cigarettes and preset components in main stream smoke of cigarettes, accurately predicts the component data in main stream smoke of unknown cigarette samples, and is sufficient to provide reliable reference for design, research and development of cigarette products and quality control.
Drawings
For the purpose of making the objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described with reference to the accompanying drawings, in which:
FIG. 1 is a flow chart of a method for predicting the influence of parameters and physical indexes of auxiliary materials and main stream smoke components of cigarettes provided by an embodiment of the invention;
FIG. 2 is a schematic diagram of linear fitting of predicted and actual values of tar release in mainstream smoke of 8 unknown cigarette samples provided by an embodiment of the present invention;
FIG. 3 is a schematic diagram of linear fitting of predicted and actual values of nicotine release in mainstream smoke of 12 unknown cigarette samples provided by an embodiment of the present invention;
fig. 4 is a schematic linear fitting diagram of predicted values and actual values of CO release amounts in mainstream smoke of 12 unknown cigarette samples provided in an embodiment of the present invention.
Detailed Description
Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative only and are not to be construed as limiting the invention.
The invention provides an embodiment of a method for predicting the influence of auxiliary parameters, physical indexes and main stream smoke components of cigarettes, specifically, as shown in fig. 1, the method comprises the following steps:
step S1, inputting cigarette auxiliary material parameters of known cigarette samples, physical indexes and preset component data in main stream smoke of cigarettes in advance;
step S2, determining a random forest method (comprising a random forest classification method or a random forest regression method, which can be selected as the random forest regression method) from a system, and setting corresponding algorithm parameters, wherein the algorithm parameters can comprise the number of models to be constructed, the sample size, the number of maximum nodes, the maximum tree depth and the minimum number of child nodes; for example, the number of models is set to 100, the sample size is set to 1.0, and other construction parameters can be preset default values.
S3, obtaining an importance ordering result of the cigarette auxiliary material parameters and the physical indexes of the known cigarette samples in the regression of the preset components of the main stream smoke of the cigarette and a corresponding random forest model fitting result by utilizing the random forest method;
and S4, based on a random forest model fitting result, predicting preset components in main stream smoke of the unknown cigarette sample by combining the cigarette auxiliary parameters and physical indexes of the unknown cigarette sample.
It should be noted here that the present invention is primarily directed to these three main stream smoke constituent objects, particularly with respect to tar, nicotine and CO emissions, but the present invention is not limited to these three elements, either necessarily or only.
In some preferred embodiments, the obtaining the importance ranking result of the cigarette accessory parameters and the physical indexes of the known cigarette sample in the preset component regression of the main stream smoke of the cigarette includes:
step S31, sampling is carried out in a random and repeatable mode according to the set training sample size from the data of the known cigarette samples to form a training set;
step S32, combining the optimal segmentation modes of the given auxiliary parameters and physical indexes of the cigarettes in the training set to obtain a known cigarette sample which is not extracted;
and step S33, when the preset components of the main stream smoke of the cigarettes are regressed, taking the known cigarette samples which are not extracted as test samples, and evaluating the importance of the cigarette auxiliary material parameters and physical indexes of each cigarette sample in the preset components by adopting a random sampling mode.
The cigarette auxiliary material parameters and physical indexes specifically can comprise a plurality of combinations of the following various parameter indexes: the cigarette paper ventilation degree, the cigarette paper ration, the cigarette paper citrate content, the cigarette paper phosphate content, the cigarette paper oxalate content, the cigarette paper potassium ion content, the cigarette paper sodium ion content, the tipping paper ventilation degree, the cigarette weight, the standard smoking resistance, the closed smoking resistance, the filter rod ventilation rate, the cigarette paper ventilation rate and the total ventilation rate. And the data of preset components (such as the release amounts of tar, nicotine and CO) in the main stream smoke of the cigarettes are obtained by actual measurement according to the established standard in the industry.
Here, taking an analysis process of a random forest method of the release amount of CO in main stream smoke of cigarettes as an example: the total number of known cigarette samples is N, the size of the training sample is y, and the number of constructed models is N tress The number of cigarette auxiliary parameters and physical index variables of the cigarette samples randomly extracted at the nodes is m; then, sampling is preferably performed by adopting a bootstrap sampling method, namely sampling is performed by adopting a random and repeatable sampling mode according to the size of a set training sample from N pieces of data of known cigarette samples to form a group of training sets; then, carrying out regression on the known cigarette samples which are not taken by using an optimal segmentation mode of m variables in the training set, wherein each time the samples which are not taken are N× (1-y), and the samples which are not taken can be collectively called as out-of-bag data; when the release amounts of tar, nicotine and CO are regressed, the out-of-bag data can be used as a test sample, and a random sampling method is adopted to evaluate the importance of the material parameters of each cigarette sample in the prediction of the release amounts of tar, nicotine and CO. In the concrete calculation, the importance of the cigarette auxiliary parameters and physical indexes of the cigarette sample is preferably represented by standardized data with poor inter-tree prediction precision.
The invention is further specifically described with reference to the following examples, wherein: the method is characterized in that 14 auxiliary parameters and physical indexes of a cigarette sample with known tar release amount are used as variables, a random forest method is adopted to analyze the importance of the cigarette auxiliary parameters and physical indexes of the cigarette sample in the regression of the tar release amount in the main stream smoke of the cigarette, and the tar release amount in the main stream smoke of an unknown cigarette sample is predicted.
The tar release amounts of the main stream smoke of the produced 55 cigarette samples were subjected to fitting analysis (table 1, table 2) by taking 14 cigarette auxiliary parameters and physical indexes of the ventilation degree of the cigarette paper, the quantification of the cigarette paper, the citrate content of the cigarette paper, the phosphate content of the cigarette paper, the oxalate content of the cigarette paper, the potassium ion content of the cigarette paper, the sodium ion content of the cigarette paper, the ventilation degree of the tipping paper, the weight of the cigarette, the standard smoking resistance, the closed smoking resistance, the ventilation rate of the filter stick, the ventilation rate of the cigarette paper and the total ventilation rate as variables. The results show that the tar release amount in the main stream smoke of the cigarette samples can be accurately fitted by preferably but not exclusively adopting a random forest regression method configured in an IBM SPSS model 18.0 data processing system according to the cigarette auxiliary parameters and physical indexes of each cigarette sample.
TABLE 1 random forest regression analysis results of tar release from 55 cigarette samples
Figure BDA0004138255770000061
Figure BDA0004138255770000071
TABLE 2 analysis of Tar Release random forest method fitting results for 55 cigarette samples
Figure BDA0004138255770000072
Next, the importance of each cigarette accessory parameter and physical index is shown in Table 3. As can be seen from table 3, the importance ranking among the 14 detected cigarette auxiliary parameters and physical indexes is: cigarette paper potassium ion content & gt filter stick ventilation rate & gt standard smoke resistance & gt total ventilation rate & gt tipping paper air permeability & gt cigarette paper ventilation rate & gt cigarette paper citrate content & gt cigarette paper quantitative & gt cigarette paper sodium ion content, cigarette paper phosphate content, cigarette paper oxalate content, cigarette weight and closed smoke resistance, because the minimum field variation is less than 0.05, the cigarette paper phosphate is automatically removed by a random forest regression method in a data processing system, and the cigarette paper phosphate is not listed in importance ranking.
TABLE 3 importance ordering of 14 auxiliary parameters and physical indicators in tar release fitting
Figure BDA0004138255770000073
Figure BDA0004138255770000081
Then, random forest predictions were performed on 8 unknown cigarette samples according to the fitting results obtained by the foregoing method (see table 4). The result shows that according to the cigarette auxiliary material parameters and physical indexes of the cigarette samples and the established random forest model of the tar release amount in the main stream smoke, the tar release amount in the main stream smoke of unknown cigarette samples can be accurately predicted by adopting a random forest regression method (see table 5 and figure 2), and R2 is 0.959.
Tab 4 8 unknown cigarette sample main stream smoke tar release prediction results
Figure BDA0004138255770000082
Table 5 8 random forest method prediction result analysis of tar release amount of unknown cigarette samples
Figure BDA0004138255770000083
In addition to the above examples, the method further uses 14 auxiliary parameters and physical indexes of the cigarette sample with known nicotine release amount as variables, adopts a random forest method to analyze the importance of the cigarette auxiliary parameters and physical indexes of the cigarette sample in the regression of the nicotine release amount in the main stream smoke of the cigarette, and predicts the nicotine release amount in the main stream smoke of the unknown cigarette sample.
The tobacco release amounts of the produced main stream smoke of 63 cigarette samples were subjected to fitting analysis (table 6, table 7) by taking 14 cigarette auxiliary parameters and physical indexes of the ventilation degree of the cigarette paper, the quantification of the cigarette paper, the citrate content of the cigarette paper, the phosphate content of the cigarette paper, the oxalate content of the cigarette paper, the potassium ion content of the cigarette paper, the sodium ion content of the cigarette paper, the ventilation degree of the tipping paper, the weight of the cigarette, the standard smoking resistance, the closed smoking resistance, the ventilation rate of the filter stick, the ventilation rate of the cigarette paper and the total ventilation rate as variables. The result shows that the release amount of nicotine in main stream smoke of the cigarette samples can be accurately fitted by adopting a random forest regression method according to the cigarette auxiliary material parameters and physical indexes of each cigarette sample.
TABLE 6 random forest regression analysis of nicotine Release from 63 cigarette samples
Figure BDA0004138255770000091
Figure BDA0004138255770000101
TABLE 7 analysis of random forest method fitting results for nicotine Release from 63 cigarette samples
Figure BDA0004138255770000102
Next, the importance of 14 auxiliary parameters and physical indexes of the cigarette samples on the release amount of nicotine in the main stream smoke is analyzed, and the importance of each auxiliary parameter and physical index of each cigarette is shown in table 8. As can be seen from table 8, the importance ranking of the detected 14 cigarette auxiliary parameters and physical indexes is: the cigarette paper potassium ion content is larger than the cigarette paper ventilation rate is larger than the cigarette paper quantification is larger than the cigarette paper ventilation rate is larger than the filter stick ventilation rate is larger than the tipping paper air permeability is larger than the cigarette paper citrate content is larger than the standard smoke resistance is larger than the total ventilation rate is larger than the cigarette paper air permeability is larger than the cigarette paper sodium ion content, and the cigarette paper phosphate radical content, the cigarette paper oxalate radical content, the cigarette weight and the closed smoke resistance are automatically removed by a random forest regression method in the system because the minimum field variation is smaller than 0.05, so that the cigarette paper phosphate radical content, the cigarette paper oxalate radical content, the cigarette weight and the closed smoke resistance are not listed in the importance ranking.
TABLE 8 importance ranking of 14 auxiliary parameters and physical indicators in Nicotine Release amount fitting
Figure BDA0004138255770000111
Then, random forest predictions were made for 12 unknown cigarette samples according to the random forest model that had been established (table 9). The results show that the random forest regression method can accurately predict the release amount of nicotine in the main stream smoke of an unknown cigarette sample according to the cigarette auxiliary material parameters and physical indexes of the cigarette sample and the established random forest model of the release amount of nicotine in the main stream smoke (see table 10 and figure 3), and R2 is 0.965.
TABLE 9 prediction results of nicotine Release in mainstream Smoke from 12 unknown cigarette samples
Figure BDA0004138255770000121
TABLE 10 analysis of prediction results of nicotine Release from 12 unknown cigarette samples by random forest method
Figure BDA0004138255770000122
Finally, the method also takes 14 auxiliary parameters and physical indexes of the cigarette sample with known CO release as variables, adopts a random forest method to analyze the importance of the cigarette auxiliary parameters and physical indexes of the cigarette sample in the regression of the CO release in the main stream smoke of the cigarette and predicts the CO release in the main stream smoke of the unknown cigarette sample.
The CO release amounts in the main stream smoke of the produced 63 cigarette samples were subjected to fitting analysis (table 11, table 12) by taking 14 cigarette auxiliary parameters and physical indexes of the air permeability of the cigarette paper, the quantification of the cigarette paper, the citrate content of the cigarette paper, the phosphate content of the cigarette paper, the oxalate content of the cigarette paper, the potassium ion content of the cigarette paper, the sodium ion content of the cigarette paper, the air permeability of the tipping paper, the weight of the cigarette, the standard smoking resistance, the closed smoking resistance, the ventilation rate of the filter stick, the ventilation rate of the cigarette paper and the total ventilation rate as variables. The result shows that the CO release amount in the main stream smoke of the cigarette samples can be accurately fitted by adopting a random forest regression method according to the cigarette auxiliary material parameters and physical indexes of each cigarette sample.
TABLE 11 random forest regression analysis results of CO Release from 63 cigarette samples
Figure BDA0004138255770000123
/>
Figure BDA0004138255770000131
/>
Figure BDA0004138255770000141
TABLE 12 random forest method fitting result analysis of CO Release amount for 63 cigarette samples
Figure BDA0004138255770000142
Next, the importance of the 14 auxiliary parameters and physical indexes of the cigarette sample on the release amount of CO in the main stream smoke is analyzed, and the importance of each auxiliary parameter and physical index of the cigarette is shown in table 13. As can be seen from table 13, the importance ranking of the detected 14 cigarette auxiliary parameters and physical indexes is: cigarette paper potassium ion content & gt, filter stick ventilation rate & gt, standard smoke resistance & gt, cigarette paper ventilation rate & gt, cigarette paper quantitative & gt, cigarette paper air permeability & gt, total ventilation rate & gt, cigarette paper citrate content & gt, tipping paper air permeability & gt, cigarette paper sodium ion content, and cigarette paper phosphate content, cigarette paper oxalate content, cigarette weight and closed smoke resistance are automatically removed by a random forest regression method in a system due to the fact that the minimum field variation is smaller than 0.05, so that importance ranking is not included.
Table 13 importance ranking of 14 auxiliary parameters and physical indicators in CO release fitting
Figure BDA0004138255770000143
/>
Figure BDA0004138255770000151
Then, random forest predictions were performed on 12 unknown cigarette samples according to the random forest model method that was already established (table 14). The result shows that according to the cigarette auxiliary material parameters and physical indexes of the cigarette samples and the established random forest model of the CO release amount in the main stream smoke, the CO release amount in the main stream smoke of the unknown cigarette samples can be accurately predicted by adopting a random forest regression method (see table 15 and figure 4), and R2 is 0.950.
TABLE 14 prediction results of CO Release in mainstream smoke of 12 unknown cigarette samples
Figure BDA0004138255770000152
TABLE 15 analysis of CO Release random forest prediction results for 12 unknown cigarette samples
Figure BDA0004138255770000153
In summary, the main design concept of the invention is to detect and obtain the cigarette auxiliary parameters and physical indexes of the known cigarette samples and the data of the main stream smoke and the preset components in advance, then obtain the influence degree of the cigarette auxiliary parameters and physical indexes of the cigarette samples in the regression of the specific component content in the main stream smoke of the cigarette and the regression result of the corresponding random forest model after random forest analysis, and use the established random forest model to predict the specific component data in the main stream smoke of the unknown cigarette samples. The method utilizes the characteristics that the random forest method is not easy to fall into overfitting, has remarkable noise resistance and can process higher dimensional data, effectively reveals the correlation strength between the parameters and physical indexes of auxiliary materials of cigarettes and preset components in main stream smoke of cigarettes, accurately predicts the component data in main stream smoke of unknown cigarette samples, and is sufficient to provide reliable reference for design, research and development of cigarette products and quality control.
In the embodiments of the present invention, "at least one" means one or more, and "a plurality" means two or more. "and/or", describes an association relation of association objects, and indicates that there may be three kinds of relations, for example, a and/or B, and may indicate that a alone exists, a and B together, and B alone exists. Wherein A, B may be singular or plural. The character "/" generally indicates that the context-dependent object is an "or" relationship. "at least one of the following" and the like means any combination of these items, including any combination of single or plural items. For example, at least one of a, b and c may represent: a, b, c, a and b, a and c, b and c or a and b and c, wherein a, b and c can be single or multiple.
The construction, features and effects of the present invention are described in detail according to the embodiments shown in the drawings, but the above is only a preferred embodiment of the present invention, and it should be understood that the technical features of the above embodiment and the preferred mode thereof can be reasonably combined and matched into various equivalent schemes by those skilled in the art without departing from or changing the design concept and technical effects of the present invention; therefore, the invention is not limited to the embodiments shown in the drawings, but is intended to be within the scope of the invention as long as changes made in the concept of the invention or modifications to the equivalent embodiments do not depart from the spirit of the invention as covered by the specification and drawings.

Claims (7)

1. A method for predicting the influence of auxiliary parameters, physical indexes and main stream smoke components of cigarettes is characterized by comprising the following steps:
inputting cigarette auxiliary material parameters, physical indexes and preset component data in main stream smoke of a known cigarette sample in advance;
determining a random forest method from a system, and setting corresponding algorithm parameters;
obtaining importance ordering results of cigarette auxiliary material parameters and physical indexes of known cigarette samples in preset component regression of cigarette main stream smoke and corresponding random forest model fitting results by utilizing the random forest method;
based on a random forest model fitting result, the cigarette auxiliary material parameters and physical indexes of the unknown cigarette sample are combined, and the preset components in the main stream smoke of the unknown cigarette sample are predicted.
2. The method for predicting the influence of parameters and physical indexes of auxiliary materials on main stream smoke components of cigarettes according to claim 1, wherein the obtaining the importance sequencing result of the parameters and physical indexes of auxiliary materials of known cigarettes in the regression of the preset components of main stream smoke of cigarettes comprises:
sampling according to the set training sample size from the data of the known cigarette samples in a random and repeatable mode to form a training set;
combining the optimal segmentation modes of the given auxiliary parameters and physical indexes of the cigarettes in the training set to obtain a known cigarette sample which is not extracted;
when the preset components of the main stream smoke of the cigarettes are regressed, the known cigarette samples which are not extracted are taken as test samples, and the importance of the cigarette auxiliary material parameters and the physical indexes of each cigarette sample in the preset components is evaluated by adopting a random sampling mode.
3. The method for predicting the influence of the parameters and the physical indexes of the auxiliary materials of the cigarettes and the components of the main stream smoke according to claim 2, wherein the importance of the parameters and the physical indexes of the auxiliary materials of the cigarettes is represented by using standardized data with poor inter-tree prediction precision.
4. The method for predicting the effects of auxiliary parameters, physical indexes and main stream smoke components of a cigarette according to claim 1, wherein the algorithm parameters comprise the number of models to be constructed, the sample size, the maximum node number, the maximum tree depth and the minimum child node number.
5. The method for predicting the effect of cigarette accessory parameters and physical indicators and main stream smoke components according to claim 1, wherein the cigarette accessory parameters and physical indicators comprise a plurality of combinations of the following parameter indicators: the cigarette paper ventilation degree, the cigarette paper ration, the cigarette paper citrate content, the cigarette paper phosphate content, the cigarette paper oxalate content, the cigarette paper potassium ion content, the cigarette paper sodium ion content, the tipping paper ventilation degree, the cigarette weight, the standard smoking resistance, the closed smoking resistance, the filter rod ventilation rate, the cigarette paper ventilation rate and the total ventilation rate.
6. The method for predicting the influence of auxiliary parameters, physical indexes and main stream smoke components of cigarettes according to claim 1, wherein the preset component data in the main stream smoke of the cigarettes are obtained by actual measurement according to established standards in the industry.
7. The method for predicting the influence of cigarette auxiliary parameters, physical indexes and main stream smoke components according to any one of claims 1 to 6, wherein the random forest method comprises a random forest classification method or a random forest regression method.
CN202310281731.7A 2023-03-17 2023-03-17 Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method Pending CN116343944A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310281731.7A CN116343944A (en) 2023-03-17 2023-03-17 Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310281731.7A CN116343944A (en) 2023-03-17 2023-03-17 Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method

Publications (1)

Publication Number Publication Date
CN116343944A true CN116343944A (en) 2023-06-27

Family

ID=86889002

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310281731.7A Pending CN116343944A (en) 2023-03-17 2023-03-17 Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method

Country Status (1)

Country Link
CN (1) CN116343944A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117493945A (en) * 2023-12-25 2024-02-02 山东产研青创信息科技有限公司 Flue gas analysis early warning method and system based on computer vision

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117493945A (en) * 2023-12-25 2024-02-02 山东产研青创信息科技有限公司 Flue gas analysis early warning method and system based on computer vision
CN117493945B (en) * 2023-12-25 2024-03-29 山东产研青创信息科技有限公司 Flue gas analysis early warning method and system based on computer vision

Similar Documents

Publication Publication Date Title
EP1860961B1 (en) Smoking machine
CN107796782B (en) Redrying quality stability evaluation method based on tobacco leaf characteristic spectrum consistency measurement
CN116343944A (en) Cigarette auxiliary material parameter and physical index and main stream smoke component influence prediction method
CN105445421B (en) A kind of method of aesthetic quality during predictive slice nicotinic alcohol by appearance index
CN108732127B (en) Method for detecting mixing proportion of each component in cut tobacco
CN101949862A (en) Method for measuring wrapped ash state after burning of cigarette
CN103344572A (en) Method for evaluating blending homogeneity of cut rolled stems and regenerated cut tobaccos in cigarettes
CN113190967B (en) Multi-objective screening-based fine cigarette material combination method
CN110274848A (en) Cigarette shreds based on density of tobacco rod are distributed characterizing method
CN104899424B (en) A kind of cured tobacco leaf maturity and blade construction objective measure based on microscopic feature
CN105628646B (en) A kind of online tar prediction of cigarette and method for early warning
CN101147623A (en) Cigarette supplementary lean combined assembling method
CN103892435A (en) Method for preparing tobacco powder raw materials and stabilizing quality of reconstituted tobacco
CN105223140A (en) The method for quickly identifying of homology material
CN108398520B (en) Method for predicting cigarette combustion intensity based on detection of flow field distribution
CN104090082B (en) The Forecasting Methodology of the full smoke pH of a kind of cigarette
CN114563425A (en) Tobacco leaf potassium content determination method based on tobacco leaf microstructure
CN110333177B (en) Cigarette segmented suction resistance nondestructive rapid detection evaluation method based on cigarette paper air permeability stability
CN113092689B (en) Quantitative determination method for tobacco stem penetration degree
CN101701885B (en) Gas collecting bottle for connecting smoking machine with electronic nose and use method thereof
CN117272239A (en) Cigarette ash packing performance prediction method based on random forest algorithm
CN104598752B (en) A kind of CO burst size forecast model construction methods in cigarette smoke based on combustion adjuvant
CN113222268A (en) Multi-mode reasoning-based tobacco baking quality prediction model establishing method
CN102823943B (en) Cigarette design adjusting method based on NNK release amount in deep smoking case
CN112526009A (en) Method for measuring water content of heated cigarette core material based on water activity

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination