CN113516189B - 基于两阶段随机森林算法的网站恶意用户预测方法 - Google Patents
基于两阶段随机森林算法的网站恶意用户预测方法 Download PDFInfo
- Publication number
- CN113516189B CN113516189B CN202110805134.0A CN202110805134A CN113516189B CN 113516189 B CN113516189 B CN 113516189B CN 202110805134 A CN202110805134 A CN 202110805134A CN 113516189 B CN113516189 B CN 113516189B
- Authority
- CN
- China
- Prior art keywords
- feature
- training data
- user
- random forest
- current
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000007637 random forest analysis Methods 0.000 title claims abstract description 65
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000004422 calculation algorithm Methods 0.000 title claims abstract description 23
- 238000003066 decision tree Methods 0.000 claims abstract description 69
- 230000003993 interaction Effects 0.000 claims abstract description 27
- 238000012549 training Methods 0.000 claims description 99
- 230000009467 reduction Effects 0.000 claims description 15
- 238000005070 sampling Methods 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 6
- 238000012217 deletion Methods 0.000 claims description 5
- 230000037430 deletion Effects 0.000 claims description 5
- 238000004458 analytical method Methods 0.000 claims description 4
- 238000012360 testing method Methods 0.000 claims description 4
- 239000011159 matrix material Substances 0.000 claims description 3
- 230000000717 retained effect Effects 0.000 claims description 3
- 238000001514 detection method Methods 0.000 abstract description 13
- 238000005192 partition Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 238000012216 screening Methods 0.000 description 2
- 238000010187 selection method Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/24323—Tree-organised classifiers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2132—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on discrimination criteria, e.g. discriminant analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/245—Classification techniques relating to the decision surface
- G06F18/2451—Classification techniques relating to the decision surface linear, e.g. hyperplane
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/259—Fusion by voting
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110805134.0A CN113516189B (zh) | 2021-07-16 | 2021-07-16 | 基于两阶段随机森林算法的网站恶意用户预测方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110805134.0A CN113516189B (zh) | 2021-07-16 | 2021-07-16 | 基于两阶段随机森林算法的网站恶意用户预测方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113516189A CN113516189A (zh) | 2021-10-19 |
CN113516189B true CN113516189B (zh) | 2022-08-26 |
Family
ID=78067829
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110805134.0A Active CN113516189B (zh) | 2021-07-16 | 2021-07-16 | 基于两阶段随机森林算法的网站恶意用户预测方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113516189B (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114925759B (zh) * | 2022-05-12 | 2024-07-12 | 东北大学 | 一种区块链钓鱼行为账户的特征分析方法 |
CN117527369B (zh) * | 2023-11-13 | 2024-06-04 | 无锡商业职业技术学院 | 基于哈希函数的安卓恶意攻击监测方法及系统 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9684870B2 (en) * | 2013-01-02 | 2017-06-20 | Qualcomm Incorporated | Methods and systems of using boosted decision stumps and joint feature selection and culling algorithms for the efficient classification of mobile device behaviors |
US10230747B2 (en) * | 2014-07-15 | 2019-03-12 | Cisco Technology, Inc. | Explaining network anomalies using decision trees |
CN104794192B (zh) * | 2015-04-17 | 2018-06-08 | 南京大学 | 基于指数平滑、集成学习模型的多级异常检测方法 |
CN106709336A (zh) * | 2015-11-18 | 2017-05-24 | 腾讯科技(深圳)有限公司 | 识别恶意软件的方法和装置 |
CN106778836A (zh) * | 2016-11-29 | 2017-05-31 | 天津大学 | 一种基于约束条件的随机森林推荐算法 |
US10885469B2 (en) * | 2017-10-02 | 2021-01-05 | Cisco Technology, Inc. | Scalable training of random forests for high precise malware detection |
CN108289104B (zh) * | 2018-02-05 | 2020-07-17 | 重庆邮电大学 | 一种工业SDN网络DDoS攻击检测与缓解方法 |
-
2021
- 2021-07-16 CN CN202110805134.0A patent/CN113516189B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN113516189A (zh) | 2021-10-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111612041B (zh) | 异常用户识别方法及装置、存储介质、电子设备 | |
US20070094216A1 (en) | Uncertainty management in a decision-making system | |
CN111143838B (zh) | 数据库用户异常行为检测方法 | |
CN107292097B (zh) | 基于特征组的中医主症选择方法 | |
CN113516189B (zh) | 基于两阶段随机森林算法的网站恶意用户预测方法 | |
US11615361B2 (en) | Machine learning model for predicting litigation risk in correspondence and identifying severity levels | |
US20210081899A1 (en) | Machine learning model for predicting litigation risk on construction and engineering projects | |
JP2023546021A (ja) | 機械学習モデルにおける反実仮想説明のためのシステム及び方法 | |
Ma et al. | A hybrid methodologies for intrusion detection based deep neural network with support vector machine and clustering technique | |
Vivekanandan et al. | Mining data streams with concept drifts using genetic algorithm | |
CN116958622A (zh) | 数据的分类方法、装置、设备、介质及程序产品 | |
US20170220665A1 (en) | Systems and methods for merging electronic data collections | |
CN108304568B (zh) | 一种房地产公众预期大数据处理方法及系统 | |
CN113743453A (zh) | 一种基于随机森林的人口数量预测方法 | |
CN117675691A (zh) | 路由器的远程故障监控方法、装置、设备及存储介质 | |
Liu et al. | The design of error-correcting output codes algorithm for the open-set recognition | |
US20200142910A1 (en) | Data clustering apparatus and method based on range query using cf tree | |
MANSOURI et al. | Generating fuzzy rules for protein classification | |
Liang et al. | Incremental deep forest for multi-label data streams learning | |
Alharbi | Classification Performance Analysis of Decision Tree‐Based Algorithms with Noisy Class Variable | |
CN113159976A (zh) | 一种微博网络重要用户的识别方法 | |
Li et al. | Research on listed companies’ credit ratings, considering classification performance and interpretability | |
Marinakos et al. | Viability prediction for retail business units using data mining techniques: a practical application in the Greek pharmaceutical sector | |
Liu et al. | Using improved feature extraction combined with RF-KNN classifier to predict coal and gas outburst | |
CN116932487B (zh) | 一种基于数据段落划分的量化式数据分析方法及系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231213 Address after: Building 27, No. 1882, Yan'an west road, Changning District, Shanghai, 200050 Patentee after: Shanghai Sanfen Sugar Technology Co.,Ltd. Address before: 230000 floor 1, building 2, phase I, e-commerce Park, Jinggang Road, Shushan Economic Development Zone, Hefei City, Anhui Province Patentee before: Dragon totem Technology (Hefei) Co.,Ltd. Effective date of registration: 20231213 Address after: 230000 floor 1, building 2, phase I, e-commerce Park, Jinggang Road, Shushan Economic Development Zone, Hefei City, Anhui Province Patentee after: Dragon totem Technology (Hefei) Co.,Ltd. Address before: 541004 No. 15 Yucai Road, Qixing District, Guilin, the Guangxi Zhuang Autonomous Region Patentee before: Guangxi Normal University |
|
TR01 | Transfer of patent right |