CN112182226B - 一种基于主成分分析和密度峰值聚类的垃圾邮件检测方法 - Google Patents
一种基于主成分分析和密度峰值聚类的垃圾邮件检测方法 Download PDFInfo
- Publication number
- CN112182226B CN112182226B CN202011114698.1A CN202011114698A CN112182226B CN 112182226 B CN112182226 B CN 112182226B CN 202011114698 A CN202011114698 A CN 202011114698A CN 112182226 B CN112182226 B CN 112182226B
- Authority
- CN
- China
- Prior art keywords
- entry
- entries
- significant
- weight
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 14
- 238000000513 principal component analysis Methods 0.000 title claims abstract description 10
- 238000004458 analytical method Methods 0.000 claims abstract description 5
- 238000000034 method Methods 0.000 claims description 14
- 238000007621 cluster analysis Methods 0.000 claims description 6
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000012216 screening Methods 0.000 abstract description 4
- 230000009467 reduction Effects 0.000 abstract description 2
- 238000001914 filtration Methods 0.000 description 8
- 230000006872 improvement Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 230000007547 defect Effects 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
- G06Q10/107—Computer-aided management of electronic mailing [e-mailing]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L51/00—User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
- H04L51/21—Monitoring or handling of messages
- H04L51/212—Monitoring or handling of messages using filtering or selective blocking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L51/00—User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
- H04L51/42—Mailbox-related aspects, e.g. synchronisation of mailboxes
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Entrepreneurship & Innovation (AREA)
- Computer Hardware Design (AREA)
- Economics (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011114698.1A CN112182226B (zh) | 2020-10-16 | 2020-10-16 | 一种基于主成分分析和密度峰值聚类的垃圾邮件检测方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011114698.1A CN112182226B (zh) | 2020-10-16 | 2020-10-16 | 一种基于主成分分析和密度峰值聚类的垃圾邮件检测方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112182226A CN112182226A (zh) | 2021-01-05 |
CN112182226B true CN112182226B (zh) | 2022-09-30 |
Family
ID=73950838
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011114698.1A Active CN112182226B (zh) | 2020-10-16 | 2020-10-16 | 一种基于主成分分析和密度峰值聚类的垃圾邮件检测方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112182226B (zh) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105630904A (zh) * | 2015-12-21 | 2016-06-01 | 中国电子科技集团公司第十五研究所 | 一种互联网账户信息挖掘的方法和装置 |
CN108462624A (zh) * | 2017-02-17 | 2018-08-28 | 阿里巴巴集团控股有限公司 | 一种垃圾邮件的识别方法、装置以及电子设备 |
CN109947936A (zh) * | 2018-08-21 | 2019-06-28 | 北京大学 | 一种基于机器学习动态检测垃圾邮件的方法 |
-
2020
- 2020-10-16 CN CN202011114698.1A patent/CN112182226B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105630904A (zh) * | 2015-12-21 | 2016-06-01 | 中国电子科技集团公司第十五研究所 | 一种互联网账户信息挖掘的方法和装置 |
CN108462624A (zh) * | 2017-02-17 | 2018-08-28 | 阿里巴巴集团控股有限公司 | 一种垃圾邮件的识别方法、装置以及电子设备 |
CN109947936A (zh) * | 2018-08-21 | 2019-06-28 | 北京大学 | 一种基于机器学习动态检测垃圾邮件的方法 |
Non-Patent Citations (1)
Title |
---|
基于聚类分析算法的垃圾邮件识别;盖璇;《计算机与现代化》;20201015(第10期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN112182226A (zh) | 2021-01-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109165284B (zh) | 一种基于大数据的金融领域人机对话意图识别方法 | |
CN103024746B (zh) | 一种电信运营商垃圾短信处理系统及处理方法 | |
CN107169001A (zh) | 一种基于众包反馈和主动学习的文本分类模型优化方法 | |
CN103729474B (zh) | 用于识别论坛用户马甲账号的方法和系统 | |
CN110826320A (zh) | 一种基于文本识别的敏感数据发现方法及系统 | |
CN109034194A (zh) | 基于特征分化的交易欺诈行为深度检测方法 | |
CN107704512A (zh) | 基于社交数据的金融产品推荐方法、电子装置及介质 | |
CN108363717B (zh) | 一种数据安全级别的识别检测方法及装置 | |
CN107657286B (zh) | 一种广告识别方法及计算机可读存储介质 | |
CN105426441B (zh) | 一种时间序列自动预处理方法 | |
CN107145573A (zh) | 人工智能客服机器人的问题解答方法及系统 | |
CN110750978A (zh) | 情感倾向分析方法、装置、电子设备及存储介质 | |
CN115186654B (zh) | 一种公文文本摘要生成方法 | |
CN109657063A (zh) | 一种海量环保人工上报事件数据的处理方法及存储介质 | |
CN106649338B (zh) | 信息过滤策略生成方法及装置 | |
CN115222303A (zh) | 基于大数据的行业风险数据分析方法、系统及存储介质 | |
CN107818173B (zh) | 一种基于向量空间模型的中文虚假评论过滤方法 | |
CN112182226B (zh) | 一种基于主成分分析和密度峰值聚类的垃圾邮件检测方法 | |
CN115186095B (zh) | 一种未成年人文本识别方法及装置 | |
CN114417821B (zh) | 基于云平台的金融文本核查分析系统 | |
CN113158669B (zh) | 一种用工平台正负面评论识别的方法及系统 | |
CN112507115B (zh) | 一种弹幕文本中情感词的分类方法、装置及存储介质 | |
CN114860931A (zh) | 一种基于Voting Classifier模型的继电保护缺陷文本定级方法 | |
CN114443930A (zh) | 一种新闻舆情智能监测分析方法、系统及计算机存储介质 | |
CN114881130A (zh) | 一种基于Bagging模型的继电保护缺陷文本定级方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231213 Address after: 430000, Room 05, 27th Floor, Building 1, Phase 3, Guannan Fuxing Pharmaceutical Park, No. 58 Guanggu Avenue, Wuhan Donghu New Technology Development Zone, Wuhan, Hubei Province (Wuhan Area of Free Trade Zone) Patentee after: Wuhan Tianzhiran Intellectual Property Operation Co.,Ltd. Address before: 325000 Wenzhou City National University Science Park incubator, No. 38 Dongfang South Road, Ouhai District, Wenzhou, Zhejiang Patentee before: WENZHOU VOCATIONAL & TECHNICAL College |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231228 Address after: 4-1210, 12th Floor, No. 28 Chengfu Road, Haidian District, Beijing, 100080 Patentee after: Beijing Yunche Yigou Technology Co.,Ltd. Address before: 430000, Room 05, 27th Floor, Building 1, Phase 3, Guannan Fuxing Pharmaceutical Park, No. 58 Guanggu Avenue, Wuhan Donghu New Technology Development Zone, Wuhan, Hubei Province (Wuhan Area of Free Trade Zone) Patentee before: Wuhan Tianzhiran Intellectual Property Operation Co.,Ltd. |