CN105095382B - 样本分布式聚类计算方法及装置 - Google Patents
样本分布式聚类计算方法及装置 Download PDFInfo
- Publication number
- CN105095382B CN105095382B CN201510375182.5A CN201510375182A CN105095382B CN 105095382 B CN105095382 B CN 105095382B CN 201510375182 A CN201510375182 A CN 201510375182A CN 105095382 B CN105095382 B CN 105095382B
- Authority
- CN
- China
- Prior art keywords
- characteristic value
- computing device
- value
- similarity
- characteristic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000004364 calculation method Methods 0.000 title claims abstract description 41
- 238000000034 method Methods 0.000 claims abstract description 44
- 238000012545 processing Methods 0.000 claims abstract description 44
- 230000000717 retained effect Effects 0.000 claims description 15
- 230000003252 repetitive effect Effects 0.000 claims description 12
- 238000012216 screening Methods 0.000 claims description 8
- 238000000605 extraction Methods 0.000 claims description 5
- 238000002372 labelling Methods 0.000 claims description 5
- 238000004422 calculation algorithm Methods 0.000 abstract description 23
- 241000700605 Viruses Species 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 238000000638 solvent extraction Methods 0.000 description 5
- 201000010099 disease Diseases 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 238000004590 computer program Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 239000002574 poison Substances 0.000 description 2
- 231100000614 poison Toxicity 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 101100217298 Mus musculus Aspm gene Proteins 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (18)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510375182.5A CN105095382B (zh) | 2015-06-30 | 2015-06-30 | 样本分布式聚类计算方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510375182.5A CN105095382B (zh) | 2015-06-30 | 2015-06-30 | 样本分布式聚类计算方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105095382A CN105095382A (zh) | 2015-11-25 |
CN105095382B true CN105095382B (zh) | 2018-09-14 |
Family
ID=54575819
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510375182.5A Active CN105095382B (zh) | 2015-06-30 | 2015-06-30 | 样本分布式聚类计算方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105095382B (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108460049B (zh) * | 2017-02-21 | 2021-10-19 | 阿里巴巴集团控股有限公司 | 一种确定信息类别的方法和系统 |
CN110472055B (zh) * | 2019-08-21 | 2021-09-14 | 北京百度网讯科技有限公司 | 用于标注数据的方法和装置 |
CN112487432A (zh) * | 2020-12-10 | 2021-03-12 | 杭州安恒信息技术股份有限公司 | 一种基于图标匹配的恶意文件检测的方法、系统及设备 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101178720A (zh) * | 2007-10-23 | 2008-05-14 | 浙江大学 | 一种面向互联网微内容的分布式聚类方法 |
CN102930206A (zh) * | 2011-08-09 | 2013-02-13 | 腾讯科技(深圳)有限公司 | 病毒文件的聚类划分处理方法和装置 |
CN103218233A (zh) * | 2013-05-09 | 2013-07-24 | 福州大学 | Hadoop异构集群中的数据分配策略 |
US8655878B1 (en) * | 2010-05-06 | 2014-02-18 | Zeitera, Llc | Scalable, adaptable, and manageable system for multimedia identification |
CN103595805A (zh) * | 2013-11-22 | 2014-02-19 | 浪潮电子信息产业股份有限公司 | 一种基于分布式集群的数据放置方法 |
-
2015
- 2015-06-30 CN CN201510375182.5A patent/CN105095382B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101178720A (zh) * | 2007-10-23 | 2008-05-14 | 浙江大学 | 一种面向互联网微内容的分布式聚类方法 |
US8655878B1 (en) * | 2010-05-06 | 2014-02-18 | Zeitera, Llc | Scalable, adaptable, and manageable system for multimedia identification |
CN102930206A (zh) * | 2011-08-09 | 2013-02-13 | 腾讯科技(深圳)有限公司 | 病毒文件的聚类划分处理方法和装置 |
CN103218233A (zh) * | 2013-05-09 | 2013-07-24 | 福州大学 | Hadoop异构集群中的数据分配策略 |
CN103595805A (zh) * | 2013-11-22 | 2014-02-19 | 浪潮电子信息产业股份有限公司 | 一种基于分布式集群的数据放置方法 |
Non-Patent Citations (1)
Title |
---|
云计算:构建未来电力系统的核心计算平台;赵俊华等;《电力系统自动化》;20100810;第34卷(第15期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN105095382A (zh) | 2015-11-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104978526B (zh) | 病毒特征的提取方法及装置 | |
WO2016101628A1 (zh) | 一种数据建模中的数据处理方法及装置 | |
CN107220261B (zh) | 一种基于分布式数据的实时挖掘方法及装置 | |
CN106803799B (zh) | 一种性能测试方法和装置 | |
JP7069173B2 (ja) | 高速分析のためにネットワーク・トラフィックを準備するシステム | |
CN105095382B (zh) | 样本分布式聚类计算方法及装置 | |
CN106062751A (zh) | 对与数据类型有关的数据剖析操作的管理 | |
CN108805174A (zh) | 聚类方法及装置 | |
CN108628675A (zh) | 一种数据处理方法、装置、设备及计算机可读存储介质 | |
CN104881322A (zh) | 一种基于装箱模型的集群资源调度方法及装置 | |
CN106708738A (zh) | 一种软件测试缺陷预测方法及系统 | |
CN111626311B (zh) | 一种异构图数据处理方法和装置 | |
CN112241494A (zh) | 基于用户行为数据的关键信息推送方法及装置 | |
CN106453320A (zh) | 恶意样本的识别方法及装置 | |
CN105630797B (zh) | 数据处理方法及系统 | |
CN109828790A (zh) | 一种基于申威异构众核处理器的数据处理方法和系统 | |
CN109564569A (zh) | 减少用于长期计算的存储器使用 | |
CN109416688B (zh) | 用于灵活的高性能结构化数据处理的方法和系统 | |
CN101495978B (zh) | 减少总线连接的消费者和产生者之间的消息流 | |
CN103699653A (zh) | 数据聚类方法和装置 | |
CN103530369A (zh) | 一种去重方法及系统 | |
CN110209656B (zh) | 数据处理方法及装置 | |
CN110807159B (zh) | 数据标记方法、装置、存储介质及电子设备 | |
CN104778088A (zh) | 一种基于减少进程间通信开销的并行i/o优化方法与系统 | |
Yan et al. | Automatic virtual network embedding based on deep reinforcement learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220727 Address after: 300450 No. 9-3-401, No. 39, Gaoxin 6th Road, Binhai Science Park, Binhai New Area, Tianjin Patentee after: 3600 Technology Group Co.,Ltd. Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park) Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Patentee before: Qizhi software (Beijing) Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230714 Address after: 1765, floor 17, floor 15, building 3, No. 10 Jiuxianqiao Road, Chaoyang District, Beijing 100015 Patentee after: Beijing Hongxiang Technical Service Co.,Ltd. Address before: 300450 No. 9-3-401, No. 39, Gaoxin 6th Road, Binhai Science Park, Binhai New Area, Tianjin Patentee before: 3600 Technology Group Co.,Ltd. |
|
CP03 | Change of name, title or address |
Address after: 1765, floor 17, floor 15, building 3, No. 10 Jiuxianqiao Road, Chaoyang District, Beijing 100015 Patentee after: Beijing 360 Zhiling Technology Co.,Ltd. Country or region after: China Address before: 1765, floor 17, floor 15, building 3, No. 10 Jiuxianqiao Road, Chaoyang District, Beijing 100015 Patentee before: Beijing Hongxiang Technical Service Co.,Ltd. Country or region before: China |