CN110955732B - 一种用于在Spark环境中实现分区负载均衡的方法和系统 - Google Patents
一种用于在Spark环境中实现分区负载均衡的方法和系统 Download PDFInfo
- Publication number
- CN110955732B CN110955732B CN201911294970.6A CN201911294970A CN110955732B CN 110955732 B CN110955732 B CN 110955732B CN 201911294970 A CN201911294970 A CN 201911294970A CN 110955732 B CN110955732 B CN 110955732B
- Authority
- CN
- China
- Prior art keywords
- partition
- key
- module
- hash table
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/27—Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
- G06F16/278—Data partitioning, e.g. horizontal or vertical partitioning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2255—Hash tables
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911294970.6A CN110955732B (zh) | 2019-12-16 | 2019-12-16 | 一种用于在Spark环境中实现分区负载均衡的方法和系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911294970.6A CN110955732B (zh) | 2019-12-16 | 2019-12-16 | 一种用于在Spark环境中实现分区负载均衡的方法和系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110955732A CN110955732A (zh) | 2020-04-03 |
CN110955732B true CN110955732B (zh) | 2020-12-01 |
Family
ID=69981885
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911294970.6A Active CN110955732B (zh) | 2019-12-16 | 2019-12-16 | 一种用于在Spark环境中实现分区负载均衡的方法和系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110955732B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112000467A (zh) * | 2020-07-24 | 2020-11-27 | 广东技术师范大学 | 一种数据倾斜处理方法、装置、终端设备及存储介质 |
CN111966490A (zh) * | 2020-07-24 | 2020-11-20 | 广东技术师范大学 | 一种Spark分区负载均衡方法 |
CN113778657B (zh) * | 2020-09-24 | 2024-04-16 | 北京沃东天骏信息技术有限公司 | 一种数据处理方法和装置 |
CN114780541B (zh) * | 2022-04-01 | 2024-04-12 | 港珠澳大桥管理局 | 微批流处理系统中的数据分区方法、装置、设备和介质 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10043009B2 (en) * | 2014-09-24 | 2018-08-07 | Intel Corporation | Technologies for software basic block similarity analysis |
CN108536808B (zh) * | 2018-04-04 | 2022-04-29 | 国家计算机网络与信息安全管理中心 | 一种基于Spark计算框架的数据获取方法和装置 |
CN109388615B (zh) * | 2018-09-28 | 2022-04-01 | 智器云南京信息科技有限公司 | 基于Spark的任务处理方法及系统 |
-
2019
- 2019-12-16 CN CN201911294970.6A patent/CN110955732B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN110955732A (zh) | 2020-04-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110955732B (zh) | 一种用于在Spark环境中实现分区负载均衡的方法和系统 | |
Cheng et al. | Network-aware locality scheduling for distributed data operators in data centers | |
Rödiger et al. | Locality-sensitive operators for parallel main-memory database clusters | |
US11475006B2 (en) | Query and change propagation scheduling for heterogeneous database systems | |
WO2018054221A1 (en) | Pipeline dependent tree query optimizer and scheduler | |
CN106599091B (zh) | 基于键值存储的rdf图结构存储和索引方法 | |
CN110659278A (zh) | 基于cpu-gpu异构架构的图数据分布式处理系统 | |
Cederman et al. | Concurrent data structures for efficient streaming aggregation | |
CN108536824B (zh) | 一种数据处理方法及装置 | |
Kang et al. | The processing-in-memory model | |
CN111464451B (zh) | 一种数据流等值连接优化方法、系统及电子设备 | |
CN108319604B (zh) | 一种hive中大小表关联的优化方法 | |
CN116756150B (zh) | 一种Mpp数据库大表关联加速方法 | |
CN112445776B (zh) | 基于Presto的动态分桶方法、系统、设备及可读存储介质 | |
KR101914784B1 (ko) | 쿼드 트리에 기반한 스카이라인 질의 방법 | |
Gabert et al. | Elga: elastic and scalable dynamic graph analysis | |
CN114443236A (zh) | 一种任务处理方法、装置、系统、设备及介质 | |
Kalnis et al. | Mizan: Optimizing graph mining in large parallel systems | |
CN111831425B (zh) | 一种数据处理方法、装置及设备 | |
Salah et al. | Lazy-Merge: A Novel Implementation for Indexed Parallel $ K $-Way In-Place Merging | |
RU2490702C1 (ru) | Способ ускорения обработки множественных запросов типа select к rdf базе данных с помощью графического процессора | |
Lu et al. | Improving mapreduce performance by using a new partitioner in yarn | |
Gong et al. | Accelerating large-scale prioritized graph computations by hotness balanced partition | |
Hamidzadeb et al. | Dynamic scheduling of real-time tasks, by assignment | |
Zhang et al. | Improving performance for geo-distributed data process in wide-area |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Li Kenli Inventor after: Liu Chubo Inventor after: Cao Ronghui Inventor after: Liu Xiang Inventor after: Tang Zhuo Inventor after: Du Lifan Inventor after: He Kailin Inventor after: Li Wen Inventor after: Zhang Xuedong Inventor after: Yang Wangdong Inventor after: Zhou Xu Inventor before: Tang Zhuo Inventor before: Liu Chubo Inventor before: Cao Ronghui Inventor before: Liu Xiang Inventor before: Li Kenli Inventor before: Du Lifan Inventor before: He Kailin Inventor before: Li Wen Inventor before: Zhang Xuedong Inventor before: Yang Wangdong Inventor before: Zhou Xu |
|
CB03 | Change of inventor or designer information |