CN105550318A - 一种基于Spark大数据处理平台的查询方法 - Google Patents
一种基于Spark大数据处理平台的查询方法 Download PDFInfo
- Publication number
- CN105550318A CN105550318A CN201510930909.1A CN201510930909A CN105550318A CN 105550318 A CN105550318 A CN 105550318A CN 201510930909 A CN201510930909 A CN 201510930909A CN 105550318 A CN105550318 A CN 105550318A
- Authority
- CN
- China
- Prior art keywords
- result
- spark
- task
- data processing
- processing platform
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 58
- 238000004364 calculation method Methods 0.000 claims abstract description 27
- 230000008569 process Effects 0.000 claims description 24
- 238000012216 screening Methods 0.000 claims description 2
- 230000002452 interceptive effect Effects 0.000 claims 1
- 230000004044 response Effects 0.000 abstract description 12
- 230000006870 function Effects 0.000 description 13
- 238000010586 diagram Methods 0.000 description 4
- 101000932776 Homo sapiens Uncharacterized protein C1orf115 Proteins 0.000 description 3
- 101150039208 KCNK3 gene Proteins 0.000 description 3
- 102100025480 Uncharacterized protein C1orf115 Human genes 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000007599 discharging Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- WSZPRLKJOJINEP-UHFFFAOYSA-N 1-fluoro-2-[(2-fluoro-2,2-dinitroethoxy)methoxy]-1,1-dinitroethane Chemical compound [O-][N+](=O)C(F)([N+]([O-])=O)COCOCC(F)([N+]([O-])=O)[N+]([O-])=O WSZPRLKJOJINEP-UHFFFAOYSA-N 0.000 description 1
- 101150083764 KCNK9 gene Proteins 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2453—Query optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2471—Distributed queries
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Fuzzy Systems (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Software Systems (AREA)
Abstract
Description
Claims (6)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510930909.1A CN105550318B (zh) | 2015-12-15 | 2015-12-15 | 一种基于Spark大数据处理平台的查询方法 |
PCT/CN2016/095353 WO2017101475A1 (zh) | 2015-12-15 | 2016-08-15 | 一种基于Spark大数据处理平台的查询方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510930909.1A CN105550318B (zh) | 2015-12-15 | 2015-12-15 | 一种基于Spark大数据处理平台的查询方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105550318A true CN105550318A (zh) | 2016-05-04 |
CN105550318B CN105550318B (zh) | 2017-12-26 |
Family
ID=55829507
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510930909.1A Expired - Fee Related CN105550318B (zh) | 2015-12-15 | 2015-12-15 | 一种基于Spark大数据处理平台的查询方法 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105550318B (zh) |
WO (1) | WO2017101475A1 (zh) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106372127A (zh) * | 2016-08-24 | 2017-02-01 | 云南大学 | 基于Spark的大规模图数据的多样性图排序方法 |
WO2017101475A1 (zh) * | 2015-12-15 | 2017-06-22 | 深圳市华讯方舟软件技术有限公司 | 一种基于Spark大数据处理平台的查询方法 |
CN106909621A (zh) * | 2017-01-17 | 2017-06-30 | 中国科学院信息工程研究所 | 一种提速的基于ipc编码的查询处理方法 |
CN107480202A (zh) * | 2017-07-18 | 2017-12-15 | 湖南大学 | 一种用于多并行处理框架的数据处理方法及装置 |
CN107609130A (zh) * | 2017-09-18 | 2018-01-19 | 链家网(北京)科技有限公司 | 一种选择数据查询引擎的方法及服务器 |
CN108062251A (zh) * | 2018-01-09 | 2018-05-22 | 福建星瑞格软件有限公司 | 一种服务器资源回收方法以及计算机设备 |
CN108536727A (zh) * | 2018-02-24 | 2018-09-14 | 国家计算机网络与信息安全管理中心 | 一种数据检索方法和装置 |
CN108874897A (zh) * | 2018-05-23 | 2018-11-23 | 新华三大数据技术有限公司 | 数据查询方法及装置 |
CN110019497A (zh) * | 2017-08-07 | 2019-07-16 | 北京国双科技有限公司 | 一种数据读取方法及装置 |
CN110109747A (zh) * | 2019-05-21 | 2019-08-09 | 北京百度网讯科技有限公司 | 基于Apache Spark的数据交换方法及系统、服务器 |
CN110659292A (zh) * | 2019-09-21 | 2020-01-07 | 北京海致星图科技有限公司 | 一种基于Spark和Ignite的分布式实时图构建和查询的方法及系统 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109582706A (zh) * | 2018-11-14 | 2019-04-05 | 重庆邮电大学 | 基于Spark大数据平台的邻域密度不平衡数据混合采样方法 |
CN112612584A (zh) * | 2020-12-16 | 2021-04-06 | 远光软件股份有限公司 | 任务调度方法、装置、存储介质和电子设备 |
CN113392140B (zh) * | 2021-06-11 | 2023-05-09 | 上海达梦数据库有限公司 | 一种数据排序方法、装置、电子设备及存储介质 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799622A (zh) * | 2012-06-19 | 2012-11-28 | 北京大学 | 基于MapReduce扩展框架的分布式SQL查询方法 |
US20130346988A1 (en) * | 2012-06-22 | 2013-12-26 | Microsoft Corporation | Parallel data computing optimization |
CN103995827A (zh) * | 2014-04-10 | 2014-08-20 | 北京大学 | MapReduce计算框架中的高性能排序方法 |
CN104239501A (zh) * | 2014-09-10 | 2014-12-24 | 中国电子科技集团公司第二十八研究所 | 一种基于Spark的海量视频语义标注方法 |
US9135559B1 (en) * | 2015-03-20 | 2015-09-15 | TappingStone Inc. | Methods and systems for predictive engine evaluation, tuning, and replay of engine performance |
CN104951509A (zh) * | 2015-05-25 | 2015-09-30 | 中国科学院信息工程研究所 | 一种大数据在线交互式查询方法及系统 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105550318B (zh) * | 2015-12-15 | 2017-12-26 | 深圳市华讯方舟软件技术有限公司 | 一种基于Spark大数据处理平台的查询方法 |
-
2015
- 2015-12-15 CN CN201510930909.1A patent/CN105550318B/zh not_active Expired - Fee Related
-
2016
- 2016-08-15 WO PCT/CN2016/095353 patent/WO2017101475A1/zh active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799622A (zh) * | 2012-06-19 | 2012-11-28 | 北京大学 | 基于MapReduce扩展框架的分布式SQL查询方法 |
US20130346988A1 (en) * | 2012-06-22 | 2013-12-26 | Microsoft Corporation | Parallel data computing optimization |
CN103995827A (zh) * | 2014-04-10 | 2014-08-20 | 北京大学 | MapReduce计算框架中的高性能排序方法 |
CN104239501A (zh) * | 2014-09-10 | 2014-12-24 | 中国电子科技集团公司第二十八研究所 | 一种基于Spark的海量视频语义标注方法 |
US9135559B1 (en) * | 2015-03-20 | 2015-09-15 | TappingStone Inc. | Methods and systems for predictive engine evaluation, tuning, and replay of engine performance |
CN104951509A (zh) * | 2015-05-25 | 2015-09-30 | 中国科学院信息工程研究所 | 一种大数据在线交互式查询方法及系统 |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017101475A1 (zh) * | 2015-12-15 | 2017-06-22 | 深圳市华讯方舟软件技术有限公司 | 一种基于Spark大数据处理平台的查询方法 |
CN106372127B (zh) * | 2016-08-24 | 2019-05-03 | 云南大学 | 基于Spark的大规模图数据的多样性图排序方法 |
CN106372127A (zh) * | 2016-08-24 | 2017-02-01 | 云南大学 | 基于Spark的大规模图数据的多样性图排序方法 |
CN106909621A (zh) * | 2017-01-17 | 2017-06-30 | 中国科学院信息工程研究所 | 一种提速的基于ipc编码的查询处理方法 |
CN107480202A (zh) * | 2017-07-18 | 2017-12-15 | 湖南大学 | 一种用于多并行处理框架的数据处理方法及装置 |
CN107480202B (zh) * | 2017-07-18 | 2020-06-02 | 湖南大学 | 一种用于多并行处理框架的数据处理方法及装置 |
CN110019497A (zh) * | 2017-08-07 | 2019-07-16 | 北京国双科技有限公司 | 一种数据读取方法及装置 |
CN107609130A (zh) * | 2017-09-18 | 2018-01-19 | 链家网(北京)科技有限公司 | 一种选择数据查询引擎的方法及服务器 |
CN108062251A (zh) * | 2018-01-09 | 2018-05-22 | 福建星瑞格软件有限公司 | 一种服务器资源回收方法以及计算机设备 |
CN108062251B (zh) * | 2018-01-09 | 2023-02-28 | 福建星瑞格软件有限公司 | 一种服务器资源回收方法以及计算机设备 |
CN108536727A (zh) * | 2018-02-24 | 2018-09-14 | 国家计算机网络与信息安全管理中心 | 一种数据检索方法和装置 |
CN108874897A (zh) * | 2018-05-23 | 2018-11-23 | 新华三大数据技术有限公司 | 数据查询方法及装置 |
CN108874897B (zh) * | 2018-05-23 | 2019-09-13 | 新华三大数据技术有限公司 | 数据查询方法及装置 |
CN110109747A (zh) * | 2019-05-21 | 2019-08-09 | 北京百度网讯科技有限公司 | 基于Apache Spark的数据交换方法及系统、服务器 |
CN110109747B (zh) * | 2019-05-21 | 2021-05-14 | 北京百度网讯科技有限公司 | 基于Apache Spark的数据交换方法及系统、服务器 |
CN110659292A (zh) * | 2019-09-21 | 2020-01-07 | 北京海致星图科技有限公司 | 一种基于Spark和Ignite的分布式实时图构建和查询的方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN105550318B (zh) | 2017-12-26 |
WO2017101475A1 (zh) | 2017-06-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105550318A (zh) | 一种基于Spark大数据处理平台的查询方法 | |
US11681547B2 (en) | File operation task optimization | |
Yan et al. | Blogel: A block-centric framework for distributed computation on real-world graphs | |
US8209703B2 (en) | Apparatus and method for dataflow execution in a distributed environment using directed acyclic graph and prioritization of sub-dataflow tasks | |
He et al. | Comet: batched stream processing for data intensive distributed computing | |
CN110908788B (zh) | 基于Spark Streaming的数据处理方法、装置、计算机设备及存储介质 | |
CN103631870B (zh) | 一种用于大规模分布式数据处理的系统及其方法 | |
CN105573840B (zh) | 工作流运行期的事件处理方法和装置 | |
US20200026788A1 (en) | Adaptive granule generation for parallel queries with run-time data pruning | |
CN104685499A (zh) | 过滤/投影操作的硬件实现 | |
CN111324606B (zh) | 数据分片的方法及装置 | |
CN108509280B (zh) | 一种基于推送模型的分布式计算集群本地性调度方法 | |
US11487555B2 (en) | Running PBS jobs in kubernetes | |
CN106202092A (zh) | 数据处理的方法及系统 | |
CN112163048A (zh) | 基于ClickHouse实现OLAP分析的方法、装置 | |
CN106383746A (zh) | 大数据处理系统的配置参数确定方法和装置 | |
CN106874109A (zh) | 一种分布式作业分发处理方法及系统 | |
US20240061712A1 (en) | Method, apparatus, and system for creating training task on ai training platform, and medium | |
CN105138405A (zh) | 基于待释放资源列表的MapReduce任务推测执行方法和装置 | |
CN110134646B (zh) | 知识平台服务数据存储与集成方法及系统 | |
Bardhan et al. | The Anatomy of MapReduce Jobs, Scheduling, and Performance Challenges. | |
CN117056303B (zh) | 适用于军事行动大数据的数据存储方法及装置 | |
CN111858739A (zh) | 一种基于Mapreduce的数据汇聚方法及系统 | |
US20230205770A1 (en) | Opportunistic cloud data platform pipeline scheduler | |
Pu et al. | MPEFT: A novel task scheduling method for workflows |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 518102 Guangdong Province, Baoan District Xixiang street Shenzhen City Tian Yi Lu Chen Tian Bao Industrial District thirty-seventh building 3 floor Applicant after: SHENZHEN HUAXUN FANGZHOU SOFTWARE TECHNOLOGY Co.,Ltd. Applicant after: CHINA COMMUNICATION TECHNOLOGY Co.,Ltd. Address before: 518102 Guangdong Province, Baoan District Xixiang street Shenzhen City Tian Yi Lu Chen Tian Bao Industrial District thirty-seventh building 3 floor Applicant before: SHENZHEN HUAXUN FANGZHOU SOFTWARE TECHNOLOGY Co.,Ltd. Applicant before: CHINA COMMUNICATION TECHNOLOGY Co.,Ltd. |
|
COR | Change of bibliographic data | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Wan Xiuyuan Inventor after: Zhao Shukai Inventor after: Fan Congming Inventor before: Wan Xiuyuan |
|
GR01 | Patent grant | ||
PP01 | Preservation of patent right | ||
PP01 | Preservation of patent right |
Effective date of registration: 20210630 Granted publication date: 20171226 |
|
PD01 | Discharge of preservation of patent | ||
PD01 | Discharge of preservation of patent |
Date of cancellation: 20230421 Granted publication date: 20171226 |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230606 Address after: 518102 room 404, building 37, chentian Industrial Zone, chentian community, Xixiang street, Bao'an District, Shenzhen City, Guangdong Province Patentee after: Shenzhen Huaxun ark Photoelectric Technology Co.,Ltd. Patentee after: SHENZHEN HUAXUN FANGZHOU SOFTWARE TECHNOLOGY Co.,Ltd. Address before: 518102 3rd floor, building 37, chentian Industrial Zone, Baotian 1st Road, Xixiang street, Bao'an District, Shenzhen City, Guangdong Province Patentee before: SHENZHEN HUAXUN FANGZHOU SOFTWARE TECHNOLOGY Co.,Ltd. Patentee before: CHINA COMMUNICATION TECHNOLOGY Co.,Ltd. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20171226 |