CN101739281B - 用于机器集群的并行编程的方法和设备 - Google Patents
用于机器集群的并行编程的方法和设备 Download PDFInfo
- Publication number
- CN101739281B CN101739281B CN200910205412.8A CN200910205412A CN101739281B CN 101739281 B CN101739281 B CN 101739281B CN 200910205412 A CN200910205412 A CN 200910205412A CN 101739281 B CN101739281 B CN 101739281B
- Authority
- CN
- China
- Prior art keywords
- function
- user
- node
- chunk
- vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000012545 processing Methods 0.000 claims abstract description 75
- 238000005192 partition Methods 0.000 claims abstract description 21
- 239000013598 vector Substances 0.000 claims description 320
- 238000000034 method Methods 0.000 claims description 53
- 238000013507 mapping Methods 0.000 claims description 51
- 238000009826 distribution Methods 0.000 claims description 17
- 230000000977 initiatory effect Effects 0.000 claims description 9
- 230000006870 function Effects 0.000 description 186
- 230000000875 corresponding effect Effects 0.000 description 35
- 230000008569 process Effects 0.000 description 32
- 239000000047 product Substances 0.000 description 21
- 230000009467 reduction Effects 0.000 description 16
- 230000009466 transformation Effects 0.000 description 12
- 238000005315 distribution function Methods 0.000 description 10
- 238000005267 amalgamation Methods 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 9
- 238000004364 calculation method Methods 0.000 description 9
- 238000006073 displacement reaction Methods 0.000 description 9
- 230000007246 mechanism Effects 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 7
- 238000011156 evaluation Methods 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000013480 data collection Methods 0.000 description 4
- 238000003860 storage Methods 0.000 description 4
- 239000006185 dispersion Substances 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 3
- 238000013439 planning Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 235000008694 Humulus lupulus Nutrition 0.000 description 2
- 101000855863 Viola biflora Cyclotide vibi-E Proteins 0.000 description 2
- 101000855868 Viola biflora Cyclotide vibi-I Proteins 0.000 description 2
- 101000855869 Viola biflora Cyclotide vibi-J Proteins 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 238000007726 management method Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000012367 process mapping Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 101000957313 Acanthamoeba polyphaga mimivirus Mitochondrial carrier-like protein L276 Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 244000144992 flock Species 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000013468 resource allocation Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5066—Algorithms for mapping a plurality of inter-dependent sub-tasks onto a plurality of physical CPUs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformation of program code
- G06F8/41—Compilation
- G06F8/45—Exploiting coarse grain parallelism in compilation, i.e. parallelism between groups of instructions
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Multi Processors (AREA)
Abstract
Description
表1-用于排序的用户定义函数 |
Record mergeSort(Record Z1,Record Z2) {new Record Z; //来自记录Z1的下一字符串 String a=Z1.next(); //来自记录Z2的下一字符串 |
String b=Z2.next(); do{ if(a<b){ Z.append(a); a=Z1.next(); } else{ Z.append(b); b=Z2.next(); } }while(!Z1.empty()&&!Z2.empty()); return x; } |
表2-用于对划分的记录数量进行计数的用户定义的函数 |
bloFunc(Iterator records)/**向量分组的记录的列表**/ int count=0; for each record x in records count++ EmitResult(Vector Z,count) |
表格3-用于计算多个中值的用户定义的函数 |
bloFunc(list of records X): p=partition(X) for each x in X for each bracket b if(x in b) cp,b++/**针对等级b,对划分p中的记录进行计数**/ for each bracket b EmitResult(b;p,cp,b) |
表4-用于确定中值的用户定义的函数 |
bloFunc(list of records X): |
p=partition(X)/**p划分**/ sort X by account balance for each bracket b/**b等级**/ if(p==pb)/**划分p的pb等级**/ find rbth value in bracket b/**找到等级b内的排名后的值,该值对应于中值**/ EmitResult(b,rbth balance) |
Claims (24)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/267,142 | 2008-11-07 | ||
US12/267,142 US7970872B2 (en) | 2007-10-01 | 2008-11-07 | Infrastructure for parallel programming of clusters of machines |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101739281A CN101739281A (zh) | 2010-06-16 |
CN101739281B true CN101739281B (zh) | 2015-04-22 |
Family
ID=41664570
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200910205412.8A Expired - Fee Related CN101739281B (zh) | 2008-11-07 | 2009-10-23 | 用于机器集群的并行编程的方法和设备 |
Country Status (4)
Country | Link |
---|---|
US (1) | US7970872B2 (zh) |
EP (1) | EP2184680B1 (zh) |
CN (1) | CN101739281B (zh) |
CA (1) | CA2681154C (zh) |
Families Citing this family (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7970872B2 (en) * | 2007-10-01 | 2011-06-28 | Accenture Global Services Limited | Infrastructure for parallel programming of clusters of machines |
CN102667761B (zh) | 2009-06-19 | 2015-05-27 | 布雷克公司 | 可扩展的集群数据库 |
US8918365B2 (en) * | 2009-06-19 | 2014-12-23 | Blekko, Inc. | Dedicating disks to reading or writing |
US8543596B1 (en) * | 2009-12-17 | 2013-09-24 | Teradata Us, Inc. | Assigning blocks of a file of a distributed file system to processing units of a parallel database management system |
US8392463B2 (en) * | 2010-04-22 | 2013-03-05 | International Business Machines Corporation | GPU enabled database systems |
EP2580633A4 (en) * | 2010-06-11 | 2015-12-30 | Optuminsight Inc | APPARATUSES AND METHODS FOR PARALLEL ANALYTICAL PROCESSING |
US8839214B2 (en) * | 2010-06-30 | 2014-09-16 | Microsoft Corporation | Indexable type transformations |
CN102314460B (zh) | 2010-07-07 | 2014-05-14 | 阿里巴巴集团控股有限公司 | 数据分析方法、系统及服务器 |
US8370316B2 (en) | 2010-07-12 | 2013-02-05 | Sap Ag | Hash-join in parallel computation environments |
US9246914B2 (en) * | 2010-07-16 | 2016-01-26 | Nokia Technologies Oy | Method and apparatus for processing biometric information using distributed computation |
US9489183B2 (en) | 2010-10-12 | 2016-11-08 | Microsoft Technology Licensing, Llc | Tile communication operator |
US9430204B2 (en) | 2010-11-19 | 2016-08-30 | Microsoft Technology Licensing, Llc | Read-only communication operator |
US9507568B2 (en) | 2010-12-09 | 2016-11-29 | Microsoft Technology Licensing, Llc | Nested communication operator |
US9395957B2 (en) | 2010-12-22 | 2016-07-19 | Microsoft Technology Licensing, Llc | Agile communication operator |
US8713039B2 (en) * | 2010-12-23 | 2014-04-29 | Microsoft Corporation | Co-map communication operator |
JP5178852B2 (ja) * | 2011-01-12 | 2013-04-10 | 株式会社東芝 | 情報処理装置およびプログラム |
US9355145B2 (en) * | 2011-01-25 | 2016-05-31 | Hewlett Packard Enterprise Development Lp | User defined function classification in analytical data processing systems |
JP6138701B2 (ja) * | 2011-03-04 | 2017-05-31 | 富士通株式会社 | 分散計算方法及び分散計算システム |
US8589480B2 (en) * | 2011-05-24 | 2013-11-19 | Sony Computer Entertainment America Llc | Automatic performance and capacity measurement for networked servers |
US20120321202A1 (en) * | 2011-06-20 | 2012-12-20 | Michael Benjamin Selkowe Fertik | Identifying information related to a particular entity from electronic sources, using dimensional reduction and quantum clustering |
US9619495B2 (en) * | 2011-07-01 | 2017-04-11 | Hewlett Packard Enterprise Development Lp | Surrogate key generation |
US9053067B2 (en) * | 2011-09-30 | 2015-06-09 | International Business Machines Corporation | Distributed data scalable adaptive map-reduce framework |
US9201690B2 (en) | 2011-10-21 | 2015-12-01 | International Business Machines Corporation | Resource aware scheduling in a distributed computing environment |
US8886651B1 (en) | 2011-12-22 | 2014-11-11 | Reputation.Com, Inc. | Thematic clustering |
CN103186416B (zh) * | 2011-12-29 | 2016-06-22 | 比亚迪股份有限公司 | 构建多任务多分支过程的方法、状态机及执行方法 |
US10636041B1 (en) | 2012-03-05 | 2020-04-28 | Reputation.Com, Inc. | Enterprise reputation evaluation |
US9697490B1 (en) | 2012-03-05 | 2017-07-04 | Reputation.Com, Inc. | Industry review benchmarking |
US20130297624A1 (en) * | 2012-05-07 | 2013-11-07 | Microsoft Corporation | Interoperability between Map-Reduce and Distributed Array Runtimes |
US8924977B2 (en) | 2012-06-18 | 2014-12-30 | International Business Machines Corporation | Sequential cooperation between map and reduce phases to improve data locality |
US11093984B1 (en) | 2012-06-29 | 2021-08-17 | Reputation.Com, Inc. | Determining themes |
US9710357B2 (en) * | 2012-08-04 | 2017-07-18 | Microsoft Technology Licensing, Llc | Function evaluation using lightweight process snapshots |
US9373074B2 (en) * | 2012-10-09 | 2016-06-21 | Qualcomm Incorporated | Method and apparatus for time management and scheduling for sychronous processing on a cluster of processing nodes |
US8892599B2 (en) * | 2012-10-24 | 2014-11-18 | Marklogic Corporation | Apparatus and method for securing preliminary information about database fragments for utilization in mapreduce processing |
US8744866B1 (en) | 2012-12-21 | 2014-06-03 | Reputation.Com, Inc. | Reputation report with recommendation |
US8805699B1 (en) | 2012-12-21 | 2014-08-12 | Reputation.Com, Inc. | Reputation report with score |
CN103176903B (zh) * | 2013-03-12 | 2019-03-29 | 百度在线网络技术(北京)有限公司 | MapReduce分布式系统程序的测试方法及设备 |
US8925099B1 (en) | 2013-03-14 | 2014-12-30 | Reputation.Com, Inc. | Privacy scoring |
CN104077218B (zh) * | 2013-03-29 | 2018-12-14 | 百度在线网络技术(北京)有限公司 | MapReduce分布式系统的测试方法及设备 |
US9354938B2 (en) | 2013-04-10 | 2016-05-31 | International Business Machines Corporation | Sequential cooperation between map and reduce phases to improve data locality |
US9342355B2 (en) | 2013-06-20 | 2016-05-17 | International Business Machines Corporation | Joint optimization of multiple phases in large data processing |
US9703925B1 (en) * | 2013-10-22 | 2017-07-11 | Pivotal Software, Inc. | Rapid processing of biological sequence data |
US10613941B1 (en) * | 2015-09-30 | 2020-04-07 | EMC IP Holding Company LLC | Hybrid NVRAM logging in filesystem namespace |
US10262390B1 (en) * | 2017-04-14 | 2019-04-16 | EMC IP Holding Company LLC | Managing access to a resource pool of graphics processing units under fine grain control |
US10275851B1 (en) | 2017-04-25 | 2019-04-30 | EMC IP Holding Company LLC | Checkpointing for GPU-as-a-service in cloud computing environment |
US10325343B1 (en) | 2017-08-04 | 2019-06-18 | EMC IP Holding Company LLC | Topology aware grouping and provisioning of GPU resources in GPU-as-a-Service platform |
US10698766B2 (en) | 2018-04-18 | 2020-06-30 | EMC IP Holding Company LLC | Optimization of checkpoint operations for deep learning computing |
CN110795228B (zh) | 2018-08-03 | 2023-08-25 | 伊姆西Ip控股有限责任公司 | 用于训练深度学习模型的方法和制品、以及计算系统 |
US10776164B2 (en) | 2018-11-30 | 2020-09-15 | EMC IP Holding Company LLC | Dynamic composition of data pipeline in accelerator-as-a-service computing environment |
CN110209488B (zh) * | 2019-06-10 | 2021-12-07 | 北京达佳互联信息技术有限公司 | 任务执行方法、装置、设备、系统及存储介质 |
US11113064B2 (en) * | 2019-11-27 | 2021-09-07 | Sas Institute Inc. | Automated concurrency and repetition with minimal syntax |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080091842A1 (en) * | 2001-02-24 | 2008-04-17 | International Business Machines Corporation | Optimized scalable network switch |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3269849B2 (ja) | 1992-05-29 | 2002-04-02 | 株式会社日立製作所 | 並列データベース処理システムとその検索方法 |
JP3266351B2 (ja) | 1993-01-20 | 2002-03-18 | 株式会社日立製作所 | データベース管理システムおよび問合せの処理方法 |
JP3747525B2 (ja) | 1996-08-28 | 2006-02-22 | 株式会社日立製作所 | 並列データベースシステム検索方法 |
US6081801A (en) | 1997-06-30 | 2000-06-27 | International Business Machines Corporation | Shared nothing parallel execution of procedural constructs in SQL |
US7197505B2 (en) * | 2000-12-22 | 2007-03-27 | Star Bridge Systems, Inc. | Multi-dimensional recursive wavefront behavioral synthesis |
EP1454492A4 (en) * | 2001-07-18 | 2005-01-05 | Polycom Inc | SYSTEM AND METHOD FOR IMPROVING THE QUALITY OF VIDEO COMMUNICATION VIA A PACKET BASED NETWORK |
US6968335B2 (en) | 2002-11-14 | 2005-11-22 | Sesint, Inc. | Method and system for parallel processing of database queries |
US6874708B2 (en) | 2003-02-13 | 2005-04-05 | Illinois Tool Works Inc. | Automatic air-assisted manifold mounted gun |
US8630973B2 (en) | 2004-05-03 | 2014-01-14 | Sap Ag | Distributed processing system for calculations based on objects from massive databases |
US20070174290A1 (en) | 2006-01-19 | 2007-07-26 | International Business Machines Corporation | System and architecture for enterprise-scale, parallel data mining |
US7865898B2 (en) * | 2006-01-27 | 2011-01-04 | Oracle America, Inc. | Repartitioning parallel SVM computations using dynamic timeout |
US7970872B2 (en) * | 2007-10-01 | 2011-06-28 | Accenture Global Services Limited | Infrastructure for parallel programming of clusters of machines |
US8214325B2 (en) * | 2008-11-20 | 2012-07-03 | Sap Ag | Federating business event data within an enterprise network |
-
2008
- 2008-11-07 US US12/267,142 patent/US7970872B2/en not_active Expired - Fee Related
-
2009
- 2009-10-05 CA CA2681154A patent/CA2681154C/en active Active
- 2009-10-23 CN CN200910205412.8A patent/CN101739281B/zh not_active Expired - Fee Related
- 2009-11-05 EP EP09252559.1A patent/EP2184680B1/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080091842A1 (en) * | 2001-02-24 | 2008-04-17 | International Business Machines Corporation | Optimized scalable network switch |
Non-Patent Citations (1)
Title |
---|
GridBatch:Cloud Computing for Large-Scale Data-Intensive Batch Application;HUAN LIU 等;《CLUSTER COMPUTING AND THE GRID》;20080519;295-303 * |
Also Published As
Publication number | Publication date |
---|---|
EP2184680B1 (en) | 2018-09-26 |
US7970872B2 (en) | 2011-06-28 |
CN101739281A (zh) | 2010-06-16 |
CA2681154A1 (en) | 2010-05-07 |
CA2681154C (en) | 2014-07-15 |
EP2184680A2 (en) | 2010-05-12 |
US20090089560A1 (en) | 2009-04-02 |
EP2184680A3 (en) | 2010-06-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101739281B (zh) | 用于机器集群的并行编程的方法和设备 | |
CN101403978B (zh) | 用于机器集群的并行编程的基础构造 | |
Böse et al. | Probabilistic demand forecasting at scale | |
US20220066772A1 (en) | System and Method for Code and Data Versioning in Computerized Data Modeling and Analysis | |
US7680765B2 (en) | Iterate-aggregate query parallelization | |
CN103177057B (zh) | 用于内存列存储数据库的多核算法 | |
US7849114B2 (en) | Method, system, and program product for generating a virtual database | |
CN103177059B (zh) | 用于数据库计算引擎的分离处理路径 | |
US11120361B1 (en) | Training data routing and prediction ensembling at time series prediction system | |
JP6376865B2 (ja) | 並列ツリー・ベースの予測のための、コンピュータにより実行される方法、ストレージ媒体、およびコンピュータ・システム | |
US8060544B2 (en) | Representation of data transformation processes for parallelization | |
US20090077011A1 (en) | System and method for executing compute-intensive database user-defined programs on an attached high-performance parallel computer | |
US20120317059A1 (en) | System and method for space and resource optimization | |
JP6296442B2 (ja) | インメモリデータベースにおける高効率のゲノムリードアラインメント | |
CN107077480A (zh) | 基于查询需求自适应地从当前时间的行存储数据库中构建列存储数据库的方法和系统 | |
CN104205039A (zh) | 使用兴趣驱动数据管线进行数据分析的兴趣驱动商业智能系统和方法 | |
CN101506804A (zh) | 用于在大数据集分析期间维持一致性的方法和装置 | |
CN104823185A (zh) | 用于兴趣驱动的商业智能系统中的兴趣驱动的数据共享的系统和方法 | |
Huynh et al. | An efficient approach for mining sequential patterns using multiple threads on very large databases | |
CN102426582A (zh) | 数据操作管理装置和数据操作管理方法 | |
US20160203409A1 (en) | Framework for calculating grouped optimization algorithms within a distributed data store | |
Furtado | A survey of parallel and distributed data warehouses | |
Böhm et al. | Demaq/Transscale: automated distribution and scalability for declarative applications | |
Moukhi et al. | Towards a new method for designing multidimensional models | |
Ma et al. | dmapply: A functional primitive to express distributed machine learning algorithms in R |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: ACCENTURE INTERNATIONAL GMBH Free format text: FORMER OWNER: ACCENTURE GLOBAL SERVICES GMBH Effective date: 20101208 Owner name: ACCENTURE GLOBAL SERVICES GMBH Free format text: FORMER OWNER: ACCENTURE INTERNATIONAL GMBH Effective date: 20101208 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: SCHAFFHAUSEN, SWITZERLAND TO: LUXEMBOURG, LUXEMBOURG Free format text: CORRECT: ADDRESS; FROM: LUXEMBOURG, LUXEMBOURG TO: DUBLIN, IRELAND |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20101208 Address after: Dublin, Ireland Applicant after: ACCENTURE GLOBAL SERVICES Ltd. Address before: Luxemburg Luxemburg Applicant before: Accenture international LLC Effective date of registration: 20101208 Address after: Luxemburg Luxemburg Applicant after: Accenture international LLC Address before: Schaffhausen Applicant before: ACCENTURE GLOBAL SERVICES Ltd. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20150422 |