CN103403700B - 用于在mapreduce环境中处理机器学习算法的系统和方法 - Google Patents

用于在mapreduce环境中处理机器学习算法的系统和方法 Download PDF

Info

Publication number
CN103403700B
CN103403700B CN201280010904.1A CN201280010904A CN103403700B CN 103403700 B CN103403700 B CN 103403700B CN 201280010904 A CN201280010904 A CN 201280010904A CN 103403700 B CN103403700 B CN 103403700B
Authority
CN
China
Prior art keywords
execution plan
statement
mapreduce
determining
computer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201280010904.1A
Other languages
English (en)
Chinese (zh)
Other versions
CN103403700A (zh
Inventor
S·沃伊特亚娜桑
田媛媛
A·戈汀
D·R·博迪克
E·P·佩德纳特
B·赖因瓦尔德
V·辛杜瓦纳
S·塔提孔达
R·克里希纳穆尔塞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oriental Concept Ltd
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN103403700A publication Critical patent/CN103403700A/zh
Application granted granted Critical
Publication of CN103403700B publication Critical patent/CN103403700B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/42Syntactic analysis
    • G06F8/427Parsing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Devices For Executing Special Programs (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Complex Calculations (AREA)
CN201280010904.1A 2011-03-01 2012-02-29 用于在mapreduce环境中处理机器学习算法的系统和方法 Active CN103403700B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/038,086 2011-03-01
US13/038,086 US8612368B2 (en) 2011-03-01 2011-03-01 Systems and methods for processing machine learning algorithms in a MapReduce environment
PCT/CA2012/050123 WO2012116449A1 (en) 2011-03-01 2012-02-29 Systems and methods for processing machine learning algorithms in a mapreduce environment

Publications (2)

Publication Number Publication Date
CN103403700A CN103403700A (zh) 2013-11-20
CN103403700B true CN103403700B (zh) 2016-02-03

Family

ID=46753913

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280010904.1A Active CN103403700B (zh) 2011-03-01 2012-02-29 用于在mapreduce环境中处理机器学习算法的系统和方法

Country Status (6)

Country Link
US (1) US8612368B2 (https=)
JP (1) JP5705338B2 (https=)
CN (1) CN103403700B (https=)
DE (1) DE112012000628T5 (https=)
GB (1) GB2502020A (https=)
WO (1) WO2012116449A1 (https=)

Families Citing this family (87)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8677366B2 (en) * 2011-05-31 2014-03-18 International Business Machines Corporation Systems and methods for processing hierarchical data in a map-reduce framework
US9361323B2 (en) * 2011-10-04 2016-06-07 International Business Machines Corporation Declarative specification of data integration workflows for execution on parallel processing platforms
US9201690B2 (en) 2011-10-21 2015-12-01 International Business Machines Corporation Resource aware scheduling in a distributed computing environment
US8924977B2 (en) 2012-06-18 2014-12-30 International Business Machines Corporation Sequential cooperation between map and reduce phases to improve data locality
US9471390B2 (en) 2013-01-16 2016-10-18 International Business Machines Corporation Scheduling mapreduce jobs in a cluster of dynamically available servers
US9152469B2 (en) 2013-01-28 2015-10-06 Hewlett-Packard Development Company, L.P. Optimizing execution and resource usage in large scale computing
CN103106183A (zh) * 2013-01-29 2013-05-15 福建天晴数码有限公司 基于mapreduce的大规模稀疏矩阵乘法运算的方法
US9339188B2 (en) 2013-03-04 2016-05-17 James Proud Methods from monitoring health, wellness and fitness with feedback
US9149189B2 (en) 2013-03-04 2015-10-06 Hello, Inc. User or patient monitoring methods using one or more analysis tools
US9392939B2 (en) 2013-03-04 2016-07-19 Hello Inc. Methods using a monitoring device to monitor individual activities, behaviors or habit information and communicate with a database with corresponding individual base information for comparison
US9345404B2 (en) 2013-03-04 2016-05-24 Hello Inc. Mobile device that monitors an individuals activities, behaviors, habits or health parameters
US9424508B2 (en) 2013-03-04 2016-08-23 Hello Inc. Wearable device with magnets having first and second polarities
US9298882B2 (en) 2013-03-04 2016-03-29 Hello Inc. Methods using patient monitoring devices with unique patient IDs and a telemetry system
US9367793B2 (en) 2013-03-04 2016-06-14 Hello Inc. Wearable device with magnets distanced from exterior surfaces of the wearable device
US9320434B2 (en) 2013-03-04 2016-04-26 Hello Inc. Patient monitoring systems and messages that send alerts to patients only when the patient is awake
US9704209B2 (en) 2013-03-04 2017-07-11 Hello Inc. Monitoring system and device with sensors and user profiles based on biometric user information
US9420856B2 (en) 2013-03-04 2016-08-23 Hello Inc. Wearable device with adjacent magnets magnetized in different directions
US20130281801A1 (en) 2013-03-04 2013-10-24 Hello Inc. System using patient monitoring devices with unique patient ID's and a telemetry system
US9526422B2 (en) 2013-03-04 2016-12-27 Hello Inc. System for monitoring individuals with a monitoring device, telemetry system, activity manager and a feedback system
US9532716B2 (en) 2013-03-04 2017-01-03 Hello Inc. Systems using lifestyle database analysis to provide feedback
US9427160B2 (en) 2013-03-04 2016-08-30 Hello Inc. Wearable device with overlapping ends coupled by magnets positioned in the wearable device by an undercut
US9427189B2 (en) 2013-03-04 2016-08-30 Hello Inc. Monitoring system and device with sensors that are responsive to skin pigmentation
US9420857B2 (en) 2013-03-04 2016-08-23 Hello Inc. Wearable device with interior frame
US9445651B2 (en) 2013-03-04 2016-09-20 Hello Inc. Wearable device with overlapping ends coupled by magnets
US9398854B2 (en) 2013-03-04 2016-07-26 Hello Inc. System with a monitoring device that monitors individual activities, behaviors or habit information and communicates with a database with corresponding individual base information for comparison
US9553486B2 (en) 2013-03-04 2017-01-24 Hello Inc. Monitoring system and device with sensors that is remotely powered
US9432091B2 (en) 2013-03-04 2016-08-30 Hello Inc. Telemetry system with wireless power receiver and monitoring devices
US9430938B2 (en) 2013-03-04 2016-08-30 Hello Inc. Monitoring device with selectable wireless communication
US9436903B2 (en) 2013-03-04 2016-09-06 Hello Inc. Wearable device with magnets with a defined distance between adjacent magnets
US9357922B2 (en) 2013-03-04 2016-06-07 Hello Inc. User or patient monitoring systems with one or more analysis tools
US9414651B2 (en) 2013-03-04 2016-08-16 Hello Inc. Wearable device with overlapping ends coupled by magnets operating in a temperature range of 200° F. to 400° F.
US9361572B2 (en) 2013-03-04 2016-06-07 Hello Inc. Wearable device with magnets positioned at opposing ends and overlapped from one side to another
US9848776B2 (en) 2013-03-04 2017-12-26 Hello Inc. Methods using activity manager for monitoring user activity
US9634921B2 (en) 2013-03-04 2017-04-25 Hello Inc. Wearable device coupled by magnets positioned in a frame in an interior of the wearable device with at least one electronic circuit
US9330561B2 (en) 2013-03-04 2016-05-03 Hello Inc. Remote communication systems and methods for communicating with a building gateway control to control building systems and elements
US9345403B2 (en) 2013-03-04 2016-05-24 Hello Inc. Wireless monitoring system with activity manager for monitoring user activity
US9159223B2 (en) 2013-03-04 2015-10-13 Hello, Inc. User monitoring device configured to be in communication with an emergency response system or team
US9737214B2 (en) 2013-03-04 2017-08-22 Hello Inc. Wireless monitoring of patient exercise and lifestyle
US9662015B2 (en) 2013-03-04 2017-05-30 Hello Inc. System or device with wearable devices having one or more sensors with assignment of a wearable device user identifier to a wearable device user
US9582748B2 (en) 2013-03-04 2017-02-28 Hello Inc. Base charging station for monitoring device
US9204798B2 (en) 2013-03-04 2015-12-08 Hello, Inc. System for monitoring health, wellness and fitness with feedback
US9530089B2 (en) 2013-03-04 2016-12-27 Hello Inc. Wearable device with overlapping ends coupled by magnets of a selected width, length and depth
US9406220B2 (en) 2013-03-04 2016-08-02 Hello Inc. Telemetry system with tracking receiver devices
US9354938B2 (en) 2013-04-10 2016-05-31 International Business Machines Corporation Sequential cooperation between map and reduce phases to improve data locality
US9342355B2 (en) 2013-06-20 2016-05-17 International Business Machines Corporation Joint optimization of multiple phases in large data processing
US10009581B2 (en) 2015-01-02 2018-06-26 Fitbit, Inc. Room monitoring device
US9610030B2 (en) 2015-01-23 2017-04-04 Hello Inc. Room monitoring device and sleep analysis methods
US9993197B2 (en) 2013-06-21 2018-06-12 Fitbit, Inc. Patient monitoring systems and messages that send alerts to patients only when the patient is awake
US10058290B1 (en) 2013-06-21 2018-08-28 Fitbit, Inc. Monitoring device with voice interaction
US9993166B1 (en) 2013-06-21 2018-06-12 Fitbit, Inc. Monitoring device using radar and measuring motion with a non-contact device
US10004451B1 (en) 2013-06-21 2018-06-26 Fitbit, Inc. User monitoring system
US9965512B2 (en) 2013-06-25 2018-05-08 Sap Se Operators for constants in aggregated formulas
KR20150033453A (ko) 2013-09-24 2015-04-01 주식회사 엘지씨엔에스 빅데이터 처리 방법, 이를 수행하는 빅데이터 처리 장치 및 이를 저장하는 기록매체
IN2013CH05422A (https=) * 2013-11-26 2015-05-29 Inmobi Pte Ltd
US9697475B1 (en) 2013-12-12 2017-07-04 Google Inc. Additive context model for entity resolution
US9910860B2 (en) 2014-02-06 2018-03-06 International Business Machines Corporation Split elimination in MapReduce systems
US20150278907A1 (en) * 2014-03-27 2015-10-01 Microsoft Corporation User Inactivity Aware Recommendation System
US9684493B2 (en) 2014-06-02 2017-06-20 International Business Machines Corporation R-language integration with a declarative machine learning language
US11094015B2 (en) 2014-07-11 2021-08-17 BMLL Technologies, Ltd. Data access and processing system
CN105302536A (zh) * 2014-07-31 2016-02-03 国际商业机器公司 MapReduce应用的相关参数的配置方法和装置
US10475290B2 (en) 2014-08-06 2019-11-12 Mido Play Inc. System for multiple jurisdiction lotteries with fraud detection
US9640028B2 (en) 2015-07-29 2017-05-02 Mido Play, Inc. Single platform system for multiple jurisdiction lotteries
US11244533B2 (en) 2014-08-06 2022-02-08 Lottery Now, Inc. Systems for multiple legal game providers and multiple jurisdictions with asynchronous meta games
US9734659B2 (en) 2014-08-06 2017-08-15 Mido Play Inc. Single platform system for multiple jurisdiction lotteries and social media
US12154413B2 (en) 2014-08-06 2024-11-26 Lottery Now, Inc. Systems for multiple legal game providers with digital ledger
US9659460B2 (en) 2015-06-03 2017-05-23 Mido Play Inc. Methods for multiple legal game providers and multiple jurisdictions with a single platform
US9836701B2 (en) 2014-08-13 2017-12-05 Microsoft Technology Licensing, Llc Distributed stage-wise parallel machine learning
CN105446896B (zh) * 2014-08-29 2018-05-04 国际商业机器公司 映射化简应用的缓存管理方法和装置
WO2016118036A1 (en) * 2015-01-19 2016-07-28 Huawei Technologies Co., Ltd. Systems and methods for selection of program implementation
US10540608B1 (en) 2015-05-22 2020-01-21 Amazon Technologies, Inc. Dynamically scaled training fleets for machine learning
US10496528B2 (en) 2015-08-31 2019-12-03 Microsoft Technology Licensing, Llc User directed partial graph execution
US10402469B2 (en) 2015-10-16 2019-09-03 Google Llc Systems and methods of distributed optimization
US10268461B2 (en) * 2015-11-23 2019-04-23 International Business Machines Corporation Global data flow optimization for machine learning programs
US10860947B2 (en) 2015-12-17 2020-12-08 Microsoft Technology Licensing, Llc Variations in experiment graphs for machine learning
US9715373B2 (en) * 2015-12-18 2017-07-25 International Business Machines Corporation Dynamic recompilation techniques for machine learning programs
US9916344B2 (en) 2016-01-04 2018-03-13 International Business Machines Corporation Computation of composite functions in a map-reduce framework
US20180089587A1 (en) 2016-09-26 2018-03-29 Google Inc. Systems and Methods for Communication Efficient Distributed Mean Estimation
US11196800B2 (en) 2016-09-26 2021-12-07 Google Llc Systems and methods for communication efficient distributed mean estimation
US10769549B2 (en) 2016-11-21 2020-09-08 Google Llc Management and evaluation of machine-learned models based on locally logged data
US10198291B2 (en) 2017-03-07 2019-02-05 International Business Machines Corporation Runtime piggybacking of concurrent jobs in task-parallel machine learning programs
WO2019032123A1 (en) 2017-08-11 2019-02-14 Visa International Service Association SYSTEMS AND METHODS FOR GENERATING DISTRIBUTED SOFTWARE USING AN UNREGRIBUTED SOURCE CODE
CN108960433B (zh) * 2018-06-26 2022-04-05 第四范式(北京)技术有限公司 用于运行机器学习建模过程的方法及系统
CN109657247B (zh) * 2018-12-19 2023-05-23 中科曙光国际信息产业有限公司 机器学习的自定义语法实现方法及装置
US11200238B2 (en) * 2019-01-28 2021-12-14 Roblox Corporation Computing cross products using map reduce
US11436533B2 (en) * 2020-04-10 2022-09-06 Capital One Services, Llc Techniques for parallel model training
US12236370B2 (en) 2020-08-24 2025-02-25 Samsung Electronics Co., Ltd Method and apparatus for federated learning
CA3214385C (en) 2021-06-11 2025-01-21 Iterate Studio, Inc. DATA PIPELINE AND ACCESS TO MULTIPLE MACHINE-LEARNED MODELS

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110047172A1 (en) * 2009-08-20 2011-02-24 Qiming Chen Map-reduce and parallel processing in databases

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7231399B1 (en) * 2003-11-14 2007-06-12 Google Inc. Ranking documents based on large data sets
US7650331B1 (en) * 2004-06-18 2010-01-19 Google Inc. System and method for efficient large-scale data processing
US7844959B2 (en) 2006-09-29 2010-11-30 Microsoft Corporation Runtime optimization of distributed execution graph
US8190610B2 (en) * 2006-10-05 2012-05-29 Yahoo! Inc. MapReduce for distributed database processing
US7886046B1 (en) * 2008-05-16 2011-02-08 Google Inc. Methods and apparatus for predicting impact of proposed changes and implementations in distributed networks
US9110706B2 (en) * 2009-02-09 2015-08-18 Microsoft Technology Licensing, Llc General purpose distributed data parallel computing using a high level language
US8239847B2 (en) * 2009-03-18 2012-08-07 Microsoft Corporation General distributed reduction for data parallel computing
US20100241893A1 (en) * 2009-03-18 2010-09-23 Eric Friedman Interpretation and execution of a customizable database request using an extensible computer process and an available computing environment
JPWO2011090032A1 (ja) * 2010-01-20 2013-05-23 株式会社日立製作所 並列処理プログラム生成方法、並列処理プログラム生成プログラム、及び並列処理プログラム生成装置
US8356086B2 (en) * 2010-03-31 2013-01-15 Microsoft Corporation Distributed non-negative matrix factorization
JP5584914B2 (ja) * 2010-07-15 2014-09-10 株式会社日立製作所 分散計算システム
US9600250B2 (en) * 2010-10-08 2017-03-21 Microsoft Technology Licensing, Llc Declarative programming model with a native programming language

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110047172A1 (en) * 2009-08-20 2011-02-24 Qiming Chen Map-reduce and parallel processing in databases

Also Published As

Publication number Publication date
JP5705338B2 (ja) 2015-04-22
WO2012116449A1 (en) 2012-09-07
US20120226639A1 (en) 2012-09-06
US8612368B2 (en) 2013-12-17
GB2502020A (en) 2013-11-13
GB201314958D0 (en) 2013-10-02
JP2014510342A (ja) 2014-04-24
CN103403700A (zh) 2013-11-20
DE112012000628T5 (de) 2013-11-14

Similar Documents

Publication Publication Date Title
CN103403700B (zh) 用于在mapreduce环境中处理机器学习算法的系统和方法
Bik et al. Compiler support for sparse tensor computations in MLIR
Ghoting et al. SystemML: Declarative machine learning on MapReduce
US8239847B2 (en) General distributed reduction for data parallel computing
Chen et al. Towards linear algebra over normalized data
Wang et al. The Myria Big Data Management and Analytics System and Cloud Services.
US10228922B2 (en) Hybrid parallelization strategies for machine learning programs on top of mapreduce
Wu et al. Red fox: An execution environment for relational query processing on gpus
US8095515B2 (en) Approximating relation sizes using field dependencies
US7877370B2 (en) Systems and methods for data storage and retrieval using algebraic relations composed from query language statements
US7720806B2 (en) Systems and methods for data manipulation using multiple storage formats
Mutlu et al. Comet: A domain-specific compilation of high-performance computational chemistry
Boehm et al. SystemML's Optimizer: Plan Generation for Large-Scale Machine Learning Programs.
JP5113157B2 (ja) データの記憶及び検索を行うためのシステム及び方法
US11941001B2 (en) Optimizing cursor loops in relational database systems using custom aggregates
US7613734B2 (en) Systems and methods for providing data sets using a store of albegraic relations
Genaud et al. Load-balancing scatter operations for grid computing
US20070266000A1 (en) Systems and Methods for Data Storage and Retrieval Using Virtual Data Sets
Liu et al. From Datalog rules to efficient programs with time and space guarantees
US7769754B2 (en) Systems and methods for data storage and retrieval using algebraic optimization
Nassar et al. Chi squared feature selection over Apache Spark
Müller Engineering Aggregation Operators for Relational In-Memory Database Systems
Rohrmann et al. Gilbert: Declarative sparse linear algebra on massively parallel dataflow systems
Kurapov et al. Analytical Queries: A Comprehensive Survey
US20240354604A1 (en) Non-linear multi-dimensional cost function for artificial intelligence inference

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20170425

Address after: Room 3, Pacific Plaza, 1 Queen's Road East, Wan Chai, Hongkong,, China

Patentee after: Oriental concept Limited

Address before: American New York

Patentee before: International Business Machines Corp.

TR01 Transfer of patent right