TW201913404A

TW201913404A - Method of executing tuple graphics program across the network

Info

Publication number: TW201913404A
Application number: TW107116380A
Authority: TW
Inventors: 高森珊比多萊; 馬修羅森克蘭茨; 桑傑葛馬瓦; 斯爾詹佩卓維奇; 伊凡波什瓦
Original assignee: 美商谷歌有限責任公司
Priority date: 2017-08-24
Filing date: 2018-05-15
Publication date: 2019-04-01
Also published as: EP3673369B1; CN110945481A; TWI710913B; US20190068504A1; CN110945481B; EP3673369A1; WO2019040139A1; US10887235B2

Abstract

A programming model provides a method for executing a program in a distributed architecture. One or more first shards of the distributed architecture execute one or more operations, and sending tuples to at least one second shard, the tuples being part of a stream and being based on the one or more operations. The one or more first shards send a token value to the at least one second shard when the sending of the tuples in the stream is complete. The at least one second shard determines whether a total of the token values matches a number of the one or more first shards, and takes a first action in response to determining that the total of the token values matches the number of the one or more first shards. The first action may include marking the stream as being complete and/or generating a message indicating that the stream is complete.

Description

Method for executing tuple graphic program across network

雲計算允許使用者具有用以使用共用可組態資源集區來儲存及處理資料以實現成本及計算效率之各種計算能力。用於雲計算之當前程式化模型包含MapReduce、Dryad及整體同步並行(Bulk Synchronous Parallel)雲處理。面對分散式計算之問題中之一者係效能。一分散式計算中之效能與資料至計算單元之一近接及計算單元之間的資料傳送成本有關。Cloud computing allows users to have various computing capabilities to use shared configurable resource pools to store and process data to achieve cost and computing efficiency. The current stylized models for cloud computing include MapReduce, Dryad, and Bulk Synchronous Parallel cloud processing. One of the problems facing distributed computing is performance. The performance in a distributed computing is related to the proximity of data to one of the computing units and the cost of data transfer between the computing units.

本發明闡述用於雲計算之一新程式化模型。該新程式化模型可用於寫入分散式低延時非批程式。一應用程式根據該模型而建構一程式，且然後提交該程式以用於執行。該程式由運算子之一有向非循環圖形組成。值串流沿著圖形中之邊緣自一個運算子流動至另一運算子。透過一串流而發送之每一值係一元組。同一程式中之不同運算子可在不同機器上運行。程式化模型協調此等運算子在不同機器上之執行且將資料自一個運算子傳播至另一運算子。The present invention describes a new stylized model for cloud computing. The new programming model can be used to write decentralized low-latency non-batch programs. An application builds a program based on the model, and then submits the program for execution. The program consists of directed acyclic graphs, one of the operators. The value stream flows from one operator to another along the edges in the graph. Each value sent through a stream is a tuple. Different operators in the same program can run on different machines. The stylized model coordinates the execution of these operators on different machines and propagates data from one operator to another.

程式化模型之一項態樣提供一種用於在一分散式架構中執行一程式之方法，該方法包括：由該分散式架構之一或多個第一分區執行一或多個操作；自該一或多個第一分區將元組發送至至少一個第二分區，該等元組係一串流之一部分且基於該一或多個操作；當該串流中之該等元組之該發送完成時，自該一或多個第一分區中之每一者將一符記值發送至該至少一個第二分區。該方法進一步包含：由該第二分區判定該等符記值之一總數是否匹配該一或多個第一分區之一數目；及回應於判定該等符記值之該總數匹配該一或多個第一分區之該數目而採取一第一動作。該第一動作可包含將該串流標記為已完成及/或產生指示該串流已完成之一訊息。An aspect of the stylized model provides a method for executing a program in a decentralized architecture, the method comprising: performing one or more operations from one or more first partitions of the decentralized architecture; One or more first partitions send tuples to at least one second partition, the tuples are part of a stream and are based on the one or more operations; when the sending of the tuples in the stream Upon completion, a token value is sent from each of the one or more first partitions to the at least one second partition. The method further includes: determining from the second partition whether a total of the token values matches one of the one or more first partitions; and in response to determining that the total of the token values matches the one or more A first action is taken for this number of first partitions. The first action may include marking the stream as completed and / or generating a message indicating that the stream is completed.

該至少一個第二分區係該一或多個第一分區中之一者之一接收分區。該方法可進一步包含：由該一或多個第一分區中之該一者產生該一或多個第一分區與其進行通信之該等接收分區之一清單；及由該一或多個第一分區中之該一者將該清單傳輸至一控制器。另外，該控制器可追蹤已起始處理之所有接收分區；判定該等接收分區中之已起始處理之一或多者是否不存在於該清單中；及針對已起始處理且不存在於該清單中之每一接收分區，將代表該一或多個第一分區中之該一者之一符記值發送至該接收分區。在某些實例中，該方法可進一步包含：由一控制器判定是否有任何分區未起始處理；由該控制器判定未開始處理之該等分區是否被該程式之設計故意跳過；及由該控制器代表未開始處理之任何被故意跳過之分區將一符記值發送至該第二分區。The at least one second partition is a receiving partition of one of the one or more first partitions. The method may further include: generating, from the one of the one or more first partitions, a list of the receiving partitions with which the one or more first partitions communicate; and by the one or more first partitions The one of the partitions transfers the list to a controller. In addition, the controller can track all the receiving partitions that have started processing; determine whether one or more of the started processing in these receiving partitions does not exist in the list; and for the started processing and does not exist in Each receiving partition in the list sends a token value representing the one of the one or more first partitions to the receiving partition. In some examples, the method may further include: determining by a controller whether any partitions have not started processing; determining by the controller whether the partitions that have not started processing are intentionally skipped by the design of the program; and by The controller sends a token value to the second partition on behalf of any partition that has been deliberately skipped.

本發明之另一態樣提供一種系統，該系統包括：一分散式計算環境中之一或多個第一分區；及該分散式計算環境中之至少一個第二分區，該至少一個第二分區遠離該一或多個第一分區。該一或多個第一分區經組態以：執行一或多個操作；將元組發送至至少一個第二分區，該等元組係一串流之一部分且基於該一或多個操作；及當該串流中之該等元組之該發送完成時，將一符記值發送至該至少一個第二分區。該至少一個第二分區經組態以：判定該等符記值之一總數是否匹配該一或多個第一分區之一數目；及回應於判定該等符記值之該總數匹配該一或多個第一分區之該數目而採取一第一動作。Another aspect of the present invention provides a system including: one or more first partitions in a distributed computing environment; and at least one second partition in the distributed computing environment, the at least one second partition Away from the one or more first partitions. The one or more first partitions are configured to: perform one or more operations; send tuples to at least one second partition, the tuples are part of a stream and are based on the one or more operations; And when the sending of the tuples in the stream is completed, sending a token value to the at least one second partition. The at least one second partition is configured to: determine whether a total of the symbol values matches one of the one or more first partitions; and in response to determining that the total of the symbol values matches the one or A first action is taken for the number of multiple first partitions.

該系統可進一步包含與該一或多個第一分區、該至少一個第二分區或控制器中之至少一者進行通信之一用戶端裝置。該用戶端裝置可經組態以：建構一圖形，其中該圖形之每一節點表示一分區；及基於該圖形而驗證是否將跨分散式架構準確地執行程式。該用戶端裝置可進一步經組態以在執行該程式時動態地構建該圖形之啟動。The system may further include a client device in communication with at least one of the one or more first partitions, the at least one second partition, or the controller. The client device can be configured to: construct a graph, where each node of the graph represents a partition; and based on the graph, verify whether the program will be accurately executed across the distributed architecture. The client device can be further configured to dynamically construct the start of the graphic when the program is executed.

在某些實例中，在分散式架構中之一計算裝置上執行之一動態發送操作。該動態發送操作將一資料輸入串流發送至一目的地圖形之所有啟動；且自該控制器接收新元組，該等新元組係在偵測到該目的地圖形之額外啟動時被接收。In some instances, a dynamic transmission operation is performed on a computing device in a decentralized architecture. The dynamic sending operation sends a data input stream to all activations of a destination graphic; and receives new tuples from the controller, which are received when an additional activation of the destination graphic is detected .

I.I. 概述Overview

一新程式化模型可用於寫入分散式低延時非批程式。一應用程式根據該新模型而建構一程式，且然後提交該程式以用於執行。該程式由運算子之一有向非循環圖形組成。值串流沿著圖形中之邊緣流動。透過一串流而發送之每一值係一元組。同一程式中之不同運算子可在不同機器上運行。程式化模型協調此等運算子在不同機器上之執行且將資料自一個運算子傳播至另一運算子。A new programming model can be used to write decentralized low-latency non-batch programs. An application builds a program based on the new model, and then submits the program for execution. The program consists of directed acyclic graphs, one of the operators. The value stream flows along the edges in the graph. Each value sent through a stream is a tuple. Different operators in the same program can run on different machines. The stylized model coordinates the execution of these operators on different machines and propagates data from one operator to another.

建構程式包含定義形成圖形之節點之操作。操作接收值串流來作為輸入且發送值串流來作為輸出。每一串流具有一元組類型，且流動穿過串流之所有元組必須匹配彼類型。元組類型由包含一名稱識別符及一欄位類型識別符之欄位定義。在定義操作時，類型推論用於提供使操作彼此交互之一標準化方式。舉例而言，作為其定義之一部分，一操作可係指其輸入及輸出且對該等輸入及輸出設定多種約束。此一約束之一項實例係一輸出類型可被約束為包含輸入之每個欄位。The construction program includes operations that define the nodes that form the graph. The operation receives the value stream as input and sends the value stream as output. Each stream has a tuple type, and all tuples flowing through the stream must match that type. The tuple type is defined by a field containing a name identifier and a field type identifier. When defining operations, type inference is used to provide a standardized way for operations to interact with each other. For example, as part of its definition, an operation can refer to its inputs and outputs and set various constraints on those inputs and outputs. An example of this constraint is that an output type can be constrained to contain each field of input.

可在分散式架構中之各種位置處執行圖形中之操作。儘管可在一程式化階段中定義某些運算子位置，但可並不定義其他運算子位置，且可在圖形建立及分割期間將運算子自動指派至其他位置。就此而言，以減少總體網路訊務之一方式自動指派位置。Operations in graphics can be performed at various locations in a decentralized architecture. Although certain operator positions may be defined in a stylized stage, other operator positions may not be defined, and operators may be automatically assigned to other positions during graph creation and segmentation. In this regard, locations are automatically assigned in a way that reduces overall network traffic.

基於在程式化階段中所定義之操作而建立圖形。可在兩個階段(包含一主要階段及一本端階段)中執行圖形之分割。根據一組約束而執行每一階段。針對主要分割之一第一組約束可不同於針對本端分割之一第二組約束。Create graphics based on the operations defined in the programming stage. The division of graphics can be performed in two stages (including a main stage and a local stage). Perform each stage according to a set of constraints. The first set of constraints for the main segmentation may be different from the second set of constraints for the local segmentation.

在主要階段中，一第一步驟根據第一組約束而合併子圖，從而使程式中之子圖之一總數目最小化。然後藉由將相鄰未經指派節點併入至某些子圖中而使該等子圖擴大。候選操作首先經檢查以判定其是否已被標記為可分裂的，意指其可在不改變操作之功能性之情況下被分裂成單獨操作。若否，則不將該等候選操作併入至相鄰子圖中。若該等候選操作係可分裂的，則將彼等候選者放置至相鄰子圖中受約束限制。藉由將來自經指派節點之位置複製至其相鄰者而將位置指派至所有未經指派操作。將在相同位置處運行之可能之未經分區子圖對合併以使子圖之總數目最小化。在某一時刻，進一步合併將係不可能的。In the main stage, a first step merges subgraphs according to the first set of constraints, thereby minimizing the total number of one of the subgraphs in the program. Then, by incorporating adjacent unassigned nodes into some subgraphs, the subgraphs are enlarged. Candidate operations are first checked to determine whether they have been marked as splittable, meaning that they can be split into separate operations without changing the functionality of the operation. If not, the candidate operations are not merged into adjacent subgraphs. If these candidate operations are splittable, placing their candidates into adjacent subgraphs is subject to constraints. The location is assigned to all unassigned operations by copying the location from the assigned node to its neighbors. The possible unpartitioned subgraph pairs running at the same location are merged to minimize the total number of subgraphs. At some point, further mergers will be impossible.

在本端分割階段中，識別需要被分裂(舉例而言，以防止執行中之低效率)之子圖。此等子圖可僅係含有封鎖操作之子圖，該等封鎖操作可在執行I/O之同時遵循一執行緒，從而防止其他操作能夠運行。使圖形準備好進行分裂。此可包含修改子圖以強加本端分割約束。構建一合併圖形，其中每一操作在其自身之一子圖中結束。然後將此等子圖重複地合併在一起。具體而言，將具有外部傳入邊緣之所有操作一起合併至相同子圖中。此外，合併具有非封鎖操作之所有可能之子圖對。In the local segmentation stage, identify subgraphs that need to be split (for example, to prevent inefficiencies in execution). These sub-graphs may only be sub-graphs that contain blocking operations. These blocking operations may follow a thread while performing I / O, thereby preventing other operations from being able to run. Prepare the graphic for splitting. This may include modifying the subgraph to impose local segmentation constraints. Construct a merged graph, where each operation ends in one of its own subgraphs. Then repeatedly merge these subgraphs together. Specifically, all operations with external incoming edges are merged together into the same subgraph. In addition, all possible sub-picture pairs with non-blocking operations are merged.

若針對一經分區服務實施一運算子，則新程式化模型藉由多次例示子圖而對計算進行自動分區。分區提供一延時益處(此乃因並行執行分區)以及一資料效率益處。作為資料效率益處之一實例，放置於一經分區運算子之後的運算子可通常在相同經分區執行個體上運行，從而對最終輸出進行濾波且減少該最終輸出，因此使網路訊務最小化。If an operator is implemented for a partitioned service, the new stylized model automatically partitions the calculation by instantiating the subgraph multiple times. Partitioning provides a latency benefit (this is due to the parallel execution of partitions) and a data efficiency benefit. As an example of data efficiency benefits, an operator placed after a partitioned operator can usually run on the same partitioned instance to filter the final output and reduce the final output, thus minimizing network traffic.

一旦經分割，圖形便可被執行。在子圖之各別位置處執行該等子圖中之每一者，且在一各別單個執行緒中執行每一子圖。沿著一子圖內之邊緣進行之資料傳送基於其在一單執行緒環境中之執行而被最佳化。Once divided, the graphics can be executed. Each of the subgraphs is executed at a separate position in the subgraph, and each subgraph is executed in a separate single thread. Data transfer along the edges within a subgraph is optimized based on its execution in a single thread environment.

程式化模型之各種態樣允許對程式之高效執行。此等態樣包含(以實例方式而非限制方式)管線化(pipelining)及上文所闡述之分區。管線化提供極低延時。舉例而言，針對由各自花費10 ms但涉及數十萬獨立值之一系列5個操作組成之一計算，一個接一個地處理操作將花費50 ms。然而，一恰當管線化解決方案可在低至10 ms內完成。為了實現此，在執行期間在運算子之間流式傳輸元組，此導致跨整個程式之較佳管線化。此元組流式傳輸格式跨網路期限提供高效串列化/解串列化。為了使管線在早期起始但達成較高通量，新程式化模型使用動態緩衝區擴大。舉例而言，小訊息在計算中之早期被發送但稍後擴大，此乃因較大訊息係較高效的。Various aspects of the stylized model allow efficient execution of the program. Such aspects include (by way of example and not limitation) pipelining and the partitioning described above. Pipelining provides extremely low latency. For example, for a calculation consisting of a series of 5 operations that each spend 10 ms but involve hundreds of thousands of independent values, processing the operations one by one will take 50 ms. However, a proper pipelined solution can be completed in as little as 10 ms. To achieve this, tuples are streamed between operators during execution, which results in a better pipeline across the entire program. This tuple streaming format provides efficient serialization / deserialization across network periods. To enable the pipeline to start early but achieve higher throughput, the new stylized model uses dynamic buffer expansion. For example, small messages are sent early in the calculation but expanded later, because larger messages are more efficient.

新程式化模型亦(舉例而言)藉由在網路節點之間實行流控制而提供低緩衝。舉例而言，發送節點判定一接收者是否係忙碌的，且若如此，則阻止傳輸。在一子圖內，新程式化模型能夠經由本端程序呼叫而在操作之間高效地遞送資料。新程式化模型高效地判定一計算何時完成，且藉由較快地判定完成而提供較低延時。II. 實例性系統 The new stylized model also (for example) provides low buffering by implementing flow control between network nodes. For example, the sending node determines whether a recipient is busy, and if so, blocks transmission. Within a sub-graph, the new stylized model can efficiently deliver data between operations via local procedure calls. The new stylized model efficiently determines when a calculation is complete, and provides lower latency by determining completion faster. II. Example system

圖1圖解說明包含一分散式計算環境之一實例性系統。複數個資料中心160、170、180可(舉例而言)經由一網路150而以通信方式耦合。資料中心160、170、180可經由網路150而進一步與一或多個用戶端裝置(諸如用戶端110)進行通信。因此，舉例而言，用戶端110可在「雲」中執行操作。在某些實例中，資料中心160、170、180可進一步與一控制器190進行通信。FIG. 1 illustrates an example system including a decentralized computing environment. The plurality of data centers 160, 170, 180 may be (for example) communicatively coupled via a network 150. The data center 160, 170, 180 may further communicate with one or more client devices (such as the client 110) via the network 150. Therefore, for example, the user terminal 110 can perform operations in the "cloud". In some examples, the data center 160, 170, 180 may further communicate with a controller 190.

用戶端110可執行一或多個應用程式以用於使用新程式化模型來建立程式。每一用戶端110可為意欲供由一人使用之一個人電腦，其具有通常存在於一個人電腦中之所有內部組件，諸如一中央處理單元(CPU)、CD-ROM、硬碟機及一顯示裝置(舉例而言，具有一螢幕之一監視器、一投影機、一觸控螢幕、一小LCD螢幕、一電視或另一裝置，諸如可操作以顯示由處理器120處理之資訊之一電裝置)、揚聲器、一數據機及/或網路介面裝置、使用者輸入(諸如一滑鼠、鍵盤、觸控螢幕或麥克風)，且所有該等組件用於使此等元件彼此連接。此外，根據本文中所闡述之系統及方法之電腦可包含能夠處理指令且向人類及其他電腦並自人類及其他電腦傳輸資料之裝置，包含一般用途電腦、PDA、平板電腦、行動電話、智慧型手錶、缺乏本端儲存能力之網路電腦、用於電視之機上盒及其他網路裝置。The client 110 can execute one or more applications for creating programs using the new programming model. Each client 110 may be a personal computer intended for use by a person, which has all the internal components normally present in a personal computer, such as a central processing unit (CPU), CD-ROM, hard drive, and a display device ( (For example, a monitor with a screen, a projector, a touch screen, a small LCD screen, a TV, or another device such as an electric device operable to display information processed by the processor 120) , Speakers, a modem and / or network interface device, user input (such as a mouse, keyboard, touch screen, or microphone), and all these components are used to connect these components to each other. In addition, computers according to the systems and methods described herein may include devices capable of processing instructions and transferring data to and from humans and other computers, including general-purpose computers, PDAs, tablet computers, mobile phones, and smart Watches, network computers that lack local storage capacity, set-top boxes for TVs, and other network devices.

用戶端110可含有通常存在於一般用途電腦中之一處理器120、記憶體130及其他組件。記憶體130可儲存可由處理器120存取之資訊，包含可由處理器120執行之指令132。記憶體亦可包含可由處理器120擷取、操縱或儲存之資料134。記憶體130可為能夠儲存可由處理器120存取之資訊之一種類型之非暫時性電腦可讀媒體，諸如一硬碟機、固態磁碟機、磁帶機、光學儲存裝置、記憶體卡、ROM、RAM、DVD、CD-ROM、具有寫入能力之記憶體及唯讀記憶體。處理器120可為一眾所周知之處理器或其他較罕為人知類型之處理器。另一選擇係，處理器120可為一專用控制器，諸如一ASIC。The client 110 may include a processor 120, memory 130, and other components commonly found in general-purpose computers. The memory 130 may store information accessible by the processor 120, including instructions 132 executable by the processor 120. The memory may also include data 134 that can be retrieved, manipulated, or stored by the processor 120. The memory 130 may be a type of non-transitory computer-readable medium capable of storing information accessible by the processor 120, such as a hard drive, solid state drive, tape drive, optical storage device, memory card, ROM , RAM, DVD, CD-ROM, memory capable of writing and read-only memory. The processor 120 may be a well-known processor or other less known types of processors. Alternatively, the processor 120 may be a dedicated controller, such as an ASIC.

指令132可為由處理器120直接執行(諸如機器碼)或間接執行(諸如描述性語言 (script))之一組指令。就此而已，術語「指令」、「步驟」及「程式」可在本文中互換地使用。指令132可以目標碼格式(以用於由處理器120直接處理)或其他類型之電腦語言(包含指令碼或者按需求進行解譯或提前進行編譯之獨立原始程式碼模組之集合)儲存。The instructions 132 may be a set of instructions directly executed by the processor 120 (such as machine code) or indirectly executed (such as a descriptive language). In this regard, the terms "command", "step" and "program" may be used interchangeably in this article. The instructions 132 may be stored in a target code format (for direct processing by the processor 120) or other types of computer languages (including a collection of instruction codes or independent source code modules that are interpreted or compiled in advance as required).

資料134可由處理器120根據指令132而擷取、儲存或修改。舉例而言，雖然系統及方法不受一特定資料結構限制，但資料134可儲存於電腦暫存器中、儲存於一相關資料庫中作為具有複數個不同欄位及記錄之表或XML文件。資料134亦可以一電腦可讀格式(諸如但不限於二進位值、ASCII或Unicode)進行格式化。此外，資料134可包含足以識別相關資訊(諸如數目、說明性文字、專屬程式碼、指標、對儲存於其他記憶體(包含其他網路位置)中之資料之參考或由一功能使用以計算相關資料之資訊)之資訊。The data 134 can be retrieved, stored or modified by the processor 120 according to the instruction 132. For example, although the system and method are not limited by a specific data structure, the data 134 can be stored in a computer register or a related database as a table or XML document with multiple different fields and records. The data 134 may also be formatted in a computer-readable format (such as but not limited to binary values, ASCII, or Unicode). In addition, the data 134 may contain sufficient information to identify relevant information (such as numbers, descriptive text, proprietary code, indicators, references to data stored in other memory (including other network locations), or used by a function to calculate relevant Information).

應用程式136可用於根據新程式化模式而建構程式。舉例而言，應用程式136可被下載、可自指令132執行或被遠端存取。在某些實例中，應用程式可被遠端執行。舉例而言，用戶端110可編譯一程式且將該程式發送至雲以用於執行。應用程式136可執行不同功能，諸如類型推論、圖形建立、圖形分割等。舉例而言，一個應用程式可執行多種不同功能，或各種應用程式可各自執行一或多個不同功能。The application program 136 can be used to construct a program according to the new programming mode. For example, the application 136 can be downloaded, executed from the command 132, or accessed remotely. In some instances, applications can be executed remotely. For example, the client 110 may compile a program and send the program to the cloud for execution. The application program 136 can perform different functions, such as type inference, graph creation, graph segmentation, and so on. For example, one application can perform many different functions, or each application can perform one or more different functions.

針對類型推論功能，應用程式可經組態以接收資訊，該資訊藉由欄位名稱及類型區分符而定義一操作之屬性。應用程式可進一步接收資訊，該資訊關於該等屬性而定義操作之一行為。基於屬性及行為而判定針對操作之約束。定義操作之一輸入之資訊亦可被接收且連同該等約束一起使用以判定操作之一輸出之一類型。所判定輸出類型可與操作之輸出相關聯。For the type inference function, the application can be configured to receive information that defines the properties of an operation by the field name and type distinguisher. The application can further receive information that defines an action of the operation regarding these attributes. Determine constraints on operations based on attributes and behavior. Information defining an input of an operation can also be received and used in conjunction with these constraints to determine a type of output of an operation. The determined output type can be associated with the output of the operation.

針對圖形建立，可產生複數個節點，其中每一節點對應於程式之一操作。該等節點由邊緣或頂點連接，該等邊緣或頂點表示在該等節點之間發送之串流。舉例而言，可基於計算裝置之程式要求及能力而自動地或藉由程式設計員選擇而手動地將位置指派至特定節點。For graph creation, multiple nodes can be generated, where each node corresponds to an operation of the program. These nodes are connected by edges or vertices, and these edges or vertices represent the stream sent between these nodes. For example, the location can be assigned to a specific node either automatically or manually by the programmer based on the program requirements and capabilities of the computing device.

針對圖形分割，圖形經最佳化以減少總體網路訊務。舉例而言，在可能之情況下，自動地一起指派用於執行一或多個操作之位置。如此一來，根據一定數目個預定義約束而合併及分裂節點。可在一本端層級處進一步執行分割，(舉例而言)以使操作在一經分區位置處執行。可根據一組第二單獨約束而執行此本端分割。當編譯程式時，可執行主要分割及本端分割兩者。作為分割之一結果，程式準備好由一或多個資料中心160、170、180中之計算裝置執行，且可被發送以用於執行。For graphics segmentation, graphics are optimized to reduce overall network traffic. For example, where possible, locations for performing one or more operations are automatically assigned together. In this way, nodes are merged and split according to a certain number of predefined constraints. The segmentation can be further performed at a local level, for example, so that the operation is performed at a partitioned location. This local segmentation can be performed according to a set of second individual constraints. When compiling the program, both the main split and the local split can be performed. As a result of the segmentation, the program is ready to be executed by computing devices in one or more data centers 160, 170, 180, and can be sent for execution.

雖然圖1在功能上將處理器120及記憶體130圖解說明為在同一區塊內，但處理器120及記憶體130可實際上包含可或可不儲存於同一實體殼體內之多個處理器及記憶體。舉例而言，指令132及資料134中之某些指令及資料可儲存於一可抽換式CD-ROM上且其他指令及資料儲存於一唯讀電腦晶片內。指令及資料中之某些或全部指令及資料可儲存於實體上遠離處理器120但仍可由處理器120存取之一位置中。類似地，處理器120可實際上包含可或可不並行操作之處理器之一集合。Although FIG. 1 functionally illustrates the processor 120 and the memory 130 as being in the same block, the processor 120 and the memory 130 may actually include multiple processors and may or may not be stored in the same physical housing. Memory. For example, some of the commands 132 and data 134 can be stored on a removable CD-ROM and other commands and data can be stored in a read-only computer chip. Some or all of the instructions and data may be stored in a location physically remote from the processor 120 but still accessible by the processor 120. Similarly, the processor 120 may actually include a set of processors that may or may not operate in parallel.

資料中心160至180可定位於距彼此一相當大距離處。舉例而言，資料中心可定位於全世界各個國家中。每一資料中心160、170、180可包含一或多個計算裝置，諸如處理器、伺服器、分區或諸如此類。舉例而言，如圖1中所展示，資料中心160包含計算裝置162、164，資料中心170包含計算裝置172，且資料中心180包含計算裝置181至186。舉例而言，可跨此等計算裝置執行程式，使得某些操作由一第一資料中心之一或多個計算裝置執行而其他操作由一第二資料中心之一或多個計算裝置執行。在某些實例中，各種資料中心中之計算裝置可具有不同能力。舉例而言，不同計算裝置可具有不同處理速度、工作負載等。儘管僅展示此等計算裝置中之幾個計算裝置，但應理解，每一資料中心160、170、180可包含任何數目個計算裝置，且一第一資料中心中之計算裝置之數目可不同於一第二資料中心中之計算裝置之一數目。此外，應理解，每一資料中心160至180中之計算裝置之數目可隨時間(舉例而言，在硬體被移除、替換、升級或擴展時)變化。The data centers 160 to 180 can be located at a considerable distance from each other. For example, data centers can be located in various countries around the world. Each data center 160, 170, 180 may include one or more computing devices, such as processors, servers, partitions, or the like. For example, as shown in FIG. 1, data center 160 includes computing devices 162, 164, data center 170 includes computing device 172, and data center 180 includes computing devices 181-186. For example, programs can be executed across these computing devices so that certain operations are performed by one or more computing devices in a first data center and other operations are performed by one or more computing devices in a second data center. In some examples, computing devices in various data centers may have different capabilities. For example, different computing devices may have different processing speeds, workloads, and so on. Although only a few of these computing devices are shown, it should be understood that each data center 160, 170, 180 may include any number of computing devices, and the number of computing devices in a first data center may be different One of the number of computing devices in the second data center. In addition, it should be understood that the number of computing devices in each data center 160 to 180 may change over time (for example, when hardware is removed, replaced, upgraded, or expanded).

在某些實例中，每一資料中心160至180亦可包含一定數目個儲存裝置(未展示)，諸如硬碟機、隨機存取記憶體、磁碟、磁碟陣列、磁帶機或任何其他類型之儲存裝置。資料中心160、170、180可實施包含但不限於以下各項之一定數目個架構及技術中之任一者：直接附加儲存(DAS)、網路附加儲存(NAS)、儲存區域網路(SAN)、光纖通道(FC)、乙太網路光纖通道(FCoE)、混合架構網路或諸如此類。除了儲存裝置之外，資料中心亦可包含一定數目個其他裝置，諸如佈纜、路由器等。此外，在某些實例中，資料中心160至180可為虛擬化環境。此外，儘管僅展示幾個資料中心160至180，但可經由網路150及/或額外網路而耦合眾多資料中心。In some examples, each data center 160 to 180 may also contain a certain number of storage devices (not shown), such as hard drives, random access memory, disks, disk arrays, tape drives, or any other type Of storage devices. The data centers 160, 170, and 180 can implement any of a certain number of architectures and technologies including but not limited to the following: direct attached storage (DAS), network attached storage (NAS), storage area network (SAN) ), Fibre Channel (FC), Ethernet Fibre Channel (FCoE), hybrid architecture network, or the like. In addition to storage devices, the data center may also contain a certain number of other devices, such as cabling and routers. In addition, in some examples, the data centers 160 to 180 may be virtualized environments. Furthermore, although only a few data centers 160 to 180 are shown, many data centers can be coupled via the network 150 and / or additional networks.

在某些實例中，控制器190可與資料中心160至180中之計算裝置進行通信，且可促進程式之執行。舉例而言，控制器190可追蹤每一計算裝置之能力、狀態、工作負載或其他資訊，且使用此資訊來指派任務。控制器190亦可幫助判定是否已完成經由網路而發送串流。舉例而言，在某些情形中，控制器190可發送代表經分區運算子之符記，該等符記由下游節點使用以判定串流已完成。控制器190可包含一處理器198及記憶體192 (其包含資料194及指令196)，類似於上文所闡述之用戶端110。In some examples, the controller 190 can communicate with the computing devices in the data centers 160 to 180 and can facilitate the execution of programs. For example, the controller 190 can track the capabilities, status, workload, or other information of each computing device and use this information to assign tasks. The controller 190 can also help determine whether the streaming through the network has been completed. For example, in some cases, the controller 190 may send tokens representing partitioned operators, which tokens are used by downstream nodes to determine that the streaming is complete. The controller 190 may include a processor 198 and memory 192 (which includes data 194 and instructions 196), similar to the client 110 described above.

用戶端110、資料中心160至180及控制器190可能夠進行直接及間接通信(諸如經由網路150)。舉例而言，使用一網際網路套接口，一用戶端110可透過一網際網路協定套組而連接至在遠端伺服器上操作之一服務。伺服器可創建監聽套接口，該等監聽套接口可接受一起始連接以用於發送及接收資訊。網路150及介入節點可包含各種組態及協定，包含網際網路、全球資訊網、內部網路、虛擬私人網路、廣域網路、區域網路、使用一或多個公司專屬之通信協定之私人網路、乙太網路、WiFi (例如，702.71、702.71b、g、n或其他此等標準)及HTTP以及前述各項之各種組合。此通信可藉由能夠向其他電腦及自該等其他電腦傳輸資料之一裝置(諸如數據機(例如，撥號、電纜或光纖)及無線介面)而促進。III. 建構一程式 The user terminal 110, the data centers 160 to 180 and the controller 190 may be capable of direct and indirect communication (such as via the network 150). For example, using an Internet socket, a client 110 can be connected to a service operating on a remote server through an Internet protocol suite. The server can create listening sockets, which can accept an initial connection for sending and receiving information. The network 150 and intervening nodes may include various configurations and protocols, including the Internet, World Wide Web, Intranet, virtual private network, wide area network, regional network, and one or more company-specific communication protocols Private networks, Ethernet, WiFi (for example, 702.71, 702.71b, g, n, or other such standards), HTTP, and various combinations of the foregoing. This communication can be facilitated by a device capable of transmitting data to and from other computers, such as a modem (eg, dial-up, cable, or optical fiber) and a wireless interface. III. Construct a program

圖2A至圖2B圖解說明使用程式化模型建立之一程式之一實例。在此程式中，一目標係提取一相簿中之所有影像且針對每一影像產生一縮圖。在程式碼中，可將此目標表示為： 2A to 2B illustrate an example of creating a program using a stylized model. In this program, a target extracts all the images in an album and generates a thumbnail for each image. In the code, this goal can be expressed as:

然而，若相簿資料儲存於一不同伺服器上，則影像(album_name)呼叫需要一遠端存取。應與一經分區服務並行地發送查找呼叫，該經分區服務傳回給出一影像名稱之影像資料。應與一單獨縮圖伺服器組並行地發送縮圖呼叫。根據本文中所闡述之程式化模型建構及執行之一程式以如下方式達成此分散式執行： However, if the album data is stored on a different server, the image (album_name) call requires a remote access. A search call should be sent in parallel with a partitioned service that returns image data giving an image name. Thumbnail calls should be sent in parallel with a separate group of thumbnail servers. According to one of the stylized model construction and execution procedures described in this article, this distributed execution is achieved as follows:

此程式建構產生圖2A之圖形。如所展示，輸入操作210產生一串流215，該串流將一相簿名稱饋送至ListImages操作220。ListImages 220具有相關聯後設資料，該相關聯後設資料告知程式化模型其必須在一不同伺服器(或許基於相簿名稱之一特定分區)上運行。程式化模型在內部建立對於此服務之適當分區之一遠端程序呼叫(RPC)且向該適當分區發送相簿名稱。ListImages 220產生影像名稱之一串流225。程式化模型再次找到針對每一影像名稱之查找服務之適當分區且將名稱發送至此等分區。查找運算子230產生影像之一串流235。此等影像繼而被傳遞至又一服務—產生縮圖之縮圖操作240。程式化模型找到針對每一影像之縮圖服務之適當分區，且將每一影像發送至適當分區。所產生縮圖245作為程式之輸出250而保存。雖然計算觸及三種不同經分區服務中之伺服器，但應用程式碼不必起始或管理任何遠端通信。This program is constructed to produce the graph of Figure 2A. As shown, the input operation 210 generates a stream 215 that feeds an album name to the ListImages operation 220. ListImages 220 has associated metadata, which tells the stylized model that it must run on a different server (perhaps based on a specific partition of the album name). The stylized model internally creates a remote procedure call (RPC) for the appropriate partition for this service and sends the album name to the appropriate partition. ListImages 220 generates a stream 225 of one of the image names. The stylized model again finds the appropriate partition for the search service for each image name and sends the name to these partitions. The search operator 230 generates a stream 235 of images. These images are then passed to another service-thumbnail operation 240 that generates thumbnails. The stylized model finds the appropriate partition for the thumbnail service for each image, and sends each image to the appropriate partition. The generated thumbnail 245 is saved as the output 250 of the program. Although the calculation touches three different servers in the partitioned service, the application code does not have to initiate or manage any remote communications.

圖2B表示一經更新圖形，其中程式經微調以添加由影像名稱加索引鍵之縮圖之一快取記憶體。因此，展示快取記憶體中查找操作226。經由串流228而將在快取記憶體中命中之影像縮圖直接傳遞至輸出。如之前但經由串流227而將未命中之影像名稱發送至查找230。應用程式對於快取記憶體之一位置或實施方案係不可知的。只要快取記憶體根據程式化模型(其知曉快取記憶體之位置)而實施一查找運算子，圖2B之程式便滿足要求。FIG. 2B shows an updated graphic in which the program is fine-tuned to add a cache memory of one of the thumbnails with the image name and index key. Therefore, the lookup operation 226 in the cache memory is shown. The image thumbnails hit in the cache memory are directly transmitted to the output via the stream 228. As before, but through the stream 227, the missed image name is sent to the search 230. The application is not aware of the location or implementation of one of the cache memories. As long as the cache memory implements a lookup operator based on the stylized model (which knows the location of the cache memory), the program of FIG. 2B will meet the requirements.

根據某些實例，程式化模型可提供一組內建運算子。在圖3之圖表中提供此等內建運算子之實例。每一運算子結合一對應功能來闡述。儘管僅展示幾個運算子，但應理解，可將任何數目個運算子構建至程式化模型中。另外，可建立其他運算子。可藉由(舉例而言)根據以下程式碼將運算子與串流一起寫入而構建程式： According to some examples, the stylized model may provide a set of built-in operators. Examples of these built-in operators are provided in the chart of Figure 3. Each operator is described in conjunction with a corresponding function. Although only a few operators are shown, it should be understood that any number of operators can be built into the stylized model. In addition, other operators can be established. The program can be constructed by (for example) writing the operator along with the stream according to the following code:

此程式將兩個常數組合至一單個串流中。藉由呼叫運算子特定建構函數(常數與交錯)而將運算子添加至圖形。每個運算子具有傳回一或多個串流之此一建構函數。此等串流可用作稍後運算子建構函數之引數。This program combines two constants into a single stream. The operator is added to the graph by calling the operator specific constructor (constant and interlace). Each operator has this constructor that returns one or more streams. These streams can be used as arguments for later operator constructors.

在建立程式時，運算子可由包含一名稱及一類型區分符之欄位定義。名稱可由程式設計員選擇。選擇一唯一名稱可提供最有幫助之輸出，但此並非必需的。When creating a program, the operator can be defined by a field containing a name and a type distinguisher. The name can be chosen by the programmer. Choosing a unique name provides the most helpful output, but this is not required.

一旦將運算子連線在一起，便可藉由建構一程式物件並編譯該程式物件而構建程式。另外，建構一編譯器物件，將內建運算子庫添加至該編譯器物件。可建構一繫結(Binding)物件以在稍後階段中向程式供應引數及結果。Once the operators are connected together, the program can be constructed by constructing a program object and compiling the program object. In addition, a compiler object is constructed, and the built-in operator library is added to the compiler object. A Binding object can be constructed to supply parameters and results to the program at a later stage.

可藉由添加結果及引數而達成將輸入提供至程式及自程式接收輸出。可利用一輸入串流及結果之一名稱來添加一結果，且然後使結果之名稱與一實際執行個體相關聯。此分離允許一程式與不同輸出一起再使用。可藉由一類似程序添加引數，利用一名稱來添加引數，且使該名稱與一執行個體相關聯，從而允許程式與不同輸入資料一起使用。A. 類型推論 It is possible to provide input to the program and receive output from the program by adding results and parameters. An input stream and a name of the result can be used to add a result, and then associate the name of the result with an actual instance. This separation allows a program to be reused with different outputs. You can add arguments by a similar procedure, use a name to add arguments, and associate the name with an instance, allowing the program to be used with different input data. A. Type inference

透過一串流發送之每一值係一元組。每一串流具有一元組類型且在彼串流中流動之所有元組必需匹配彼類型。一元組類型由具有形式＜name:field type＞之一組固定欄位定義。一元組類型之一實例係結構＜city string, population int64＞。此元組具有兩個欄位：保持一字串之一城市欄位，及保持一數目(64位元整數)之一人口欄位。匹配此類型之某些值係{city:'Tokyo', population:13350000}及{city:'Ithaca', population:30515}。一元組中可支援之欄位類型之實例包含但不限於：布林值(bool)、位元組、字串(必需係有效UTF-8之位元組)、雙精度浮點數(double)、單精度浮點數(float)、int32、uint32、int64及uint64。Each value sent through a stream is a tuple. Each stream has a tuple type and all tuples flowing in that stream must match that type. The one-tuple type is defined by a set of fixed fields with the form <name: field type>. One example of one-tuple type is the structure <city string, population int64>. This tuple has two fields: a city field holding a string, and a population field holding a number (64-bit integer). Some values that match this type are {city: 'Tokyo', population: 13350000} and {city: 'Ithaca', population: 30515}. Examples of field types that can be supported in a tuple include but are not limited to: Boolean value (bool), byte, string (must be a valid UTF-8 byte), double precision floating point (double) , Single precision floating point (float), int32, uint32, int64 and uint64.

為允許以可跨各種平臺及程式設計語言適用之一方式實施操作，一推論演算法允許去往操作之輸入改變或參數化欄位名稱、欄位類型或結構形狀。為允許分散式圖形執行，類型推論規則及約束之規範與運算子之實際實施方案分離。如此，透過操作而表達類型流，而無需判定特定實施方案。To allow operations to be implemented in a way that is applicable across various platforms and programming languages, an inference algorithm allows input to the operation to change or parameterize field names, field types, or structural shapes. To allow decentralized graphics execution, the rules of type inference rules and constraints are separated from the actual implementation of operators. In this way, the type flow is expressed through operation without having to decide on a specific implementation.

作為其定義之一部分，一操作可係指其輸入及輸出且對該等輸入及輸出設定多種約束。可依據輸入而約束一輸出。舉例而言，一輸出類型可被約束為包含輸入之每個欄位。除了輸入之每個欄位之外，該輸出類型亦可被約束為包含一或多個額外欄位。作為另一實例，輸出類型可僅被約束為包含一或多個特定欄位。As part of its definition, an operation can refer to its inputs and outputs and set various constraints on those inputs and outputs. An output can be constrained based on the input. For example, an output type can be constrained to include each field of input. In addition to each input field, the output type can also be constrained to include one or more additional fields. As another example, the output type may only be constrained to contain one or more specific fields.

一實例性操作係如下： An example operation system is as follows:

在此實例中，藉由由欄位名稱key_field、fp_field及val_field定義之屬性而參數化該操作。當建立操作時，一程式設計員可指定此等欄位名稱，且參考此等欄位名稱而組態操作之行為。彼行為判定類型約束。舉例而言，類型約束可指定去往操作之一輸入應含有欄位＜key: bytes＞ (名稱：索引鍵，類型：位元組)及＜fp: uint64＞，且輸出值應含有欄位＜val: bytes＞。In this example, the operation is parameterized by attributes defined by the field names key_field, fp_field, and val_field. When creating an operation, a programmer can specify the names of these fields, and configure the behavior of the operation with reference to these field names. He acts to determine type constraints. For example, the type constraint can specify that the input to one of the operations should contain fields <key: bytes> (name: index key, type: byte) and <fp: uint64>, and the output value should contain fields val: bytes ＞.

實例性操作亦可指定其他性質，諸如輸入串流之一數目、輸出串流之一數目、應如何實施分區等。舉例而言，以上實例中之操作亦指定fp_field係用於分區目的。僅以實例方式，操作可跨100個複製品而擴展，且若均勻地分散，則每一複製品將接收輸入之1％。fp_field經查閱以便經由模組化算術而判定哪一分區應接收輸入資料。Example operations may also specify other properties, such as the number of input streams, the number of output streams, how partitioning should be implemented, and so on. For example, the operation in the above example also specifies that fp_field is used for partitioning purposes. By way of example only, operations can be extended across 100 replicas, and if distributed evenly, each replica will receive 1% of the input. The fp_field is consulted to determine which partition should receive input data through modular arithmetic.

該操作定義其接收稱為In之一單個輸入串流，且建立稱為Hits及Misses之兩個輸出串流。Misses被定義為具有與輸入相同之類型，而Hits被約束為由輸入類型與＜val_field bytes＞之一串接組成之一新類型。操作可具有並非用於類型推論目的但對於圖形執行目的係重要之其他性質。以上實例性操作中之此等其他性質之實例包含end_when_outputs_done及skip_on_empty_inputs。This operation defines that it receives a single input stream called In and establishes two output streams called Hits and Misses. Misses is defined as having the same type as the input, and Hits is constrained to be a new type consisting of the input type concatenated with one of <val_field bytes>. Operations can have other properties that are not used for type inference purposes but are important for graphical execution purposes. Examples of these other properties in the above example operations include end_when_outputs_done and skip_on_empty_inputs.

在編譯時間判定所有操作之類型並檢查其正確性。舉例而言，在編譯時間判定一個操作之輸出是否與另一操作之輸入匹配。系統執行類型推論以使類型約束轉變為具體類型。此可作為一正向傳遞而實施。Determine the type of all operations at compile time and check their correctness. For example, at compile time it is determined whether the output of one operation matches the input of another operation. The system performs type inference to convert type constraints to specific types. This can be implemented as a forward pass.

上文所提及之運算子建構函數傳回一串流物件。舉例而言：Stream s = ZipConst(input, {"label", 100})；The operator constructor mentioned above returns a stream of objects. For example: Stream s = ZipConst (input, {"label", 100});

運算子建構函數用於將類型相關資訊添加至與該運算子建構函數傳回之串流相關聯之一陳述式(statement)。在以上實例之後，ZipConst可添加類型相關資訊，諸如+＜label:int64＞。在此實例性注釋中，「+」指示應將輸入類型中之所有欄位添加至輸出類型。「＜label:int64＞」指示亦應將稱為「標籤(label)」之一64整數欄位添加至輸出類型。「＜label:int64＞」可稱作類型區分符，其可較一般地指定一系列欄位名稱、欄位類型對。程式化模型之類型推論程式碼解釋此等注釋且產生一輸出類型。在某些例項中，(諸如)若一程式設計員嘗試定義與約束不一致之一輸出，則推論可產生一錯誤。舉例而言，若輸入類型已含有稱為「標籤」之一欄位，則類型推論將失敗，此乃因每一欄位名稱可在一有效類型中出現一次。當此一錯誤發生時，可拒絕所嘗試之輸出定義，且可提示程式設計員輸入與約束一致之一不同定義。在其他實例中，可藉由程式化模型而對產生錯誤之所嘗試輸出定義自動地加旗標以由程式設計員進行進一步再檢測。The operator constructor is used to add type-related information to a statement associated with the stream returned by the operator constructor. After the above example, ZipConst can add type related information, such as + <label: int64>. In this example note, "+" indicates that all fields in the input type should be added to the output type. "<Label: int64>" indicates that a 64 integer field called "label" should also be added to the output type. "<Label: int64>" can be called a type specifier, which can specify a series of field names and field type pairs more generally. The type inference code of the stylized model interprets these annotations and produces an output type. In some instances, such as if a programmer attempts to define an output that is inconsistent with the constraints, the inference may produce an error. For example, if the input type already contains a field called "tag", the type inference will fail because each field name can appear once in a valid type. When this error occurs, the attempted output definition can be rejected, and the programmer can be prompted to enter a different definition consistent with the constraints. In other instances, the attempted output definition that caused the error can be automatically flagged by the stylized model for further re-examination by the programmer.

圖4提供列示針對現有運算子之輸出類型注釋之實例之一圖表。應理解，該圖表並非詳盡的，且亦可在程式化模型中使用其他實例性輸出類型注釋。Figure 4 provides a chart listing examples of output type annotations for existing operators. It should be understood that this diagram is not exhaustive, and other example output type annotations can also be used in the stylized model.

將某些運算子(諸如接收(Receive)及交錯(Interleave))標記為「特殊」。類型推論對此等運算子提供特殊處理。舉例而言，對於接收，輸出類型與針對相關聯於注釋之一發送節點之輸入類型相同。對於交錯，所有輸入類型係相同的，且輸出類型與輸入類型相同。儘管此可抑制寫入進行極複雜之類型處理之運算子，但該等運算子係有益的，此乃因其提供運算子當中之較大一致性。此外，若類型推論程式碼不需要運行任何運算子特定程式碼，則該類型推論程式碼可在運算子實施方案不可用之一地點中運行。舉例而言，在一分散式設定中，可在無需使所有運算子連結至控制器之情況下在該控制器處執行類型推論。類型推論可作為一正向傳遞而執行。Mark some operators (such as Receive and Interleave) as "special". Type inference provides special treatment for these operators. For example, for receiving, the output type is the same as the input type for one of the sending nodes associated with the comment. For interleaving, all input types are the same, and the output type is the same as the input type. Although this can suppress writing operators that perform extremely complex types of processing, these operators are beneficial because they provide greater consistency among the operators. In addition, if the type inference code does not need to run any operator-specific code, the type inference code can be run in a location where the operator implementation is not available. For example, in a decentralized setting, type inference can be performed at the controller without having to link all operators to the controller. Type inference can be performed as a forward pass.

程式化模型可進一步提供類型檢查。運算子建構函數可將一注釋添加至一陳述式，其中該注釋用於去往運算子之類型檢查輸入。舉例而言，運算子總和(Sum)要求一輸入含有一數字欄位，且此將跟隨之輸入類型注釋放置於其陳述式上：＜n:int64＞。程式化模型將驗證被饋送至此運算子中之任何輸入皆含有所指定欄位之一超集合。The stylized model can further provide type checking. The operator constructor can add a comment to a statement, where the comment is used to type check input to the operator. For example, the operator sum (Sum) requires an input to contain a numeric field, and this puts the input type annotation on its statement: <n: int64>. The stylized model will verify that any input fed to this operator contains a superset of the specified fields.

圖5提供圖解說明一實例性類型推論方法500之一流程圖。舉例而言，方法可由一用戶端計算裝置、控制器或其他網路計算裝置執行。儘管下文以一特定次序闡述方法，但應理解，可以一不同次序或同時地執行子部分。此外，可添加或移除子部分。FIG. 5 provides a flowchart illustrating an example method 500 of type inference. For example, the method may be performed by a client computing device, controller, or other network computing device. Although the methods are explained below in a specific order, it should be understood that the sub-parts can be performed in a different order or simultaneously. In addition, subsections can be added or removed.

在方塊510中，接收藉由欄位名稱及欄位類型識別符而定義屬性之資訊。舉例而言，參考以上程式碼中所產生之實例性操作，定義屬性{key_field, bytes}、{ val_field, bytes}及{fp_field, bytes}。此資訊用於定義操作之輸入及輸出串流之類型。In block 510, information defining attributes by field names and field type identifiers is received. For example, referring to the example operations generated in the above code, define the attributes {key_field, bytes}, {val_field, bytes} and {fp_field, bytes}. This information is used to define the types of input and output streams for operations.

在方塊520中，接收關於該等屬性而定義一操作之行為之資訊。舉例而言，參考以上實例，輸入、輸出及分區欄位判定操作將如何表現。In block 520, information is received regarding these attributes to define the behavior of an operation. For example, referring to the above example, the input, output, and partition fields determine how the operation will behave.

在方塊530中，基於屬性及行為而判定針對操作之約束。在某些實例中，可藉由程式化模型而自動地判定該等約束。在其他實例中，可由一使用者定義該等約束。In block 530, constraints on the operation are determined based on attributes and behavior. In some instances, these constraints can be automatically determined by stylized models. In other examples, a user may define these constraints.

在方塊540中，接收定義操作之一輸入之資訊。該輸入可包含(舉例而言)一欄位，該欄位包含一名稱及一類型。此資訊亦可稱作類型資訊且針對操作之一或多個輸入串流而提供。類型推論方法基於與一或多個輸入串流相關聯之類型資訊及與一運算子相關聯之一輸出注釋而判定該運算子之一或多個輸出串流之類型資訊。類型資訊可包含限制包含於串流(該類型資訊與其相關聯)中之元組之約束。類型應與在屬性中所定義之一類型對應。In block 540, information input defining one of the operations is received. The input may include, for example, a field that includes a name and a type. This information can also be referred to as type information and is provided for one or more input streams of the operation. The type inference method determines the type information of one or more output streams of the operator based on the type information associated with the one or more input streams and an output comment associated with the operator. Type information may include constraints that restrict tuples contained in the stream to which the type information is associated. The type should correspond to one of the types defined in the attribute.

在方塊550中，基於約束及所定義輸入而判定一輸出類型。舉例而言，輸出類型可被限制為在約束中所指定之一類型，且可對應於定義輸入之所接收資訊。此判定可在穿過圖形之一正向傳遞中執行，而無需進行回溯。In block 550, an output type is determined based on the constraints and the defined input. For example, the output type can be limited to one of the types specified in the constraints, and can correspond to the received information that defines the input. This determination can be performed in a forward pass through one of the graphs without backtracking.

在方塊560中，使輸出類型與操作之一輸出相關聯。舉例而言，當使用者正定義操作之一輸出時，可自動地填充輸出類型欄位。在其他實例中，可阻止使用者進行輸入一不同輸出類型之嘗試。In block 560, the output type is associated with one of the output operations. For example, when the user is defining one of the output operations, the output type field can be automatically filled. In other instances, the user may be prevented from attempting to input a different output type.

儘管前述實例闡述基於一輸入類型及所定義運算子而判定一輸出類型，但類型推論亦可在逆向中使用。舉例而言，可接收輸出類型作為輸入，且可基於所定義輸出類型及其他資訊而判定輸入類型。Although the foregoing example illustrates the determination of an output type based on an input type and defined operators, type inference can also be used in the reverse direction. For example, the output type can be received as input, and the input type can be determined based on the defined output type and other information.

如上文所闡述之類型推論及約束驗證確保準確且快速查詢執行。亦可以一極泛用方式實施操作。推論允許去往操作之輸入(諸如屬性)改變或參數化欄位名稱、欄位類型或甚至結構之形狀。為允許分散式圖形執行，類型推論規則及約束之規範與運算子之實際實施方案分離。結果係一格式，該格式係完全自任何特定實施方案抽象而成，同時透過操作而表達類型流。類型推論及約束驗證係查詢執行之一關鍵路徑之一部分，從而產生快速執行之要求。在不進行回溯之情況下之一單個遍次推論及驗證演算法進一步提供快速執行。B. 位置指派 Type inference and constraint verification as explained above ensure accurate and fast query execution. The operation can also be implemented in a very general way. Inference allows input (such as attributes) to the operation to change or parameterize the field name, field type, or even the shape of the structure. To allow decentralized graphics execution, the rules of type inference rules and constraints are separated from the actual implementation of operators. The result is a format that is completely abstracted from any particular implementation, while expressing the type flow through operations. Type inference and constraint verification are part of a critical path for query execution, resulting in a requirement for rapid execution. A single pass inference and verification algorithm without backtracking further provides fast execution. B. Location assignment

位置指派在程式建構期間發生。一圖形中之操作可具有指示可執行操作之一或多個位置之一位置限制。位置限制可由一使用者定義，或可基於分散式系統中之計算裝置之能力而判定。舉例而言，若待由一查找操作擷取之資料儲存於一特定資料中心中，則查找操作被限制為在特定資料中心處執行。Location assignment occurs during program construction. The operation in a graphic may have a position limitation indicating one or more positions where the operation can be performed. The location limit may be defined by a user, or may be determined based on the capabilities of computing devices in a distributed system. For example, if the data to be retrieved by a search operation is stored in a specific data center, the search operation is restricted to be executed at the specific data center.

針對不具有一位置限制之操作，程式化模型將一位置指派至該操作。此等位置可經選擇而以某一方式最佳化計算。舉例而言，一個節點可產生大量資料，但然後該節點後續接著一濾波器節點，該濾波器節點濾除資料之99％。在此情形中，使濾波器節點定位於資料產生節點之相同位置處係尤其有利的。位置指派可發生而作為下文進一步論述之圖形建立及分割之一部分。i. 圖形建構 For operations that do not have a position restriction, the stylized model assigns a position to the operation. These positions can be selected to optimize the calculation in a certain way. For example, a node can generate a large amount of data, but then the node is followed by a filter node, which filters out 99% of the data. In this case, it is particularly advantageous to locate the filter node at the same position of the data generation node. The location assignment can occur as part of the graph creation and segmentation discussed further below. i. Graphic construction

圖6A至圖6C圖解說明在程式建構期間之位置指派之一實例。圖6A提供一實例性圖形，該實例性圖形具有表示操作之節點及介於該等節點之間的連接一源操作與一目的地操作之邊緣，其中一源操作之輸出係目的地操作之輸入。在此程式中，將引數610 (諸如索引鍵)發送至執行一查找620之一遠端位置。將查找結果發送通過一濾波器630，該濾波器移除某些結果且將該等結果輸出至結果操作640中。6A to 6C illustrate an example of position assignment during program construction. 6A provides an example graph with nodes representing operations and edges connecting a source operation and a destination operation between the nodes, where the output of a source operation is the input of the destination operation . In this program, an argument 610 (such as an index key) is sent to a remote location where a search 620 is performed. The search result is sent through a filter 630, which removes certain results and outputs the results to a result operation 640.

圖6B圖解說明針對操作中之每一者之使用者指派位置之一實例。將引數610自位置C發送至位置L以進行查找620。然後將查找620之結果往回發送至位置C以進行濾波且輸出結果。此導致將大量資料自位置L發送至位置C，僅使彼資料中之諸多資料藉由濾波操作630而被濾波。圖6C圖解說明可能之使用者指派位置之一較高效實例，其中查找及濾波皆在相同位置L處執行。此指派最佳化執行時間且減少網路訊務。然而，依賴於使用者來預見位置指派中之潛在低效率並適應該等低效率會給使用者帶來一顯著負擔。FIG. 6B illustrates an example of user-assigned locations for each of the operations. Argument 610 is sent from position C to position L for lookup 620. The result of the search 620 is then sent back to position C for filtering and output. This causes a large amount of data to be sent from position L to position C, so that only a lot of the other data is filtered by the filtering operation 630. FIG. 6C illustrates one of the more efficient examples of possible user-assigned positions, where both search and filtering are performed at the same position L. This assignment optimizes execution time and reduces network traffic. However, relying on users to anticipate potential inefficiencies in location assignments and adapting to these inefficiencies places a significant burden on users.

圖7A至圖7B圖解說明在圖形建立期間之自動位置指派之一實例。當建構程式時，操作引數710、查找720及結果740伴隨預指派位置而發生。可由程式化模型(舉例而言)基於用於執行操作之計算裝置之能力、與操作定義相關聯之限制或任何其他資訊而自動地預指派此等位置。濾波器操作730不伴隨任何位置指派而發生。因此，當建立圖形時，可如圖7A中所展示而顯現。當提交程式以用於執行時，程式化模型將辨識出濾波器730係一資料減少操作，且將該濾波器730指派至位置L。因此，程式將如圖7B中而顯現。7A to 7B illustrate an example of automatic position assignment during graph creation. When constructing the program, the operation parameter 710, the search 720, and the result 740 occur with the pre-assigned position. These positions can be automatically pre-assigned by the stylized model (for example) based on the capabilities of the computing device used to perform the operation, the limitations associated with the operation definition, or any other information. The filter operation 730 does not occur with any position assignment. Therefore, when the graphic is created, it may appear as shown in FIG. 7A. When the program is submitted for execution, the stylized model will recognize that the filter 730 is a data reduction operation and assign the filter 730 to position L. Therefore, the program will appear as shown in FIG. 7B.

由於位置指派係自動化的，因此將程式分割至保留位置約束之圖形中亦應係自動化的。以使效能最大化之一方式完成此分割。Since the location assignment is automated, the division of the program into a graph that retains location constraints should also be automated. This segmentation is done in one way to maximize performance.

圖8A至圖8C圖解說明分割一圖形以使子圖之一數目最小化同時遵循約束之一實例。舉例而言，圖形必須保持為非循環的。如圖8A中所展示，每個操作在一單獨圖形G0、G1、G2、G3中運行。如此，需要在每個邊緣815、825、835上串列化及解串列化資料。針對橫跨兩個位置C及L之邊緣815及835，網路傳送需要此資料串列化。然而，邊緣825不需要資料串列化。然而，使圖形分割之數目最小化將產生圖8B之圖形，該圖形在圖形G0至G1及G1至G0之間引入一循環。此可導致死結(deadlock)，且因此應加以避免。使圖形之數目最小化同時阻止循環會產生圖8C之圖形，該圖形係針對程式之最佳分割。8A to 8C illustrate an example of dividing a graph to minimize the number of one of the sub-graphs while following the constraints. For example, graphics must remain acyclic. As shown in FIG. 8A, each operation runs in a single graph G0, G1, G2, G3. As such, data needs to be serialized and deserialized on each edge 815, 825, 835. For edges 815 and 835 across two locations C and L, network transmission requires this data serialization. However, the edge 825 does not require serialization of data. However, minimizing the number of graph divisions will produce the graph of FIG. 8B, which introduces a cycle between graphs G0 to G1 and G1 to G0. This can lead to deadlocks and therefore should be avoided. Minimizing the number of graphics while preventing loops will produce the graphics of Figure 8C, which is the best segmentation for the program.

經分區位置呈現針對圖形分割之額外考量因素。圖9A至圖9D提供針對經分區位置之圖形分割之一實例。經分區位置可包含具有用於執行操作之多個計算分區之任何位置。如圖9A中所展示，兩個查找操作920、940被放置於相同位置上、藉由欄位「索引鍵」而被分區。將處理操作930指派至相同經分區位置會產生圖9B之圖形，該圖形可能如圖9C中所展示而被分割。然而，此分割係不正確的。若處理操作930以任何方式修改索引鍵欄位，則在未對輸入進行重新分區之情況下，不應將該處理操作之輸出直接傳遞至第二查找操作940。為防止此發生，將所有經分區位置視為唯一的。舉例而言，即使將查找操作920、940兩者指派至相同位置，亦將該等查找操作視為不同位置。因此，如圖9D中所展示，將處理操作930指派至唯一查找位置中之一者。Additional considerations for graphics segmentation are presented by the partition location. 9A-9D provide an example of graphical segmentation for partitioned locations. The partitioned location may include any location with multiple computing partitions for performing operations. As shown in FIG. 9A, the two search operations 920, 940 are placed in the same position and are partitioned by the field "index key". Assigning processing operation 930 to the same partitioned location results in the graph of FIG. 9B, which may be segmented as shown in FIG. 9C. However, this segmentation is incorrect. If the processing operation 930 modifies the index key field in any way, the output of the processing operation should not be passed directly to the second search operation 940 without repartitioning the input. To prevent this, all partitioned locations are considered unique. For example, even if both lookup operations 920, 940 are assigned to the same location, the lookup operations are treated as different locations. Therefore, as shown in FIG. 9D, the processing operation 930 is assigned to one of the unique search locations.

如由經分區位置之以上實例所演示，將相同位置指派至兩個操作並不保證該兩個操作將在相同圖形中一起運行。保證被指派至相同經分區位置之兩個操作不在相同圖形中運行。儘管此行為不影響程式之正確性，但其可影響其效能。因此，程式化模型提供使得使用者將一組操作一起共同定位於一特定位置處之一方式。然後保證彼等操作在彼位置處之相同圖形中結束，其中其他操作可能被添加至該圖形。每當將操作指定為共同定位的，便不會對在該等操作之間發送之資料進行重新分區。As demonstrated by the above example of partitioned locations, assigning the same location to two operations does not guarantee that the two operations will run together in the same graph. Ensure that two operations assigned to the same partitioned location do not run in the same graph. Although this behavior does not affect the correctness of the program, it can affect its performance. Therefore, the stylized model provides a way for users to co-locate a set of operations together at a specific location. Then ensure that their operations end in the same graph at that location, where other operations may be added to the graph. Whenever operations are designated as co-located, the data sent between these operations will not be repartitioned.

圖10A至圖10B提供共同位置之一實例。在圖10A之程式中，使用者已指定查找操作1020、1030將在一給定位置處一起運行。在位置指派及圖形建立之後，程式將如圖10B中所展示而被分割。10A to 10B provide an example of common positions. In the program of FIG. 10A, the user has specified that the search operations 1020, 1030 will run together at a given location. After the location assignment and graphic creation, the program will be divided as shown in FIG. 10B.

總結上文所闡述之自動位置指派及圖形分割，位置係一操作而非一圖形之一性質。某些操作將伴隨指派位置或位置約束而發生，某些操作將使其位置由使用者指派，且某些操作將不伴隨任何經指派位置而發生。使用者利用一單個操作圖形寫入一程式，而無需擔心子圖。每一操作向程式化模型提供提示。舉例而言，針對每一輸出邊緣，操作報告將在彼輸出邊緣上流動之總輸入資料之百分比。此提示幫助程式化模型判定將什麼位置指派至運算子。在其他實例中，可在較早運行期間針對一給定程式而自動地計算此資訊。程式化模型將位置自動地指派至操作，且將程式自動地分割至最小數目個圖形中同時保留圖形之間無循環之性質。Summarizing the automatic position assignment and graph segmentation described above, position is a property of an operation rather than a graph. Some operations will occur with assigned positions or position constraints, some operations will have their positions assigned by the user, and some operations will not occur with any assigned positions. The user writes a program with a single operation pattern without worrying about sub-pictures. Each operation provides hints to the stylized model. For example, for each output edge, the percentage of total input data that the operation report will flow on that output edge. This hint helps the stylized model to determine where to assign the operator. In other examples, this information can be calculated automatically for a given program during earlier operations. The stylized model automatically assigns positions to operations, and automatically divides the program into a minimum number of graphics while preserving the nature of no cycles between graphics.

根據一項實例，在操作中指定之輸出提示或自先前圖形運行收集之資料可用於利用將在程式圖形之每一邊緣上流動之預期數目個元組而增大彼邊緣。可因此以使在位置之間流動之元組總數目最小化之一方式將該等位置指派至圖形節點。可藉由以元組計數遞減次序對程式中之所有邊緣進行分類且以經分類次序對邊緣進行反覆而執行此位置指派。針對每一邊緣，識別源運算子及目的地運算子且若源運算子及目的地運算子兩者皆不具有指派至其之一位置，則藉由將該源運算子與該目的地運算子分群在一起而對其指派有相同位置。若一個運算子具有經指派之一位置，則將相同位置指派至另一運算子及可已與該運算子分群在一起之所有其他運算子。此演算法自總元組計數移除最昂貴邊緣、然後移除次昂貴邊緣，以此類推。According to an example, the output prompt specified in the operation or the data collected from the previous graphics run can be used to increase the other number of tuples by using the expected number of tuples that will flow on each edge of the program graphics. These locations can thus be assigned to the graph node in a way that minimizes the total number of tuples flowing between the locations. This position assignment can be performed by classifying all edges in the program in descending order of tuple counts and iterating the edges in sorted order. For each edge, the source operator and the destination operator are identified and if neither the source operator nor the destination operator has a position assigned to it, by using the source operator and the destination operator Grouped together and assigned the same position. If one operator has an assigned position, the same position is assigned to the other operator and all other operators that may have been grouped with the operator. This algorithm removes the most expensive edge from the total tuple count, then removes the second most expensive edge, and so on.

圖11A至圖11C圖解說明具有多個輸入操作之一程式之一實例。如圖11A中所展示，程式包含經由邊緣1115而輸入至一第一查找操作1130之一第一輸入引數1110。程式亦包含經由邊緣1125而將輸入提供至一第二查找操作1140之一第二輸入引數1120。查找操作1130、1140分別經由邊緣1135、1145而將串流提供至ZipAny操作1150，且ZipAny操作1150經由邊緣1155而將串流提供至選擇操作1160。經由邊緣1165而將一輸出提供至結果1170。邊緣權重表示沿著邊緣流動之元組之所估計數目。舉例而言，邊緣1115、1135及1155具有邊緣權重1M。邊緣1125及1145具有邊緣權重1，且邊緣1165具有一權重3。對位置SL進行分區，而未對位置L進行經分區。11A to 11C illustrate an example of a program with multiple input operations. As shown in FIG. 11A, the program includes a first input argument 1110 input to a first lookup operation 1130 via the edge 1115. The program also includes providing input to a second input argument 1120 of a second search operation 1140 via the edge 1125. Lookup operations 1130, 1140 provide streaming to ZipAny operation 1150 via edges 1135, 1145, respectively, and ZipAny operation 1150 provides streaming to selection operation 1160 via edge 1155. An output is provided to the result 1170 via the edge 1165. The edge weight represents the estimated number of tuples flowing along the edge. For example, the edges 1115, 1135, and 1155 have an edge weight of 1M. Edges 1125 and 1145 have an edge weight of 1, and edge 1165 has a weight of 3. Location SL is partitioned, while location L is not partitioned.

此程式之自動分割可產生圖11B之圖形，其中ZipAny 1150及選擇1160皆被指派至位置SL。此位置指派將起作用，前提係在位置L處運行之第二查找操作1140將其元組廣播至ZipAny 1150之所有經分區執行個體。The automatic segmentation of this program can produce the graph of FIG. 11B, where both ZipAny 1150 and selection 1160 are assigned to position SL. This location assignment will work, provided that the second lookup operation 1140 running at location L broadcasts its tuple to all partitioned instances of ZipAny 1150.

如圖11C中所展示，用一交錯操作1180替換ZipAny操作。程式將起作用，前提係在位置L處運行之第二查找操作1140將其元組僅發送至交錯之經分區執行個體中之一者。儘管此解決方案係操作特定的，但亦可較一般地解決問題。舉例而言，可將所有多個輸入操作標記為不可分裂的。若一操作可被分裂成單獨操作而不改變操作之功能性，則該操作可為可分裂的。舉例而言，若存在饋送至一操作OP之三個串流S1、S2、S3，則若OP(UNION(S1, S2, S3)) == UNION(OP(S1), OP(S2), OP(S3))，則操作OP係可分裂的。一可分裂操作之一實例係圖4中所提及之使每個輸入值加倍之運算子雙精度浮點數。然而，此可導致效能降級。一種一般解決方案之另一實例係要求程式寫入者明確指定如何對多個輸入操作進行分區。然而，此將給程式寫入者帶來一負擔，且將消除動態最佳化程式之可能性。又一實例性一般解決方案係提供使一操作寫入者定製多個輸入操作之分裂之一方式。在以上實例中，ZipAny將總是想要使其其他輸入被廣播，而交錯將總是想要使其其他輸入僅被發送至一個位置。儘管此給操作寫入者帶來一額外負擔，但此與對程式寫入者之潛在負擔相比較不顯著且以經最佳化效能保留程式之正確性。ii. 圖形分割 As shown in FIG. 11C, the ZipAny operation is replaced with an interleaving operation 1180. The program will work, provided that the second lookup operation 1140 running at position L sends its tuple to only one of the interleaved partitioned instances. Although this solution is operationally specific, it can also solve the problem more generally. For example, all multiple input operations can be marked as indivisible. If an operation can be split into separate operations without changing the functionality of the operation, the operation can be splittable. For example, if there are three streams S1, S2, S3 fed to an operation OP, then if OP (UNION (S1, S2, S3)) == UNION (OP (S1), OP (S2), OP (S3)), then the OP system can be split. An example of a splittable operation is the double-precision floating-point number of the operator mentioned in FIG. 4 that doubles each input value. However, this can cause performance degradation. Another example of a general solution requires the program writer to specify how to partition multiple input operations. However, this will place a burden on the program writer and will eliminate the possibility of dynamically optimizing the program. Yet another example general solution provides a way for an operation writer to customize the split of multiple input operations. In the above example, ZipAny will always want its other inputs to be broadcast, while interleaving will always want its other inputs to be sent to only one location. Although this imposes an additional burden on the operating writer, it is insignificant compared to the potential burden on the program writer and retains the correctness of the program with optimized performance. ii. Graphics segmentation

除了位置指派之外，新程式化模型亦以經最佳化以減少總體網路訊務且增加執行程式之速度及效能之一方式執行圖形之自動分割。圖形中之操作被分割至複數個子圖中。一子圖中之每個操作必須被指派至相同位置。在執行分割時，偵測到在候選子圖當中建立一循環之一可能性，且藉由進一步分割候選子圖中之一者而消除該循環。In addition to position assignment, the new programming model also performs automatic segmentation of graphics in one way that is optimized to reduce overall network traffic and increase the speed and performance of executing programs. The operations in the graph are divided into multiple subgraphs. Each operation in a sub-picture must be assigned to the same location. When performing segmentation, the possibility of establishing a loop among the candidate subgraphs is detected, and the loop is eliminated by further segmenting one of the candidate subgraphs.

圖形分割藉由將具有經指派位置之所有操作放置至其自身之一子圖中而開始。舉例而言，可基於與操作相關聯之位置限制而指派位置。舉例而言，某些操作可具有特定要求，其中在分散式架構中之特定位置處之僅某些計算裝置能夠根據彼等限制執行操作。此等限制及能力可由程式化模型辨識，該程式化模型可因此將一位置自動地指派至操作。分割可包含減少具有被指派至一特定位置之操作之子圖之一數目。Graphic segmentation begins by placing all operations with assigned positions into one of its sub-graphs. For example, the location may be assigned based on the location constraints associated with the operation. For example, certain operations may have specific requirements, where only certain computing devices at specific locations in a decentralized architecture can perform operations according to their limitations. These limitations and capabilities can be identified by the stylized model, which can therefore automatically assign a position to the operation. Segmentation may include reducing the number of subgraphs that have operations assigned to a particular location.

在分割演算法之進程內，將未經指派操作放置至經位置指派之子圖中，且儘可能將該等子圖合併在一起。在執行演算法時，施加一定數目個主要約束，其中該等約束確保最終圖形及對於操作之位置指派使得當執行由圖形表示之程式時，圖形中之操作當中之通信係高效的。特定而言，必須將所有操作放置至一子圖中，演算法必須將一位置指派至每一子圖，具有一程式設計員指派之位置之一操作必須保持彼指派，且若一未經指派操作具有一可分裂性質，則可僅將該未經指派操作放置至一經分區位置中。此外，若一位置被分區，則其目的地操作被指派至彼位置的程式中之所有邊緣在整個演算法中保持不改變。此外，圖形必須係非循環的。In the process of the segmentation algorithm, the unassigned operations are placed into the sub-graphs with location assignments, and the sub-graphs are merged together as much as possible. When executing the algorithm, a certain number of main constraints are imposed, where these constraints ensure the final graph and the assignment of positions to the operations so that when executing the program represented by the graph, the communication among the operations in the graph is efficient. In particular, all operations must be placed in a subgraph, the algorithm must assign a position to each subgraph, an operation with a position assigned by a programmer must maintain that assignment, and if an The operation has a splittable nature, and the unassigned operation can only be placed into a partitioned location. In addition, if a location is partitioned, all edges in the program whose destination operation is assigned to that location remain unchanged throughout the algorithm. In addition, the graphics must be acyclic.

可在兩個階段中執行圖形分割。在一第一階段中，執行主要分割，而在一第二階段中，執行本端分割。第一階段之一個目的係判定圖形中之節點至位置之一指派，使得當執行由圖形表示之程式時，使位置當中之通信最小化。本端分割之一個目的係改良及最佳化一程式及被分配至相同位置之操作之實施方案。主要分割可包含合併經分區子圖、擴大經分區子圖、指派未經分區位置及合併未經分區子圖。若將所有節點指派至相同經分區位置，則一子圖被分區。作為主要分割之一結果，每一子圖被指派有一經分區位置或未經分區位置且相同子圖中之所有節點具有與子圖相同之位置。本端分割可包含識別需要被分裂之子圖、使圖形準備好進行分裂、構建合併圖形、利用外部傳入邊緣合併子圖及利用非封鎖操作合併子圖。Graphic segmentation can be performed in two stages. In a first stage, the main segmentation is performed, and in a second stage, the local segmentation is performed. One purpose of the first stage is to determine the assignment of a node to a location in the graph so that when the program represented by the graph is executed, the communication among the locations is minimized. One purpose of local segmentation is to improve and optimize a program and the implementation of operations assigned to the same location. Primary partitioning may include merging partitioned subpictures, expanding partitioned subpictures, assigning unpartitioned locations, and merging unpartitioned subpictures. If all nodes are assigned to the same partitioned location, a subgraph is partitioned. As a result of the main segmentation, each subgraph is assigned a partitioned or unpartitioned position and all nodes in the same subgraph have the same position as the subgraph. Local segmentation can include identifying subgraphs that need to be split, preparing the graph for splitting, constructing merged graphs, merging subgraphs using external incoming edges, and merging subgraphs using unblocking operations.

在主要分割之第一階段中，一第一步驟合併儘可能多之經分區子圖。因此，使程式中之子圖之一總數目最小化。下一步驟係藉由將相鄰未經指派節點摺疊至經分區子圖中而擴大該等經分區子圖。候選操作首先經檢查以判定其是否已被標記為可分裂的。若否，則不摺疊候選操作。若該等候選操作係可分裂的，則將彼等候選者放置至相鄰子圖中受主要分割約束限制。在指派未經分區位置之一下一步驟中，藉由將來自經指派節點之位置複製至其相鄰者而將位置指派至所有未經指派操作。一下一步驟包含合併未經分區子圖，藉由合併在相同位置處運行之所有可能之未經分區子圖對而嘗試使子圖之總數目最小化。在某一時刻，進一步合併將係不可能的。舉例而言，當將每個操作指派至一子圖且使子圖之數目最小化時，任何進一步合併將向圖形中引入一循環或打破約束中之一者。此時，封鎖操作可被分裂至其自身之一子圖中，從而建立在相同機器上執行之本端圖形。封鎖操作係可必須進行輸入/輸出之操作，且因此可在執行I/O之同時保留一執行緒，從而防止其他操作能夠運行。In the first stage of the main segmentation, a first step merges as many partitioned subgraphs as possible. Therefore, the total number of one of the subgraphs in the program is minimized. The next step is to expand the partitioned subgraphs by folding adjacent unassigned nodes into the partitioned subgraphs. Candidate operations are first checked to determine whether they have been marked as splittable. If not, the candidate operation is not folded. If these candidate operations are splittable, placing their candidates into adjacent subgraphs is restricted by the main partitioning constraints. In one of the next steps of assigning unpartitioned locations, the location is assigned to all unassigned operations by copying the location from the assigned node to its neighbors. A next step involves merging unpartitioned subgraphs, and trying to minimize the total number of subgraphs by merging all possible pairs of unpartitioned subgraphs running at the same location. At some point, further mergers will be impossible. For example, when each operation is assigned to a sub-graph and the number of sub-graphs is minimized, any further merging will introduce one of a cycle or break the constraint into the graph. At this point, the blockade operation can be split into one of its sub-graphs, thereby creating a local graph executed on the same machine. Blocking operations may require input / output operations, and therefore I / O can be performed while retaining a thread to prevent other operations from running.

在本端分割之第二階段中，已指派位置。此外，經分區位置可正如未經分區位置一樣分裂。舉例而言，若一位置包含多個分區，則該位置可被分區。然而，分裂成多個本端子圖必須滿足一組本端約束，此要求每一封鎖操作必須在其自身之一子圖中結束，分裂可利用外部(非本端)輸入產生僅一個子圖，且子圖及子圖之間的邊緣必須係非循環的。要求該分裂利用外部輸入產生僅一個子圖確保外部圖形與一單個本端圖形進行通信，此達成較多發送/接收最佳化且簡化協定。In the second stage of local segmentation, a position has been assigned. In addition, the partitioned locations can be split just like the unpartitioned locations. For example, if a location includes multiple partitions, the location can be partitioned. However, splitting into multiple local terminal diagrams must satisfy a set of local constraints. This requires that each lock operation must end in one of its own subgraphs. Splitting can use external (non-local) input to generate only one subgraph. And the edges between subgraphs and subgraphs must be acyclic. The split is required to generate only one sub-picture using external input to ensure that the external graphics communicate with a single local graphics, which achieves more transmission / reception optimization and simplifies the agreement.

本端分割之一第一步驟係識別需要被分裂之子圖。此等子圖可僅係含有封鎖操作之子圖。在一下一步驟中，使圖形準備好進行分裂。此可包含修改子圖以強加本端分割約束。舉例而言，該修改可在每一封鎖操作之前及之後插入空操作(no-op)。在封鎖操作之前插入空操作確保在子程式中不存在具有外部輸入之封鎖操作。在封鎖操作之後插入空操作確保在子程式中不存在具有外部輸出之封鎖操作。One of the first steps of local segmentation is to identify the subgraphs that need to be split. These sub-pictures may only be sub-pictures containing blocking operations. In the next step, prepare the graph for splitting. This may include modifying the subgraph to impose local segmentation constraints. For example, the modification may insert a no-op before and after each lock operation. Insert an empty operation before the block operation to ensure that there is no block operation with external input in the subroutine. Insert a blank operation after the block operation to ensure that there is no block operation with external output in the subroutine.

在本端分割之一下一步驟中，構建一合併圖形，其中每一操作在其自身之一子圖中結束。然後將此等子圖重複地合併在一起。具體而言，將具有外部傳入邊緣之所有操作一起合併至相同子圖中。此外，合併具有非封鎖操作之所有可能之子圖對。In one of the next steps of local segmentation, a merged graph is constructed, where each operation ends in one of its own subgraphs. Then repeatedly merge these subgraphs together. Specifically, all operations with external incoming edges are merged together into the same subgraph. In addition, all possible sub-picture pairs with non-blocking operations are merged.

圖12圖解說明一實例性程式。圖13A至圖13F闡述程式之主要分割，而圖14A至圖14E闡述本端分割之一實例。在圖15中展示所得程式。FIG. 12 illustrates an example program. 13A to 13F illustrate the main division of the program, and FIGS. 14A to 14E illustrate an example of local division. The resulting program is shown in Figure 15.

如圖12中所展示，建立程式之一初始圖形。該圖形包含表示各種操作之複數個節點A至K，其中邊緣1211至1222表示在該等節點之間流動之資料串流。節點中之某些節點具有預定義位置。舉例而言，將節點A之操作指派至位置C，而將節點I之操作指派至位置L。將節點B、C、E及F中之每一者之操作指派至經分區位置SL。在分割期間，將位置自動地指派至其餘節點D、J、G、H及K。As shown in Figure 12, one of the initial graphics of the program is created. The graph includes a plurality of nodes A to K representing various operations, where edges 1211 to 1222 represent data streams flowing between the nodes. Some of the nodes have predefined positions. For example, the operation of node A is assigned to position C, and the operation of node I is assigned to position L. The operation of each of nodes B, C, E, and F is assigned to the partitioned location SL. During the segmentation, positions are automatically assigned to the remaining nodes D, J, G, H, and K.

在圖13A中，將具有一預定義位置之每一節點放置至其自身之子圖中。舉例而言，將節點A放置於子圖1310中、將節點B放置於子圖1312中、將節點C放置於子圖1314中、將節點E放置於子圖1316中、將節點F放置於子圖1318中且將節點I放置於子圖1320中。在分割期間，將具有未經指派位置之節點放置至此等子圖1310至1320中，且儘可能地合併子圖。此根據上文所提及之主要分割約束而執行。In FIG. 13A, each node with a predefined position is placed into its own subgraph. For example, node A is placed in subgraph 1310, node B is placed in subgraph 1312, node C is placed in subgraph 1314, node E is placed in subgraph 1316, and node F is placed in subgraph In Figure 1318 and node I is placed in subgraph 1320. During segmentation, nodes with unassigned positions are placed into these subgraphs 1310 to 1320, and subgraphs are merged as much as possible. This is performed according to the main split constraints mentioned above.

舉例而言，藉由儘可能地合併經分區子圖同時遵循主要分割約束而將圖13A變換為圖13B。用於合併之候選者包含節點B、C、E及F。節點A及I並非候選者，此乃因該等節點並非被指派至經分區位置。節點B及節點C兩者皆不可與節點E或F合併，此乃因其將向圖形中引入一循環。亦無法將子圖1316與1318合併在一起，此乃因若一經分區位置中之一發送節點之目的地節點亦處於一經分區位置中，則無法將該發送節點與其目的地合併在一起。可將節點B與C合併，且合併至相同子圖1313中。For example, transform FIG. 13A into FIG. 13B by merging partitioned subgraphs as much as possible while following the main partitioning constraints. Candidates for merging include nodes B, C, E and F. Nodes A and I are not candidates because these nodes are not assigned to partitioned locations. Neither node B nor node C can be merged with node E or F because it will introduce a cycle into the graph. It is also impossible to merge subgraphs 1316 and 1318 together, because if the destination node of a sending node in a partitioned location is also in a partitioned location, the sending node and its destination cannot be merged together. Nodes B and C can be merged and merged into the same subgraph 1313.

舉例而言，藉由擴大經分區子圖而將圖13B變換為圖13C。此擴大包含將具有未經指派位置之相鄰節點添加至經分區子圖中。節點D及G係待被摺疊至經分區子圖中之候選者，此乃因該等節點具有未經指派位置且亦具有耦合至經分區子圖內之節點之邊緣。判定節點D及G是否已被標記為可分裂的。若否，則將該等節點捨棄作為候選者。若該等節點被標記為可分裂的，則將該等節點放置至相鄰子圖中。無法將操作D放置至具有節點E之子圖1316中，此乃因以下約束：目的地操作被指派至一經分區位置之一組邊緣必須保持不改變。將操作D添加至子圖1313。若節點B及C先前未被合併至相同子圖中，則此將係不可能的，此乃因其將不遵守主要分割約束。For example, FIG. 13B is transformed into FIG. 13C by expanding the partitioned subgraph. This expansion includes adding neighboring nodes with unassigned positions to the partitioned subgraph. Nodes D and G are candidates to be folded into the partitioned subgraph because these nodes have unassigned positions and also have edges that are coupled to nodes within the partitioned subgraph. Determine whether nodes D and G have been marked as splittable. If not, the nodes are discarded as candidates. If the nodes are marked as splittable, the nodes are placed in adjacent subgraphs. Operation D cannot be placed in subgraph 1316 with node E due to the following constraint: the destination operation is assigned to a group of edges in a partitioned location and must remain unchanged. Add operation D to subgraph 1313. If nodes B and C have not been previously merged into the same subgraph, this will not be possible because it will not obey the main split constraints.

藉由將節點D添加至子圖1313中，有效地對節點D之操作進行分區。因此，將新操作D’添加至圖形以合併來自D之經分區執行個體之結果。類似地，將節點G之操作放置至具有節點F之子圖1318中，且將新操作G’添加至圖形。新操作D’及G’係不可分裂的。在圖13C中不存在可被放置至經分區位置中之其他操作。By adding node D to subgraph 1313, the operation of node D is effectively partitioned. Therefore, a new operation D 'is added to the graph to merge the results of the partitioned instances from D. Similarly, the operation of node G is placed in subgraph 1318 with node F, and a new operation G 'is added to the graph. The new operations D ’and G’ are inseparable. There is no other operation in FIG. 13C that can be placed into the partitioned location.

圖13D圖解說明將位置指派至所有未經指派位置。此可藉由將來自經指派節點之位置複製至其相鄰者而執行。舉例而言，節點D’、J、G’、H及K在圖13C中具有未經指派位置。節點G’、H及K係包含節點I之子圖1320之相鄰者。因此，被指派至節點I之位置L亦被指派至節點G’、H及K。節點D’及J不具有任何相鄰未經分區子圖，且因此節點D’及J被指派至控制器C。FIG. 13D illustrates assigning positions to all unassigned positions. This can be performed by copying the location from the assigned node to its neighbors. For example, nodes D ', J, G', H, and K have unassigned positions in Figure 13C. Nodes G ', H, and K include neighbors of subgraph 1320 of node I. Therefore, the position L assigned to the node I is also assigned to the nodes G ', H, and K. Nodes D 'and J do not have any adjacent unpartitioned subgraphs, and therefore nodes D' and J are assigned to controller C.

圖13E至圖13F圖解說明合併未經分區子圖之一實例。藉由將在相同位置處運行之可能之未經分區子圖對合併在一起而使子圖之總數目最小化。存在被指派至位置C之三個子圖(1310、1315、1317)。而且，節點G’、H、I及K之子圖全部被指派至位置L。可在不引入一循環之情況下將被指派至位置L之所有子圖合併至新子圖1322中。無法在不引入一循環之情況下將包含節點A之子圖1310與子圖1315或1317合併。然而，可合併包含節點D’之子圖1315與包含節點J之子圖1317。在圖13F中圖解說明一所得圖形。無法進一步合併此圖形。已將每一操作指派至一子圖，且已使子圖之數目最小化。任何進一步合併將打破主要分割約束中之一者。13E to 13F illustrate one example of merging unpartitioned subgraphs. The total number of subgraphs is minimized by merging together possible unpartitioned subgraph pairs at the same location. There are three subgraphs (1310, 1315, 1317) assigned to position C. Moreover, the subgraphs of nodes G ', H, I, and K are all assigned to position L. All sub-graphs assigned to position L can be merged into a new sub-graph 1322 without introducing a cycle. It is not possible to merge subgraph 1310 containing node A with subgraph 1315 or 1317 without introducing a cycle. However, the subgraph 1315 including the node D 'and the subgraph 1317 including the node J may be merged. A resulting graph is illustrated in Figure 13F. This graphic cannot be merged further. Each operation has been assigned to a subgraph, and the number of subgraphs has been minimized. Any further merger will break one of the main split constraints.

封鎖操作可被分裂至其自身之一子圖中，從而建立在相同機器上本端地執行之本端圖形。在本端分割期間，已指派位置。此外，不需要針對經分區位置進行特殊考量，該等經分區位置可在本端分割階段期間被分裂。本端分割必須遵循上文所提及之本端分割約束。此等本端分割約束要求每一封鎖操作在其自身之一子圖中結束，分裂子圖可利用外部/非本端輸入僅產生一個子圖，且圖形必須保持為非循環的。確保分裂利用外部輸入產生僅一個子圖達成較多發送及接收最佳化，且簡化程式化協定。在圖形中，一外部/非本端輸入由介於已被指派有不同位置之節點之間的一邊緣表示。在程式之執行期間，一外部邊緣導致節點之間的可能通信。The blocking operation can be split into one of its own subgraphs, thereby creating a local graph executed locally on the same machine. During local segmentation, a position has been assigned. In addition, no special consideration is needed for the partitioned locations, which can be split during the local segmentation stage. Local segmentation must follow the local segmentation constraints mentioned above. These local segmentation constraints require that each lock operation ends in one of its own subgraphs. The split subgraph can use external / non-local input to generate only one subgraph, and the graph must remain acyclic. Ensure that splitting uses external input to generate only one sub-picture to achieve more transmission and reception optimization, and simplify the stylized protocol. In the graph, an external / non-local input is represented by an edge between nodes that have been assigned different positions. During the execution of the program, an external edge causes possible communication between the nodes.

在圖14A中，識別含有封鎖操作之子圖。在此實例中，操作B及D係程式中之僅有封鎖操作。因此，將對包含節點B、C及D之子圖1413進行分裂。In FIG. 14A, the sub-picture containing the block operation is identified. In this example, operations B and D are the only blocking operations in the program. Therefore, the subgraph 1413 including nodes B, C, and D will be split.

在圖14B至圖14C中，圖14A之子圖1413經修改以便強加本端分割約束。圖14B中所展示之一第一修改確保在子圖中不存在具有外部輸入之封鎖操作。具有外部輸入之多個封鎖操作將使得難以或不可能強加本端分割約束，該等本端分割約束要求每一封鎖操作在其自身之一子圖中結束，且圖形保持為非肺循環的。第一修改僅在封鎖操作之前插入空操作。一空操作或「no-op」係在被插入於兩個操作之間的情況下不改變程式語義之一操作。一空操作之一實例係交錯。交錯將資料自其之前的一節點傳遞至其之後的一節點。由於封鎖操作B具有來自節點A之一外部輸入，因此將一No-op操作1432插入於節點A與B之間。In FIGS. 14B to 14C, the sub-graph 1413 of FIG. 14A is modified so as to impose local segmentation constraints. One of the first modifications shown in FIG. 14B ensures that there is no blocking operation with external input in the sub-graph. Multiple blocking operations with external inputs will make it difficult or impossible to impose local segmentation constraints. These local segmentation constraints require that each block operation ends in one of its own subgraphs, and the graph remains non-pulmonary. The first modification only inserts a no-operation before the blocking operation. An empty operation or "no-op" is an operation that does not change the semantics of the program if it is inserted between two operations. An example of one-off operation is staggered. Interleaving transfers data from a node before it to a node after it. Since the blocking operation B has an external input from the node A, a No-op operation 1432 is inserted between the nodes A and B.

圖14C中所展示之一第二修改確保在子圖中不存在具有外部輸出之封鎖操作。此使子圖準備好進行一最終分割步驟，其中沿著彼等輸出而插入發送及接收操作，且確保發送操作不在與封鎖操作相同之子圖中結束。因此，第二修改將另一No-op操作1434插入於節點D與D’之間。One of the second modifications shown in FIG. 14C ensures that there is no blocking operation with external output in the sub-graph. This prepares the subgraph for a final segmentation step, in which sending and receiving operations are inserted along their outputs, and ensuring that the sending operation does not end in the same subgraph as the blocking operation. Therefore, the second modification inserts another No-op operation 1434 between nodes D and D '.

圖14D圖解說明構建一合併圖形，其中每一操作在其自身之一子圖中結束。如所展示，合併圖形1450包含子圖1452、1453、1454、1456及1458，該等子圖各自包含一個操作。Figure 14D illustrates the construction of a merged graph, where each operation ends in one of its own subgraphs. As shown, the merged graph 1450 includes subgraphs 1452, 1453, 1454, 1456, and 1458, each of which includes an operation.

在圖14E中，識別具有外部傳入邊緣之操作並將其一起合併至相同子圖中。由於節點C及第一No-op操作皆具有自合併圖形1450外部之節點A傳入之外部邊緣，因此將圖14D之子圖1452與1454一起合併至圖14E之子圖1455中。In FIG. 14E, operations with external incoming edges are identified and merged together into the same subgraph. Since both node C and the first No-op operation have external edges that are passed in from node A outside merge graph 1450, sub-graphs 1452 and 1454 of FIG. 14D are merged into sub-graph 1455 of FIG. 14E.

儘可能地合併具有非封鎖操作之子圖。在圖14E中，存在含有Noop操作之兩個子圖1455、1458。然而，合併彼兩個子圖1455、1458將引入一循環，且因此係不准許的。由於包含節點B之子圖1453及包含節點D之子圖1456具有無法與任何其他子圖合併之封鎖操作，因此本端分割完成。As far as possible, merge sub-graphs with non-blocking operations. In FIG. 14E, there are two subgraphs 1455, 1458 containing Noop operations. However, merging the two sub-graphs 1455, 1458 will introduce a cycle, and therefore is not permitted. Since the subgraph 1453 including node B and the subgraph 1456 including node D have blocking operations that cannot be merged with any other subgraph, the local segmentation is completed.

圖15圖解說明在主要分割及本端分割之後的最終程式。已以使跨網路自一個位置發送至另一位置之訊務最小化之此一方式將若干位置指派至每一位置。此外，已採取預防措施以(舉例而言)藉由防止節點之間的循環且最佳化節點之間的發送及接收而確保效率。Figure 15 illustrates the final program after the main split and the local split. Several locations have been assigned to each location in such a way as to minimize traffic sent from one location to another across the network. In addition, preventive measures have been taken to ensure efficiency by, for example, preventing circulation between nodes and optimizing transmission and reception between nodes.

圖16提供圖解說明針對一所建立程式之圖形建立及分割之一實例性方法1600之一流程圖。結合圖17至圖18而進一步詳細地闡述方法1600之某些部分。本文中所闡述之方法中之每一者包含可以一不同次序或同時執行之部分，且可包含額外部分，其中可省略其他部分。FIG. 16 provides a flowchart illustrating an example method 1600 for graph creation and segmentation of an established program. Some parts of the method 1600 are explained in further detail in conjunction with FIGS. 17-18. Each of the methods set forth herein includes portions that can be performed in a different order or simultaneously, and can include additional portions, where other portions can be omitted.

在方塊1610中，建立一有向非循環圖形，其包含表示程式之操作之節點。圖形中之節點藉由邊緣而接合，該等邊緣表示自一個操作流動至另一操作之資料串流。操作中之某些操作可具有預定義位置。舉例而言，可藉由操作之性質、分散式環境中之計算裝置之能力、程式設計員指派或任何其他資訊而判定此等位置。At block 1610, a directed acyclic graph is created, which contains nodes representing the operation of the program. The nodes in the graph are joined by edges, which represent the flow of data from one operation to another. Some of the operations can have predefined positions. For example, these locations can be determined by the nature of the operation, the capabilities of computing devices in a distributed environment, programmer assignments, or any other information.

在方塊1620中，將位置指派至不具有一預定義位置之任何操作。可基於一第一組約束(諸如上文所闡述之主要分割約束)而指派位置。在某些實例中，第一組約束要求將所有操作放置至一子圖中、將一位置指派至每一子圖、具有一程式設計員指派之位置之一操作必須保持彼指派且若一未經指派操作具有一可分裂性質，則可僅將該未經指派操作放置至一經分區位置中。此外，若一位置被分區，則其目的地操作被指派至彼位置的程式中之所有邊緣在整個演算法中保持不改變。此外，圖形必須係非循環的。舉例而言，可基於相鄰節點而指派位置。舉例而言，可根據約束而將具有未經指派位置之操作添加至毗鄰經分區子圖中。可向具有未經指派位置之任何其他操作指派匹配相鄰未經分區節點之位置。位置指派可為圖形分割之一部分。In block 1620, the location is assigned to any operation that does not have a predefined location. The location may be assigned based on a first set of constraints, such as the main segmentation constraints set forth above. In some instances, the first set of constraints requires that all operations be placed in a sub-graph, a position is assigned to each sub-graph, and an operation with a position assigned by a programmer must maintain that assignment and if one has not If the assigned operation has a splittable property, then the unassigned operation can only be placed into a partitioned location. In addition, if a location is partitioned, all edges in the program whose destination operation is assigned to that location remain unchanged throughout the algorithm. In addition, the graphics must be acyclic. For example, the location may be assigned based on neighboring nodes. For example, operations with unassigned positions can be added to adjacent partitioned subgraphs according to constraints. Any other operation with unassigned locations may be assigned locations that match adjacent unpartitioned nodes. The location assignment can be part of the graphics segmentation.

在方塊1630中，將圖形分割成複數個子圖，其中將一子圖中之操作指派至相同位置。結合圖17而進一步詳細地闡述方塊1630之分割。At block 1630, the graph is divided into a plurality of sub-graphs, where operations in a sub-graph are assigned to the same location. The division of block 1630 is explained in further detail in conjunction with FIG.

在方塊1640中，基於一第二組約束(諸如上文所論述之本端分割約束)而針對個別子圖執行本端分割。結合圖18而進一步闡述本端分割。In block 1640, local segmentation is performed for individual subgraphs based on a second set of constraints, such as the local segmentation constraints discussed above. The local segmentation is further explained in conjunction with FIG. 18.

在方塊1650中，在每一子圖之各別位置處執行該每一子圖。在一單個各別執行緒中執行個別子圖。在下一章節IV中較全面地論述程式執行。At block 1650, each sub-picture is executed at its respective location. Run individual subgraphs in a single, separate thread. The program execution is discussed more fully in the next chapter IV.

圖17提供圖解說明主要圖形分割之一方法1700之一流程圖。在方塊1710中，儘可能地合併經分區子圖，同時遵循主要分割約束。上文結合圖13B而論述一實例。FIG. 17 provides a flow chart illustrating one of the methods 1700 of primary graphics segmentation. In block 1710, the partitioned subgraphs are merged as much as possible while following the main segmentation constraints. An example is discussed above in connection with FIG. 13B.

在方塊1720中，儘可能地將具有未經指派位置之節點添加至相鄰經分區子圖中，同時遵循主要分割約束。上文結合圖13C而論述一實例。在某些例項中，此可包含建立一額外操作。舉例而言，在將具有一未經指派位置之一節點添加至相鄰經分區子圖時對該節點有效地進行分區之情況下，將一新操作添加至子圖外部之圖形以合併來自經分區操作之結果。In block 1720, nodes with unassigned positions are added to adjacent partitioned subgraphs as much as possible, while following the main partitioning constraints. An example is discussed above in conjunction with FIG. 13C. In some instances, this may include establishing an additional operation. For example, in the case where a node with an unassigned position is effectively added to an adjacent partitioned subgraph, a new operation is added to the graph outside the subgraph to merge the The result of the partition operation.

在方塊1730中，將位置指派至具有未經指派位置之任何其餘節點。可基於先前被指派至相鄰節點之位置同時遵循主要分割約束而指派位置。上文結合圖13D而論述一實例。In block 1730, the location is assigned to any remaining nodes with unassigned locations. The location may be assigned based on the location previously assigned to the neighboring node while following the main split constraint. An example is discussed above in connection with FIG. 13D.

在方塊1740中，合併在相同位置處運行之可能之未經分區子圖對。上文結合圖13E至圖13F而論述一實例。At block 1740, merge unprovisioned subgraph pairs that may be running at the same location. An example is discussed above in connection with FIGS. 13E to 13F.

圖18提供圖解說明本端分割之一實例性方法1800之一流程圖。在方塊1810中，識別需要被分裂之子圖。舉例而言，若子圖含有一或多個封鎖操作，則可將要分裂該等子圖。FIG. 18 provides a flowchart illustrating an example method 1800 of local segmentation. At block 1810, the subgraphs that need to be split are identified. For example, if the subgraph contains one or more blocking operations, the subgraphs may be split.

在方塊1820中，使所識別子圖準備好進行分裂。舉例而言，準備可包含進行修改以確保不存在具有去往子圖之外部輸入之封鎖操作，且在子圖中不存在具有外部輸出之封鎖操作。此等修改可包含將操作添加至子圖，諸如上文結合圖14B至圖14C所論述。At block 1820, the identified subgraph is prepared for splitting. For example, preparation may include making modifications to ensure that there is no blocking operation with external input to the subgraph, and that there is no blocking operation with external output in the subgraph. Such modifications may include adding operations to sub-graphs, such as discussed above in connection with FIGS. 14B-14C.

在方塊1830中，構建一合併圖形，其中每一操作在一單獨子圖中結束。上文結合圖14D而論述一實例。At block 1830, a merged graph is constructed, where each operation ends in a separate subgraph. An example is discussed above in connection with FIG. 14D.

在方塊1840中，在不打破本端分割約束中之一者之情況下重複地合併單獨子圖直至無進一步合併可執行為止。可針對圖形中之每一相關子圖而重複此方法。IV. 跨網路執行一程式 At block 1840, the individual subgraphs are repeatedly merged without breaking one of the local segmentation constraints until no further merge can be performed. This method can be repeated for each related subgraph in the graph. IV. Run a program across the network

在執行程式時，判定經由圖形而發送之一串流是否已完成。為進行此判定，一端節點自將元組發送至該端節點之每一其他節點接收一符記，該符記指示其他節點已完成提供輸入。該其他節點可(舉例而言)為一經分區節點，或簡稱為分區。端節點將符記加在一起，且當符記之一總和等於提供輸入之其他節點之數目時，端節點判定串流已完成。When executing the program, it is determined whether a stream sent via graphics has been completed. To make this determination, an end node receives a token from each other node that sent the tuple to the end node, the token indicating that other nodes have completed providing input. The other node may be, for example, a partitioned node, or simply a partition. The end node adds tokens together, and when one sum of tokens equals the number of other nodes providing input, the end node determines that the stream has been completed.

當提交程式以用於執行時，建立每一圖形之一或多個啟動。針對一唯一圖形存在一個啟動。一唯一圖形包含各自恰好運行一次之複數個節點。在圖19中提供一唯一圖形之一實例。在此實例中，節點A、B*、C*、D中之每一者運行一次，其中將來自A之串流輸入至B*，將該串流輸入至C*，將該串流輸入至D。When the program is submitted for execution, one or more launches of each graphic are created. There is a start for a unique graphic. A unique graph contains multiple nodes that each run exactly once. An example of a unique figure is provided in FIG. In this example, each of nodes A, B *, C *, and D runs once, where the stream from A is input to B *, the stream is input to C *, and the stream is input to D.

一非唯一圖形可使任意數目個啟動複本執行，使得輸入將被分裂並發送至此等執行中之任一者且輸出被合併在一起。在圖20中提供一非唯一圖形之一實例。存在操作B及C之多個複本，其中來自節點A之輸入在節點B之複本當中進行分裂等。舉例而言，節點B1、B2及B3係相同操作B之啟動。類似地，節點C1及C2係相同操作C之啟動。A non-unique graph can cause any number of startup replicas to execute, so that the input will be split and sent to any of these executions and the output will be merged together. An example of a non-unique figure is provided in FIG. 20. There are multiple replicas of operations B and C, where the input from node A is split among the replicas of node B, and so on. For example, nodes B1, B2, and B3 are the same operation B start. Similarly, the nodes C1 and C2 are activated by the same operation C.

當初始化啟動時，每一節點本端地追蹤其所連接至的一定數目個上游(發送節點)及下游節點(接收節點)。初始發送節點與最終接收節點之間的節點可用作發送節點及接收節點兩者，從而自一或多個節點接收資訊、執行一操作及將資訊傳輸至一或多個其他節點。透過一串流而發送之每一值係一元組。同一程式中之不同操作可在不同機器上運行。程式化模型協調此等運算子在不同機器上之執行且將資料自一個運算子傳播至另一運算子。When the initialization is started, each node locally tracks a certain number of upstream (transmitting nodes) and downstream nodes (receiving nodes) it is connected to. The node between the initial sending node and the final receiving node can be used as both a sending node and a receiving node, thereby receiving information from one or more nodes, performing an operation, and transmitting the information to one or more other nodes. Each value sent through a stream is a tuple. Different operations in the same program can be run on different machines. The stylized model coordinates the execution of these operators on different machines and propagates data from one operator to another.

由於運算子在不同機器上且因此在圖形之不同節點處運行，因此程式之部分並行地運行。為判定一特定串流是否完成，目的地節點對自分區之上游運算子接收之符記值之一數目求總和。舉例而言，當去往一發送節點之輸入結束時，發送節點將一符記值(例如，1)傳輸至該發送節點已向其傳輸資訊之每個節點。當目的地節點接收到總數達其所連接至的發送節點之一數目之符記值時，目的地節點判定串流已結束。因此，目的地節點可採取一動作，諸如產生一結束信號或將串流標記為已完成。在一項實例中，目的地節點將一完成符記發送至其他下游節點。Since the operators are running on different machines and therefore at different nodes of the graph, parts of the program run in parallel. To determine whether a particular stream is complete, the destination node sums one of the number of tokens received from the upstream operator of the partition. For example, when the input to a sending node ends, the sending node transmits a token value (for example, 1) to each node to which the sending node has transmitted information. When the destination node receives a token value whose total number reaches one of the number of sending nodes to which it is connected, the destination node determines that the streaming has ended. Therefore, the destination node can take an action, such as generating an end signal or marking the stream as completed. In one example, the destination node sends a completion token to other downstream nodes.

圖21圖解說明發送符記值從而用信號通知一串流之完成之一實例。節點B0、B1及B2中之每一者自一個節點A接收輸入。當節點A已完成發送串流時，該節點A將一符記值(諸如1)發送至所連接下游節點B0、B1、B2中之每一者。節點B0、B1、B2將等待接收等於發送者數目之符記值(在此情形中，1)。節點B0、B1、B2中之每一者繼而將串流發送至一單個目的地節點C。目的地節點C知曉其自三個不同節點B0、B1、B2接收輸入，且因此該目的地節點C等待接收等於3之符記值。當節點B0、B1、B2完成發送串流時，該等節點將一符記值發送至所連接下游節點，即，目的地節點C。目的地節點C對所接收之符記值求總和且比較該總和與該目的地節點C自其接收輸入之節點之數目。當數目相等時，節點C將其自身標記為已完成。Figure 21 illustrates an example of sending token values to signal the completion of a stream. Each of nodes B0, B1, and B2 receives input from a node A. When node A has finished sending the stream, node A sends a token value (such as 1) to each of the connected downstream nodes B0, B1, B2. Nodes B0, B1, B2 will wait to receive a token value equal to the number of senders (in this case, 1). Each of nodes B0, B1, B2 then sends the stream to a single destination node C. Destination node C knows that it receives input from three different nodes B0, B1, B2, and therefore the destination node C waits to receive a token value equal to 3. When the nodes B0, B1, and B2 finish sending the stream, the nodes send a token value to the connected downstream node, that is, the destination node C. The destination node C sums the received symbol values and compares the sum with the number of nodes from which the destination node C receives input. When the number is equal, node C marks itself as completed.

圖22圖解說明其中一發送節點僅將串流發送至該發送節點所連接至的接收節點之一子集之一實例。舉例而言，節點B及C被分區，但分區對中之僅某些分區對進行通信。B0可僅接觸分區C0而不接觸分區C1，而分區B1僅接觸分區C1而不接觸分區C0。在此情景中，發送節點可產生其已進行通信之所有接收分區之一清單，且可將此清單進一步提供至一控制器2250。舉例而言，該清單可包含於一訊息中，該訊息指示發送分區已完成其對串流之傳輸。控制器追蹤已開始處理之所有接收分區。若一特定分區已起始但不存在於清單中，則控制器對彼特定分區承擔責任且向該特定分區發送代表發送分區之一符記值。FIG. 22 illustrates an example in which a sending node only sends a stream to a subset of receiving nodes to which the sending node is connected. For example, nodes B and C are partitioned, but only some of the partition pairs communicate. B0 may only contact the partition C0 without touching the partition C1, and the partition B1 only contacts the partition C1 without touching the partition C0. In this scenario, the sending node may generate a list of all the receiving partitions it has communicated with, and may further provide this list to a controller 2250. For example, the list may be included in a message indicating that the sending partition has completed its transmission of the stream. The controller tracks all receiving partitions that have started processing. If a specific partition has started but does not exist in the list, the controller is responsible for that specific partition and sends a token value representing the sending partition to the specific partition.

圖23圖解說明其中某些發送節點可未開始處理之一實例。舉例而言，節點C及D可被節點B跳過，且將元組自在所跳過節點之前的一發送節點直接提供至目的地節點E。另一可能性係節點B、C及D全部被節點A跳過，節點A將元組直接提供至目的地節點E。在任一情形中，控制器接管將符記自圖形之所有分區進行遞送之責任。控制器模擬所跳過節點B、C及D之執行，且將符記值遞送至代表此等未經起始發送節點之下游接收者(節點E)。下游接收者可對符記值求總和以判定串流是否已完成。FIG. 23 illustrates an example in which some sending nodes may not start processing. For example, nodes C and D may be skipped by node B, and tuples are directly provided to destination node E from a sending node before the skipped node. Another possibility is that nodes B, C, and D are all skipped by node A, which provides tuples directly to destination node E. In either case, the controller takes over responsibility for delivering all tokens from the graphics. The controller simulates the execution of skipped nodes B, C, and D, and delivers the token value to downstream receivers (node E) representing these uninitiated sending nodes. The downstream receiver may sum up the token values to determine whether the stream has been completed.

如上文所提及，用於程式之圖形可為唯一或非唯一的，其中一唯一圖形之一分區運行一次且一非唯一圖形可使任意數目個複本針對一或多個分區而執行。舉例而言，在一非唯一圖形中，輸入在複本當中被分裂，且輸出被合併在一起。在其中每一發送節點並非唯一地與一個接收節點配對之圖形中，接收節點可本端地追蹤一定數目個發送節點，且當該接收節點接收到等於發送節點數目之一符記數目時，該接收節點判定一串流已完成。As mentioned above, the graphics used for the program can be unique or non-unique, one of the partitions of a unique graphics runs once and a non-unique graphics allows any number of copies to be executed for one or more partitions. For example, in a non-unique graph, the input is split among the replicas, and the output is merged together. In a graph where each sending node is not uniquely paired with a receiving node, the receiving node can locally track a certain number of sending nodes, and when the receiving node receives a token number equal to the number of sending nodes, the The receiving node determines that a stream has been completed.

每一發送操作分區可具有其自身之非唯一接收者。在此例項中，發送者僅發送一個1且對應非唯一接收者在接收到1時完成。可引入跨多個發送者共用一非唯一接收者之相同執行之一最佳化。然後，該一定數目發送者由接收者本端地追蹤，且非唯一接收者在已接收到等於發送者數目之符記時完成。Each sending operation partition may have its own non-unique recipient. In this example, the sender only sends a 1 and the corresponding non-unique receiver completes when it receives a 1. One optimization can be introduced that shares the same execution across multiple senders with a non-unique receiver. Then, the certain number of senders is tracked locally by the receiver, and the non-unique receiver completes when it has received a token equal to the number of senders.

在其他實例中，一非唯一發送者可將串流發送至一唯一接收者。每一非唯一發送者將一符記值1發送至唯一接收者且唯一接收者等待等於非唯一發送者數目之一符記值總數。當非唯一發送者完成時，其發送接收分區(該非唯一發送者已向其發送符記)之清單且控制器負責將其餘符記遞送至每一分區。在其他實例中，控制器可將針對非唯一發送者之所有符記遞送至每一接收分區。In other examples, a non-unique sender may send the stream to a unique receiver. Each non-unique sender sends a token value of 1 to the unique receiver and the unique receiver waits for a total of token values equal to the number of non-unique senders. When the non-unique sender completes, it sends a list of receiving partitions to which the non-unique sender has sent tokens and the controller is responsible for delivering the remaining tokens to each partition. In other examples, the controller may deliver all tokens for non-unique senders to each receiving partition.

在某些實例中，可將某些串流廣播至一圖形之所有啟動。然而，在建構程式之一時間處並不知曉啟動集合，且該集合係隨著程式執行繼續進行而逐步構建。圖24圖解說明其中一非唯一圖形G在分區R1上接收可導致G之多個啟動之一常規輸入X之一實例。舉例而言，G可被分區(包含分區R1及R2)，或發送者S1至R1可具有到達G之不同複本之多個啟動。應將另一輸入Y廣播至G之每個複本。針對每個此廣播，引入一動態發送操作S2。動態發送操作S2具有兩個輸入串流—來自Y之一資料輸入串流及來自控制器之一啟動輸入串流。資料輸入串流係應被發送至目的地圖形G之所有啟動之一正常元組串流。當偵測到目的地圖形之新啟動時，啟動輸入串流包含元組到達其上之啟動。舉例而言，當執行一特定圖形之一複本時，將一識別符發送至控制器，該控制器將啟動路由至適當動態發送操作。動態發送操作維持經暫存啟動之一集合，且亦維持所接收之所有輸入資料之一緩衝區。當資料輸入結束時，將一結束信號自動態發送操作發送至所有經暫存啟動及新啟動，且亦發送至隨後到達之新啟動。當啟動輸入結束時，可丟棄輸入資料之緩衝區。In some instances, certain streams can be broadcast to all activations of a graphic. However, the startup set is not known at a time when the program is constructed, and the set is gradually built as the program execution continues. FIG. 24 illustrates an example in which a non-unique graph G receives a regular input X on partition R1 that can cause multiple activations of G. For example, G can be partitioned (including partitions R1 and R2), or senders S1 through R1 can have multiple activations to different replicas of G. The other input Y should be broadcast to each copy of G. For each of these broadcasts, a dynamic transmission operation S2 is introduced. The dynamic transmission operation S2 has two input streams—a data input stream from Y and a start input stream from one of the controllers. The data input stream should be sent to all normal tuple streams of the destination graph G. When a new activation of the destination graphic is detected, the activation input stream contains the activation on which the tuple reached. For example, when a copy of a specific graphic is executed, an identifier is sent to the controller, which will initiate routing to the appropriate dynamic sending operation. The dynamic transmission operation maintains a set activated by temporary storage, and also maintains a buffer of all input data received. When the data input is completed, an end signal is sent from the dynamic sending operation to all the temporary start and new start, and also to the new start that arrives later. When the start input ends, the input data buffer can be discarded.

圖25提供圖解說明用於經由一分散式網路而執行一程式之一實例性方法2500之一流程圖，該程式由一圖形表示，該圖形包含表示操作之複數個節點，其中邊緣表示將該等節點互連之資料串流。如在以上實例中，方法2500之子部分可被重新排序、被補充或被減少。FIG. 25 provides a flowchart illustrating an example method 2500 for executing a program via a decentralized network. The program is represented by a graph including a plurality of nodes representing operations, where an edge represents the Data stream of interconnected nodes. As in the above example, the sub-portions of method 2500 may be reordered, supplemented, or reduced.

在方塊2510中，由一或多個第一分區執行操作。舉例而言，第一分區可為一圖形中之發送分區。In block 2510, the operation is performed by one or more first partitions. For example, the first partition may be a sending partition in a graphic.

在方塊2520中，將基於所執行操作之元組自一或多個第一分區發送至至少一個第二分區，諸如一接收分區。In block 2520, the tuple based on the performed operation is sent from one or more first partitions to at least one second partition, such as a receiving partition.

在方塊2530中，當一或多個第一分區已完成發送元組時，一或多個第一分區中之每一者將一符記值發送至至少一個第二分區。舉例而言，該符記值可為1。一或多個第一分區可進一步本端地註記元組之傳輸完成。In block 2530, when one or more first partitions have finished sending tuples, each of the one or more first partitions sends a token value to at least one second partition. For example, the symbol value may be 1. One or more first partitions can further locally note the completion of the tuple transmission.

在方塊2540中，至少一個第二分區對所接收符記值求總和，且判定該總和是否匹配一或多個第一分區之一數目。舉例而言，至少一個第二分區可知曉其自三個發送分區接收輸入。因此，至少一個第二分區在其認為串流完成之前進行等待直至其接收到一總數3個符記為止。In block 2540, at least one second partition sums the received token values, and determines whether the sum matches one of the number of one or more first partitions. For example, at least one second partition may be aware that it receives input from three sending partitions. Therefore, at least one second partition waits until it receives a total of 3 tokens before it considers the stream to be completed.

在方塊2550中，至少一個第二分區回應於判定符記值之總和匹配一或多個第一分區之數目而採取一動作。該動作可為(舉例而言)進行一本端註記、將一訊息及/或符記值發送至另一下游節點等。In block 2550, at least one second partition takes an action in response to the sum of the determination token values matching the number of one or more first partitions. The action may be, for example, making a local note, sending a message and / or token value to another downstream node, and so on.

上文所闡述技術提供快速且高效之程式執行。此外，所闡述之技術可適應於各種類型之經分區及經管線化程式。更進一步地，該等技術可在一程式之寫入期間應用，且因此動態地適應於程式中之改變。新程式化模型支援無限數目個分區，每一分區跨資料庫將元組傳輸至其他分區。儘管僅分區之某些子集可實際上針對一特定程式而運行，但控制器對並未運行之分區進行補償，而不會給控制器帶來顯著負擔。The techniques described above provide fast and efficient program execution. In addition, the techniques described can be adapted to various types of partitioned and pipelined programs. Furthermore, these techniques can be applied during the writing of a program, and therefore dynamically adapt to changes in the program. The new stylized model supports an unlimited number of partitions, and each partition transfers tuples across databases to other partitions. Although only certain subsets of the partitions can actually be run for a specific program, the controller compensates for partitions that are not running, without putting a significant burden on the controller.

除非另外陳述，否則前述替代實例並非相互排斥的，而是可以各種組合來實施以達成獨特優點。由於可在不背離由申請專利範圍界定之標的物之情況下利用上文所論述之特徵之此等及其他變化及組合，因此應藉由圖解說明方式而非藉由限制由申請專利範圍界定之標的物之方式理解實施例之前述說明。另外，對本文中所闡述之實例以及措辭為「諸如」、「包含」及諸如此類之從句之提供不應被解釋為將申請專利範圍之標的物限制於特定實例；而是，該等實例意欲圖解說明諸多可能實施例中之僅一者。此外，不同圖式中之相同元件符號可識別相同或類似元件。Unless stated otherwise, the aforementioned alternative examples are not mutually exclusive, but may be implemented in various combinations to achieve unique advantages. Since these and other changes and combinations of the features discussed above can be utilized without departing from the subject matter defined by the scope of the patent application, it should be defined by the scope of the patent application by way of illustration rather than by limitation. The foregoing description of the embodiment is understood in the manner of the subject matter. In addition, the provision of the examples set forth in this article and the clauses such as "such as", "including" and the like should not be interpreted as limiting the subject matter of the patent application scope to specific examples; The illustration illustrates only one of many possible embodiments. In addition, the same element symbols in different drawings can identify the same or similar elements.

110‧‧‧用戶端110‧‧‧Client

120‧‧‧處理器120‧‧‧ processor

130‧‧‧記憶體130‧‧‧Memory

132‧‧‧指令132‧‧‧Command

134‧‧‧資料134‧‧‧Information

136‧‧‧應用程式136‧‧‧Application

150‧‧‧網路150‧‧‧ Internet

160‧‧‧資料中心160‧‧‧Data Center

162‧‧‧計算裝置162‧‧‧Computer

164‧‧‧計算裝置164‧‧‧ computing device

170‧‧‧資料中心170‧‧‧Data Center

172‧‧‧計算裝置172‧‧‧ computing device

180‧‧‧資料中心180‧‧‧Data Center

181‧‧‧計算裝置181‧‧‧ computing device

182‧‧‧計算裝置182‧‧‧ computing device

183‧‧‧計算裝置183‧‧‧ computing device

184‧‧‧計算裝置184‧‧‧ computing device

185‧‧‧計算裝置185‧‧‧ Computing device

186‧‧‧計算裝置186‧‧‧ computing device

190‧‧‧控制器190‧‧‧Controller

192‧‧‧記憶體192‧‧‧Memory

194‧‧‧資料194‧‧‧ Information

196‧‧‧指令196‧‧‧Command

198‧‧‧處理器198‧‧‧ processor

210‧‧‧輸入操作210‧‧‧ Input operation

215‧‧‧串流215‧‧‧Stream

220‧‧‧ListImages操作/ListImages220‧‧‧ListImages operation / ListImages

225‧‧‧串流225‧‧‧Stream

226‧‧‧快取記憶體中查找操作226‧‧‧ Search operation in cache

227‧‧‧串流227‧‧‧Stream

228‧‧‧串流228‧‧‧Stream

230‧‧‧查找運算子/查找230‧‧‧Find Operator / Find

235‧‧‧串流235‧‧‧Stream

240‧‧‧縮圖操作240‧‧‧ Thumbnail operation

245‧‧‧所產生縮圖245‧‧‧ generated thumbnail

250‧‧‧輸出250‧‧‧Output

500‧‧‧類型推論方法500‧‧‧Type inference method

510‧‧‧方塊510‧‧‧ block

520‧‧‧方塊520‧‧‧ block

530‧‧‧方塊530‧‧‧ block

540‧‧‧方塊540‧‧‧ block

550‧‧‧方塊550‧‧‧ block

560‧‧‧方塊560‧‧‧ block

610‧‧‧引數610‧‧‧Argument

620‧‧‧查找620‧‧‧Find

630‧‧‧濾波器630‧‧‧filter

640‧‧‧結果640‧‧‧ result

710‧‧‧引數710‧‧‧Argument

720‧‧‧查找720‧‧‧Find

730‧‧‧濾波器730‧‧‧filter

740‧‧‧結果740‧‧‧Result

815‧‧‧邊緣815‧‧‧ edge

825‧‧‧邊緣825‧‧‧edge

835‧‧‧邊緣835‧‧‧edge

920‧‧‧查找920‧‧‧Find

930‧‧‧處理930‧‧‧Process

940‧‧‧查找940‧‧‧Find

1020‧‧‧查找1020‧‧‧Find

1030‧‧‧查找1030‧‧‧Find

1110‧‧‧第一輸入引數1110‧‧‧ First input parameter

1115‧‧‧邊緣1115‧‧‧Edge

1125‧‧‧邊緣1125‧‧‧edge

1130‧‧‧第一查找操作/查找操作1130‧‧‧First search operation / search operation

1135‧‧‧邊緣1135‧‧‧edge

1140‧‧‧第二查找操作/查找操作1140‧‧‧Second search operation / search operation

1145‧‧‧邊緣1145‧‧‧ Edge

1150‧‧‧ZipAny操作/ZipAny1150‧‧‧ZipAny operation / ZipAny

1155‧‧‧邊緣1155‧‧‧ Edge

1160‧‧‧選擇操作/選擇1160‧‧‧select operation / select

1165‧‧‧邊緣1165‧‧‧edge

1170‧‧‧結果1170‧‧‧Result

1211‧‧‧邊緣1211‧‧‧edge

1212‧‧‧邊緣1212‧‧‧Edge

1213‧‧‧邊緣1213‧‧‧Edge

1214‧‧‧邊緣1214‧‧‧Edge

1215‧‧‧邊緣1215‧‧‧edge

1216‧‧‧邊緣1216‧‧‧edge

1217‧‧‧邊緣1217‧‧‧edge

1218‧‧‧邊緣1218‧‧‧Edge

1219‧‧‧邊緣1219‧‧‧Edge

1220‧‧‧邊緣1220‧‧‧Edge

1221‧‧‧邊緣1221‧‧‧Edge

1222‧‧‧邊緣1222‧‧‧Edge

1310‧‧‧子圖1310‧‧‧Sub-picture

1312‧‧‧子圖1312‧‧‧Sub-picture

1313‧‧‧子圖1313‧‧‧Sub-picture

1315‧‧‧子圖1315‧‧‧Sub-picture

1316‧‧‧子圖1316‧‧‧Sub-picture

1317‧‧‧子圖1317‧‧‧Sub-picture

1318‧‧‧子圖1318‧‧‧Sub-picture

1320‧‧‧子圖1320‧‧‧Sub-picture

1322‧‧‧新子圖1322‧‧‧New sub-picture

1413‧‧‧子圖1413‧‧‧Sub-picture

1432‧‧‧No-op操作1432‧‧‧No-op operation

1434‧‧‧No-op操作1434‧‧‧No-op operation

1450‧‧‧合併圖形1450‧‧‧ merged graphics

1452‧‧‧子圖1452‧‧‧Sub-picture

1453‧‧‧子圖1453‧‧‧Sub-picture

1454‧‧‧子圖1454‧‧‧Sub-picture

1455‧‧‧子圖1455‧‧‧Sub-picture

1456‧‧‧子圖1456‧‧‧Picture

1458‧‧‧子圖1458‧‧‧sub-picture

1600‧‧‧方法1600‧‧‧Method

1610‧‧‧方塊1610‧‧‧ block

1620‧‧‧方塊1620‧‧‧ block

1630‧‧‧方塊1630‧‧‧ block

1640‧‧‧方塊1640‧‧‧ block

1650‧‧‧方塊1650‧‧‧ block

1700‧‧‧方法1700‧‧‧Method

1710‧‧‧方塊1710‧‧‧ block

1720‧‧‧方塊1720‧‧‧ block

1730‧‧‧方塊1730‧‧‧ block

1740‧‧‧方塊1740‧‧‧ block

1800‧‧‧方法1800‧‧‧method

1810‧‧‧方塊1810‧‧‧ block

1820‧‧‧方塊1820‧‧‧ block

1830‧‧‧方塊1830‧‧‧ block

1840‧‧‧方塊1840‧‧‧ block

2500‧‧‧方法2500‧‧‧Method

2510‧‧‧方塊2510‧‧‧ block

2520‧‧‧方塊2520‧‧‧ block

2530‧‧‧方塊2530‧‧‧ block

2540‧‧‧方塊2540‧‧‧ block

2550‧‧‧方塊2550‧‧‧ block

A‧‧‧節點A‧‧‧Node

B‧‧‧節點/操作/封鎖操作B‧‧‧node / operation / block operation

B0‧‧‧節點/下游節點B0‧‧‧node / downstream node

B1‧‧‧節點/下游節點/分區B1‧‧‧Node / Downstream Node / Division

B2‧‧‧節點/下游節點B2‧‧‧node / downstream node

B3‧‧‧節點B3‧‧‧Node

C‧‧‧目的地節點/節點/位置/控制器/操作C‧‧‧Destination node / node / location / controller / operation

C0‧‧‧分區C0‧‧‧Division

C1‧‧‧分區/節點C1‧‧‧ partition / node

C2‧‧‧節點C2‧‧‧Node

D‧‧‧節點/操作D‧‧‧node / operation

D’‧‧‧新操作/節點D’ ‧‧‧ New operation / node

E‧‧‧節點/目的地節點E‧‧‧node / destination node

F‧‧‧節點F‧‧‧ Node

G‧‧‧節點/非唯一圖形/目的地圖形G‧‧‧node / non-unique graph / destination graph

G’‧‧‧新操作/節點G’‧‧‧New operation / node

G0‧‧‧單獨圖形/圖形G0‧‧‧Individual graphics / graphics

G1‧‧‧單獨圖形/圖形G1‧‧‧Separate graphics / graphics

G2‧‧‧單獨圖形G2‧‧‧separate graphics

G3‧‧‧單獨圖形G3‧‧‧separate graphics

H‧‧‧節點H‧‧‧ Node

I‧‧‧節點I‧‧‧ Node

J‧‧‧節點J‧‧‧ Node

K‧‧‧節點K‧‧‧node

L‧‧‧位置L‧‧‧Location

R1‧‧‧分區R1‧‧‧ Division

S1‧‧‧串流/發送者S1‧‧‧Streaming / Sender

S2‧‧‧串流/動態發送操作S2‧‧‧Streaming / Dynamic sending operation

SL‧‧‧位置/經分區位置SL‧‧‧Location / Divisional location

X‧‧‧常規輸入X‧‧‧Regular input

Y‧‧‧輸入Y‧‧‧Input

圖1係根據本發明之態樣之一實例性系統之一方塊圖。FIG. 1 is a block diagram of an exemplary system according to an aspect of the present invention.

圖2A至圖2B圖解說明根據本發明之態樣之使用程式化模型建立之一程式的一實例。2A to 2B illustrate an example of creating a program using a stylized model according to aspects of the present invention.

圖3係根據本發明之態樣之列示程式化模型之內建操作之實例的一圖表。FIG. 3 is a diagram showing an example of built-in operations of the stylized model according to the aspect of the present invention.

圖4係根據本發明之態樣之列示針對操作之輸出類型注釋之實例的一圖表。FIG. 4 is a diagram showing examples of output type annotations for operations according to aspects of the present invention.

圖5提供圖解說明根據本發明之態樣之一實例性類型推論方法之一流程圖。FIG. 5 provides a flowchart illustrating one example type inference method according to aspects of the present invention.

圖6A至圖6C圖解說明根據本發明之態樣之在程式建立期間之位置指派的一實例。6A to 6C illustrate an example of position assignment during program creation according to aspects of the present invention.

圖7A至圖7B圖解說明根據本發明之態樣之在圖形建立期間之自動位置指派的一實例。7A to 7B illustrate an example of automatic position assignment during pattern creation according to aspects of the present invention.

圖8A至圖8C圖解說明根據本發明之態樣之分割一圖形以使子圖之一數目最小化的一實例。8A to 8C illustrate an example of dividing a figure according to the aspect of the present invention to minimize the number of one of the sub-pictures.

圖9A至圖9D提供根據本發明之態樣之針對經分區位置之圖形分割的一實例。9A-9D provide an example of graphical segmentation for partitioned locations according to aspects of the present invention.

圖10A至圖10B提供根據本發明之態樣之共同位置之一實例。10A to 10B provide an example of common positions according to aspects of the present invention.

圖11A至圖11C圖解說明根據本發明之態樣之具有多個輸入操作之一程式的一實例。11A to 11C illustrate an example of a program having multiple input operations according to aspects of the present invention.

圖12圖解說明根據本發明之態樣之一實例性程式。FIG. 12 illustrates an example program according to aspects of the present invention.

圖13A至圖13F闡述根據本發明之態樣之圖12之程式之主要分割的一實例。13A to 13F illustrate an example of the main division of the program of FIG. 12 according to the aspect of the present invention.

圖14A至圖14E闡述根據本發明之態樣之圖12之程式之本端分割的一實例。14A to 14E illustrate an example of local segmentation of the program of FIG. 12 according to aspects of the present invention.

圖15圖解說明根據本發明之態樣之在執行主要分割及本端分割之後的圖12之程式。FIG. 15 illustrates the program of FIG. 12 after performing main division and local division according to the aspect of the present invention.

圖16提供圖解說明根據本發明之態樣之圖形建立及分割之一實例性方法的一流程圖。16 provides a flowchart illustrating an example method of graph creation and segmentation according to aspects of the present invention.

圖17提供圖解說明主要圖形分割之一方法之一流程圖。FIG. 17 provides a flowchart illustrating one of the methods of main graphics segmentation.

圖18提供圖解說明本端分割之一實例性方法之一流程圖。FIG. 18 provides a flowchart illustrating one example method of local segmentation.

圖19係根據本發明之態樣之一唯一圖形之一實例的一圖形圖解說明。FIG. 19 is a graphical illustration of an example of a unique figure according to the aspect of the invention.

圖20係根據本發明之態樣之一實例性非唯一圖形之一圖形圖解說明。Figure 20 is a graphical illustration of one of the exemplary non-unique graphics according to one aspect of the present invention.

圖21圖解說明根據本發明之態樣之發送符記值從而用信號通知一串流之完成的一實例。FIG. 21 illustrates an example of transmitting token values according to the aspect of the present invention to signal completion of a stream.

圖22圖解說明其中一發送節點僅將串流發送至該發送節點所連接至的接收節點之一子集之一實例。FIG. 22 illustrates an example in which a sending node only sends a stream to a subset of receiving nodes to which the sending node is connected.

圖23圖解說明在存在未經起始發送節點時判定一串流之完成之一實例。FIG. 23 illustrates an example of determining completion of a stream when there is a sending node that has not been initiated.

圖24圖解說明在存在一圖形之多個啟動時判定一串流之完成之一實例。FIG. 24 illustrates an example of determining the completion of a stream when there are multiple activations of a graphic.

圖25提供圖解說明用於經由一分散式網路而執行一程式之一實例性方法2500之一流程圖。FIG. 25 provides a flowchart illustrating an example method 2500 for executing a program over a distributed network.

Claims

A method for executing a program in a decentralized architecture includes: performing one or more operations by one or more first partitions of the decentralized architecture; tuples from the one or more first partitions Sent to at least one second partition, the tuples are part of a stream and based on the one or more operations; when the sending of the tuples in the stream is completed, from the one or more Each of the partitions sends a token value to the at least one second partition; the second partition determines whether a total of the token values matches one of the one or more first partitions; and A first action is taken in response to determining that the total number of the token values matches the number of the one or more first partitions.

The method of claim 1, wherein the at least one second partition is one of the one or more first partitions receiving the partition, the method further comprises: the one of the one or more first partitions It generates a list of the receiving partitions with which the one or more first partitions communicate; and transmits the list to a controller by the one of the one or more first partitions.

The method of claim 2, further comprising: tracking, by the controller, all receiving partitions that have initiated processing; determining, by the controller, whether one or more of the initiated processing in the receiving partitions does not exist in the In the list; and for each receiving partition that has been initiated and does not exist in the list, the controller sends a token value representing the one of the one or more first partitions to the receiving Partition.

The method of claim 1, further comprising: determining by a controller whether any partitions have not been processed; determining by the controller whether the partitions that have not been processed are intentionally skipped by the design of the program; and by the The controller sends a token value to the second partition on behalf of any partition that has been deliberately skipped.

The method of claim 1, wherein taking the first action includes marking the stream as completed or generating at least one of a message indicating that the stream is completed.

The method of claim 1, further comprising: constructing a graph, wherein each node of the graph represents a partition; and verifying based on the graph whether the program will be accurately executed across the decentralized architecture.

The method of claim 6, which further includes dynamically starting the graph when the program is executed.

The method of claim 7, further comprising: sending a data input stream to all activations of a destination graph by a dynamic sending operation; receiving a new tuple from a controller at the dynamic sending operation, the The new tuple is received when an additional activation of the destination graphic is detected.

The method of claim 6, wherein the graphic is not unique.

The method of claim 1, wherein performing the one or more operations is part of a pipelined data processing stream.

A system includes: one or more first partitions in a distributed computing environment; and at least one second partition in the distributed computing environment, the at least one second partition is remote from the one or more first partitions ; Wherein the one or more first partitions are configured to: perform one or more operations; send tuples to at least one second partition, the tuples are part of a stream and are based on the one or more Operation; when the sending of the tuples in the stream is completed, sending a token value to the at least one second partition; and wherein the at least one second partition is configured to: determine the tokens Whether the total number of values matches one of the one or more first partitions; and in response to determining that the total number of tokens matches the number of the one or more first partitions, a first action is taken.

The system of claim 11, further comprising a controller, wherein the at least one second partition is a receiving partition of one of the one or more first partitions, and wherein the one or more first partitions further It is configured to: generate a list of the receiving partitions with which the one or more first partitions communicate; and transmit the list to the controller.

As in the system of claim 12, wherein the controller is configured to: track all received partitions that have initiated processing; determine whether one or more of the initiated processing in those received partitions does not exist in the list; And for each receiving partition that has started processing and does not exist in the list, a token value representing the one of the one or more first partitions is sent to the receiving partition.

As in the system of claim 11, one of the controllers is configured to: determine whether any partitions have not been processed; determine whether the partitions that have not started processing are intentionally skipped by a program design; Any partition that was intentionally skipped will send a token value to the second partition.

The system of claim 11, wherein taking the first action includes marking the stream as completed or generating at least one of a message indicating that the stream is completed.

The system of claim 11, further comprising a client device communicating with at least one of the one or more first partitions, the at least one second partition, or the controller, the client device configured to : Construct a graph, where each node of the graph represents a partition; and verify whether a program will be accurately executed across the decentralized architecture based on the graph.

The system of claim 16, wherein the client device is further configured to dynamically construct the start of the graphic while executing the program.

The system of claim 17, further comprising performing a dynamic sending operation on a computing device in the distributed architecture, wherein the dynamic sending operation: sends a data input stream to all activations of a destination graphic; New tuples are received from the controller, and these new tuples are received when an additional activation of the destination graphic is detected.

As in the system of claim 16, the graphic is not unique.

The system of claim 11, wherein performing the one or more operations is part of a pipelined data processing flow.