This is the C++ API for the Template Task Graph (TTG) programming model for flowgraph-based composition of high-performance algorithms executable on distributed heterogeneous computer platforms. The TTG API abstracts out the details of the underlying task and data flow runtime; the current realization is implemented using MADNESS and PaRSEC runtimes as backends.
- TTG marries the idea of flow programming models with the key innovations in the PARSEC runtime for compact specification of DAGs (PTG).
- TTG can efficiently compose and execute irregular computation patterns which are poorly served by the current programming and execution models.
- TTG has strong support for distributed hybrid architectures for running modern scientific algorithms efficiently on current and near-future supercomputers.
- To try out TTG in a Docker container, install Docker, then execute
bin/docker-build.sh
and follow instructions inbin/docker.md
; - See INSTALL.md to learn how to build and install TTG.
TaskId
(akaKey
): A unique identifier for each task. It should be hashable. For example, if computing a matrix multiplicaion, TaskId could be a triplet of integers identifying the tiles being operated upon.Terminal
: Input and output arguments are exposed by the runtime as terminals. Input terminal is a single assignment variable and is used by the runtime to determine when arguments of a task are available. An input terminal is programmable. For example, it could perform a reduction operation.Edge
: An output terminal is connected to the input terminal using edges. Multiple edges can connect to an input terminal enabling data to come from multiple sources and an output terminal might connect to multiple successors implying a broadcast operation.TemplateTask
: This wraps a user-defined function with informal signature void f(TaskId, Arg0, Arg1, ..., OutputTerminals). A task is marked for execution when all input arguments are received. To instantiate a TemplateTask, make_tt function is used.
- Step 1 : Include the required header files. For MADNESS runtime, add the below header file.
#include "madness/ttg.h"
For PaRSEC runtime, add the below header file.
#include "parsec/ttg.h"
Import the namespaces required for using the TTG API.
using namespace madness;
using namespace madness::ttg;
using namespace ::ttg;
-
Step 2 : Define a TaskId (Key) which represents a unique identifier for each task and which is hashable.
-
Step 3 : Define a factory that returns a TemplateTask for every function that runs the computation. Below factory function returns a TemplateTask for recursively exploring the wavefronts of the Smith Waterman algorithm. The code adopts several common design motifs of a TTG program. Complete implementation of the algorithm can be found in the examples directory.
template <typename funcT, typename T>
auto make_sw1(const funcT& func, int block_size, const std::string &a, const std::string &b,
int problem_size, Edge<Key, BlockMatrix<T>>& leftedge, Edge<Key, BlockMatrix<T>>& topedge,
Edge<Key, BlockMatrix<T>>& diagedge, Edge<Key, T>& resultedge) {
auto f = [block_size, problem_size, a, b, func](const Key& key, BlockMatrix<T>&& toporleft,
std::tuple<Out<Key, BlockMatrix<T>>, Out<Key, BlockMatrix<T>>,
Out<Key, BlockMatrix<T>>, Out<Key, BlockMatrix<T>>, Out<Key, T>>& out) {
// Getting the block coordinates
auto[i, j] = key;
int next_i = i + 1;
int next_j = j + 1;
int num_blocks = problem_size / block_size;
BlockMatrix<T> X(block_size, block_size);
if (i == 0 && j == 0) {
//No top, left or diagonal blocks
X = sw_iterative(i, j, X, X, X, X, block_size, a, b, problem_size);
}
else if (i == 0) {
//Only left block, single dependency
X = sw_iterative(i, j, X, toporleft, X, X, block_size, a, b, problem_size);
}
else if (j == 0) {
//Only top block, single dependency
X = sw_iterative(i, j, X, X, toporleft, X, block_size, a, b, problem_size);
}
//std::cout << X << std::endl;
if (next_i < num_blocks) {
//std::cout << "left " << next_i << " " << j << std::endl;
if (j == 0) // send top block for next block computation
send<0>(Key(next_i, j), X, out);
else // send top block for next block computation
send<2>(Key(next_i, j), X, out);
}
if (next_j < num_blocks) {
if (i == 0) // send left block for next block computation
send<0>(Key(i, next_j), X, out);
else // // send left block for next block computation
send<1>(Key(i, next_j), X, out);
}
if (next_i < num_blocks && next_j < num_blocks) {
send<3>(Key(next_i, next_j), X, out); //send diagonal block for next block computation
}
if (i == num_blocks - 1 && j == num_blocks - 1)
send<4>(Key(i,j), X(block_size-1, block_size-1), out);
};
Edge<Key, BlockMatrix<T>> recur("recur");
return wrap(f, edges(recur), edges(recur, leftedge, topedge, diagedge, resultedge), "sw1", {"recur"},
{"recur", "leftedge", "topedge", "diagedge", "resultedge"});
}
- Step 4 : Define the edges and verify that the graph is connected in the main program.
ttg_initialize(argc, argv, -1);
Edge<Key, BlockMatrix<int>> leftedge, topedge, diagedge;
Edge<Key, int> resultedge;
auto s = make_sw1(sw_iterative<int>, block_size, a, b, problem_size, leftedge, topedge,
diagedge, resultedge);
auto s1 = make_sw2(sw_iterative<int>, block_size, a, b, problem_size, leftedge, topedge,
diagedge, resultedge);
auto r = make_result(verify, val1, resultedge);
auto connected = make_graph_executable(s.get());
assert(connected);
TTGUNUSED(connected);
std::cout << "Graph is connected.\n";
- Step 5 : Execute the graph.
if (ttg_default_execution_context().rank() == 0)
s->in<0>()->send(Key(0,0), BlockMatrix<int>());
ttg_execute(ttg_default_execution_context());
ttg_fence(ttg_default_execution_context());
- TTG API documentation is available for the following versions:
The task graph can be dumped into a DOT format using the below code in the main program after connecting the graph. GraphViz tools can be used to visualize the task graph.
std::cout << "==== begin dot ====\n";
std::cout << Dot()(s.get()) << std::endl;
std::cout << "==== end dot ====\n";
Below is a TTG graph generated by Smith Waterman algorithm. Each operation/TemplateTask factory is denoted by a rectangle with input terminals on the top and output terminals listed on the bottom part of the rectangle.
The development of TTG was made possible by:
- The EPEXA project, currently supported by the National Science Foundation under grants 1931387 at Stony Brook University, 1931347 at Virginia Tech, and 1931384 at the University of Tennesse, Knoxville.
- The TESSE project, supported by the National Science Foundation under grants 1450344 at Stony Brook University, 1450262 at Virginia Tech, and 1450300 at the University of Tennesse, Knoxville.