-
Notifications
You must be signed in to change notification settings - Fork 1
Run Pegasus Workflow across Multiple HTCondor Pools in Flocking mode
The workflow submitter consists of HTCondor master and submitter that run in the flocking mode. Pegasus is installed on the HTCondor submitter. The workflow submitter flock workflow jobs to other HTCondor pools with available resources.
To install the workflow submitter, follow the instructions on the main page to install the HTCondor master and submitter using Docker. To enable flocking,
- Set
TCP_FORWARDING_HOST
incondor_config.local.submitter
with the public/external IP address of the host - Set
FLOCK_TO
as a comma-separated list of HTCondor pool addresses where the workflow jobs will flock to - Provide proper credentials, certificates and keys for flocking to secured HTCondor pools
- Run
docker_run_htcondor.sh -f
and wait for 60 seconds for the submitter to be ready
The HTCondor pool will offer resources for running flocking jobs. As of writing, it can be installed on a single node with Ubuntu 14.04 using condor_setup/ubuntu_install.sh <flock-from-list>
. The <flock-from-list>
should be a comma-separated list of host addresses where the workflow jobs will flock from.
Note that the default HTCondor deployment defaults to a low security level (CLAIMTOBE
) and requires no authentication for flocking jobs. The default security configuration should only be used for development and test. For more sophisticated security configuration, please refer to this page.
There is an example Pegasus workflow under the home directory of condor_pool
in the HTCondor submitter. The workflow consists of a configurable number of parallel jobs, each of which sleeps for 120 seconds and writes its finishing time and hostname as output. To submit the workflow,
- Log in the HTCondor submitter:
docker exec -it condor-submitter /bin/bash
- Switch to user
condor_pool
and enter the workflow directory:
su - condor_pool
cd ~/example