Ansible playbook for the deployment of a custom CKAN for spatial data management in different environments.
Deployments available for the following OS:
OS | Compatibility | Versions | Remarks |
---|---|---|---|
Arch Linux | 🚧 WIP | - | - |
CentOS | 🚧 WIP | CentOS 8 , CentOS Stream 9 |
- |
Debian | 🚧 WIP | 12 - Bookworm |
- |
RedHat Enterprise Linux | ✅ Yes | RHEL 8 , RHEL 9 |
- |
Ubuntu | 🚧 WIP | 20.04 - Focal Fossa |
- |
And CKAN versions:
CKAN version | Compatibility | Versions | Remarks |
---|---|---|---|
2.8 | ❌ No | - | Incompatible with Python 3 |
2.9 | ✅ Yes | ckan-2.9.11 |
- |
2.10 | ✅ Yes | ckan-2.10.5 |
- |
ansible
which adds a community-curated selection of Ansible Collections.
Warning
Do not install the ansible-core
minimalist version. Install ansible
using pip
or pipx
:
python3 -m ensurepip --upgrade
python3 -m pip install --user ansible
The full ansible
package includes a community-curated selection of Ansible Collections, which helps avoid issues with unavailable ansible-galaxy
modules in the ansible-core
version.
Check the ansible
version and collection list:
ansible --version
ansible-galaxy collection list
After installing Ansible collections using ansible-galaxy, ensure their directories are included in the collections_paths within ansible.cfg
. Follow these steps:
- List Installed Collections: Run the following command to display all installed Ansible collections and their respective paths:
ansible-galaxy collection list
-
Identify Collection Directories: From the output, note the directories where the collections are installed. Each collection entry will display its path.
-
Update
collections_paths
inansible.cfg
: If the collection directories are not already included in thecollections_paths
, add them. Edit theansible.cfg
file to include these paths, separated by colons (:
).
Replace /path/to/your/collection
with the actual paths obtained from the ansible-galaxy collection list
output.
For example:
[defaults]
########################################
# Common destinations
########################################
inventory = ./inventories/development/hosts.ini
hostfile = ./inventories/development/hosts.ini
roles_path = ./roles/common:./roles/ckan:./roles/database:./roles/webserver:./roles/solr:./roles/redis:./roles/supervisor
retry_files_save_path = ./config/tmp/retry/
log_path = ./config/ansible.log
stdout_callback = yaml
### Custom ansible-galaxy collection paths
collections_paths = ~/.ansible/collections:/usr/share/ansible/collections:/lib/python3.9/site-packages/ansible_collections:/path/to/your/collection
-
Clone this repository to your local machine and edit the
ansible.cfg
to use the env what you want, by defaultdevelopment
:git clone https://github.com/mjanez/ckan-ansible.git && cd ckan-ansible vi playbook/ansible.cfg
-
Ensure that
inventory
andhostfile
point to the desired hosts files, e.g., fordevelopment
(default):[defaults] ######################################## # Common destinations ######################################## inventory = ./inventories/development/hosts.ini hostfile = ./inventories/development/hosts.ini roles_path = ./roles/common:./roles/ckan:./roles/database:./roles/webserver:./roles/solr:./roles/redis:./roles/supervisor retry_files_save_path = ./config/tmp/retry/ log_path = ./config/ansible.log stdout_callback = yaml ### Custom ansible-galaxy collection paths collections_paths = ~/.ansible/collections:/usr/share/ansible/collections:/lib/python3.9/site-packages/ansible_collections:/path/to/your/collection
-
Edit the
hosts.ini
and add the target deployment servers IP addresses orhostname
for the specific environment.vi playbook/inventories/development/hosts.ini
[ckan_servers] development_01 ansible_connection=local
Or for local use:
[ckan_servers] development_01 ansible_connection=local
-
Modify the host variables in
playbook/inventories/*/host_vars/*_01.yml
, for instance,development_01.yml
. Check any necessary variables such as database credentials, CKAN versions, and other specific settings.
By default, a local PostgreSQL database is set up. To use an external PostgreSQL database, set use_external_postgres_database
to true
.
### Webserver #########################################
proxy_server_name: localhost
proxy_server_url: http://{{ proxy_server_name }}
proxy_local_services_url: http://localhost
...
### Database service #########################################
postgres_port: 5432
postgres_dir: "/var/lib/pgsql/data"
# Using an external PostgreSQL server or setup a local one
use_external_postgres_database: false
ckan_database: {
postgres_user: "postgres",
postgres_password: "postgres",
ckan_db_user: "ckandbuser",
ckan_db_password: "ckandbpassword",
ckan_db: "ckan_db",
postgres_host: "localhost",
}
...
Caution
The playbook/inventories/*/host_vars/*.yml
file contain customizable configuration variables for deployment. Remember to change before running the Ansible playbook. Specifically the host users/pwds info, and CKAN configuration: ckan_sysadmin_name
, ckan_sysadmin_password
and ckan_sysadmin_email
. Also the proxy_server_name
and nginx_port
for correct deployment.
Important
Also if using a SSH password authentication for private repos create a SSH key pair and copy the keys to the ./playbook/roles/common/files/keys
. The filenames of the keypair files must begin with id_ (e.g. id_rsa
+ id_rsa.pub
)
-
Last, run the Ansible playbook to deploy CKAN on the target server. The following command will deploy CKAN on the target server using the playbook configuration. The
-vvv
flag is used for verbose output:# Location of the ansible.cfg file based on the clone directory export ANSIBLE_CONFIG=$(pwd)/playbook/ansible.cfg # Location if ckan-ansible is cloned in the home directory export ANSIBLE_CONFIG=$HOME/ckan-ansible/playbook/ansible.cfg # Run the ansible playbook, Verbose with -vvv ansible-playbook $HOME/ckan-ansible/playbook/playbook.yml
The
ANSIBLE_CONFIG
environment variable is used to specify the location of theansible.cfg
file. This is useful when you have multiple Ansible configurations and you want to specify which one to use, eg.rhel-9
,ubuntu-20.04
, etc. -
You can manage the deployment of CKAN using two primary commands:
deploy
andupdate
. More info at deployment options
Tip
Use deploy
when:
- You are installing CKAN for the first time.
- You need to perform a complete reinstallation, including resetting databases.
Use update
when:
- You want to update CKAN's codebase, extensions, and associated services without affecting the existing databases.
- You need to apply patches, upgrades, or new features to your current CKAN deployment.
Once you have Vagrant, VirtualBox installed, run the following commands under your project directory:
- Start the VM:
# Change to the project directory
cd ckan-ansible
# Change to a specific OS directory
cd vagrant/rhel/rhel-9
# Edit the host_vars file: ansible_host, ansible_port, ansible_user, ansible_ssh_pass with the Vagrantfile values and .env file as needed
vi playbook/inventories/development/host_vars/development_01.yml
vi .env
# Start the virtual machine, vagrant copy the playbook to the VM
vagrant up
# Launch ansible playbook
vagrant ssh
- In the virtual machine, run the following commands to deploy CKAN with Ansible:
# ckan-ansible has been cloned into the home directory
export ANSIBLE_CONFIG=$HOME/ckan-ansible/playbook/ansible.cfg
# Verbose with -vvv
ansible-playbook $HOME/ckan-ansible/playbook/playbook.yml
Tip
ckan-ansible/*/*
are rsynced to the VM at /home/vagrant/ckan-ansible/*/*
, you can edit the playbook in your local machine and run the ansible-playbook command in the VM using VSCode.
- Once the playbook has finished, you can access CKAN at
http://192.168.56.20
from your local machine.
Once you have Docker installed, run the following commands under your project directory:
# Edit the .env file with the docker-compose values
vi .env
# Up the docker compose services
docker-compose up -d
# Test the ssh connection
ssh -p 2222 ckan@localhost
Could you https://developers.redhat.com/rhel8/install-rhel8-vbox#overview
RHEL info:
You can manage the deployment of CKAN using two primary commands: deploy
and update
. Each serves a distinct purpose depending on your current setup and desired outcome.
The deploy
command performs a fresh deployment of CKAN. This process involves:
- Setting up the environment from scratch: Configuring all necessary components and dependencies required for CKAN to run.
- Initializing databases: This can suppress or reset the CKAN and Datastore databases, effectively clearing existing data. Use this option when:
- Setting up CKAN for the first time.
- Performing a complete teardown and reinstall of your CKAN environment.
Caution: Running deploy
will erase existing data in the CKAN and Datastore databases. Ensure you have backups if you need to preserve existing data.
The update
command is designed to upgrade an existing CKAN installation without affecting the databases. This process includes:
- Updating the CKAN codebase: Pulling the latest changes and applying updates to the CKAN application.
- Refreshing extensions: Updating any CKAN extensions to their latest versions.
- Updating
ckan-pycsw
and workers/supervisors: Ensuring that auxiliary services are up-to-date. - Restarting web services: Applying the changes by restarting the necessary web services, ensuring minimal downtime.
By default, update
is the preferred method when you want to apply updates or changes to your CKAN instance without disrupting your existing data.
# To obtain info of the SSH connection
vagrant ssh-config
# To stop the virtual machine
vagrant halt
# To suspend the virtual machine
vagrant suspend
# Terminate the virtual machine
vagrant destroy --force
You can use the vagrant ssh-config
command to get the SSH configuration for your Vagrant machine, which can be simpler. Here's how:
-
Run
vagrant ssh-config
in the terminal in your Vagrant project directory. This will print the SSH configuration for your Vagrant machine. -
Copy the output of
vagrant ssh-config
into your SSH configuration file (~/.ssh/config
). -
In VS Code, open the command palette and run the
"Remote-SSH: Connect to Host..."
command. Choose your Vagrant machine from the list of hosts.
VS Code will connect to your Vagrant machine and you will be able to edit files directly on the Vagrant machine using VS Code.
The base configuration uses an NGINX image as the front-end (ie: reverse proxy). It includes HTTPS running on port number 8443. A "self-signed" SSL certificate is generated. The ENV proxy_server_name
, NGINX server_name
directive and the CN
field in the SSL certificate have been both set to 'localhost'. This should obviously not be used for production.
Creating the SSL cert and key files as follows:
openssl req -new -newkey rsa:4096 -days 365 -nodes -x509 -subj "/C=DE/ST=Berlin/L=Berlin/O=None/CN=localhost" -keyout ckan-local.key -out ckan-local.crt
The ckan-local.*
files will then need to be moved into the ckan-ansible/playbook/roles/webserver/tasks/files
directory
playbook/
├── ansible.cfg
├── playbook.yml
├── inventories/
│ ├── group_vars/
│ │ ├── all.yml
│ │ ├── production_01.yml
│ ├── os_vars/
│ │ ├── archlinux.yml
│ │ ├── centos-8.yml
│ │ ├── centos-9.yml
│ │ ├── debian-12.yml
│ │ ├── redhat-8.yml
│ │ ├── redhat-9.yml
│ ├── production/
│ │ ├── hosts.ini
│ │ └── host_vars/
│ │ ├── production_01.yml
│ ├── development/
│ ├── hosts.ini
│ └── host_vars/
│ └── development_01.yml
└── roles/
├── ckan/
│ ├── tasks/
│ │ └── main.yml
│ └── ...
├── ckan_pycsw/
│ ├── tasks/
│ │ └── main.yml
│ └── ...
├── common/
│ ├── files/
│ ├── tasks/
│ │ └── main.yml
│ └── ...
├── database/
│ ├── tasks/
│ │ └── main.yml
│ └── ...
├── redis/
│ ├── tasks/
│ │ └── main.yml
│ └── ...
├── solr/
│ ├── tasks/
│ │ └── main.yml
│ └── ...
├── supervisor/
│ ├── tasks/
│ │ └── main.yml
│ └── ...
└── webserver/
├── tasks/
│ └── main.yml
└── ...
This directory structure organizes the ckan-ansible
project. Here's an explanation:
ansible.cfg
: Ansible configuration file. This file sets global configurations for Ansible, such as the inventory file location, roles path, logging, and other settings. It ensures that Ansible knows where to find the necessary files and how to execute the playbooks.playbook.yml
: Ansible playbook for deploying CKAN. This file contains a series of tasks and roles that define the steps required to set up and configure CKAN and its dependencies. It orchestrates the deployment process by specifying which roles to apply and in what order.inventories/
: Inventory files for different environments (e.g., development, staging, production, etc.). These files list the servers (hosts) that Ansible will manage. Each environment has its own directory containing the relevant inventory files.group_vars/
: Group-specific variables. These files define variables that apply to groups of hosts, allowing for centralized configuration management.os_vars/
: OS-specific variables. These files define variables specific to different operating systems, ensuring that tasks are executed correctly on different OS types.production/
: Production environment inventory.hosts.ini
: Hosts file for production.host_vars/
: Host-specific variables.
development/
: Development environment inventory.hosts.ini
: Hosts file for development.host_vars/
: Host-specific variables.
roles/
: Ansible roles for different components.ckan/
: CKAN installation and configuration.ckan_pycsw/
: ckan-pycsw CSW/INSPIRE endpoint.common/
: Common tasks shared across different components. Also contains thefiles/keys/
directory for copying SSH keys to the target server as needed.database/
: PostgreSQL databases.redis/
: Redis.solr/
: Solr.supervisor/
: Supervisor configuration.webserver/
: Web server (NGINX).
-
Configuration (
ansible.cfg
):- Sets the default inventory file location (
inventories/development/hosts.ini
orinventories/production/hosts.ini
). - Defines the roles path, so Ansible knows where to find the roles.
- Configures logging and other settings to control Ansible's behavior.
- Sets the default inventory file location (
-
Inventory (
hosts.ini
):- Lists the servers that Ansible will manage.
- Specifies connection details for each server (e.g., IP address, SSH user, port).
- Can be environment-specific (e.g.,
development
,staging
,production
).
-
Playbook (
playbook.yml
):- Defines the sequence of tasks and roles to be executed on the servers listed in the inventory.
- Uses the roles defined in the
roles/
directory to perform specific tasks (e.g., installing CKAN, configuring the database).
-
Variables (
group_vars/
,host_vars/
,os_vars/
):- Provide configuration details that can be applied to groups of hosts, individual hosts, or specific operating systems.
- Ensure that tasks are executed with the correct parameters and settings.
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the GNU AFFERO GENERAL PUBLIC LICENSE - see the LICENSE file for details.