Skip to content

Latest commit

 

History

History
47 lines (32 loc) · 916 Bytes

README.md

File metadata and controls

47 lines (32 loc) · 916 Bytes

Corpus Tools Docker Image

Docker Images for the IMS Open Corpus Workbench and UCS Toolkit.

CWB

The IMS Open Corpus Workbench (CWB) is a collection of open-source tools for managing and querying large text corpora.

http://cwb.sourceforge.net/

UCS

The UCS toolkit is a collection of libraries and scripts for the statistical analysis of cooccurrence data.

http://www.collocations.de/software.html

Building the Images

How to build the images:

# Corpus Workbench
# make cwb
docker build -t cwb -f cwb/Dockerfile .

# UCS toolkit
# make ucs
docker build -t ucs -f ucs/Dockerfile .

# Corpus Workbench and CUCS toolkit
# make cwb-ucs
docker build -t cwb-ucs -f cwb-ucs/Dockerfile .

Using the Images

To run the containers:

docker run -ti cwb
docker run -ti ucs
docker run -ti cwb-ucs

Mounting a volume for persistent storage:

docker run -ti cwb -v /home/yourname/data:/data