Skip to content

Architecture

javild edited this page Aug 24, 2016 · 3 revisions

CellBase has been implemented in a very modular way providing a three layer architecture, enabling different access levels and use cases of the system:

  1. MongoDB Databases in current implementation. An independent MongoDB database is created for each species and assembly. Collection schemas and indexes were designed to ensure an efficient response to the main expected queries (See https://github.com/opencb/cellbase/wiki/MongoDB-implementation for detailed specification of current mongo schema)

  2. a Java abstraction layer that deals with the peculiarities of the underlying database system and provides an API to access the data. Advanced users may directly use this API to avoid network access and web services overload thus improving the performance of their implementations. Moreover, any advanced user wanting to use any other database system will just have to implement this API plugin for the desired database system.

  3. a set of REST Web Services which provide an efficient high level interface to the data regardless of the programming language. Most frequent access will be through this top layer. Thus, the Web Services API can be queried either by provided clients in Python, R and Java, or by own user-developed methods that implement calls to the Web Services by building appropriate urls. Comprehensive RESTful API specification is provided by Swagger: http://bioinfo.hpc.cam.ac.uk/cellbase/webservices/