Skip to content

RESTful Web Services

Nacho edited this page Aug 27, 2015 · 10 revisions

Understanding the URL

The general structure of a OpenCGA RESTful URL call is:

http://HOST_URL/webservices/rest/{version}/{resource}/{id}/{endpoint}?{options}

Where HOST_URL is as follows http://bioinfodev.hpc.cam.ac.uk/opencga

Sections in braces are web service parameters, so they can be treated as variables.

Quick example:
http://bioinfodev.hpc.cam.ac.uk:80/opencga/webservices/rest/v1/users/opencga/login?password=opencga

As is explained later in this documentation, this RESTful web service will login user opencga.

The available parameters are:

  • version: indicates OpenCGA version to retrieve information from, data models and API may change between versions.
  • resource: specifies the data type of what the user want to retrieve in the id field. This can be one of users, projects, studies, files, jobs, individuals, samples, variables, cohorts or tools, more information here.
  • id: the resource id we want to query.
  • endpoint: these parameters must be specified depending on the nature of your input data. For example, if we want to query CellBase by a genomic region (e.g. 13:32889575-32889763) we should use the genomic category and region subcategory. More information here.
  • options:
Version

Versions are numbered as v1, v2, v3, etc. At this moment the latest stable version is v3. However, the latest stable version will be always coded as latest.

Species

A list of the species available for version v3 can be found here Use the abbreviated code to indicate the species. For example, hsapiens for Homo sapiens or mmusculus for Mus musculus.

Category and subcategory

There are 4 main categories:

Category Description Subcategories
Genomic Genomic category makes reference to genomic coordinates Position, Variant, Region
Feature Feature category involve all elements which have a defined location on the genome and provides an easy way to retrieve cross references for an ID Gene, SNP, Transcript, Protein, Xref, ...
Regulatory Regulatory category refers to all regulatory interactions involving transcription factors and microRNAs TFBSs, miRNAs
Network Network category makes reference to all types of networks and pathways, including the protein interactome, the regulatory network and Reactome Pathway
Id

This is the query parameter, it is the feature or term about we want to retrieve the information (resource). Its type must correspond with the subcategory.

NOTE: In order to improve performance, ID lists can be passed together in only one REST call separated by commas. Only 200 IDs are allowed. For larger queries user the CellBase client.

Examples:

Resources

Each Category and Subcategory can have different resources and actions allowed. They specify the type of result we want to obtain from the ID.

Examples:

NOTE: Resources must always be written in singular.

Filters and extra options

Filters and extra options can be added at the end of the REST query followed by ?. They are optional and can be combined using the & sign. E.g:
bioinfo.hpc.cam.ac.uk/cellbase/webservices/rest/latest/hsapiens/feature/gene/BRCA2/info?of=json&include=id,name,biotype

These are the available filters and options:

Help and metadata

The available main categories are listed at:
bioinfo.hpc.cam.ac.uk/cellbase/webservices/rest/latest/hsapiens

And the available subcategories for a main category are listed at:

Subcategories should specify the type of the id. Subcategories can be different for each category and are described within each of the categories mentioned above.

Each subcategory usage is shown by writing the subcategory name followed by /help, for example:

bioinfo.hpc.cam.ac.uk/cellbase/webservices/rest/latest/hsapiens/feature/gene/help

Swagger

Swagger project has been set up and is available at:

http://bioinfodev.hpc.cam.ac.uk/opencga/webservices/

Clone this wiki locally