Definitions
  • Bank: Any file or set of files located on a remote server
  • Process: All transformations applied
    upstream or downstream of the bank download
    (scripts)

BioMAJ needs to know three parameters to get a new bank:

  • configuration parameters with the global.properties file
  • which database to update with the bank.properties files
  • which processes (alignment, indexation …) will be applied to the database with the process files

Workflow

The BioMAJ workflows takes place as follows, the pre-processes are the first to be applied (get files from a private bank, checking disk space, etc.). Then the workflow checks the release information, if there is already the same release in the database, the workflow stops. If there is a new release, the bank files are downloaded and uncompressed in the offline-dir directory. It is possible to choose which files to keep in the in the bank’s final directory. The workflow applies post-processes, and delete the old version of the bank (if necessary). At the end of the workflow the bank will be published, which creates a current symlink.

See how the different architectures work here.