Log in to the LSF cluster, where all data processing must take place. Switch to a common EVA production user instead of your personal account. Then adjust and execute the commands below. They will set up the environment, fetch and build the code.
Notes:
- The first five variables are installation-specific and are blanked in this repository. You can get the values for the EVA use case from the private repository.
- By modifying the
*REMOTE
and*BRANCH
variables, you can run arbitrary versions of the pipeline. This is highly useful for development and debugging. By default it fetches the master branch from the main pipeline repository. - Running these commands will overwrite any local changes you had in the repository copy on the cluster, including any changes to the Java ClinVar XML parser. Be sure to commit and push those before re-running this block of commands.
# This variable should point to the directory where the clone of this repository is located on the cluster
export CODE_ROOT=
# Location of Python installation which you configured using build instructions
export PYTHON_INSTALL_PATH=
# Location of bcftools installation path
export BCFTOOLS_INSTALL_PATH=
# The directory where subdirectories for each batch will be created
export BATCH_ROOT_BASE=
# Base path of FTP directory on the cluster
export FTP_PATH_BASE=
# Base bsub command line for all commands.
export BSUB_CMDLINE="bsub"
# Setting up Python paths
export PATH=${PYTHON_INSTALL_PATH}:${PYTHON_INSTALL_PATH}/bin:${BCFTOOLS_INSTALL_PATH}:$PATH
export PYTHONPATH=${PYTHON_INSTALL_PATH}
# External service paths
CLINVAR_PATH_BASE="ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar"
export GIT_REMOTE=origin
export GIT_BRANCH=master
cd ${CODE_ROOT}
git fetch ${GIT_REMOTE}
git checkout ${GIT_BRANCH}
git reset --hard ${GIT_REMOTE}/${GIT_BRANCH}
source env/bin/activate
python3 -m pip -q install -r requirements.txt
python3 setup.py install