DESeq2#

Add to module run order:
#BioModule biolockj.module.rnaseq.DESeq2

Description#

Determine statistically significant differences using DESeq2.

Properties#

Properties are the name=value pairs in the configuration file.

DESeq2 properties:#

Property Description
deseq2.designFactors list
A comma-separated list of metadata columns to include as factors in the design forumula used with DESeq2.
default: null
deseq2.designFormula string
The exact string to use as the design the call to DESeqDataSetFromMatrix().
default: null
deseq2.scriptPath file path
An R script to use in place of the default script to call DESeq2.
default: null

General properties applicable to this module:#

Property Description
cluster.batchCommand string
Terminal command used to submit jobs on the cluster
default: null
cluster.jobHeader string
Header written at top of worker scripts
default: null
cluster.modules list
List of cluster modules to load at start of worker scripts
default: null
cluster.prologue string
To run at the start of every script after loading cluster modules (if any)
default: null
cluster.statusCommand string
Terminal command used to check the status of jobs on the cluster
default: null
docker.saveContainerOnExit boolean
If Y, docker run command will NOT include the --rm flag
default: null
docker.verifyImage boolean
In check dependencies, run a test to verify the docker image.
default: null
exe.Rscript executable
Path for the "Rscript" executable; if not supplied, any script that needs the Rscript command will assume it is on the PATH.
default: null
script.defaultHeader string
Store default script header for MAIN script and locally run WORKER scripts.
default: #!/bin/bash
script.numThreads integer
Used to reserve cluster resources and passed to any external application call that accepts a numThreads parameter.
default: 8
script.numWorkers integer
Set number of samples to process per script (if parallel processing)
default: 1
script.permissions string
Used as chmod permission parameter (ex: 774)
default: 770
script.timeout integer
Sets # of minutes before worker scripts times out.
default: null

Details#

version: 0.0.0 The two methods of expresison the design are mutually exclusive.
deseq2.designFormula is used as an exact string to pass as the design argument to DESeqDataSetFromMatrix(); example: ~ Location:SoilType . deseq2.designFactors is a list (such as "fist,second") of one or more metadata columns to use in a formula. Using this method, the formula will take the form: ~ first + second
The following two lines are equivilent:
deseq2.designFormula = ~ treatment + batch
deseq2.designFactors = treatment,batch

Advanced users may want to make more advanced modifications to the call to the DESeq2 functions. The easiest way to do this is to run the module with the default script, and treat that as a working template (ie, see how input/outputs are passed to/from the R script). Modify the script in that first pipeline, and save the modified script to a stable location. Then run the pipeline with deseq2.scriptPath giving the path to the modified script.

Adds modules#

pre-requisite modules
none found
post-requisite modules
none found

Docker#

If running in docker, this module will run in a docker container from this image:

biolockjdevteam/r_deseq2:v1.3.18

This can be modified using the following properties:
DESeq2.imageOwner
DESeq2.imageName
DESeq2.imageTag

Citation#

R Core Team (2019). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
Love, M.I., Huber, W., Anders, S. (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology, 15:550. 10.1186/s13059-014-0550-8

Module developed by Ivory, Ke and Rosh
BioLockJ v1.4.2