| Package | Description | 
|---|---|
| biolockj.module.diversity | |
| biolockj.module.implicit | The modules in this package are implicitly added to pipelines as needed. These modules cannot be directly added to any pipeline unless overridden via pipeline.disableImplicitModules=Y. | 
| biolockj.module.implicit.parser | This package contains Parser BioModules in the r16s and WGS sub-packages that are paired with a
  ClassifierModuleviaBioModule.getPostRequisiteModules()to run immediately after the
 classifier. | 
| biolockj.module.implicit.parser.r16s | This package contains Parser  BioModules that convert the 16S taxonomy reports generated by
 16S classifiers (such as RDP and QIIME) into standardized OTU abundance tables. | 
| biolockj.module.implicit.parser.wgs | This package contains Parser  BioModules that convert the WGS taxonomy reports generated by
 WGS classifiers (such as Kraken, Kraken2 Metaphlan2, or Humann2) into standardized OTU abundance tables. | 
| biolockj.module.report | This package contains  BioModules that normalize OTU abundance tables output by Parser
 modules, merges them with the metadata, and generates various reports and notifications. | 
| biolockj.module.report.humann2 | |
| biolockj.module.report.otu | |
| biolockj.module.report.taxa | |
| biolockj.module.seq | BioModules used to prepare sequence files or update the metadata prior to classification. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ShannonDiversity | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | DemultiplexerThis BioModule splits multiplexed data into a separate file or pair of files (for paired reads) for each sample. | 
| class  | RegisterNumReadsThis BioModule parses sequence file to count the number of reads per sample. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | ParserModuleImplParser  BioModules readClassifierModuleoutput to build
 standardized OTU count tables. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | QiimeParserThis BioModules parses QiimeClassifier output reports to build standard OTU abundance tables. | 
| class  | RdpHierParser | 
| class  | RdpParserThis BioModule parses RDP output files to build standard OTU abundance tables. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | Humann2ParserThis BioModule parses Humann2Classifier output reports to build standard OTU abundance tables. Samples IDs are found in the column headers starting with the 2nd column. The count type depends on the HumanN2 config properties. | 
| class  | Kraken2ParserThis BioModules parses KrakenClassifier output reports to build standard OTU abundance tables. | 
| class  | KrakenParserThis BioModules parses KrakenClassifier output reports to build standard OTU abundance tables. | 
| class  | Metaphlan2ParserThis BioModules parses Metaphlan2Classifier output reports to build standard OTU abundance tables. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | JsonReportThis BioModule is used to build a JSON file (summary.json) compiled from all OTUs in the dataset. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AddMetadataToPathwayTablesThis BioModule is used to add metadata columns to the HumanN2 pathway abundance, pathway coverage, and gene family
 tables. | 
| class  | Humann2CountModuleThis abstract superclass is extended by all other modules in this package. Shared method implementations are defined to ensure uniform adoption of dependencies and prerequisites. | 
| class  | RemoveLowPathwayCountsThis BioModule set low Pathway counts below a configured threshold to zero. These low sample counts are assumed to be miscategorized or genomic contamination. | 
| class  | RemoveScarcePathwayCountsThis BioModule removes scarce pathways not found in enough samples. Each pathway must be found in a configurable percentage of samples to be retained. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | CompileOtuCountsThis BioModule compiles the counts from all OTU count files into a single summary OTU count file containing OTU
 counts for the entire dataset. | 
| class  | OtuCountModuleOtuCount modules reads OTU count assignment tables (1 file/sample) with 2 columns. Col1: Full OTU pathway spanning top to bottom level Col2: Count (# of reads) for the sample. | 
| class  | RarefyOtuCountsThis BioModule applies a mean iterative post-OTU classification rarefication algorithm so that each output sample
 will have approximately the same number of OTUs. | 
| class  | RemoveLowOtuCountsThis BioModule set low OTU counts below a configured threshold to zero. These low sample counts are assumed to be miscategorized or contaminents. | 
| class  | RemoveScarceOtuCountsThis BioModule removes scarce OTUs not found in enough samples. The OTU must be found in a configurable percentage of samples. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | AddMetadataToTaxaTablesThis BioModule is used to add metadata columns to the OTU abundance tables. | 
| class  | AddPseudoCount | 
| class  | BuildTaxaTablesMany R BioModules expect separate tables containing log-normalized taxa counts for each taxonomy level. | 
| class  | LogTransformTaxaTablesThis utility is used to log-transform the raw OTU counts on Log10 or Log-e scales. | 
| class  | NormalizeByReadsPerMillionNormalize each sample for sequencing depth by reporting each value as the number 
 of counts per million counts in a given sample. | 
| class  | NormalizeTaxaTablesThis utility is used to normalize and/or log-transform the raw OTU counts using the formulas:
 
 Normalized OTU count formula = (RC/n)*((SUM(x))/N)+1
 Relative abundance formula = Log(log_base) [ (RC/n)*((SUM(x))/N)+1 ]
 
 The code implementation supports (log_base = e) and (log_base = 10) which is configured via
  Constants.REPORT_LOG_BASEproperty. | 
| class  | TaxaCountModuleTBD | 
| class  | TransformTaxaTablesThis utility is used to normalize, log-transform or othewise transfrom data that is stored as taxa counts. | 
| Modifier and Type | Class and Description | 
|---|---|
| class  | MultiplexerThis BioModule will merge sequence files into a single combined sequence file, with either the sample ID or an
 identifying bar-code (if defined in the metatata) is stored in the sequence header. BioLockJ is designed to run on demultiplexed data so this must be the last module to run in its branch. | 
| class  | RarefySeqsThis BioModule imposes a minimum and/or maximum number of reads per sample. | 
| class  | SeqFileValidatorThis BioModule validates fasta/fastq file formats are valid and enforces min/max read lengths. | 
| class  | TrimPrimersThis BioModule removes sequence primers from demultiplexed files. The primers are defined using regular expressions in a separate file. |