Skip to content

Usher

Quick Facts

Workflow Type Applicable Kingdom Last Known Changes Command-line Compatibility Workflow Level
Phylogenetic Placement Viral PHB v2.1.0 Yes Sample-level, set-level

Usher_PHB

UShER (Ultrafast Sample Placement on Existing Trees) rapidly places new samples onto an existing phylogeny using maximum parsimony. This workflow uses the UCSC-maintained global trees for SARS-CoV-2, mpox, RSV-A, and RSV-B if those organisms are specified in the organism input field. However, UShER can be used on any organism as long as a mutation-annotated tree (MAT) is provided in protobuf format. Contact us if you need help generating your own mutation-annotated tree, or follow the instructions available on the UShER wiki here.

Inputs

While this workflow is technically a set-level workflow, it works on the sample-level too. When run on the set-level, the samples are placed with respect to each other.

Terra Task Name Variable Type Description Default Value Terra Status
usher_workflow assembly_fasta Array[File] The assembly files for the samples you want to place on the pre-existing; can either be a set of samples, an individual sample, or multiple individual samples Required
usher_workflow organism String What organism to run UShER on; the following organism have default global phylogenies and reference files provided: sars-cov-2, mpox, RSV-A, RSV-B. Required
usher_workflow tree_name String The output prefix for the uncondensed tree output and the clades output. Required
usher cpu Int Number of CPUs to allocate to the task 8 Optional
usher disk_size Int Amount of storage (in GB) to allocate to the task 200 Optional
usher docker String The Docker container to use for the task us-docker.pkg.dev/general-theiagen/pathogengenomics/usher:0.6.2 Optional
usher memory Int Amount of memory/RAM (in GB) to allocate to the task 32 Optional
usher mutation_annotated_tree_pb File Required for organisms other than sars-cov-2, mpox, RSV-A or RSV-B. This is the mutation-annotated global phylogeny upon which your samples will be placed Optional, Required
usher reference_genome File Required for organisms other than sars-cov-2, mpox, RSV-A or RSV-B. This is the reference genome used to determine your sequence's mutations to accurately place the sample on the phylogeny. Optional, Required
usher subtree_size Int Indicates how many of the closest-related samples you want to show in a subtree; more subtrees are made if there is more sequence diversity in the set of input samples (multiple subtrees are only generated if this workflow is run on the set level). 20 Optional
version_capture docker String The Docker container to use for the task "us-docker.pkg.dev/general-theiagen/theiagen/alpine-plus-bash:3.20.0" Optional
version_capture timezone String Set the time zone to get an accurate date of analysis (uses UTC by default) Optional

Outputs

Variable Type Description
usher_clades File The clades predicted for the samples
usher_phb_analysis_date String The date the analysis was run
usher_phb_version String The version of PHB the workflow is from
usher_protobuf_version String The version of the mutation-annotated protobuf tree (what day and what samples are included, if a default organism was used; otherwise, says it was user-provided)
usher_subtree_mutations Array[File] An array of files showing the mutations at each internal node for the subtree
usher_subtrees Array[File] An array of subtrees where your samples have been placed
usher_uncondensed_tree File The entire global tree with your samples included (warning: may be a very large file if the organism is "sars-cov-2")
usher_version String The version of UShER used