Skip to content

Fetch SRR Accession Workflow

Quick Facts

Workflow Type Applicable Kingdom Last Known Changes Command-line Compatibility Workflow Level
Data Import Any Taxa PHB v2.3.0 Yes Sample-level

Fetch SRR Accession

This workflow retrieves the Sequence Read Archive (SRA) accession (SRR) associated with a given sample accession. The primary inputs are BioSample IDs (e.g., SAMN00000000) or SRA Experiment IDs (e.g., SRX000000), which link to sequencing data in the SRA repository.

The workflow uses the fastq-dl tool to fetch metadata from SRA and specifically parses this metadata to extract the associated SRR accession and outputs the SRR accession.

Inputs

Terra Task Name Variable Type Description Default Value Terra Status
fetch_srr_metadata sample_accession String SRA-compatible accession, such as a BioSample ID (e.g., "SAMN00000000") or SRA Experiment ID (e.g., "SRX000000"), used to retrieve SRR metadata. Required
fetch_srr_metadata cpu Int Number of CPUs allocated for the task. 2 Optional
fetch_srr_metadata disk_size Int Disk space in GB allocated for the task. 10 Optional
fetch_srr_metadata docker String Docker image for metadata retrieval. us-docker.pkg.dev/general-theiagen/biocontainers/fastq-dl:2.0.4--pyhdfd78af_0 Optional
fetch_srr_metadata memory Int Memory in GB allocated for the task. 8 Optional
version_capture docker String The Docker container to use for the task "us-docker.pkg.dev/general-theiagen/theiagen/alpine-plus-bash:3.20.0" Optional
version_capture timezone String Set the time zone to get an accurate date of analysis (uses UTC by default) Optional

Workflow Tasks

This workflow has a single task that performs metadata retrieval for the specified sample accession.

fastq-dl: Fetches SRR metadata for sample accession

When provided a BioSample accession or SRA experiment ID, 'fastq-dl' collects metadata and returns the appropriate SRR accession.

fastq-dl Technical Details

Links
Task Task on GitHub
Software Source Code fastq-dl Source
Software Documentation fastq-dl Documentation
Original Publication fastq-dl: A fast and reliable tool for downloading SRA metadata

Outputs

Variable Type Description
srr_accession String The SRR accession's associated with the input sample accession.
fetch_srr_accession_version String The version of the fetch_srr_accession workflow.
fetch_srr_accession_analysis_date String The date the fetch_srr_accession analysis was run.

References

Valieris, R. et al., "fastq-dl: A fast and reliable tool for downloading SRA metadata." Bioinformatics, 2021.