Skip to content

Instantly share code, notes, and snippets.

@chasemc
Created May 14, 2021 21:27
Show Gist options
  • Save chasemc/30a129488733fcc628a21cf3e7ac5a4e to your computer and use it in GitHub Desktop.
Save chasemc/30a129488733fcc628a21cf3e7ac5a4e to your computer and use it in GitHub Desktop.
Read NCBI assembly info from R
assembly_summary_refseq <- readr::read_delim("https://ftp.ncbi.nih.gov/genomes/ASSEMBLY_REPORTS/assembly_summary_refseq.txt", quote='', skip = 2, delim = "\t", col_names = c("assembly_accession", "bioproject", "biosample", "wgs_master", "refseq_category", "taxid", "species_taxid", "organism_name", "infraspecific_name", "isolate", "version_status", "assembly_level", "release_type", "genome_rep", "seq_rel_date", "asm_name", "submitter", "gbrs_paired_asm", "paired_asm_comp", "ftp_path", "excluded_from_refseq", "relation_to_type_material"))
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment