Skip to content

Instantly share code, notes, and snippets.

View jaclynsaunders's full-sized avatar

Jaci Saunders jaclynsaunders

View GitHub Profile
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@jaclynsaunders
jaclynsaunders / emails-pull-canvas-zoom-breakouts.ipynb
Created August 19, 2020 22:49
emails-pull-canvas-zoom-breakouts
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
import pandas as pd
import re
INPUT_FILE = "CAT-taxa-out.txt"
OUTPUT_FILE = "formatted_CAT-taxa-out.txt"
ORF_list = []
taxid_list = []
no_hits = []
import pandas as pd
from Bio import SeqIO
#As written, this stores sequences in memory. For very large files will want to
#use a different Biopython parsing method.
FILENAME = "input_file_to_rename.fasta"
RENAME_PATTERN = "Project-name_ORF_"
OUT_FILENAME = "outfile_Project_ORF_renamed"