Skip to content

Instantly share code, notes, and snippets.

@iracooke
Created June 8, 2016 05:42
Show Gist options
  • Save iracooke/324198c88c159eb270463825f27f4b79 to your computer and use it in GitHub Desktop.
Save iracooke/324198c88c159eb270463825f27f4b79 to your computer and use it in GitHub Desktop.

Nucleotide fasta files sometimes encode ambiguous bases simply with an 'N'.
Many downstream tools support this but don't support the full set of IUPAC ambiguity codes

The unix tool tr can be used to get rid of these.

  tr 'RYSWKMBDHV' 'N' < input.fasta
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment