Skip to content

Instantly share code, notes, and snippets.

View avrilcoghlan's full-sized avatar

Avril Coghlan avrilcoghlan

View GitHub Profile
@avrilcoghlan
avrilcoghlan / treefam_QC5.pl
Created March 1, 2013 14:46
Perl script that finds cases where a full gene set for a species was loaded into the TreeFam mysql database, but no genes from that species were added to families.
#!/usr/local/bin/perl
#
# Perl script treefam_QC5.pl
# Written by Avril Coghlan ([email protected])
# 4-Feb-09.
# Updated 26-June-09 to work with TreeFam release 8.
#
# This perl script finds cases where a full gene set for a species was
# loaded into the TreeFam mysql database, but no genes from that species
@avrilcoghlan
avrilcoghlan / treefam_QC6.pl
Created March 1, 2013 14:47
Perl script that finds cases where more than one alternative splice form from the same gene was added to a family
#!/usr/local/bin/perl
#
# Perl script treefam_QC6.pl
# Written by Avril Coghlan ([email protected])
# 4-Feb-09.
#
# This perl script finds cases where more than one alternative splice form
# from the same gene was added to a family.
#
@avrilcoghlan
avrilcoghlan / treefam_QC7.pl
Created March 1, 2013 14:49
Perl script that finds cases where different alternative spliceforms of the same gene do not have unique transcript ids in the 'genes' table of the TreeFam mysql database
#!/usr/local/bin/perl
#
# Perl script treefam_QC7.pl
# Written by Avril Coghlan ([email protected])
# 4-Feb-09.
#
# This perl script finds cases where more different alternative splices of
# the same gene do not have unique transcript ids in the 'genes' table.
#
@avrilcoghlan
avrilcoghlan / treefam_QC8.pl
Created March 1, 2013 14:51
Perl script that finds cases where a transcript listed in the 'genes' table of the TreeFam mysql database lacks any amino acid sequence in the 'aa_seq' table, or lacks a DNA sequence in the 'nt_seq' table.
#!/usr/local/bin/perl
#
# Perl script treefam_QC8.pl
# Written by Avril Coghlan ([email protected])
# 4-Feb-09.
#
# This perl script finds cases where a transript listed in the 'genes' table
# is missing an amino acid sequence in the 'aa_seq' table or missing a
# DNA sequence in the 'nt_seq' table.
@avrilcoghlan
avrilcoghlan / treefam_QC9.pl
Created March 1, 2013 14:52
Perl script that finds TreeFam transcripts that appear in the 'fam_genes' table of the TreeFam mysql database, but do not appear in the 'genes' table.
#!/usr/local/bin/perl
#
# Perl script treefam_QC9.pl
# Written by Avril Coghlan ([email protected])
# 3-Feb-09.
#
# This perl script finds TreeFam transcripts that appear in the fam_genes
# table, but do not appear in the 'genes' table.
#
@avrilcoghlan
avrilcoghlan / treefam_QC10.pl
Created March 1, 2013 14:54
Perl script that finds TreeFam proteins that were added to a particular family, but actually have a stronger hmmer match to a different family.
#!/usr/local/bin/perl
#
# Perl script treefam_QC10.pl
# Written by Avril Coghlan ([email protected])
# 4-Feb-09.
#
# This perl script finds TreeFam proteins were added to a particular
# family, but actually have a stronger hmmer match to a different family.
#
@avrilcoghlan
avrilcoghlan / treefam_QC11.pl
Created March 1, 2013 14:56
Perl script that finds cases where different alternative splices of the same gene were put into different families, but those alternative splice forms overlap a lot at the DNA level
#!/usr/local/bin/perl
#
# Perl script treefam_QC11.pl
# Written by Avril Coghlan ([email protected])
# 5-Feb-09.
#
# This perl script finds cases where different alternative splices of the
# same gene were put into different families, but those alternative splice
# forms overlap a lot at the DNA level.
@avrilcoghlan
avrilcoghlan / treefam_QC12.pl
Created March 1, 2013 14:58
Perl script that checks for cases where a TreeFam family seems to have disappeared from a particular version of TreeFam, even though it was present in the previous version of TreeFam and has not been curated since.
#!/usr/local/bin/perl
#
# Perl script treefam_QC12.pl
# Written by Avril Coghlan ([email protected])
# 6-Feb-09.
#
# This perl script checks for cases where a TreeFam family seems to have
# disappeared from a particular version of TreeFam, even though it was
# present in the previous version of TreeFam and has not been curated since.
@avrilcoghlan
avrilcoghlan / treefam_release2.pl
Created March 1, 2013 15:02
Perl script that prints out the total number of genes in families, and the total number of families, in a particular TreeFam release
#!/usr/local/bin/perl
#
# Perl script treefam_release2.pl
# Written by Avril Coghlan ([email protected])
# 10-Feb-09.
#
# This perl script prints out the total number of genes in families,
# and the total number of families, in a particular TreeFam release.
#
@avrilcoghlan
avrilcoghlan / count_worm_paralogs2c.pl
Created March 1, 2013 15:12
Perl script that, given a list of Caenorhabditis elegans paralog pairs, uses the TreeFam tree that they are in to calculate information about the paralogs and the tree
#! /usr/bin/perl
#
# Perl script count_worm_paralogs2b.pl
# Written by Avril Coghlan ([email protected])
# 1-Oct-08.
#
# This perl script reads in the trees in TreeFam-6 that a pair of
# worm paralogs are in, and calculates information about the paralogs
# and the tree.