Skip to content

Instantly share code, notes, and snippets.

@radaniba
Created November 29, 2012 17:18
Show Gist options
  • Select an option

  • Save radaniba/4170510 to your computer and use it in GitHub Desktop.

Select an option

Save radaniba/4170510 to your computer and use it in GitHub Desktop.
Get all UCSC refseq and calculate first intron length and normalized first intron length
BEGIN {
FS="\t";
}
{
split($10,exonStarts,",");
split($11,exonEnds,",");
geneSize=1.0*int($6)-int($5);
exonCount=int($9);
if(exonCount<2)
{
next;
}
if($4=="+")
{
printf("%i\t%f\t%s\n",(exonStarts[2]-exonEnds[1]),(exonStarts[2]-exonEnds[1])/geneSize,$0);
}
else
{
printf("%i\t%f\t%s\n",(exonStarts[exonCount]-exonEnds[exonCount-1]),(exonStarts[exonCount]-exonEnds[exonCount-1])/geneSize,$0);
}
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment