Skip to content

Instantly share code, notes, and snippets.

View TomConlin's full-sized avatar
💭
Data Gardener

Tom Conlin TomConlin

💭
Data Gardener
View GitHub Profile
@TomConlin
TomConlin / FastFastaRead.readme
Created December 27, 2019 22:10
fasta binary formats exploration with Julia (v0.2?)
2019 - almost a decade ago now I helped mentor
a CS student writing a bioinformatics exercise in C++.
To help avoid doing his work I did not use C++.
Instead I experimented in Julia; probably v0.2 or v0.3.
The task was to compare ginormas metagenomics studies quickly.
@TomConlin
TomConlin / FriendsOfEntropy
Last active June 17, 2021 18:17
FriendsOfEntropy - enrich decidability for Variant Call Format(VCF) data across multiple cultivar/strains for Machine Learning
gvcf is not just "gzipped" vcf it is some "genomic" extension to the format
# test
vcftools --gzvcf ./data/gvcf/IDTX_PCRfree_PI562985_ATTACTCG-ATAGAGGC_Sorghum_I549_L1.bwa_pcr_free.raw.snps.indels.g.vcf.gz \
--non-ref-ac-any 1 \
--recode \
--recode-INFO-all \
--stdout > test_vcftools_filtered
@TomConlin
TomConlin / README
Last active March 2, 2021 17:09
Shrub-Datalog
placeholder
@TomConlin
TomConlin / README.md
Last active March 3, 2021 05:25
Dipper-Quality

Dipper repo pylint-quality over time.

This graph is generated with git logs and python's builtin static analysis tool 'pylint'.

Pylint is called with only the default configuration.

For each series-of-commits by an author, the first & last are checked out and static analysis of the code base is preformed. capturing the first & final score (<= 10) of the interval.