The goal of today's practical session is to get your hands dirty with bedtools. We will be studying ChiP-seq data from three different cell types. Each cell type was assayed for H3K27ac. Our research goal is to understand and explore the similarities and differences between the ChIP peaks observed in the 3 different cell types.
http://dl.dropbox.com/u/515640/BIOCH508/H3K27a_HPC7_final.bed http://dl.dropbox.com/u/515640/BIOCH508/H3K27a_HPC7-derived_final.bed http://dl.dropbox.com/u/515640/BIOCH508/H3K27a_HPC7-Primed_final.bed
Annotations:
-
Genes:
/data/slib/quinlan-bedtools-session/data/annotations/gencode.v14.bed
-
Conservation:
/data/slib/quinlan-bedtools-session/data/annotations/conservation.bed
bedtools
is installed on Franklin at /data/slib/quinlan-bedtools-session/bin/
.
Use the full PATH
to the executable.
Examples:
bedtools --help
bedtools intersect -a a.bed -b b.bed
bedtools intersect -h
bedtools multiinter -i *.bed
bedtools jaccard -a a.bed -b b.bed
bedtools closest -a peaks.bed -b genes.bed
- Download the data (above) to your laptop
- Upload the data to UCSC.
- Explore the data.
- How similar are the peaks between the cell types?
- Can you make an estimate as to how similar (that is, how much overlap) the peaks from each cell type are?
- Hints: use bedtools; what is another word for overlap?
- http://en.wikipedia.org/wiki/Jaccard_index
- bedtools has a
jaccard
metric as well.
bedtools multiinter
. Have a look at the-examples
,-header
, and-names
options.- Examples here
- Hint: the fourth column tells you how many datasets had a peak for each interval.
awk '$4 == 3'
will identify all intervals where all three cell types had a peak.- The awk example above is the same as
perl -ane 'if ($F[3] == 3) {print}'. Note that
awk` uses 1-based column numbers whereas Perl uses 0-based.
- Focus on
H3K27a_HPC7-Primed_final.bed
awk
is still your friend...what column?
Genes: /data/slib/quinlan-bedtools-session/data/annotations/gencode.v14.bed
Conservation: /data/slib/quinlan-bedtools-session/data/annotations/conservation.bed