Skip to content

Instantly share code, notes, and snippets.

@rdmpage
rdmpage / index.html
Created February 24, 2015 12:04
BOLD website returned this HTML 2015-02-24
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
<head>
<title>www.boldsystems.org [8]</title>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta http-equiv="pragma" content="no-cache">
<style>
body { margin:0; padding:0; }
</style>
</head>
@rdmpage
rdmpage / occurrence.geojson
Last active August 29, 2015 14:15
Museu Paraense Emilio Goeldi - Ornithology Collection GeoJSON for first 1000 records after lat/long divided by 10000000, see issue http://dev.gbif.org/issues/browse/POR-2649
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
We can make this file beautiful and searchable if this error is corrected: It looks like row 3 should actually have 10 columns, instead of 1 in line 2.
"Catalog# (TTU-M)","TK#","Genus","Species","Country","State","County","Collection Date","Tissues Available","Preparator"
"84773","104001","ARTIBEUS","OBSCURUS","ECUADOR","PASTAZA DISTRICT","---","7/19/2001","Yes","HAYNIE, M L"
"Subspecies: ---Specific Locality: 5 KM E PUYO, SAFARI HOSTERIA PARKElevation: ---Zone: 18Easting: 166624Northing: 9840350Latitude: ---Longitude: ---Collector: BAKER, R J ET ALPreparation Type: SBSex: FEMALEAssociated Sequences: ---Specimen#: ---Prep#: 157Prep Date: 7/19/2001Total Length (mm): 100Tail Length (mm): 0Foot Length (mm): 11Ear Length (mm): 12Weight (g): 14.9Tail Broken: NoTragus Length (mm): 0Forearm Length (mm): 0Reproductive Notes: ---Age: ADULT"
@rdmpage
rdmpage / permissions.txt
Created January 8, 2015 11:09
MySQL database permissions
When copying MySQL database files (i.e. the binary files in /usr/local/mysql/data ) to a new machine we need to set permissions.
For the database "db" we do the following:
sudo chown -R _mysql db
sudo chgrp -R _mysql db
sudo chmod -R 644 db
sudo chmod 755 db
@rdmpage
rdmpage / zsm.asc
Last active August 29, 2015 14:07
GenBank, BOLD, GBIF (ZSM iBOL)

GenBank, BOLD, GBIF (ZSM iBOL)

// GBIF occurrence
== GBIF geotagged, sequence in Genbank not geotagged
//hide
//setup
//output
[source,cypher]
----
// GBIF occurrence
@rdmpage
rdmpage / getag.asc
Last active August 29, 2015 14:07
Getagged in Genbank but not GBIF
@rdmpage
rdmpage / gb.asc
Last active August 29, 2015 14:07
GenBank specimen

Simple case of specimen and sequence both in GBIF

@rdmpage
rdmpage / gbif.asc
Last active March 31, 2018 05:03
GenBank and GBIF

Cluster multiple occurrence records in GBIF

In this example we have the same sequence GQ247641 in two sequence datasets ("European Molecular Biology Laboratory Australian Mirror" and "Geographically tagged INSDC sequences"), and also the voucher specimen ("AM W.35546.001" or "AMS:W.35546") also occurs in GBIF (provided by "Australian Museum provider for OZCAM"). Linking the two sequence occurrences is trivial, we just link by the accession "GQ247641". Linking the sequence to the museum specimen requires matching the slightly different strings "AM W.35546.001" and "AMS:W.35546".

The graph links three records in GBIf that all refer to the same thing.

@rdmpage
rdmpage / geotagging.txt
Created August 25, 2014 09:57
BioStor geotagging stats
How many articles?
SELECT COUNT(reference_id) FROM rdmp_reference WHERE PageID <> 0;
+---------------------+
| COUNT(reference_id) |
+---------------------+
| 106617 |
+---------------------+
1 row in set (0.08 sec)