Skip to content

Instantly share code, notes, and snippets.

@UncleCJ
Last active October 17, 2020 17:36
Show Gist options
  • Save UncleCJ/6f9b36f55214e1d9ca5d128a9854e9f3 to your computer and use it in GitHub Desktop.
Save UncleCJ/6f9b36f55214e1d9ca5d128a9854e9f3 to your computer and use it in GitHub Desktop.

Drafting scope for now:

  • Family trees
    • GEDCOM, new formats, proprietary eg. in FamilyTreeDNA web
    • Other genealogists
    • More or less reliable public collections
    • Higher order methods such as ancestral recombination graphs and tree sequences (combines with genetics)
    • Hard, factual relationship structure
    • Unified mesh out of the primitive mother-father-child motif
  • Genetics
    • Biological mechanism
      • Diploid genome, unphased sequence
      • Base pairs versus morgan
      • Sex chromosomes and mitochondria
        • Mutation rate
        • Relation to patronymikon
        • Relationship motif constraints ("X-matches")
      • Meiosis and fertilisation
        • Haploid gametes (half genome)
        • Crossover (probability in sperm vs egg)
    • Matches (siloed in different sites)
      • Estimated relationship distance ("what are the odds")
      • Match metrics (common SNPs, longest match)
      • Common matches
        • Clustering - geneticaffairs (improving, but originally a crude binary adjacency matrix)
        • Pitfall - population segments
        • Triangulation
          • Segments matching for three or more people ...
          • ... or estimated relationship distance
    • "DNA painting" / "Chromosome browser"
      • Plot match segments
      • Deduce ancestor contributions from match segments and vice versa (hypothetically apply other methods recursively for ancestors)
    • Phasing
      • Particularly applicable for three whole-siblings (but hypothetically useful in a "fuzzy" fashion)
      • Learn crossover loci (edges of match segments)
      • Learn grandparent "mosaic" (reconcile with matches)
      • Allen's interval algebra
    • Actual SNP sequences
      • Private!
      • Deduce their and ancestor segments
  • Public records
    • Accessibility
      • Proprietary, subscription and compilations
      • Interpretation, translation
    • Digitize
      • Scan
      • Read (OCR)
      • Index
      • Structure
        • Archives
        • Contents
    • Graves
      • Catalogues
      • Inventory
    • Legal records
    • Libraries
      • News
      • Literature
      • Portrait collections
    • Referencing, quoting and authorities
  • Private knowledge, photos, anecdotes etc.
  • History / geography
    • Locations
      • Proximity
      • Chronology (migration)
    • Events and periods
      • Chronology (biography)
      • Causality
      • Age
      • Duration
      • Allen's interval algebra
    • Entities and borders
      • Countries
      • Counties
      • Parishes
      • Estates, farms and addresses
    • Partly in family trees and public records
  • Populations
    • Location-centric genealogy
    • Migration patterns
    • Statistics
    • Coverage of knowledge (records keeping and research)
    • Ancestry and composition
    • Cultural heritage
  • Health
    • Heritable disease
    • Susceptibilities and advice
    • Other phenotypes
      • Appearance (facial quantization?)
  • Forensic
    • Need to pinpoint ancestors (and their descendants) of perpetrators at scale
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment