Skip to content

Instantly share code, notes, and snippets.

@vipranarayan14
Last active May 3, 2024 13:56
Show Gist options
  • Save vipranarayan14/f6167990e9bdd171dd28a7a765d76402 to your computer and use it in GitHub Desktop.
Save vipranarayan14/f6167990e9bdd171dd28a7a765d76402 to your computer and use it in GitHub Desktop.
lttoolbox Sanskrit morphological dictionary notes

lttoolbox Sanskrit morphological dictionary notes

Apertium - Machine Translation

  • morphology
    • morphology => morph = forms + ology = study
      • study of forms = SabxarUpANam aXyayanam
      • subanwaH wifanwaH padam
      • bAla + sup = subanwa
      • bAla -- bAlaH, bAlO, bAlAH, bAle, bAlayoH, bAleRu
    • morphological analysis = SabxaviSleRaNa
      • getting grammatical info from the form
    • morphological generation = Sabxowpawwi
      • using grammatical info to create forms
  • lttoolbox
    • morphological analysis & morphological generation
    • Finite State Automata
    • Space and time efficient than using database - faster & lighter
    • compile = convert human-readable file to machine-readable binary
  • xml
    • structured data

Dix file format

  • sdefs = symbol definitions
  • sdef = symbol definition
    • n = name
  • pardef = paradigm definition
    • n = name
    • paradigm = example/standard
    • paradigm = uxAharaNam
    • e = entry
  • section
    • e = entry
    • lm = lemma = root = pratipadika/dhatu
    • p = pair = couple = xvandva
    • l = left (form)
    • r = right (grammtical information)
    • s = symbol = grammatical symbol. eg. puM, prathama vibhakti, ekavacanam
    • par = paradigm
  • More on Monodix basics here.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment