Skip to content

Instantly share code, notes, and snippets.

@chasemc
Created July 12, 2023 12:20
Show Gist options
  • Save chasemc/7084ba54824add1dd9fa1c17d8727ab5 to your computer and use it in GitHub Desktop.
Save chasemc/7084ba54824add1dd9fa1c17d8727ab5 to your computer and use it in GitHub Desktop.
import json
z=set()
with open("/media/socialgene_nvme/v0.2.3/refseq/socialgene_neo4j/import/antismash_results.jsonl", "r") as h:
for line in h:
for record in json.loads(line)['records']:
for k,v in record.items():
for i in v:
for ii in i['products']:
z.add(ii)
# {'siderophore', 'thioamide-NRP', 'lanthipeptide-class-ii', 'furan', 'tropodithietic-acid', 'lipolanthine', 'epipeptide', 'amglyccycl', 'other', 'pyrrolidine', 'transAT-PKS-like', 'lanthipeptide-class-iv', 'proteusin', 'terpene', 'sactipeptide', 'NRPS', 'NAPAA', 'resorcinol', 'thiopeptide', 'lassopeptide', 'phenazine', 'T1PKS', 'acyl_amino_acids', 'bottromycin', 'blactam', 'redox-cofactor', 'glycocin', 'PpyS-KS', 'thioamitides', 'prodigiosin', 'melanin', 'nucleoside', 'lanthipeptide-class-iii', 'T2PKS', 'aminocoumarin', 'arylpolyene', 'guanidinotides', 'PBDE', 'hglE-KS', 'RRE-containing', 'fungal-RiPP', 'spliceotide', 'RaS-RiPP', 'oligosaccharide', 'indole', 'hserlactone', 'NRPS-like', 'phosphoglycolipid', 'linaridin', 'microviridin', 'PKS-like', 'betalactone', 'cyclic-lactone-autoinducer', 'PUFA', 'NAGGN', 'ectoine', 'butyrolactone', 'phosphonate', 'cyanobactin', 'T3PKS', 'RiPP-like', 'ranthipeptide', 'ladderane', 'transAT-PKS', 'lanthipeptide-class-v', 'lanthipeptide-class-i', 'LAP', 'CDPS'}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment