Skip to content

Instantly share code, notes, and snippets.

@donbr
donbr / arize-phoenix-iceberg.md
Last active December 6, 2024 17:18
Integrating Arize Phoenix and Apache Iceberg for Local Telemetry Data Management and Querying

Title: Integrating Arize Phoenix and Apache Iceberg for Local Telemetry Data Management and Querying

Authors: Don Branson


Abstract

In modern data observability workflows, capturing and managing telemetry data is crucial for debugging and improving machine learning systems. This paper demonstrates the integration of Arize Phoenix, an open-source observability platform, with Apache Iceberg, a high-performance table format for data lakes, to create a scalable and efficient local telemetry data management and querying system. We present a step-by-step implementation for capturing telemetry data as Parquet files using Arize Phoenix on a local system and using Apache Iceberg to enable schema evolution, time travel, and efficient queries. This solution bridges the gap between data observability and data lake management for machine learning monitoring.

@donbr
donbr / IKG_for_Real-Time_Situational_Awareness.md
Created December 7, 2024 02:56
IKG_for_Real-Time_Situational_Awareness.md

Proposal: Enhanced Interactive Knowledge Graph (IKG) for Real-Time Situational Awareness


Objective

To design and develop a multi-agent, dynamic, and interactive knowledge graph (IKG) system that integrates real-time data fusion, ontology-based semantic layers, and advanced human-in-the-loop capabilities. This system will enhance situational awareness (SA) for Air Force operations by allowing autonomous agents to dynamically manage, analyze, and adapt knowledge graphs, ensuring timely and effective decision-making in rapidly evolving environments.


@donbr
donbr / smolagents-examples.md
Created January 21, 2025 21:23
Smolagents Examples
@donbr
donbr / biogrid_pubmed_35559673_interactions.csv
Last active January 27, 2025 01:59
biogrid_pubmed_35559673_interactions.csv
We can make this file beautiful and searchable if this error is corrected: Unclosed quoted field in line 2.
BIOGRID_INTERACTION_ID,ENTREZ_GENE_A,ENTREZ_GENE_B,BIOGRID_ID_A,BIOGRID_ID_B,SYSTEMATIC_NAME_A,SYSTEMATIC_NAME_B,OFFICIAL_SYMBOL_A,OFFICIAL_SYMBOL_B,SYNONYMS_A,SYNONYMS_B,EXPERIMENTAL_SYSTEM,EXPERIMENTAL_SYSTEM_TYPE,PUBMED_AUTHOR,PUBMED_ID,ORGANISM_A,ORGANISM_B,THROUGHPUT,QUANTITATION,MODIFICATION,ONTOLOGY_TERMS,QUALIFICATIONS,TAGS,SOURCEDB,PUBLICATION_YEAR
3410710,8289,51244,113894,119402,RP1-50O24.1,-,ARID1A,CCDC174,B120|BAF250|BAF250a|BM029|C1orf4|ELD|MRD14|OSA1|P270|SMARCF1|hELD|hOSA1,C3orf19,Negative Genetic,genetic,Feng X (2022),35559673,9606,9606,High Throughput,-,-,"{'1197135': {'ONTOLOGY_TERM_ID': 'HP:0001507', 'NAME': 'Growth abnormality', 'TYPE_ID': 4, 'TYPE_NAME': 'Unspecified', 'QUALIFIERS': {'9503': {'ONTOLOGY_TERM_ID': 'PATO:0000169', 'DESC': ""An organismal quality inhering in a bearer or a population by virtue of the bearer's disposition to survive and develop normally or the number of surviving individuals in a given population."", 'ID': 9503, 'NAME': 'viability'}}, 'FLAG': 'P', 'DESC': '-',
@donbr
donbr / smolagents-biopython-capabilityquestions.md
Created January 27, 2025 20:01
smolagents-biopython-capabilityquestions.md

╭──────────────────────────────────────────────────────────── New run ────────────────────────────────────────────────────────────╮ │ │ │ Correlate TP53BP1 protein interactions with mRNA co-expression patterns in TCGA breast cancer data. │ │ │ │ If you have the required libraries and functions to complete the task: │ │ 1. run the code │ │ 2. limit the response to the first 5 records for initial validation │ │

@donbr
donbr / knowledge-graph-construction.md
Last active February 3, 2025 18:04
knowledge-graph-construction.md
@donbr
donbr / covid19-air-cargo-disruptions-multistakeholder.md
Created January 30, 2025 18:56
covid19-air-cargo-disruptions-multistakeholder.md

🚀 Multi-Stakeholder Perspective: Air Cargo Disruptions During COVID-19 (One-Week Interval)

📌 Global Meta Event: COVID-19 Outbreak (March 2020 - Global Lockdowns, Border Closures, & Supply Chain Shocks)
📆 Time Window: March 16–22, 2020 (One-Week Interval of Major Cargo Disruptions)
🌍 Disruption: Rapid lockdowns, flight cancellations, PPE (Personal Protective Equipment) demand spikes, and re-routing bottlenecks.

Key Stakeholders & Their Perspectives:

  • 📦 Shipper (Medical Equipment Manufacturer) → Focus: Single Critical PPE Shipment
  • ✈ Airline Cargo Operations (Major Carrier) → Focus: Hundreds of disrupted flights & rerouted shipments
  • 🌏 Freight Forwarder (Global Logistics Company) → Focus: Managing demand spikes, re-bookings, and rate volatility
@donbr
donbr / crisis-management-a-personal-perspective.md
Last active February 16, 2025 04:35
crisis management insights and strategic directions

Crisis Management - a personal perspective

The COVID-19 pandemic highlighted critical gaps in our crisis response capabilities, particularly in an industry where I worked directly. This paper examines the requirements for effective training and preparedness during acute phases of the outbreak.

A fundamental challenge lies in preparing both human operators and their supporting AI systems to respond to unprecedented scenarios. Crisis events, by their nature, often fall outside of the standard validation parameters used to test AI systems' generalization capabilities.

The ongoing refinement of the Graph Simulation Pipeline and associated open-source datasets represents a crucial element in advancing crisis preparedness. These tools enhance our ability to transform overwhelming data streams into actionable intelligence during evolving situations.

The framework incorporates two key components:

  1. Replay Mode: leverages tools like NetworkX and TigerGraph to model temporal dependencies using historical crisis
@donbr
donbr / knowledge-graph-pipeline-jan31.md
Last active January 31, 2025 22:39
knowledge-graph-pipeline-jan31.md

Knowledge Graph Pipeline

Core Components:

  • Prefect 3.1 Cloud
  • Great Expectations
  • dbt
  • Hugging Face Datasets

@donbr
donbr / decentralized-intelligence.md
Last active February 3, 2025 08:32
Chaos Happens: decentralized intelligence with centralized oversight

adding critical layers of autonomy, adaptability, and trust to crisis management frameworks.

  • merging these ideas into a hybrid reference architecture and implementation blueprint
  • structured for crisis-ready IoT systems.

Reference Architecture Diagram (Textual Representation)

Layers Components Enhancements