Hi, I'm Siddhant Khare, a software engineer building developer tools, distributed systems, and fun side projects with AI.
- π¦ Twitter/X: @siddhant_K_code
- βοΈ Blog: dev.to/siddhantkcode
- π» GitHub: siddhant-k-code
- π¨π»βπΌ LinkedIn: siddhantkhare24
A collection of interesting / fun / random experiments (not in priority order).
-
A2AS Implementation
Proof-of-concept implementation of the A2AS (Agent-to-Agent Security) framework demonstrating Behavior Certificates, Authenticated Prompts, Security Boundaries, and Enforcement Gates. Inspired by the A2AS Paper. -
Agentflow
Kubernetes for AI agents | One single platform combining orchestration runtime, prompt operations, security layer, observability stack, and cost-aware scheduling. Deploy production AI workflows with confidence: DAG execution, multi-provider routing, budget enforcement, and semantic tracing built-in. -
KV-Cache Profiler
Profiler for LLM inference that analyzes and visualizes KV-cache memory usage across transformer layers. Helps identify inefficiencies, optimize memory, and understand sequence length scaling. -
TokenVM
TokenVM is a high-performance runtime that treats LLM KV cache and activations as a virtual memory working set across GPU VRAM β pinned host RAM β NVMe storage, with intelligent paging, prefetching, and compute-copy overlap. -
CloudArb
GPU arbitrage platform that optimizes cloud compute costs for AI companies through real-time multi-cloud allocation and automated deployment. -
LLMTraceFX
GPU-level LLM execution profiler on Modal. Goes beyond flame graphs with kernel breakdowns, stall attribution, cache diagnostics, and GPT-powered insights. -
Song Vector Explorer
Explore song lyrics as interactive 3D vector spaces using OpenAI embeddings + TensorBoard visualization. -
actionsec
Fast, local-first CLI to scan GitHub Actions workflows for security vulnerabilities using AI. -
SageMap Β· Live Demo
Interactive tool to map and evolve personal beliefs using OpenAI. -
ArchiFusion Β· Live Demo
Transforms architectural ideas into 3D building models with AI. -
LLM Parallelism Explorer
Proof-of-concept for exploring parallelism in LLM inference. Accompanies my blog post on execution parallelism. -
AI Agent Orchestrator
Multi-agent AI system on Cloudflare Workers + Containers that enables collaboration across specialized AI agents for research and analysis. -
MediBrief
Automates summarization of PubMed research papers into concise narrated videos, auto-published on YouTube. -
Cloud Architect AI
AI-assisted design tool for visualizing, planning, and implementing cloud infrastructure. -
Contextify
Lightweight Python script to streamline injecting project context into LLM prompts.
-
Apple Notes MCP Server
Interact with Apple Notes via natural language β create, search, and retrieve notes seamlessly with Claude or other MCP-compatible assistants. -
Smart Photo Journal MCP Server
Search and analyze your photo library with AI. Useful for reminiscing or finding specific memories. -
Devcontainer MCP Server
Manage DevContainers using AI prompts in VS Code, Cursor, or any MCP-compatible editor.
AI-related posts:
- How to Make AI Code Edits More Accurate
- An Easy Way to Stop Claude Code from Forgetting the Rules
- Claude Code Is Costly β Unless You Do This
- Extending Claude's Brain: How MCP Makes AI Actually Useful for Developers
- How Docs AI Search Works: Mintlify-Style with OpenAI Agents SDK
- Isolating AI Agents with DevContainer: A Secure and Scalable Approach
- Exploring Parallelism in Large Language Models (LLMs)
π Full archive: dev.to/siddhantkcode