Skip to content
View Siddhant-K-code's full-sized avatar
🚀
Shipping
🚀
Shipping

Sponsors

Private Sponsor
@ramkumarkb

Block or report Siddhant-K-code

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Siddhant-K-code/README.md

Hi, I'm Siddhant Khare 👋

📍 India | 🛠️ Former Ona/Gitpod engineer (acquired by OpenAI) | 🔐 OpenFGA Core Maintainer | 🎤 International Speaker | siddhantkhare.com

Building infrastructure for AI agents - context efficiency, least-privilege security, production-grade tooling.

📖 The Agentic Engineering Guide

I wrote a book: The Agentic Engineering Guide - 10 parts, 33 chapters on the engineering decisions behind AI agent infrastructure. Context engineering, authorization, cost control, evaluation, adoption playbooks.

Free to read online. PDF & EPUB on Gumroad. Source on GitHub.

Current Projects

Agent Infrastructure & Context Engineering

  • 🧬 Distill - Deterministic context deduplication for LLMs. Clean context in ~12ms, zero LLM calls.
  • 🔍 agent-trace - strace for AI agents. Capture and replay every tool call, prompt, and response. OTLP export to Datadog, Honeycomb, New Relic, Splunk.
  • 🧪 ContextLab - Open-source LLM context engineering toolkit: analyze, compress, visualize.
  • 💾 TokenVM - High-performance runtime treating LLM KV cache as virtual memory with page-based eviction.
  • 📊 KV-Cache Profiler - Profile LLM GPU memory needs before deployment, not after.
  • 🔬 LLMTraceFX - GPU-level LLM inference profiler with kernel timing and AI-powered bottleneck detection.
  • 🤖 Agentflow - Kubernetes for AI agents - orchestration runtime, prompt ops, security layer, observability, and cost-aware scheduling.
  • 🌐 AI Agent Orchestrator - Multi-agent AI system on Cloudflare Workers + Containers.
  • 🚨 OnCall Agent - AI-powered on-call triage agent. Pulls alerts, checks deploys, maps dependencies, reads runbooks, and ranks hypotheses by confidence. Built on OpenAI Agents SDK.
  • 🔢 LLM Parallelism Explorer - Research tool for optimizing parallelism strategies in LLMs (MoE focus).
  • 💰 CloudArb - GPU arbitrage platform for AI compute optimization.

Agent Security & Authorization

  • 🔐 agentic-authz - OpenFGA + MCP authorization gateway for AI agents. Fine-grained access control at team, project, and tool levels.
  • 🔏 Agentic Authorization - Authorization patterns for autonomous AI agent systems using ReBAC with OpenFGA.
  • 🔗 A2AS Implementation - Proof-of-concept of Agent-to-Agent Security framework.
  • 🛡️ Sentinel AI - Hermetic CLI for security scanning and dead-code detection with LLM-powered triage.
  • 🔒 actionsec - Fast, local-first CLI for GitHub Actions security analysis.

Developer Tools & MCP Servers

Applications

  • 🍔 Swiggy Party Agent - AI agent that orders food for your entire team.
  • 👁️ Gemma Vision - Local image intelligence CLI powered by gemma4:12b via Ollama.
  • 🎬 Lipsync Check - Detects audio-visual desync in video files using Gemma 4 via Ollama.
  • 🎵 Song Vector Explorer - Explore song lyrics as interactive 3D vector spaces.
  • 🗺️ SageMap - Interactive tool to map and evolve personal beliefs.
  • 🏥 Radiology Copilot - Gemini-powered multimodal radiology assistant.
  • 🏛️ ArchiFusion - Transforms architectural ideas into 3D building models.
  • 💊 MediSearchAI - A smarter way to search for medicines.
  • 📄 MedBrief - Automated PubMed research paper summarization into narrated videos.
  • 📚 LangChain x OpenAI: Bring Your Own Data - Train with custom markdown data using LangChain.
  • 🖥️ GPT-CLI - GPT in your terminal.

Maintainer Roles

  • 🔑 OpenFGA - Core Maintainer of Google Zanzibar-style fine-grained authorization system (CNCF Incubating). First independent maintainer.
  • 🔍 GitHub1s - Maintainer of one-second code reading for GitHub repositories.

Legacy / Earlier Work

What I'm Doing

  • Engineering @ Ona - Building infrastructure for AI agents (formerly Gitpod).
  • Open Source - Maintaining OpenFGA, GitHub1s, and shipping agent infrastructure tools.
  • Writing - The Agentic Engineering Guide (book) + 40+ technical articles on AI infrastructure, security, and context engineering on siddhantkhare.com/writing.
  • Speaking - KubeCon India 2025: Beyond Productivity: Scaling Cloud Dev Environments for Faster Feedback & Sustainable Engineering. Available for conferences on AI agent security and authorization.
  • Mentoring - Book a session on MentorCruise.

Latest Blog Posts

Connect

Twitter LinkedIn YouTube Dev.to Website GitHub Sponsors Buy Me a Coffee


Recognition

Videos

Research Focus

Building at the intersection of LLM efficiency and agent security:

  1. Context Efficiency & Reliability - Deterministic algorithms for context deduplication and optimization (Distill, ContextLab, TokenVM)
  2. Agent Authorization & Audit Trails - Google Zanzibar-style authorization for agent-tool interactions (agentic-authz, OpenFGA)
  3. Adversarial Robustness & Observability - Detecting and mitigating attacks on agent tool-use pipelines (Sentinel AI, agent-trace, LLMTraceFX)

Pinned Loading

  1. conwnet/github1s conwnet/github1s Public

    One second to read GitHub code with VS Code.

    TypeScript 23.3k 906

  2. openfga/openfga openfga/openfga Public

    A high performance and flexible authorization/permission engine built for developers and inspired by Google Zanzibar

    Go 5.4k 428

  3. distill distill Public

    Context intelligence layer for LLM agents: persistent memory with write-time dedup, sensitivity tagging, conflict detection, and hierarchical decay. ~12ms. No LLM calls. MIT.

    Go 171 16

  4. agentic-authz agentic-authz Public

    Fine-grained authorization for AI agents using OpenFGA.

    TypeScript 65 4

  5. memory-journal-mcp-server memory-journal-mcp-server Public

    MCP server designed to help you search and analyze your photo library (iCloud - Apple photos)

    Python 24 7

  6. agent-trace agent-trace Public

    Observability for AI agents. See what your agent did, why it cost that much, and what to fix.

    Python 74 6