Optimize token usage for Claude API calls
-
Updated
Jun 28, 2026 - JavaScript
Optimize token usage for Claude API calls
Honey (I Shrunk the AI) by GreenPT: a cross-tool coding skill that cuts AI coding-agent token usage and LLM API costs — write less code, less prose, and denser agent-to-agent handoffs (−53%, lossless in benchmarks) with no loss of quality. Works with Claude Code, Cursor, GitHub Copilot, Codex, Gemini CLI, Windsurf, Cline & Kiro.
Claude Code plugin that tracks token usage, identifies wasted context, and saves 30-50% on API costs. Heatmaps, ROI reports, budget alerts, efficiency scores, git-aware suggestions — all local, zero config.
Claude Code skills for developers who code like cats — never more effort than the problem requires.
45% cost reduction measured. The only Claude Code plugin built from CC source analysis — cache expiry prevention, SubTask auto-delegation, zero-cost context restoration, real-time dashboard. Max Plan + API pay-per-use.
Local-first context compression for AI coding tools. One binary saves 85-93% of redundant tokens across every LLM call.
Token-efficient Claude Code workspace with parallel agents and persistent memory. Research → Plan → Implement → Validate workflow.
Chrome Extension that lets you continue any AI conversation anywhere—without losing context. No more copy-paste—move full context across AI tools.
Verdict-first output for AI coding agents. Tiny prompt + installer for Claude Code, Codex, Gemini, Cursor, opencode, and 30+ agents.
reShapr website
Open-source library of token-efficient prompts — 18 prompts, 14 categories, 3 variants each (Lean/Balanced/Max Quality). Covers code, research, creative writing, career, mental health, and more.
Claude Code CLI skill that delegates complex tasks to an OpenCode subagent via ACP protocol, saving 50-90% tokens.
50%+ fewer input tokens. 20%+ shorter output. Do more work in the same context window.
Token-safe C++/C# code search via clangd/Roslyn index instead of grep. Local-only, no IDE required. Claude Code plugin + vts CLI.
See cache health, context fill, token burn, rate limits, and peak hours in Claude Code CLI. The status line for tokenminning - cache rules everything around me - dolla, dolla bill, y'all.
Reduce noisy shell, CI, diff, and MCP-adjacent output into compact answers your coding agent can actually use. Alembic is a local, skill-first tool for Codex and Claude that cuts context waste without adding a network dependency.
🧠 Knowledge Graph Memory for AI Coding Agents & Openclaw - Full offline mode with Docker. Integrates with Claude, Cline, Cursor, Windsurf, and more. Auto-extracts entities & relationships. No API keys required.
serena MCP (38% less tokens). Quick setup: npx serena-slim --setup
CLI for diagnosing Claude Code context burn and generating practical cleanup guidance.
The missing Middleware for reducing LLM API costs through TOON format by converting JSON to TOON automatically with 30-60% token savings with no code changes.
Add a description, image, and links to the token-optimization topic page so that developers can more easily learn about it.
To associate your repository with the token-optimization topic, visit your repo's landing page and select "manage topics."