DEV Community

# localllm

Posts

πŸ‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
prima.cpp local llm benchmark: 15% Faster Than llama.cpp

prima.cpp local llm benchmark: 15% Faster Than llama.cpp

8 min read
How I Run My Content Tooling on a Local Model for $0

How I Run My Content Tooling on a Local Model for $0

5 min read
Local AI Agent Browser Extension: Hermes in 120ms

Local AI Agent Browser Extension: Hermes in 120ms

9 min read
Cool AI Projects That Failed: The File Integrity Gap

Cool AI Projects That Failed: The File Integrity Gap

5 min read
Free Local AI Coding Agent: Cut Dev Costs 90%

Free Local AI Coding Agent: Cut Dev Costs 90%

11 min read
My 2-Month local llm daily coding replacement: Real Benchmarks

My 2-Month local llm daily coding replacement: Real Benchmarks

7 min read
Book Library: A Local RAG That Answers From My Own PDFs

Book Library: A Local RAG That Answers From My Own PDFs

5 min read
Cline + LM Studio 2026: complete setup guide, the 32k context trap, and which coding models actually hold up

Cline + LM Studio 2026: complete setup guide, the 32k context trap, and which coding models actually hold up

5 min read
Kimi K2.6 for Local AI in 2026: What VRAM and System RAM You Need to Actually Run the 1T-Parameter MoE Coding Leader

Kimi K2.6 for Local AI in 2026: What VRAM and System RAM You Need to Actually Run the 1T-Parameter MoE Coding Leader

6 min read
Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s

Qwen 3.6 35B-A3B for Local AI in 2026: The 24GB VRAM Line That Gets You 120 tok/s

6 min read
Open-LLM-VTuber Review: Offline AI Companion with Live2D

Open-LLM-VTuber Review: Offline AI Companion with Live2D

10 min read
Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]

Local LLM Hardware Requirements in 2026: What You Actually Need for Every Model Tier [Guide]

8 min read
Hermes Agent Desktop Free With Local LLMs: The Claude Code Alternative Nobody's Billing You For [2026]

Hermes Agent Desktop Free With Local LLMs: The Claude Code Alternative Nobody's Billing You For [2026]

8 min read
Two Qwen3 Models on One DGX Spark: The Residency Math for Local LLM Coding

Two Qwen3 Models on One DGX Spark: The Residency Math for Local LLM Coding

5 min read
[Day 11] I turned my cat into anime art β€” and the AI drew a human girl instead. One photo through IPAdapter pulls it back to a cat

[Day 11] I turned my cat into anime art β€” and the AI drew a human girl instead. One photo through IPAdapter pulls it back to a cat

5 min read
πŸ‘‹ Sign in for the ability to sort posts by relevant, latest, or top.