DEV Community

# benchmark

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
GLM Is the New Hotness, So Let's Test It On the Homelab

GLM Is the New Hotness, So Let's Test It On the Homelab

2
11 min read
Debugging Deployments with Gemma 12B, TPU v6e-4, MCP, and Antigravity CLI

Debugging Deployments with Gemma 12B, TPU v6e-4, MCP, and Antigravity CLI

5
16 min read
Populating a Java POJO with Reflection vs. with the ClassFile API - small benchmark

Populating a Java POJO with Reflection vs. with the ClassFile API - small benchmark

1
1 min read
DiffusionGemma 26B 登陸 M2 Max:MLX 吞吐量實測與 Context 極限挑戰

DiffusionGemma 26B 登陸 M2 Max:MLX 吞吐量實測與 Context 極限挑戰

3 min read
DiffusionGemma 26B 挑戰 GH200 效能極限

DiffusionGemma 26B 挑戰 GH200 效能極限

1
2 min read
Portrait Generation Benchmark Q1 2026: Flux.2 vs SDXL vs Proprietary

Portrait Generation Benchmark Q1 2026: Flux.2 vs SDXL vs Proprietary

3 min read
Model Showdown Round 7: Five Local Models vs. One Cloud Model on a Real Coding Task

Model Showdown Round 7: Five Local Models vs. One Cloud Model on a Real Coding Task

1
9 min read
A UMAP With Arrows Is Not a Benchmark. This Is

A UMAP With Arrows Is Not a Benchmark. This Is

7 min read
Engineering CellFateBench: A Reproducible Python Benchmark for Single-Cell Genomics Reasoning

Engineering CellFateBench: A Reproducible Python Benchmark for Single-Cell Genomics Reasoning

8 min read
PostAll vs Manual Content Creation: A Developer's Performance Breakdown

PostAll vs Manual Content Creation: A Developer's Performance Breakdown

9 min read
Frontier Bakeoff: We Benchmarked Fable 5 Hours Before the Shutdown

Frontier Bakeoff: We Benchmarked Fable 5 Hours Before the Shutdown

6 min read
Too cheap to be good? Think again.

Replacing bloated panels with Caddy and scripts

Too cheap to be good? Think again.

85
115
14 min read
Ideogram 4.0 is Good. Just Good.

Ideogram 4.0 is Good. Just Good.

2 min read
I Tested CodeGraph on Hono. The Tool-Call Savings Reproduce — the Cost Savings Don't.

I Tested CodeGraph on Hono. The Tool-Call Savings Reproduce — the Cost Savings Don't.

13 min read
We Benchmarked the Most Popular Code Search Tools. We Beat All of Them.

We Benchmarked the Most Popular Code Search Tools. We Beat All of Them.

11 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.