ollama (@ollama) / X

ollama

8,421 posts

ollama

@ollama

ollama.com

California, USA

github.com/ollama/ollama

Joined August 2023

Following

168.5K

Followers

Pinned
ollama
@ollama
Jun 16
🤯 GLM-5.2 is here — built for long-horizon coding and agentic tasks, now with a solid 1M-token context. The strongest open-source coding model yet! Available now on Ollama's cloud, hosted in the US on the latest @NVIDIAAI Blackwell datacenter GPUs. Privacy policy and zero
Z.ai
@Zai_org
Jun 16
Introducing GLM-5.2: Frontier Intelligence, Open Weights - Significant improvements in coding and agentic tasks - Strong long-horizon capabilities with a 1M context window - Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong
223K
ollama reposted
Todd Dailey
@twid
5h
Article
I tested Ollama's "90% faster Gemma 4" claim on an M5 Max MacBook Pro
Bottom line up front: Gemma 4 now runs on Ollama at a blazing 152 tok/s on an M5 Max MacBook Pro using the new MLX-optimized 26B MoE model. You should run it! "ollama pull gemma4:31b-mlx" and you're...
5.2K
ollama
@ollama
5h
Congratulations to our friends at @togethercompute. Exciting moment for open models!!
Vipul Ved Prakash
@vipulved
11h
We @togethercompute believe intelligence should be abundant, not expensive. Today we announced our Series C funding of $800m @ $8.3B valuation, to continue to build the world's most efficient platform for generative AI. Thanks @nikogallogly for telling our story in @nytimes!
12K
ollama
@ollama
Jul 1
Gemma 4 is now nearly 90% faster on Apple Silicon with Ollama using MLX! The speedup comes from improved multi-token prediction (MTP), now on by default for Gemma 4, with more models to come. Ollama automatically tunes how many tokens to draft as it runs, so it never slows
164K
ollama
@ollama
Jul 1
Read more:
Faster Gemma 4 on MLX with multi-token prediction· Ollama Blog
From ollama.com
10K
ollama
@ollama
Jun 27
Run Ornith with Ollama: ollama run ornith For coding, use it with Claude or Pi: ollama launch claude --model ornith ollama launch pi --model ornith For the more capable 35B model, use: ollama launch claude --model ornith:35b
Ornith
@ornith_
Jun 25
Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on
132K
ollama
@ollama
Jun 27
Model page:
ollama.com
ornith
A self-improving family of open-source models for agentic coding
7.3K
ollama reposted
jietang
@jietang
Jun 24
5.2 could be better with more RL ...
青龍聖者
@bdsqlsz
Jun 24
Deepswe's benchmark results are my own experience. I've used all models, GLM 5.2 ≈ Claude Opus 4.6–4.7. Kimi 2.7 code more like inference optimization. Looking forward to K3. Doubao-seed 2.1 Pro around 37% ≈ Gemini 3.5 Flash. code are quite weak, but visual are strong.
180K
ollama reposted
LanceDB
@lancedb
Jun 23
At @aiDotEngineer World's Fair next week? Come join us Tuesday night 🏓 We're co-hosting an evening at SPIN with @Theoryvc and @ollama. Talk shop with the people building the next-generation of local and cloud AI infrastructure, grab a drink, and get a few games in. 📅 Tuesday,
Local Serve · Luma
From luma.com
5.3K
ollama reposted
Evan Boyle
@_Evan_Boyle
Jun 23
BYOK is now live in the GitHub Copilot App! Works with @ollama, foundry, and any OAI completions or Anthropic compatible messages endpoint. Give it a try today!
29K
ollama reposted
Ankit Gupta
@agupta
Jun 23
.@opencode and @ollama if you ask me
Suhail
@Suhail
Jun 22
Which means, somebody will make a lot of money commoditizing it vs charging a subscription for it.
14K
ollama reposted
Tomasz Tunguz
@ttunguz
Jun 22
The sharpest questions in AI live at the local–cloud boundary : where should inference run, & where should your data live? In town for AIE? Come hash it out with @Theoryvc, @lancedb & @ollama. June 30 · 6–9PM · SPIN SF 🏓
Local Serve · Luma
From luma.com
9.6K
ollama reposted
Ray Fernando
@RayFernando1337
Jun 22
I found the Ollama!!
10K
ollama
@ollama
Jun 21
Let’s go open models! ❤️
Guillermo Rauch
@rauchg
Jun 21
Genuinely impressed, almost shocked, at how good GLM-5.2 by @Zai_org is at coding. This changes things.
80K
ollama reposted
Matt Furnari
@matthewfurnari
Jun 20
Replying to @ItakGol
Yeah I agree. The big winner is going to be Ollama: I've offloaded all my supervisory, code review, and ontology learning agents to my $20 a month Ollama subscription, because I keep getting weight limited on my triple Chat GPT Pro accounts. It's the first model that I can do
12K