Log inSign up
ollama
8,421 posts
user avatar
ollama
@ollama
ollama.com
California, USA
github.com/ollama/ollama
Joined August 2023
11
Following
168.5K
Followers
  • Pinned
    user avatar
    ollama
    @ollama
    Jun 16
    🤯 GLM-5.2 is here — built for long-horizon coding and agentic tasks, now with a solid 1M-token context. The strongest open-source coding model yet! Available now on Ollama's cloud, hosted in the US on the latest @NVIDIAAI Blackwell datacenter GPUs. Privacy policy and zero
    GLM 5.2
    user avatar
    Z.ai
    @Zai_org
    Jun 16
    Introducing GLM-5.2: Frontier Intelligence, Open Weights - Significant improvements in coding and agentic tasks - Strong long-horizon capabilities with a 1M context window - Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong
    223K
  • ollama reposted
    user avatar
    Todd Dailey
    @twid
    5h
    Article cover image
    Article
    I tested Ollama's "90% faster Gemma 4" claim on an M5 Max MacBook Pro
    Bottom line up front: Gemma 4 now runs on Ollama at a blazing 152 tok/s on an M5 Max MacBook Pro using the new MLX-optimized 26B MoE model. You should run it! "ollama pull gemma4:31b-mlx" and you're...
    5.2K
  • user avatar
    ollama
    @ollama
    5h
    Congratulations to our friends at @togethercompute. Exciting moment for open models!!
    user avatar
    Vipul Ved Prakash
    Together AI
    @vipulved
    11h
    We @togethercompute believe intelligence should be abundant, not expensive. Today we announced our Series C funding of $800m @ $8.3B valuation, to continue to build the world's most efficient platform for generative AI. Thanks @nikogallogly for telling our story in @nytimes!
    12K
  • user avatar
    ollama
    @ollama
    Jul 1
    Gemma 4 is now nearly 90% faster on Apple Silicon with Ollama using MLX! The speedup comes from improved multi-token prediction (MTP), now on by default for Gemma 4, with more models to come. Ollama automatically tunes how many tokens to draft as it runs, so it never slows
    164K
    user avatar
    ollama
    @ollama
    Jul 1
    Read more:
    Faster Gemma 4 on MLX with multi-token prediction· Ollama Blog
    From ollama.com
    10K
  • user avatar
    ollama
    @ollama
    Jun 27
    Run Ornith with Ollama: ollama run ornith For coding, use it with Claude or Pi: ollama launch claude --model ornith ollama launch pi --model ornith For the more capable 35B model, use: ollama launch claude --model ornith:35b
    user avatar
    Ornith
    @ornith_
    Jun 25
    Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on
    132K
    user avatar
    ollama
    @ollama
    Jun 27
    Model page:
    ollama.com
    ornith
    A self-improving family of open-source models for agentic coding
    7.3K
  • ollama reposted
    user avatar
    jietang
    Z.ai
    @jietang
    Jun 24
    5.2 could be better with more RL ...
    user avatar
    青龍聖者
    @bdsqlsz
    Jun 24
    Deepswe's benchmark results are my own experience. I've used all models, GLM 5.2 ≈ Claude Opus 4.6–4.7. Kimi 2.7 code more like inference optimization. Looking forward to K3. Doubao-seed 2.1 Pro around 37% ≈ Gemini 3.5 Flash. code are quite weak, but visual are strong.
    180K
  • ollama reposted
    user avatar
    LanceDB
    @lancedb
    Jun 23
    At @aiDotEngineer World's Fair next week? Come join us Tuesday night 🏓 We're co-hosting an evening at SPIN with @Theoryvc and @ollama. Talk shop with the people building the next-generation of local and cloud AI infrastructure, grab a drink, and get a few games in. 📅 Tuesday,
    Local Serve · Luma
    From luma.com
    5.3K
  • ollama reposted
    user avatar
    Evan Boyle
    GitHub
    @_Evan_Boyle
    Jun 23
    BYOK is now live in the GitHub Copilot App! Works with @ollama, foundry, and any OAI completions or Anthropic compatible messages endpoint. Give it a try today!
    29K
  • ollama reposted
    user avatar
    Ankit Gupta
    Y Combinator
    @agupta
    Jun 23
    .@opencode and @ollama if you ask me
    user avatar
    Suhail
    @Suhail
    Jun 22
    Which means, somebody will make a lot of money commoditizing it vs charging a subscription for it.
    14K
  • ollama reposted
    user avatar
    Tomasz Tunguz
    Theory Ventures
    @ttunguz
    Jun 22
    The sharpest questions in AI live at the local–cloud boundary : where should inference run, & where should your data live? In town for AIE? Come hash it out with @Theoryvc, @lancedb & @ollama. June 30 · 6–9PM · SPIN SF 🏓
    Local Serve · Luma
    From luma.com
    9.6K
  • ollama reposted
    user avatar
    Ray Fernando
    @RayFernando1337
    Jun 22
    I found the Ollama!!
    10K
  • user avatar
    ollama
    @ollama
    Jun 21
    Let’s go open models! ❤️
    user avatar
    Guillermo Rauch
    Vercel
    @rauchg
    Jun 21
    Genuinely impressed, almost shocked, at how good GLM-5.2 by @Zai_org is at coding. This changes things.
    80K
  • ollama reposted
    user avatar
    Matt Furnari
    @matthewfurnari
    Jun 20
    Replying to @ItakGol
    Yeah I agree. The big winner is going to be Ollama: I've offloaded all my supervisory, code review, and ontology learning agents to my $20 a month Ollama subscription, because I keep getting weight limited on my triple Chat GPT Pro accounts. It's the first model that I can do
    12K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up