Prime Intellect’s cover photo
Prime Intellect

Prime Intellect

Software Development

The Open Superintelligence Stack

About us

Prime Intellect makes frontier AI training accessible to every company. We operate as both a full-stack platform and an open research lab, enabling organizations to train their own models by directly leveraging our state-of-the-art agentic training infrastructure.

Website
https://www.primeintellect.ai/
Industry
Software Development
Company size
11-50 employees
Headquarters
San Francisco
Type
Privately Held

Locations

Employees at Prime Intellect

Updates

  • We're excited to join the NVIDIA Nemotron Coalition 💚 Frontier open models matter for the whole ecosystem. We're bringing the RL infrastructure and environments we've built over the last year to help scale agentic capabilities. Our contribution: the post‑training & RL environments layer. 2,500+ open RL environments, the verifiers framework, Prime Sandbox, and NeMo Gym integration — all natively integrated into NVIDIA's ecosystem, incl. Nemotron.

    • No alternative text description for this image
  • Today, we're sharing how our collaboration with NVIDIA helps power the open superintelligence stack. The next frontier of AI infrastructure is building systems for agentic models that can reason for hours, use tools, execute code, and learn from outcomes at scale. At Prime Intellect, we're building that stack end to end: → agentic RL training and inference on frontier open models → RL sandboxes → open-source libraries like verifiers + prime-rl Giving everyone access to frontier lab infrastructure. Our training and serving workloads run on NVIDIA Blackwell systems at rack scale and soon on NVIDIA Vera Rubin. This is powering the next generation of our training, inference, and sandbox stack. Inference is becoming central to both RL rollouts and production agents. We chose NVIDIA Dynamo because agentic inference at scale means handling global deployments, long-context reasoning, multi-turn trajectories, sparse MoEs, and large fleets of adapters. This collaboration also extends into research: → Nemotron models available through Lab → NVIDIA NeMo Gym environments in verifiers/prime-rl → NVIDIA NeMo RL stack integrated with verifiers Read more on our blog: https://lnkd.in/gsqhJvGa

    • No alternative text description for this image
  • Prime Intellect reposted this

    Excited to join NVIDIA GTC ahead of Jensen's keynote next Monday to talk about the agentic AI inflection point and building the self-improving agent stack "The agentic AI inflection point – Harrison Chase (LangChain), Samuel G. Rodriques (Edison Scientific), Peter Steinberger (OpenClaw), and Vincent Weisser (Prime Intellect) look at the rise of agentic systems that reason step by step, use tools, and complete complex tasks. As these systems generate far more tokens per workflow, inference demand is rising quickly and reshaping how software is built and deployed." https://lnkd.in/gh9QKUxz

    View organization page for NVIDIA

    5,740,439 followers

    The #NVIDIAGTC keynote pregame lineup is officially live. 🎥 Before Jensen Huang’s keynote, join us for NVIDIA GTC Live as we bring together analysts, founders, and industry leaders to discuss the technologies shaping the next phase of AI. Hosts Sarah Guo (Conviction), Gavin Baker (Atriedes), Alfred Lin (Sequoia Capital), and Tiffany Janzen (TiffinTech) will dive deep with guests into: ✔️ The shift from general-purpose to accelerated computing. ✔️ How AI is as an essential infrastructure layer. ✔️ The power of open models. ✔️ The rise of agentic AI. ✔️ Advances in physical AI and robotics. 🗓️ Join us Monday, March 16 at 8:00 a.m. PT. Get all the details 👇

  • We're excited to introduce Arcee AI's Trinity Large model. An open 400B total, 13B active parameter Mixture of Experts model with frontier performance. Trained in collaboration between Arcee AI, DatologyAI and Prime Intellect. Trinity Architecture Key design choices: - Interleaved local + global attention (3:1 pattern) - Grouped-query + gated attention - New load-balancing method (SMEBU) - Depth scaled sandwich norm and QK norm With extreme sparsity, built for long context and fast inference. Infrastructure - Large-scale synthetic data generation on 2k H100s - Training Trinity Large on 2k B300 GPUs Training stack: - Modified torchtitan - Muon optimizer - HSDP with FSDP group size 128 - Expert parallelism - Context parallelism for context extension - Improvements to recover quickly from hardware failures Benchmark Performance Despite lower active parameters, Trinity Large Base is very competitive with other top open models Inference Performance Trinity Large with its extreme sparsity and interleaved local/global attention results in: - Higher throughput - Lower end-to-end latency - Faster time to first token Next up: we’re continuing to scale agentic RL on top of Trinity Large using our frontier open post-training stack over the coming weeks. The full technical report by Arcee AI https://lnkd.in/db89eJdS The blog post by Arcee AI https://lnkd.in/g6NM5J3K

  • Prime Intellect reposted this

    We're releasing Medmarks v0.1, the largest completely open-source automated evaluation suite for assessing the medical capabilities of LLMs! Developed in our MedARC community, with support from Prime Intellect So far we’ve explored 46 models to figure out the best! Why did we build this benchmark suite? Because there isn't yet a completely open and easy-to-run medical LLM benchmark, evaluated on various kinds of realistic tasks and updated regularly with new models. We aggregated a total of 28 tasks in 20 benchmarks in our suite. We divide them into two subsets: 1. Medmarks-Verifiable: 14 verifiable benchmarks, mostly multiple-choice question answering but also other tasks like medical calculations 2. Medmarks-OE: 6 open-ended benchmarks, e.g. answering patient questions On average, we find that GPT-5.1, GPT-5.2, and Qwen3-235B-A22B-Thinking are the best-performing models on the medical tasks evaluated. Lots of interesting observations: • medical-specific LLMs can be quite performant • open weight models are close to proprietary model perf but less token efficient • reasoning post-training improves performance • There's a few standout datasets that aren't saturated yet We built our evaluation suite on top of Prime Intellect's verifiers library. This provides us many advantages: 1. RL environments for free for datasets with training split 2. integration with Inference API enables easy benchmarking of model APIs 3. benchmarks available on Hub (under the MedARC organization) This is only just the beginning! We plan to add more models and benchmarks, and of course develop our own models and benchmarks! If you're interested in collaborating, join the MedARC Discord or contact us (contact@sophontai.com) To learn more about our benchmark suite, leaderboard, and interesting findings, read our blog post: https://lnkd.in/gNeZRVdw Check out the leaderboard: https://medmarks.ai code: https://lnkd.in/gNwmaXPZ environment hub: https://lnkd.in/gPCv3NUJ

    • No alternative text description for this image

Similar pages

Browse jobs

Funding

Prime Intellect 2 total rounds

Last Round

Seed

US$ 15.0M

See more info on crunchbase