Log inSign up
Vaibhav (VB) Srivastav
13K posts
user avatar
Vaibhav (VB) Srivastav
@reach_vb
founder mode @OpenAI | ex @huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my own
codex -m gpt 5.5
vaibhavs10.github.io
Joined June 2017
292
Following
52.3K
Followers
  • Pinned
    user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Jan 21, 2024
    I did this. Fuck what anyone else says, just put the pedal to the metal and BUILD. Push spaghetti code. Nobody cares about OOPs. Doesn’t matter what anyone thinks. Just keep on doing. Document in public. Don’t listen to the haters. Release more than you refactor. Just keep
    user avatar
    Nick Dobos
    @NickADobos
    Jan 21, 2024
    Just do stuff
    1.1M
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Jan 21, 2025
    Let’s fucking goo!! DeepSeek R1 1.5B running FULLY LOCALLY in your browser at 60 tok/ sec powered by WebGPU🔥 Intelligence truly is too cheap to meter! ⚡️
    00:00
    973K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Jan 27, 2025
    WAIT A SECOND, DeepSeek just dropped Janus 7B (MIT Licensed) - multimodal LLM (capable of generating images too) 🔥
    556K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Mar 13, 2025
    HOLY SHITT, Sesame Labs just dropped CSM (Conversational Speech Model) - Apache 2.0 licensed! 💥 > Trained on 1 MILLION hours of data 🤯 > Contextually aware, emotionally intelligent speech > Voice cloning & watermarking > Ultra fast, real-time synthesis > Based on llama
    00:00
    690K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Jan 28, 2025
    NEW: DeepSeek Janus Pro 1B (Generate Images, Chat with PDF) running in your browser, 100% local, powered by WebGPU 🔥 Zero server costs, brought to you by transformers.js - try it out!
    00:00
    635K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Jan 20, 2025
    "DeepSeek-R1-Distill-Qwen-1.5B outperforms GPT-4o and Claude-3.5-Sonnet on math benchmarks with 28.9% on AIME and 83.9% on MATH." 1.5B did WHAT?
    991K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Aug 29, 2025
    🚨 Apple just released FastVLM on Hugging Face - 0.5, 1.5 and 7B real-time VLMs with WebGPU support 🤯 > 85x faster and 3.4x smaller than comparable sized VLMs > 7.9x faster TTFT for larger models > designed to output fewer output tokens and reduce encoding time for high
    00:00
    582K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Oct 27, 2024
    Wow! Meta dropped an open NotebookLM recipe: NotebookLlama 🔥 It uses L3.2 1B/ 3B for pre-processing the PDF, L3.1 70B for Transcript creation, L3.1 8B for re-writes and Parler TTS for Text to Speech ⚡ Step 1: Pre-process PDF: Use Llama-3.2-1B-Instruct to pre-process the PDF
    00:00
    857K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Feb 10, 2025
    HOLY FUCK! @ZyphraAI just dropped Zonos - Apache 2.0 licensed, Multilingual, Text to Speech model with INSTANT voice cloning! 🔥 > Zero-shot TTS with Voice Cloning: Input text and a 10-30 second speaker sample to generate high-quality text-to-speech output > Audio Prefix
    00:00
    299K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Feb 27, 2025
    HOLY SHITT, Microsoft dropped an open-source Multimodal (supports Audio, Vision and Text) Phi 4 - MIT licensed! 🔥 > Beats Gemini 2.0 Flash, GPT4o, Whisper, SeamlessM4T v2 > Models on Hugging Face hub, integrated with/ Transformers! Phi-4-Multimodal: > Modalities: Integrates
    210K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Sep 26, 2024
    Fuck yeah! Llama 3.2 3B running on your browser! 100% local, powered by WebGPU & MLC 🦙
    00:00
    282K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Jan 20, 2025
    holy fuck, these gigachads dropped 6 distilled models right from 1.5B to 70B 🔥
    121K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Jan 24, 2025
    HOLY SHITT! Llasa TTS - Llama 3.2 fine-tune with ultra realistic audio 🔥 > supports voice cloning in English + Chinese > trained on 250K hours of audio > 1B, 3B model (8B soon) > emotional speech (happy, angry, sad, whisper) > open weights & works with transformers/ vllm
    00:00
    222K
  • user avatar
    Vaibhav (VB) Srivastav
    @reach_vb
    Dec 16, 2024
    Microsoft open sourced MarkItDown - convert files to Markdown - perfect for using with LLMs! 🔥
    197K

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up