Vaibhav (VB) Srivastav (@reach_vb) / X

Vaibhav (VB) Srivastav

13K posts

Vaibhav (VB) Srivastav

@reach_vb

founder mode @OpenAI | ex @huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my own

codex -m gpt 5.5

vaibhavs10.github.io

Joined June 2017

Pinned
Vaibhav (VB) Srivastav
@reach_vb
Jan 21, 2024
I did this. Fuck what anyone else says, just put the pedal to the metal and BUILD. Push spaghetti code. Nobody cares about OOPs. Doesn’t matter what anyone thinks. Just keep on doing. Document in public. Don’t listen to the haters. Release more than you refactor. Just keep
Nick Dobos
@NickADobos
Jan 21, 2024
Just do stuff
1.1M
Vaibhav (VB) Srivastav
@reach_vb
Jan 21, 2025
Let’s fucking goo!! DeepSeek R1 1.5B running FULLY LOCALLY in your browser at 60 tok/ sec powered by WebGPU🔥 Intelligence truly is too cheap to meter! ⚡️
00:00
973K
Vaibhav (VB) Srivastav
@reach_vb
Jan 27, 2025
WAIT A SECOND, DeepSeek just dropped Janus 7B (MIT Licensed) - multimodal LLM (capable of generating images too) 🔥
556K
Vaibhav (VB) Srivastav
@reach_vb
Mar 13, 2025
HOLY SHITT, Sesame Labs just dropped CSM (Conversational Speech Model) - Apache 2.0 licensed! 💥 > Trained on 1 MILLION hours of data 🤯 > Contextually aware, emotionally intelligent speech > Voice cloning & watermarking > Ultra fast, real-time synthesis > Based on llama
00:00
690K
Vaibhav (VB) Srivastav
@reach_vb
Jan 28, 2025
NEW: DeepSeek Janus Pro 1B (Generate Images, Chat with PDF) running in your browser, 100% local, powered by WebGPU 🔥 Zero server costs, brought to you by transformers.js - try it out!
00:00
635K
Vaibhav (VB) Srivastav
@reach_vb
Jan 20, 2025
"DeepSeek-R1-Distill-Qwen-1.5B outperforms GPT-4o and Claude-3.5-Sonnet on math benchmarks with 28.9% on AIME and 83.9% on MATH." 1.5B did WHAT?
991K
Vaibhav (VB) Srivastav
@reach_vb
Aug 29, 2025
🚨 Apple just released FastVLM on Hugging Face - 0.5, 1.5 and 7B real-time VLMs with WebGPU support 🤯 > 85x faster and 3.4x smaller than comparable sized VLMs > 7.9x faster TTFT for larger models > designed to output fewer output tokens and reduce encoding time for high
00:00
582K
Vaibhav (VB) Srivastav
@reach_vb
Oct 27, 2024
Wow! Meta dropped an open NotebookLM recipe: NotebookLlama 🔥 It uses L3.2 1B/ 3B for pre-processing the PDF, L3.1 70B for Transcript creation, L3.1 8B for re-writes and Parler TTS for Text to Speech ⚡ Step 1: Pre-process PDF: Use Llama-3.2-1B-Instruct to pre-process the PDF
00:00
857K
Vaibhav (VB) Srivastav
@reach_vb
Feb 10, 2025
HOLY FUCK! @ZyphraAI just dropped Zonos - Apache 2.0 licensed, Multilingual, Text to Speech model with INSTANT voice cloning! 🔥 > Zero-shot TTS with Voice Cloning: Input text and a 10-30 second speaker sample to generate high-quality text-to-speech output > Audio Prefix
00:00
299K
Vaibhav (VB) Srivastav
@reach_vb
Feb 27, 2025
HOLY SHITT, Microsoft dropped an open-source Multimodal (supports Audio, Vision and Text) Phi 4 - MIT licensed! 🔥 > Beats Gemini 2.0 Flash, GPT4o, Whisper, SeamlessM4T v2 > Models on Hugging Face hub, integrated with/ Transformers! Phi-4-Multimodal: > Modalities: Integrates
210K
Vaibhav (VB) Srivastav
@reach_vb
Sep 26, 2024
Fuck yeah! Llama 3.2 3B running on your browser! 100% local, powered by WebGPU & MLC 🦙
00:00
282K
Vaibhav (VB) Srivastav
@reach_vb
Jan 20, 2025
holy fuck, these gigachads dropped 6 distilled models right from 1.5B to 70B 🔥
121K
Vaibhav (VB) Srivastav
@reach_vb
Jan 24, 2025
HOLY SHITT! Llasa TTS - Llama 3.2 fine-tune with ultra realistic audio 🔥 > supports voice cloning in English + Chinese > trained on 250K hours of audio > 1B, 3B model (8B soon) > emotional speech (happy, angry, sad, whisper) > open weights & works with transformers/ vllm
00:00
222K
Vaibhav (VB) Srivastav
@reach_vb
Dec 16, 2024
Microsoft open sourced MarkItDown - convert files to Markdown - perfect for using with LLMs! 🔥
197K