Sign in to view Lev’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Lev’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
San Francisco Bay Area
Sign in to view Lev’s full profile
Lev can introduce you to 10+ people at Galileo
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
8K followers
500+ connections
Sign in to view Lev’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Lev
Lev can introduce you to 10+ people at Galileo
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
View mutual connections with Lev
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Sign in to view Lev’s full profile
or
New to LinkedIn? Join now
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Articles by Lev
-
Making PHP / Wordpress send email using custom sendmail
Making PHP / Wordpress send email using custom sendmail
Recently I have had the pleasure of migrating a WordPress website which resulted in a peculiar problem - sending email…
17
-
My super secure password schemeApr 4, 2014
My super secure password scheme
For the longest time in my life I have only used a single password for all the online services until I realized how…
4
14 Comments
Activity
8K followers
-
Lev N. shared thisLFG!Lev N. shared this🚀 Big News: Galileo is joining forces with Cisco! 🚀 We are thrilled to announce a massive milestone: Cisco has announced its intent to acquire Galileo! Five years ago, we started Galileo with a simple but bold mission: to solve the “trust problem” for software built with language models (aka NLP). We saw early on that these software workloads were fundamentally different—non-deterministic, unpredictable, and requiring a completely new approach to observability. Today, language model powered AI software is increasingly ubiquitous, the "trust gap" is the biggest bottleneck to unleash AI at scale and Galileo’s platform has been rapidly adopted by some of the world’s largest enterprises to ship trustworthy AI products. Splunk and Cisco more broadly have been pioneers in the observability and security space for decades. In becoming part of Cisco, we are excited and prepared to redefine how the world builds, deploys, and trusts AI at scale. The opportunity ahead of us is massive, and we are only getting started. What does this mean for our customers? The most important thing to know is that our commitment to you remains unchanged. You will still be working with the same reliable Galileo team you know and trust. However, we are now turbocharged with the "superpowers" of Cisco and Splunk! ⚡️ We are incredibly grateful to our team, our partners, and—most importantly—our users. We are always here for you, and we couldn’t be more excited about this next chapter. Onward! 🚀✨ Vikram Chatterji, Atindriyo Sanyal, and Yash Sheth Learn more here: https://lnkd.in/gU_uADMa
-
Lev N. shared thisI built OpenClaw plugin for Agent Control and I am excited to demo it live! Tune in, don't drop out! Meanwhile, you can play around with the plugin yourself - https://lnkd.in/g3E2ZDHhLev N. shared this🦞 OpenClaw is one of the most capable agent frameworks available. It's also one of the easiest to lose control of. We're running a hands-on workshop to close that gap because prompt-based safety can't survive at scale. Join us for Taming The Claw, a hands-on workshop where our engineer, Lev N., shows you how to layer Agent Control on top of OpenClaw to close the governance gaps that prompt-based safety can't cover. You'll learn: → How to install the Agent Control OpenClaw plugin → How to set up centralized governance for tool calling → Policy patterns for common failure modes: unconstrained tool access, permission escalation, uncontrolled sub-agents, and memory leakage You'll leave with: → A working Agent Control + OpenClaw integration you can adapt for your stack → A centralized control plane your entire team can update in minutes This is for engineers building with or evaluating OpenClaw who want production-grade governance. 🎟️ Register here: https://lnkd.in/gRvErppy
-
Lev N. shared thisThis is the second time I've built a caching library from scratch. 🔁 The first time was at DoorDash, in Kotlin. It ended up being critical to scaling DashPass and was adopted by many other teams. When I joined Galileo and hit the same performance bottlenecks, I built it again in Python. The core problems are always the same: ❌ No visibility into whether caching is actually helping ❌ Invalidation scattered across the codebase ❌ No way to turn it off without a full redeploy ❌ Shipping cache at 100% on day one with no way to roll back So I built GCache, an open-source Python library that wraps Redis and cachetools with guardrails: opt-in caching, gradual rollout with a runtime kill switch, entity-based invalidation, and built-in Prometheus metrics you can pipe straight into Grafana Labs. 🛡️ At Galileo, GCache cut p50 latency by 50% and CPU usage by 40% on our highest-traffic endpoint. We've since rolled it out across many others. Full writeup on the Galileo engineering blog, along with the GitHub repo. Link in the comments. 👇 #python #opensource #softwareengineering #redis #performance
-
Lev N. shared thisCheck out what I have been working on for the past few months! It's been fun. Working on Agent Control has stirred similar feelings of wonder and potential that I had when I was just starting out. Incredible things are ahead! I have started using it to protect my personal OpenClaw gateway and will share more details soon. Meanwhile checkout https://lnkd.in/grPXDEBp If you find this interesting, I would love your feedback! #agents #security #control #openclawLev N. shared thisEvery agent your team ships has its own hardcoded guardrails, its own bespoke logic, its own failure modes. That's not governance. These brittle controls soon become a liability. Galileo is proud to announce the open-source launch of Agent Control 🚀 Agent Control is the open-source control plane that solves for the open, centralized governance needs for all your AI Agents. 💬 "We've had a front-row seat to agent development at Fortune 500 and digital-native companies. They have been struggling to hard-code safety rules and controls into each agent which makes them brittle. With Agent Control, developers can now create policies in one place and then use those to enforce guardrails everywhere." — Yash Sheth, Co-founder & CTO, Galileo Agent Control integrates seamlessly with all your agents using the @control hook or just by leveraging our native integrations with some of the leading agent frameworks. No redeployment. No code changes. No vendor lock-in. 💬 “Centralized management of policies can help organizations to manage AI agent behaviors. A unified control plane and centralized governance of agents can help organizations efficiently deploy AI agents at scale. Organizations that embrace eval engineering as a core competency will shorten the time to value for their AI investments. By taking a lifecycle approach, organizations can achieve a continuous improvement loop for AI systems.” – Tim Law, IDC Research Director, AI and Automation Agent Control is already backed by partners including Amazon Web Services (AWS), Cisco AI Defense, CrewAI, Glean, ServiceNow, and Rubrik, and it works with the guardrail providers you already use, from our Luna models to NVIDIA NeMo or AWS Bedrock. The repo is live, built in the open with contributions from some of the largest AI infrastructure companies in the world. Try it out today: https://agentcontrol.dev/ Watch Yash walk through how it works in the video below, and check the comments for links to our launch webinar, announcement blog, and full press release. 👇
-
Lev N. reposted thisGreat ✍ feature by Geekwire.Lev N. reposted thisIn today's newsletter: — TheFounderVC, a new Seattle- and San Francisco-based firm, unveiled a $5 million early stage fund focused on “vertical AI” startups. — As tech leaders threaten to leave over taxes, a new ranking named Seattle as the best U.S. city to live in. — MicroVision layoffs: The Redmond, Wash.-based radar technology company is cutting 49 employees, including many senior-level engineering positions.Seattle's newest early stage fund bets on vertical AI startupsSeattle's newest early stage fund bets on vertical AI startupsGeekWire
-
Lev N. posted thisConnecting Codex to Notion and asking to spec, plan and break down tasks is like having your own engineering team! You can even ask it to add logs to each tickets as it does the work. I feel like this simple integration is probably killing a number of startups.
-
Lev N. shared thisFeel lucky to be part of an incredible team solving agentic control problems. Satyam Dhar has good insight into whats going to really matter when taking agents from prototype to production. 2026 is going to be wild, stay tuned!Lev N. shared thisAmazon’s “Buy for Me” feature is getting attention for the right reason. But the core issue isn’t about Amazon or small businesses. It's about agent control. It’s also about what happens when agentic systems cross ownership and consent boundaries without clear guardrails. From a systems perspective, this is a purchasing agent operating across third-party surfaces and making assumptions about permission, pricing, inventory, and fulfillment that normally require explicit agreement. Opt-out mechanics don’t solve that. Defaults do. Small brands aren’t passive endpoints. They manage brand reputation, inventory risk, pricing strategy, and legal obligations that don’t translate cleanly into an automated intermediary acting on behalf of Amazon, thereby strategically inserting Amazon in the path of their brand discovery itself. This is the kind of problem that shows up when autonomy moves from “assistive” to “transactional.” The technical challenge isn’t whether the agent can complete a purchase. It’s figuring out how to enforce manual opt-in and keeping automatic opt-in under tighter checks. How authorization is defined, constrained, audited, and enforced across organizational boundaries. We’re going to see more of this as platforms experiment with agents that act, not just recommend. The teams that get this right will be the ones that treat consent, control, and observability as first-class system requirements, not policy change afterthoughts. https://lnkd.in/gY9ME_ad
-
Lev N. posted thisPeak vibe coding: Having to switch to codex inside my Warp because I ran out of Warp credits.
-
Lev N. shared thisAt Galileo we were hitting the classic GenAI inference problem: poor GPU utilization and high tail latencies. I designed a load-aware client-side router backed by Redis + Lua to solve this. It tracks real-time GPU load in a fleet of GPUs and routes requests to the least busy GPU. The results: - 🚀 Cluster-wide GPU utilization up by ~40% - 📉 lowered tail latencies by as much as 70% Full deep dive on the architecture here: https://lnkd.in/gCVaPZ6H #GPU #GenAI #MLOps #SystemDesign #Redis #LUA
-
Lev N. liked thisLev N. liked thisFinance people are notoriously skeptical of big ad campaigns because you can’t track ROI, but there’s something thrilling about seeing your bus in the wild. Or using ChatGPT to put it on the moon. 🌕 🔥
-
Lev N. liked thisLev N. liked thisEpisode 2 of Successful Prompt Podcast is live! 🎉 Today we're touching on an unpopular topic — failures. I know this feeling personally. When my startup closed, the hardest part wasn't the business dying — it was separating myself from it. Was I the failure, or was the company? Xenia Masl helped me find the answer. She graduated from Minerva University — one of the most selective universities on the planet — built a startup to bring transparency to charity, and when it shut down, she didn't just move on. She studied the failure like a scientist. 🔬 We talked about: 💡 The 3 types of failure — and which one actually moves you forward 🧠 The science of failing well 🏗️ How to close a startup without losing yourself 🤖 Whether AI is helping us fail smarter — or just faster Success is so unique, but failures have so much in common. So let's master them together. 📖 Inspired by "Right Kind of Wrong" by Amy Edmondson — if you are not failing, you are not getting to new territory. ▶️ YouTube: https://lnkd.in/drg9YMhi 🎙️ Spotify: https://lnkd.in/dRcVBBzM Your support means the world to me! 🙏
-
Lev N. liked thisLev N. liked thisI have a new company to share, built on ten years of production lessons from shipping voice AI in industry. In 2012, after leaving NASA/JPL, I moved into pure machine learning, back when deep learning was still fairly obscure. At Gridspace, we built early speech AI when scaling any machine learning application was novel. Over the years we shipped Memo (AI notetaking and video conferencing before Zoom existed), Sift (real-time analysis for financial services, scaled to billions of minutes of audio), and Grace (the first neural conversational voice agent). But every pilot and production rollout ran into the same wall: reliability and robustness problems that had little to do with the AI itself and everything to do with integration hell. This month, we're launching a new company called Guava to build on the production lessons from the last decade of operating voice AI at banks and hospitals. Guava is a pure developer SDK, utterly maxed out on naturalness, latency, and robustness. I'm excited to share more in the coming months. In the meantime, if you're curious, take a look at goguava.ai
-
Lev N. liked thisLev N. liked thisGalileo teams met last week. We talked craft. Whiteboarded, bug-bashed, and shipped. Already, half the ideas from that room are turning into features our customers will feel. This team is deeply tuned into customer needs and moves fast. We're just getting started 💪 What’s exciting: As Cisco announced it's intent to acquire Galileo, observability is about to get a lot smarter with visibility into agentic workflows, helping teams move from an issue to action faster than ever. Excited to be part of the team and looking forward to scale our reach!! cc: Vikram Chatterji, Atindriyo Sanyal, Yash Sheth
-
Lev N. liked thisLev N. liked thisI nearly blamed LangGraph for a cost it wasn't actually causing. At AGI House's Agent Harness Build Day, I benchmarked four agent harnesses on the same 10 tasks with the same model (Claude Sonnet 4.6): a thin SDK loop, Claude Agent SDK, a prebuilt LangGraph ReAct agent, and a custom LangGraph graph with a planner/router/executor/reflector topology. The headline number: my custom thick graph cost 3x the thin loop for the same output quality. First read: "thick frameworks are wasteful." Wrong. Adding the prebuilt LangGraph create_react_agent as a fourth harness reframed everything. Its overhead was only 1.4x thin. My custom planner/router/executor/reflector stacked another 2.1x on top of that. The framework wasn't the expensive part. The topology I layered on top of it was. 𝗠𝗲𝗮𝘀𝘂𝗿𝗲 𝗳𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 𝘁𝗮𝘅 𝗮𝗻𝗱 𝘁𝗼𝗽𝗼𝗹𝗼𝗴𝘆 𝘁𝗮𝘅 𝘀𝗲𝗽𝗮𝗿𝗮𝘁𝗲𝗹𝘆, 𝗼𝗿 𝘆𝗼𝘂'𝗹𝗹 𝗯𝗹𝗮𝗺𝗲 𝘁𝗵𝗲 𝘄𝗿𝗼𝗻𝗴 𝘁𝗵𝗶𝗻𝗴. Along the way, LangSmith made debugging cheap. Two moments that saved me time: • One harness's trace page sat completely empty while the other three streamed spans. A 15-second glance told me the Claude Agent SDK transport was invisible to the generic tracer, something that would have taken much longer to isolate with raw logs. • My model-call turn count disagreed with LangSmith's UI. Reading their integration source revealed why: they dedupe on message_id because the SDK can split one logical turn across multiple events. Once I understood that, I applied the same deduplication in my own counting. The free Developer plan ships 5,000 traces/month with no credit card required, enough to run a benchmark like this without touching your observability budget. Full writeup: https://lnkd.in/dtp5JHab GitHub: https://lnkd.in/dd9KfK9h #AgenticAI #AIEngineering #LangSmith #LangChain
-
Lev N. liked thisLev N. liked thisclaude.ai/design is so expensive to use it's hilarious. A couple of minutes and it's all done. I feel we keep talking about all these AI design tools and how they took a gun to Figma, and queue the apocalpyse, this is the future!! But the actual cost of running these tools will make me broke. I can't see myself using them, no matter how powerful they are.
-
Lev N. liked thisExcited to be a part of this .. looking forward to some great talks and conversations!Lev N. liked this🔜 Coming up next Monday, we’re hosting a meetup for the NYC AI engineer community at Google NYC. You’ll get the chance to network with other AI engineers and hear from industry leaders, including: 🗣️ Lukas Geiger - Cloud Customer Engineer @ Google 🗣️ Christopher Page - Applied AI @ Google 🗣️ Jason Mancuso - FDE Modal 🗣️ Sudhir Tonse - Head of Engineering @ Galileo Join the waitlist for the event here: https://lnkd.in/eGVPD54S Special thanks to Kashaf Mazhar for helping us set up this event 🤝
-
Lev N. reacted on thisLev N. reacted on thisAI-native analytics is evolving fast—and Ridge #AI is a great example of where things are heading 🚀 Seattle-based Ridge AI has raised $2.6M in pre-seed funding led by Madrona, with backing from TheFounderVC and angels from companies like Tableau, Trifacta, and Streamlit. Founded by CEO Ellie Fields and Chief Scientist Jeffrey Heer, Ridge AI is helping B2B software companies ship interactive dashboards and AI data agents in hours—not weeks. What makes it stand out? Instead of relying on heavy cloud infrastructure, Ridge processes data directly in the browser, delivering sub-second performance even on datasets with millions of rows—while reducing ongoing costs for SaaS companies. On top of that, users can interact with dashboards using natural language, asking follow-up questions and getting instant answers powered by AI data agents. This combination of embedded analytics + AI is a big step toward making data more accessible and actionable for end users inside SaaS products. Ridge AI is currently opening limited beta access, onboarding select teams each week. A glimpse into the future of analytics—faster, smarter, and truly user-friendly. #AI #DataAnalytics #SaaS #Startups #ArtificialIntelligence #B2B #TechInnovation #EmbeddedAnalytics #DataScience #FutureOfWork #AITechSupports #Fundraising #AITech #AITechnology #CEO #CTO
Experience & Education
-
Galileo 🔭
******** ********
-
**********
*******
-
*********
*******
-
**** **********
******** ** ******* ******** ******* undefined
-
**** **********
****** ** ******* **** ******** *******
View Lev’s full experience
See their title, tenure and more.
Welcome back
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
New to LinkedIn? Join now
or
By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.
Publications
-
WordSeeker: concurrent bioinformatics software for discovering genome-wide patterns and word-based genomic signatures
BMC Bioinformatics
I implemented Suffix Tree which was used to build Markov models on larger data sets than we were possible with existing Radix Tree, with other trade offs as well. I have also refactored some of the existing code to be more efficient, eventually achieving 4x speed up over older versions, regardless of the data structure used.
Other authorsSee publication -
An efficient Associative Processor solution to an Air Traffic Control problem
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
This paper deals with research into SIMD implementations of air traffic control algorithms. I have implemented an equivalent algorithm in MIMD (regular CPU with multiple cores) using C++ and multi threading. This MIMD implementation was used to compare against SIMD one.
Other authorsSee publication -
SecondWATCH: A workspace awareness tool based on a 3-D virtual world
Software Engineering - Companion Volume, 2009. ICSE-Companion 2009. 31st International Conference on
I took over Hiep Dinh as chief programmer and did extensive refactoring, and extensive feature additions to the project. Here is a youtube video visually explaining our work: http://www.youtube.com/watch?v=o7OLPkHnlw8
Other authorsSee publication
Courses
-
Advanced AI
-
-
Advanced Algortihm Design
-
-
Advanced Networking
-
-
Machine Learning
-
-
Natural Language Processing
-
Projects
-
Master Thesis - Solving Stochastic Differential Equations Using General Purpose Graphics Processing Unit
* Developed a C++ library using CUDA/Boost for solving differential equations with random variables via iterative integration methods (Euler, Runge-Kutta).
* Pipeline-like design for computing properties of ensemble realizations (mean, PSD) with most components running on GPU and some aggregation done on CPU.
* Utilizes all available GPUs on a host.
* Achieved ~600x boost over CPU only implementation.
* Used C++ features to achieve polymorphism on older NVIDIA GPUs without…* Developed a C++ library using CUDA/Boost for solving differential equations with random variables via iterative integration methods (Euler, Runge-Kutta).
* Pipeline-like design for computing properties of ensemble realizations (mean, PSD) with most components running on GPU and some aggregation done on CPU.
* Utilizes all available GPUs on a host.
* Achieved ~600x boost over CPU only implementation.
* Used C++ features to achieve polymorphism on older NVIDIA GPUs without performance loss.
- Ability to plugin new equations and random number sources by using templatesOther creatorsSee project -
SECONDWatch
-
The Second Life Workspace Awareness Toolkit and Collaboration Hub, or SecondWATCH for short, is a 3-dimensional workspace awareness tool based on Second Life, a 3-D online virtual world. SecondWATCH informs developers of real-time, history artifact and co-worker awareness information by monitoring team members' activities on their local workspaces, version control repository, and bug tracking system. It then extracts, analyzes, and visualizes the corresponding information in Second Life as a…
The Second Life Workspace Awareness Toolkit and Collaboration Hub, or SecondWATCH for short, is a 3-dimensional workspace awareness tool based on Second Life, a 3-D online virtual world. SecondWATCH informs developers of real-time, history artifact and co-worker awareness information by monitoring team members' activities on their local workspaces, version control repository, and bug tracking system. It then extracts, analyzes, and visualizes the corresponding information in Second Life as a common view shared by the whole team using a 3-D city metaphor.
Other creatorsSee project
Languages
-
Russian
-
-
English
-
Organizations
-
ACM
Member
Recommendations received
11 people have recommended Lev
Join now to viewView Lev’s full profile
-
See who you know in common
-
Get introduced
-
Contact Lev directly
Other similar profiles
Explore more posts
-
Siddharth Ramakrishnan
Scale Venture Partners • 2K followers
Cursor's new Composer model is (not so quietly) Chinese under the hood. Cursor shipped the new version of Composer as its "first in-house frontier coding model," optimized through RL for real coding tasks. Within days, users noticed Composer's hidden thinking was full of Chinese characters. Industry press connected the dots: both Cursor's Composer and Windsurf's SWE-1.5 appear based on Chinese large models (likely Qwen bases), fine-tuned and wrapped in US products. Community reaction is split. Some call it a major downgrade: slower and struggling with complexity. Others report speed gains and slightly better performance. So why is the reception so mixed? Gavin Leech's AI Tigers post (link in comments) shows that on fresh benchmarks, Chinese models lose around 21% performance compared to roughly 10% for Western models. They look closer on leaderboards than they perform on novel problems. The cost story is similar: 3-6x cheaper per token, but often requiring 2-4x more tokens, which shrinks the cost advantage fast. So why would Cursor make this trade? Because they're not trying to replace GPT or Claude. They're owning a narrow, high-volume behavior and squeezing it with fine-tuning. Cursor has massive amounts of product-specific data (traces, diffs, accept/reject signals) and can invest in the RL loop to extract value from a cheaper base. This is where Chinese and open-source models actually fit in the stack. They're a strong fit for narrow problems with abundant training signal: tab complete, inline edits, code transforms. They're a weaker fit as primary reasoning for high-stakes, long-horizon workflows. Frontier US models still dominate the second category. Chinese open-source is taking over the first. Composer's rollout is a case study in how powerful Chinese models have become, and a live demo of where their limits still show.
39
12 Comments -
Jack Vanlightly
Confluent • 3K followers
I’ve just published the next installment in my "Theory of Durable Execution" series. Across frameworks we hear terms like workflows, activities, virtual objects, handlers, and functions. Underneath the terminology, they’re all describing a small number of execution patterns. In this new post, I break durable functions down into three forms: 🔹 Stateless functions for one-shot deterministic logic 🔹 Sessions for long-running, interactive orchestration 🔹 Actors for persistent, message-driven state To get there, I walk through: 🔹 The behavior–state continuum 🔹 How identity, lifetime, communication, and concurrency shape execution semantics 🔹 How frameworks like Temporal, Restate, DBOS, and Resonate map onto these forms This builds on the previous two posts in the series: 🔹 Demystifying Determinism in Durable Execution 🔹 The Durable Function Tree (part 1 and 2) This was the last post in this series, distilling the various frameworks down to a simpler logical model. Full post: The Three Durable Function Forms https://lnkd.in/eJHpY9qC
77
7 Comments -
Gunnar Morling
Confluent • 9K followers
🤓 Woot, woot, very cool to see: Folks giving #Hardwood a try for parsing their Parquet files. Performance hasn't been a big focus yet, but it seems good enough when processing entire data sets (thus not requiring predicate push down). 👉 https://lnkd.in/eaty-Q6g
23
-
Lee Stott
Microsoft • 10K followers
Interested in learning more about Foundry Local https://foundrylocal.ai and how to build RAG-style systems that run locally? I’ve been exploring patterns for local-first retrieval, generation, and agent workflows, focusing on privacy, performance, and developer control. If you’re experimenting with RAG beyond cloud-only setups, these samples might be useful: • Local RAG – a simple, practical starting point for retrieval‑augmented generation running locally https://lnkd.in/e8DaV8ae • Local CAG – moving beyond classic RAG into contextual and agent‑driven generation patterns https://lnkd.in/eb5u_QXc • Local Hybrid Retrieval (ONNX) – combining lexical + vector search with optimized local inference https://lnkd.in/eh4mMwSi These projects are all about understanding the building blocks: local models, embeddings, hybrid retrieval, and how they fit into modern agent architectures with Foundry Local. If you’re curious about edge AI, local-first agents, or privacy-aware RAG, feel free to explore, fork, or share feedback. #AI #RAG #FoundryLocal #AgenticAI #LocalAI #EdgeAI #Developers
18
-
Gunnar Morling
Confluent • 9K followers
📝 Wrote a thing: "'Streaming vs. Batch' Is a Wrong Dichotomy, and I Think It's Confusing" Oftentimes, Stream vs. Batch is discussed as if it’s one or the other, but to me this does not make that much sense really. They actually go together, and the more important distinction to make is Push vs. Pull. 👉 Read the full post here: https://lnkd.in/eNsq5DYq
136
10 Comments -
Bryan Fordham
Self Financial, Inc. • 504 followers
Kagi has a translator that will turn English into McKinsey-speak. Also Gen Z. The avian asset’s presence facilitated a positive shift in emotional sentiment, driven by its high-gravity aesthetic and disciplined posture. I initiated a stakeholder inquiry: "Despite your suboptimal grooming and lack of plumage, you demonstrate significant risk appetite. As a legacy entity from the nocturnal ecosystem, what is your core brand identity on the Plutonian shore?" The Raven responded with a definitive, non-negotiable strategic pivot: "Nevermore." https://lnkd.in/ezKWuRxQ
1
-
Deepthi Talasila
Microsoft • 1K followers
Anthropic’s new Claude Sonnet 4.6 promises Opus-level coding at Sonnet pricing Anthropic on Tuesday launched Claude Sonnet 4.6, the latest version of its mainstream model. This new version promises to almost match the company’s flagship Opus 4.6 model, which launched barely two weeks ago, in most tasks, but at the significantly lower price of $3/$15 per million input/output tokens (compared to $5/$25 for the Opus model). Just like the previous version and Opus 4.6, Sonnet 4.6 offers a 1-million-token context window in beta. Stay connected for industry’s latest content – Follow Deepthi Talasila #DevSecOps #ApplicationSecurity #AgenticAI #CloudSecurity #CyberSecurity #AIinSecurity #SecureDevOps #AppSec #AIandSecurity #CloudComputing #SecurityEngineering #ZeroTrust #MLSecurity #AICompliance #SecurityAutomation #SecureCoding #linkedin #InfoSec #SecurityByDesign #AIThreatDetection #CloudNativeSecurity #ShiftLeftSecurity #SecureAI #AIinDevSecOps #SecurityOps #CyberResilience #DataSecurity #SecurityInnovation #SecurityArchitecture #TrustworthyAI #AIinCloudSecurity #NextGenSecurity https://lnkd.in/gJ3USuAK
-
Jonathan Desrosiers
Bluehost • 1K followers
In case you missed it when the episode dropped, I was recently on the Crossword podcast. I finally had a chance to write a bit about our conversations on my site. We covered a lot of ground, including the concept of active versus passive contribution, how to find the appropriate balance between those two groups, the importance of being prepared when new contributors show up, and the nuance between a do-ocracy and a meritocracy. I'd love to hear your thoughts after you listen! https://lnkd.in/evmRqabE
-
Yaron Pdut
Varonis • 2K followers
Boris Cherny’s View on Vibe Coding Limitations (from the Podcast) • Best for throwaway/prototype code: “I do this all the time [vibe coding], but it’s definitely not the thing you want to do all the time.” It’s ideal for non-critical path code, quick experiments, or prototypes that might get discarded. • Lacks maintainability and thoughtfulness: “You want maintainable code sometimes. You want to be very thoughtful about every line sometimes.” AI can produce working code fast, but it often results in messy, hard-to-maintain output (e.g., inconsistent style, over-engineering, or subtle issues). • Requires human oversight for quality: On the Claude Code team, they hold AI-generated code to the exact same bar as human-written code. If it “sucks,” they don’t merge it—they ask the model to improve it. • Preferred approach: Pair programming with the model: For important code, align on a plan first (e.g., using Claude Code’s plan mode), iterate incrementally, review/clean up, or even hand-write critical parts. Boris still manually codes core sections where he has strong opinions (e.g., parameter names). • Models aren’t perfect yet: “The models are still overall not great at coding… this is the worst it’s ever going to be.” Massive improvements are coming, but current limitations mean vibe coding alone isn’t reliable for production. Broader Limitations of Vibe Coding (from Industry Discussion & Karpathy’s Own Reflections) Karpathy popularized the term but has since highlighted drawbacks, even abandoning pure vibe coding for complex projects due to persistent bugs and “slop”: • Produces “slop” or low-quality code — Hallucinations, duplicated logic, bloated/over-engineered solutions, inconsistent architecture → Leads to technical debt and hard-to-debug systems. • Security vulnerabilities — AI can introduce exploits (e.g., insecure dependencies, leaked keys) that humans might miss if not reviewing deeply. • Hard to debug and maintain long-term — Lack of deep understanding means subtle/non-obvious problems pile up; fixing requires trial-and-error. • Not suitable for production/critical software — Fine for weekend projects or demos, but risky for anything with real stakes (e.g., scalability, reliability). • Paradox: Best for experienced coders — Experts can spot/fix issues; novices risk building fragile apps without realizing. In summary, vibe coding is a powerful tool for speed and creativity (especially prototypes), but its limitations make it unsuitable as a full replacement for thoughtful, reviewed coding—especially in professional settings. As Boris puts it, treat AI like a pairing partner: collaborate actively, enforce high standards, and intervene where needed. Models will get better rapidly, narrowing these gaps over the next months/years. https://lnkd.in/e446EFR3
4
-
Guy Elsmore-Paddock
Modulate • 672 followers
This is absolutely true. I would also add that just because AI decreases cycle time does not mean you should operate without a plan. To use an analogy, pretend you are going on a trip from New York to Los Angeles and fuel/electricity are free (or only cost pennies). Now imagine that you don't have a map but are just driving based on instinct. Because you are moving and the cost is low or zero, you feel you are getting somewhere even when you are not. You need a map (app or paper), or at least need to stop for directions occasionally. Movement, no matter the speed, is not progress unless it's toward where you want to go.
3
2 Comments -
Mary Poplin
Savannah College of Art and… • 4K followers
OK. Before I get into this, a quick disclaimer: This is my personal opinion as an individual. It does not reflect the views or positions of Adobe in any way. Nothing here constitutes financial or legal advice. These are simply my own thoughts. There is a lot of speculation right now about Adobe and generative AI. The market narrative says AI will disrupt creative software and undermine the subscription model companies like Adobe built over the last decade. Yet Adobe continues to report strong growth. The stock still dropped because of broader fears about AI competition and leadership changes. I think the market has this one wrong. Many analysts still wrongly frame Adobe as a consumer creative software company. In reality, Adobe runs enterprise-scale ecosystems across Creative Cloud, Document Cloud, and Experience Cloud that power creative production, marketing operations, analytics, and document workflows for major organizations worldwide. Enterprise adoption tells a very different story from the market narrative. Nearly every Fortune 100 & 500 company uses AI features in Adobe apps, and most large enterprise customers have already adopted AI-driven tools like Firefly services, GenStudio, and AI assistants. This is not novelty AI. Consumer AI tools often work like slot machines. You type a prompt and hope something good enough appears. Enterprise creative production requires governance, repeatability, brand control, and legal safety. That is where Adobe operates. Generative AI can produce generic content quickly. It does not eliminate the need for creatives. If anything, it increases the demand for people who understand storytelling, design systems, and brand vision. Creative work has never been about pressing a button. It has always been about vision. I believe Adobe’s strategy reflects that. AI is being embedded directly into creative workflows so professionals can move faster while keeping control of the output. From where I sit, this is a case of analysts underestimating the value of creative professionals and the systems they rely on, and ignoring business offerings. Folks who are not creative can't make calls about how creatives work and what their needs are, and I don't think many startups understand enterprise needs at scale or what Adobe's ecosystem brings. I work with these folks, I have a different perspective. I believe the market is undervaluing the platform. I am still investing. Maybe I am wrong, maybe I am right. But if you want to make an omelette, you have to crack a few eggs. But there's one thing I do believe, in a world where AI agents are deleting years' worth of databases, self-driving cars can be captured in what amounts to painted salt circles, computer vision is targeting pictures of planes on the ground, and companies having major security issues in the software AI writes... I don't trust a computer with the most important parts of my business, only the busywork. And busywork has never been the vision.
35
6 Comments -
Rafael Jesus Hernández Vasquez
Boom Entertainment • 2K followers
This week, Anthropic's Claude 4 seems to be again on a roller coaster: after pushing Amazon's new Kiro IDE (which was briefly available for download before an avalanche of demand slammed it shut), his coding performance plummeted noticeably. Tool usage has dropped. The reason? Degraded. And as usual at Anthropic, no one knows why. We just get silence and a fresh dose of usage limits. Meanwhile, Microsoft's Copilot is getting a makeover literally. The new "appearance" feature could soon allow it to simulate traits like age or personality. Whether it's charming or dystopian depends on your tolerance for digital companions with synthetic charm. But one thing is clear: AI is no longer just smart, it's present. Speaking of IAG, Meta appointed Shengjia Zhao to lead its new Superintelligence Lab. If that name doesn't ring a bell, it soon will: Meta is delving into fundamental models with AI ambitions, indicating that the "big three" (Meta, OpenAI, Google) are gearing up for the final showdown. But it's not all polished demos and ambitious projects. A troubling moment at Sketch.dev exposed the dark side of AI programming assistants: a subtle AI-written database optimization caused a crash in production under load. Reminder: just because your AI can write code doesn't mean it should. Test, test, and test again. Mistral's Codestral also launched, quietly stealing the show. With 22 billion parameters and support for over 80 languages, it's open-source, meaning it has no barriers. Elsewhere, Google's NotebookLM received a major update powered by Gemini. It now summarizes YouTube videos and drafts of content like your personal caffeine research companion. Think of it as a fusion of CliffsNotes and ChatGPT: optimized, hyper-efficient, and incredibly good at synthesizing ideas. In the worst-case scenario for cybersecurity, researchers have revealed AI-powered malware that thinks for itself. It interprets its environment, adapts, and executes evasive strategies with chilling precision. It's not HAL yet, but it's time we reconsider the true meaning of "autonomous threat." Finally, some news that caught my attention: A new concept unlike the transformers and tokens we know today, the Hierarchical Reasoning Model (HRM) concept promises to make the large models we know more efficient. In its first milestone, a small model of just 27M was able to beat o3-mini on pre-trained tasks. for more information follow me here: https://lnkd.in/gNVYeZST
4
-
Fatema Saifee
N26 • 4K followers
Advantages of Working Collaboratively: In our increasingly complex world, the secret power often lies not in singular genius—but in collective intelligence. As Colin M. Fisher writes in The Collective Edge: “The secret to getting the most from your groups is knowing how to work with the invisible forces of group dynamics instead of being mindlessly pushed around by them.” Here are a few key advantages of working collaboratively, inspired by Fisher’s insights: 1. Synergy & Creativity When structured well, groups can be more than the sum of their parts. Diverse perspectives spark innovation, as Fisher illustrates with design firms and high-performing teams. 2. Healthy Competition Competition within a collaborative group doesn’t have to be destructive. It can motivate without leading to conflict, if the group norms support it. 3. Constructive Conformity Conformity isn’t always a bad thing — under the right conditions, it can promote alignment, shared goals, and group cohesion. 4. Effective Leadership through Structure Great leadership doesn’t always mean charismatic speeches; sometimes it's about setting the right structure, coaching, and creating psychological safety. 5. Better Decision Making By paying attention to group-level dynamics, teams can make more robust decisions — avoiding the pitfalls of blind conformity or domination by a single voice. — When we lean into the hidden dynamics of groups, we unlock potential that individual effort alone can't reach. Let's celebrate and build collective edge in our teams. #Collaboration #Teamwork #GroupDynamics #Leadership #Innovation #CollectiveIntelligence #WorkTogether #CollectiveEdge
2
-
Stephen White
Google • 352 followers
One of my favourite WebGPU demos is the "deferred rendering" demo (or as I call it, "disco dragon"). It's pretty, and it demonstrates a cool rendering technique for highly complex scenes with lots of dynamic lights: https://lnkd.in/eVE226HN So I rewrote it in Toucan, my graphics-oriented programming language: https://lnkd.in/e2Kwe334 (runs in Chrome and Firefox Nightly). It's 424 lines of Toucan, versus 801 total lines of WGSL and TypeScript. Most of the savings come on the host side, where, having defined our vertex buffers, uniform buffers, bind groups and pipeline variables once for the device side, we don't need to re-declare them for the host side in a new syntax. Plus, a common type system means there's no need to compute byte offsets or vertex strides, match up group numbers or bindings numbers, re-declare attachments, or accommodate std140 or std430 memory layouts. All of that is done automatically by the compiler. Type safety means not only being able to assign to members of a vertex or uniform buffer on the host using the same syntax as the device, but also applies to the usage itself: since the "vertex" and "uniform" qualifiers are understood by the host side, they become part of the type. If a required qualifier is omitted on allocation, attempting to bind the buffer as such will result in a compile-time error, not a runtime one. Here's a diff of the two versions (with comments and whitespace added to help the diffs to match): https://lnkd.in/ew64MGGS
8
7 Comments -
Pamela Fox
Microsoft • 14K followers
Open Responses is an open-source spec based off OpenAI's Responses API, with backing so far from OpenRouter, Vercel, HuggingFace, LM Studio, Ollama, and vLLM. https://openresponses.org Will more providers get on board? It'd be so nice to have a true standard for LLM APIs.
69
5 Comments -
Neha Naidu
IntelliBooks • 7K followers
As COO at www.jaiinfoway.com this is an important signal for where agentic AI is heading. Non technical implementations are catching up fast but the real story is iteration speed and learning loops. Building Cowork entirely with Claude Code shows how agents are becoming first class workers not just assistants. At www.jaiinfoway.com we see this shift clearly as enterprises move from copilots to autonomous task execution with guardrails. UX will evolve tooling will mature but the direction is set. Curious how others here see non technical users adopting agent driven workflows this year. Are we ready for agents as coworkers. #Jaiinfoway #AgenticAI #Claude #AIAgents #EnterpriseAI #FutureOfWork #AIProductivity #AITransformation #AILeadership
2
-
Matt Kauffman
454 followers
In this blog Mohammed Rafiq and I dive into the technical details of how Ragie's audio video pipeline and retrievals work. We share some of the approaches we considered and the reasoning behind the design we landed on. If you get a minute, give it a read and let us know what you think.
6
-
⚙️ Andrew Rosca
AlphaSense • 1K followers
A federal judge has ruled that Anthropic's use of published books to train its AI models falls under fair use, marking a significant legal precedent in AI copyright disputes. This decision emphasizes that AI companies can argue fair use to justify transforming copyrighted materials for training, despite concerns over unauthorized copying and piracy. However, the ruling highlights ongoing legal uncertainties, especially concerning the methods of data acquisition and the extent of damages related to piracy. https://lnkd.in/guwV_tYC
2
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content