
Sudeep Pillai
@spillai • 2,780 subscribers
Founder / CEO @vlmrun | FF23 @southpkcommons | ML Leader @ToyotaResearch | CS PhD @MIT CSAIL #girldad
Videos

🤯 Building computer-vision applications is never going to be the same again. As Andrej Karpathy says, "English is going to be the new programming-language", and we just made that happen for programming visual AI. 👨💻 This past month, the VLM Run team cooked up something magical that I'm especially proud of - a natural-language interface to build and run computer vision workflows (object-detection, OCR, parsing, and so much more) – all via our new VLM Run MCP server. 🔌 It works with any MCP-compatible AI agent (Claude, OpenAI, Cursor, etc.) and abstracts complex visual workflows into simple English commands. No need to write Python or fiddle with bespoke computer-vision code – just say what you want your agent to do, and it does it. Here’s a glimpse into what you can build today ( ☑️ Image AI agents – run detection, tagging, blurring, cropping, and more traditional CV tasks with English ☑️ Document AI agents – extract totals from invoices, search visual content, or act on what’s found ☑️ Video AI agents – understand scene content, trim clips, add subtitles, and create ready-to-share edits ☑️ And best-of-all, it's fully programmable with English. 🧑🍳 MCP Showcase: 📚 MCP Cookbooks: 📚 MCP Docs: #computervision #ai #agents #llm #vlm #mcp #vibecoding #genai
Sudeep Pillai17,762 views • 11 months ago
No more content to load