
Guri Singh
@heygurisingh • 60,048 subscribers
Sharing practical ways to use Al, No code, and Tech Tools • Follow me to learn and master AI, Tech tools & Digital Skills • AI Educator & Writer • DM for Collab
Shorts
Videos

🚨Architects are going to hate this. Someone just open sourced a full 3D building editor that runs entirely in your browser. No AutoCAD. No Revit. No $5,000/year licenses. It's called Pascal Editor. Built with React Three Fiber and WebGPU -- meaning it renders directly on your GPU at near-native speed. Here's what's inside this thing: → A full building/level/wall/zone hierarchy you can edit in real time → An ECS-style architecture where every object updates through GPU-powered systems → Zustand state management with full undo/redo built in → Next.js frontend so it deploys as a web app, not a desktop install → Dirty node tracking -- only re-renders what changed, not the whole scene Here's the wildest part: You can stack, explode, or solo individual building levels. Select a zone, drag a wall, reshape a slab -- all in 3D, all in the browser. Architecture firms pay $50K+ per seat for BIM software that does this workflow. This is free. 100% Open Source.
Guri Singh2,967,409 просмотров • 2 месяцев назад

Holy shit... A guy got laid off, built an AI job search system on Claude Code, evaluated 740+ job offers with it, and landed a Head of Applied AI role. Then he open-sourced the entire thing. It's called career-ops. One slash command. Full pipeline. Paste a job URL → get back a structured A-F evaluation, an ATS-optimized PDF tailored to that exact role, salary research, interview prep, and a tracker entry. All in one shot. No spreadsheets. No copy-pasting. No spray-and-pray. Here's what's inside: → 14 skill modes (evaluate, scan, pdf, batch, apply, deep research, negotiation scripts, LinkedIn outreach) → Portal scanner pre-loaded with 45+ companies — Anthropic, OpenAI, ElevenLabs, Mistral, Cohere, Stripe, Retool, Vercel, Decagon, the works → 19 search queries across Ashby, Greenhouse, Lever, Wellfound, Workable → ATS-optimized PDF generation via Playwright with Space Grotesk + DM Sans → Go terminal dashboard built with Bubble Tea to browse your pipeline → Batch mode that evaluates 10+ offers in parallel using Claude sub-agents → An interview Story Bank that accumulates STAR+Reflection stories across evaluations until you have 5-10 master answers for any behavioral question → Auto-fill for application forms The wildest part isn't the automation. It's the philosophy. Career-ops is explicitly NOT a spray-and-pray tool. It's a filter. The system literally refuses to recommend applying to anything scoring below 4.0/5. The whole point is to find the few offers worth your time out of hundreds, not to flood recruiters with garbage. It evaluates fit by reasoning about your CV vs the JD. Not keyword matching. And because it's all built on Claude Code skills, you can ask Claude to rewrite the system itself. "Change the archetypes to backend roles." "Add these 10 companies." "Translate the modes to English." It reads the same files it uses, so it knows exactly what to edit. 8.2k stars already. 100% Open Source. MIT licensed. (Link in the replies)
Guri Singh973,412 просмотров • 1 месяц назад

Alibaba just open-sourced the world's FIRST AI dubbing model that handles multi-speaker scenes. It's called Fun-CineForge and it nails what every other dubbing model fails at: Lip-sync. Emotion. Voice stability. Timing. Across multiple characters. Not a single open-source model could do this before. Here's how it works: ↓ Tongyi Lab Alibaba Group
Guri Singh620,634 просмотров • 2 месяцев назад

🚨Science nerds are going to lose their minds. Kai Rowan just open sourced a framework that predicts how your brain responds to any text, audio, or video by simulating cortical fMRI activity with 30% more accuracy than Meta's own model. No fMRI scanner. No neuroscience PhD. No million-dollar lab. It's called NForge. Here's what this thing actually does: → Feed it any combination of text, audio, or video and it predicts cortical surface activity across ~20,484 brain vertices → Extracts deep features via LLaMA 3.2, V-JEPA2, and Wav2Vec-BERT simultaneously → Generates ROI attention maps showing exactly which brain regions fire hardest at which moments → Runs real-time streaming predictions from live feature streams -- no pre-loading the full clip → Breaks down exactly how much text vs audio vs video drove each prediction with per-vertex modality attribution scores → Adapts to entirely new subjects with just a few calibration scans -- no full retraining required Here's the wildest part: Built on Meta's TRIBE v2 foundation but adds 6 major capabilities Meta never shipped. Cross-subject generalization. Streaming inference. Modality attribution. torch.compile support. Full test coverage. Professional src/ package layout. You literally point this at a movie clip and it tells you which parts of the human cortex light up -- broken down by what your eyes, ears, and language centers each contributed. That sentence shouldn't be real in 2026. But here we are. 100% Open Source. pip install nforge. (Link in the comments)
Guri Singh244,515 просмотров • 2 месяцев назад

Holy shit... Typeless just solved the most annoying problem in multilingual communication. Speak in English. Get perfectly written text in Spanish, French, or any language phrased like a native speaker wrote it. No Google Translate. No awkward phrasing. No copy-pasting between apps. Just hold the Speak button, swipe up, and talk. It doesn't just translate your words, it translates your intent. Filler words gone. Grammar fixed. Output reads like a local typed it. Available on Mac and iOS. 4,000 words/week free. Try it: iOS:
Guri Singh271,704 просмотров • 2 месяцев назад

Cloud GPU training is a scam. A single M4 MacBook does 2.9 TFLOPS. Seven friends with MacBooks match an NVIDIA A100. Alexander Hayes just open-sourced a tool that makes this work over Wi-Fi. It's called AirTrain. Here's how it works: Traditional distributed training (DDP) syncs gradients after every single step. For a 124M parameter model, that's ~500MB exchanged per step. You need 50 GB/s of sustained bandwidth. Impossible over Wi-Fi. AirTrain uses the DiLoCo algorithm. Each Mac trains independently for 500 steps, then syncs only the difference. One sync per 500 steps instead of one per step. 500x less network communication. Wi-Fi actually works. The entire sync takes ~2 seconds. Here's what makes it wild: → Zero-config discovery. Devices find each other automatically via mDNS/Bonjour. Same protocol as AirDrop. → Fault tolerant. Nodes can join and leave mid-training without killing the run. → Checkpoint relay. Train for a few hours, export a checkpoint, hand it off to someone else to continue. Like a relay race for ML training. → Built on Apple's MLX framework. Native to M1/M2/M3/M4/M5 unified memory. No host-to-device copy overhead. → Local dashboard. Real-time loss curves, peer monitoring, throughput metrics in your browser. Here's the wildest part: An M4 Max with 128GB unified memory can train a 70B parameter model without offloading. An NVIDIA RTX 4090 has 24GB VRAM. Apple Silicon gets ~245-460 GFLOPS per watt. Training on MacBooks costs almost nothing in electricity compared to cloud GPUs. And there are hundreds of millions of Apple Silicon Macs in the world. The math: Traditional DDP: 1 sync per step = 50 GB/s required AirTrain (DiLoCo): 1 sync per 500 steps = 0.1 GB/s required Wi-Fi handles 0.1 GB/s. That's it. That's the breakthrough. They even built a community platform at with live session browsing, checkpoint sharing, and a contributor leaderboard. Training a 124M parameter GPT-2? Instead of renting cloud GPUs at $3/hr, pool three MacBooks in a coffee shop and train for free. MIT licensed. Built in Python. 1 contributor. Early stage but the idea is insane. 100% Open Source. (Link in the comments)
Guri Singh160,201 просмотров • 1 месяц назад

🚨BREAKING: An open-source agentic video production system just dropped. 11 pipelines, 49 tools, and a full product ad produced for $0.69 total. It's called OpenMontage. And it's not a text-to-video tool. It's a full production orchestration system where your AI coding assistant (Claude Code, Cursor, Copilot, Windsurf) becomes the director. Describe what you want in plain language. The agent researches, scripts, generates assets, edits, and renders the final video. Here's what the pipeline actually does: → Live web research first: 15-25+ searches across YouTube, Reddit, news sites before writing a single word of script → 12 video generation providers: Kling, Runway Gen-4, Google Veo 3, MiniMax, plus local GPU options (WAN 2.1, Hunyuan, CogVideo) → 8 image generation providers: FLUX, Google Imagen 4, DALL-E 3, Stable Diffusion locally → 4 TTS providers: ElevenLabs, Google (700+ voices), OpenAI, and Piper offline for free → WhisperX word-level subtitles burned in automatically → Remotion for React-based animated composition with spring physics, transitions, TikTok-style captions → Budget governance: cost estimate before execution, per-action approval above $0.50, hard cap at $10 Here's the wildest part: One product ad. 4 AI-generated images, TTS narration, royalty-free music, word-level subtitles, Remotion data visualizations. Total cost: $0.69. Zero manual asset work. Works with zero API keys too. Piper narrates locally, Pexels/Pixabay provide free stock, Remotion animates everything. No spend required to start. 100% Open Source. AGPL v3 License. (Link in the comments)
Guri Singh112,201 просмотров • 1 месяц назад

🚨 BREAKING: 170K+ traders are opening leveraged positions directly from geopolitical alerts, before the headline even hits your timeline. You're still refreshing CNN. Glint built a Telegram bot with an in-house LLM trained on 400+ news and OSINT sources. It tracks conflicts, sanctions, leadership changes, economic shifts and tells you exactly what's moving the S&P 500 before the rest of the market reacts. Then you open a leveraged position on Hyperliquid or place a bet on Polymarket. Same chat. Same second. No CEX. No KYC. No app switching. Non-custodial wallet. It's called Glint Bot. By the time you hear about it on Twitter, their users have already traded it. Watch it in action ↓
Guri Singh113,181 просмотров • 2 месяцев назад

1. The Meta-Prompt Copy this into ChatGPT: “You are to act as my prompt engineer. I would like to accomplish: [insert your goal]. Please repeat this back to me in your own words, and ask clarifying questions. Once we confirm, generate the final optimized prompt.” That’s it. This turns ChatGPT into your personal prompt engineer.
Guri Singh21,888 просмотров • 9 месяцев назад
Больше нет контента для загрузки