
Harman Singh (in NYC for summer)
@Harman26Singh • 1,867 subscribers
PhD student @berkeley_ai, RS Intern MSL @MetaAI (NYC). Prev: Gemini @GoogleDeepMind, AI Resident @MetaAI Interested in intelligence.
Videos

Can LLMs Self-Verify? Much better than you'd expect. LLMs are increasingly used as parallel reasoners, sampling many solutions at once. Choosing the right answer is the real bottleneck. We show that pairwise self-verification is a powerful primitive. Introducing V1, a framework that unifies generation and self-verification: 💡 Pairwise self-verification beats pointwise scoring, improving test-time scaling 💡 V1-Infer: Efficient tournament-style ranking that improves self-verification 💡 V1-PairRL: RL training where generation and verification co-evolve for developing better self-verifiers 🧵👇
Harman Singh (in NYC for summer)102,871 views • 3 months ago
No more content to load