Video yükleniyor...

Video Yüklenemedi

Ana Sayfaya Dön

New research from Meta FAIR — Meta Explore Theory-of-Mind: Program guided adversarial data generation for theory of mind reasoning. This work includes a new research paper, code & a dataset on Hugging Face. Details + eight more new releases from FAIR ➡️

38,568 görüntüleme • 1 yıl önce •via X (Twitter)

3 Yorum

Zoe Wang profil fotoğrafı
Zoe Wang1 yıl önce

Breakdown of the paper: The paper introduces ExploreToM, a framework for generating diverse and challenging data to evaluate and train large language models (LLMs) on theory of mind reasoning. Theory of mind is the ability to understand others' intentions, beliefs, and mental states, which is crucial for social intelligence. The ExploreToM-generated data is highly challenging for state-of-the-art LLMs, with accuracies as low as 0% and 9% for Llama-3.1-70B and GPT-4o, respectively. The authors found that LLMs struggle with basic state tracking, which is a prerequisite for theory of mind reasoning. They also showed that fine-tuning on ExploreToM data can significantly improve performance on existing theory of mind benchmarks. full paper:

Data & Analytics profil fotoğrafı
Data & Analytics1 yıl önce

@huggingface @AIatMeta, yo, that sounds like some cutting-edge stuff from Meta. Theory of mind in AI? That's deep! How do you think it’ll impact our interaction with tech?

Electe profil fotoğrafı
Electe1 yıl önce

@huggingface @AIatMeta, exciting developments in theory-of-mind reasoning. 📚

Benzer Videolar

Open science is how we continue to push technology forward and today at Meta FAIR we’re sharing eight new AI research artifacts including new models, datasets and code to inspire innovation in the community. More in the video from Joelle Pineau. This work is another important step towards our goal of achieving Advanced Machine Intelligence (AMI). What we’re releasing: • Meta Spirit LM: An open source language model for seamless speech and text integration. • Meta Segment Anything Model 2.1: An updated checkpoint with improved results on visually similar objects, small objects and occlusion handling. Plus a new developer suite to make it easier for developers to build with SAM 2. • Layer Skip: Inference code and fine-tuned checkpoints demonstrating a new method for enhancing LLM performance. • SALSA: New code to enable researchers to benchmark AI-based attacks in support of validating security for post-quantum cryptography. • Meta Lingua: A lightweight and self-contained codebase designed to train language models at scale. • Meta Open Materials: New open source models and the largest dataset of its kind to accelerate AI-driven discovery of new inorganic materials. • MEXMA: A new research paper and code for our novel pre-trained cross-lingual sentence encoder with coverage across 80 languages. • Self-Taught Evaluator: a new method for generating synthetic preference data to train reward models without relying on human annotations. Access to state-of-the-art AI creates opportunities for everyone. We’re excited to share this work and look forward to seeing the community innovation that results from it. Details and access to everything released by FAIR today ➡️

AI at Meta

150,222 görüntüleme • 1 yıl önce

Today is a good day for open science. As part of our continued commitment to the growth and development of an open ecosystem, today at Meta FAIR we’re announcing four new publicly available AI models and additional research artifacts to inspire innovation in the community and help advance AI in a responsible way. More in the video from Joelle Pineau. What we’re releasing: 🦎 Meta Chameleon 7B & 34B language models that support mixed-modal input and text-only outputs. 🪙 Meta Multi-Token Prediction Pretrained Language Models for code completion using Multi-Token Prediction. 🎼 Meta JASCO Generative text-to-music models capable of accepting various conditioning inputs for greater controllability. Paper available today with a pretrained model coming soon. 🗣️ Meta AudioSeal An audio watermarking model that we believe is the first designed specifically for the localized detection of AI-generated speech, available under a commercial license. 📝 Additional RAI artifacts Including research, data and code to measure and improve the representation of geographical and cultural preferences and diversity in AI systems. We believe that access to state-of-the-art AI creates opportunities for everyone – not just a small handful of Big Tech companies. We’re excited to share this work and to see how the community learns, iterates and builds using this technology. Details and access to everything released by FAIR today ➡️

AI at Meta

380,695 görüntüleme • 2 yıl önce