Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

⚡️🔬📣 Excited to share our new nature article building and evaluating PathChat, a multimodal generative AI copilot and chatbot for human pathology. Article: Open Access Link: We leverage our previous success in building foundation models for computational pathology such as UNI / CONCH and combine it with the advancements...

291,475 Aufrufe • vor 2 Jahren •via X (Twitter)

10 Kommentare

Profilbild von Bo Wang
Bo Wangvor 2 Jahren

@Nature Congratulations @AI4Pathology ! Your team is on fire 🔥! Look forward to hearing about your research at CVPR!

Profilbild von Aaditya Ura
Aaditya Uravor 2 Jahren

@Nature Amazing work! Will it be open source?

Profilbild von Arjun (Raj) Manrai
Arjun (Raj) Manraivor 2 Jahren

@Nature Congrats @AI4Pathology !

Profilbild von Tanishq Mathew Abraham, Ph.D.
Tanishq Mathew Abraham, Ph.D.vor 2 Jahren

@Nature Congrats to the lab, great work!

Profilbild von Mingyao Li
Mingyao Livor 2 Jahren

@Nature Congratulations, Faisal!! 👍

Profilbild von Sebastian
Sebastianvor 2 Jahren

@Nature Congrats 🥳

Profilbild von Mehdi Maanaoui
Mehdi Maanaouivor 2 Jahren

@Nature .@Ijeb #When ??

Profilbild von Bryan Wong
Bryan Wongvor 2 Jahren

@Nature Great work! Are there any plans to open-source PathChat?

Profilbild von Khalid خالد
Khalid خالدvor 2 Jahren

@Nature very cool!

Profilbild von John
Johnvor 2 Jahren

@Nature That's fantastic! Congrats on the publication! PathChat sounds like a fascinating tool for pathology. Can't wait to see how it helps advance the field. 🌟 #Innovation #AIInHealthcare

Ähnliche Videos

⚡️📣👇Tremendously excited to share our new Cell article, where we develop TriPath, a method for analyzing 3D pathology samples using weakly supervised AI. Article: TriPath enables 3D computational pathology via 3D multiple instance learning allowing AI models to capture intricate morphological details from pathology volumes. Code: Blog post: Tested on two different imaging modalities, and patient cohorts from two institutions. Our superstar Andrew H. Song put in a monumental effort of leading the study, in a fantastic collaboration with Jonathan Liu at University of Washington . Interesting aspects: - Utilizing the whole tissue volume and leveraging 3D deep learning enable superior risk prediction performance compared to 2D deep learning baselines based on a few sampled tissue sections that emulate standard clinical practice. This indicates TriPath can harness additional information provided by 3D tissue morphology. - The performance is also superior to clinical baselines from a reader study that involved six expert pathologists. - The morphologically heterogeneous tissue volume could lead to opposing patient-level outcome predictions, dependent on which portion of the tissue volume is used. This concurs with current clinical literature warning that tissue sampling bias can lead to misdiagnosis. Some limitations: - While the 3D pathology cohort size is unprecedented, it is smaller than typical 2D pathology cohorts. Further large-scale studies will be required for validation. Nevertheless, we believe that this study will initiate a positive cycle, encouraging academic institutions and pharmaceutical companies to contribute large banks of human tissue blocks with paired clinical outcomes, thus speeding up advancements in 3D computational pathology. Concluding insights: We believe that 3D pathology is just around the corner - It has the huge potential to not only augment/improve the current clinical practice centered around 2D examination of human tissue, but also help reveal novel biomarkers for prognosis and therapeutic response.. Harvard Medical School Harvard Data Science Initiative Mass General Brigham Broad Institute

Faisal Mahmood

65,520 Aufrufe • vor 2 Jahren

Open science is how we continue to push technology forward and today at Meta FAIR we’re sharing eight new AI research artifacts including new models, datasets and code to inspire innovation in the community. More in the video from Joelle Pineau. This work is another important step towards our goal of achieving Advanced Machine Intelligence (AMI). What we’re releasing: • Meta Spirit LM: An open source language model for seamless speech and text integration. • Meta Segment Anything Model 2.1: An updated checkpoint with improved results on visually similar objects, small objects and occlusion handling. Plus a new developer suite to make it easier for developers to build with SAM 2. • Layer Skip: Inference code and fine-tuned checkpoints demonstrating a new method for enhancing LLM performance. • SALSA: New code to enable researchers to benchmark AI-based attacks in support of validating security for post-quantum cryptography. • Meta Lingua: A lightweight and self-contained codebase designed to train language models at scale. • Meta Open Materials: New open source models and the largest dataset of its kind to accelerate AI-driven discovery of new inorganic materials. • MEXMA: A new research paper and code for our novel pre-trained cross-lingual sentence encoder with coverage across 80 languages. • Self-Taught Evaluator: a new method for generating synthetic preference data to train reward models without relying on human annotations. Access to state-of-the-art AI creates opportunities for everyone. We’re excited to share this work and look forward to seeing the community innovation that results from it. Details and access to everything released by FAIR today ➡️

AI at Meta

150,222 Aufrufe • vor 1 Jahr

VITA Towards Open-Source Interactive Omni Multimodal LLM discuss: The remarkable multimodal capabilities and interactive experience of GPT-4o underscore their necessity in practical applications, yet open-source models rarely excel in both areas. In this paper, we introduce VITA, the first-ever open-source Multimodal Large Language Model (MLLM) adept at simultaneous processing and analysis of Video, Image, Text, and Audio modalities, and meanwhile has an advanced multimodal interactive experience. Starting from Mixtral 8x7B as a language foundation, we expand its Chinese vocabulary followed by bilingual instruction tuning. We further endow the language model with visual and audio capabilities through two-stage multi-task learning of multimodal alignment and instruction tuning. VITA demonstrates robust foundational capabilities of multilingual, vision, and audio understanding, as evidenced by its strong performance across a range of both unimodal and multimodal benchmarks. Beyond foundational capabilities, we have made considerable progress in enhancing the natural multimodal human-computer interaction experience. To the best of our knowledge, we are the first to exploit non-awakening interaction and audio interrupt in MLLM. VITA is the first step for the open-source community to explore the seamless integration of multimodal understanding and interaction. While there is still lots of work to be done on VITA to get close to close-source counterparts, we hope that its role as a pioneer can serve as a cornerstone for subsequent research.

AK

23,958 Aufrufe • vor 1 Jahr