
Alexandre Morgand
@Almorgand • 2,059 subscribers
Computer Vision Research Scientist at @simulon, music lover , fond of scientific/musical/geeky/useless stuff. I'm posting papers on whatever I found amazing :)
Shorts
Videos

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding TL;DR: DINO-X Pro: sota model with enhanced perception capabilities for various scenarios; DINO-X Edge: model optimized for faster inference speed and better suited for deployment on edge devices
Alexandre Morgand60,720 Aufrufe • vor 1 Jahr

"Cameras as Relative Positional Encoding" TLDR: comparison for conditioning transformers on cameras: token-level raymap, attention-level relative pose encodings, a (new) relative encoding Projective Positional Encoding -> camera frustums, (int|ext)insics for relative pos encoding
Alexandre Morgand17,795 Aufrufe • vor 10 Monaten
Keine weiteren Inhalte verfügbar