Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

Self-supervised representation learning looks a bit like RL. What if we literally use RL as a SSL method for visual representations? Turns out that it works quite well. In new work by Dibya Ghosh, we show how this can be done:

48,715 Aufrufe • vor 1 Jahr •via X (Twitter)

5 Kommentare

Profilbild von Sergey Levine
Sergey Levinevor 1 Jahr

Imagine an MDP where the state is the current crop of the image, an action is to pick a new crop, and rewards are matching textual captions or other (weak or strong) labels. Training a value function for this MDP instantiations a representation learning method.

Profilbild von Sergey Levine
Sergey Levinevor 1 Jahr

Reward could come from matching a text label, or provided in a fully unsupervised way via crop consistency. The stronger the reward, the better it works, but even weak rewards like crop consistency lead to improvement. For more, check out the website:

Profilbild von Joanne Mercado
Joanne Mercadovor 1 Jahr

@its_dibya *an SSL, but overall your grammar and punctuation are top-tier 💯

Profilbild von Ethan vs Machines
Ethan vs Machinesvor 1 Jahr

@its_dibya RL for SSL using semantic rewards? Brilliant method. Scaling beyond COCO might be tough here though—Canada’s R&D can’t keep up with compute demands anymore.

Profilbild von ᐸGerardSans/ᐳ🚀🇬🇧
ᐸGerardSans/ᐳ🚀🇬🇧vor 1 Jahr

@its_dibya That’s just flattened patching which is something but not really.

Ähnliche Videos