Video wird geladen...

Video konnte nicht geladen werden

Zur Startseite

asi has been achieved internally

146,363 Aufrufe • vor 2 Jahren •via X (Twitter)

10 Kommentare

Profilbild von danb
danbvor 2 Jahren

q learning optimizations that worked main theme: compress states into smaller, info-dense chunks. maximize (info^2 / state space) - gave it relative distances to objects from the head, rather than absolute positions of objects - gave it log(distance) to things instead of the distance other: - scaled rewards appropriately - made learning rate smaller - having a 1% chance to randomly move has high p(killing the snake) if its running along a border or next to itself

Profilbild von Aynio
Ayniovor 2 Jahren

very nice. now we need another predator

Profilbild von danb
danbvor 2 Jahren

snake with two snakes would be interesting

Profilbild von Joseph Suarez (e/🐡)
Joseph Suarez (e/🐡)vor 2 Jahren

Hey, try it with PufferLib. I bet you get big snake fast

Profilbild von danb
danbvor 2 Jahren

PufferLib seems cool, but my experiments are pretty small and just for learning RL id probably look into it more if I were doing RL for a company but right now its just learning on the side

Profilbild von techniacus
techniacusvor 2 Jahren

That's insane!

Profilbild von pixel
pixelvor 2 Jahren

was thinking about doing this same project, this is dope!

Profilbild von danb
danbvor 2 Jahren

it was fun figuring out optimizations. it barely got above len=2 for the longest time.

Profilbild von Forscience
Forsciencevor 2 Jahren

I want to see q learning on an actual complex game like league of legends or world of Warcraft.

Profilbild von danb
danbvor 2 Jahren

id be interested to see how someone would pull that off successfully youd have to do a TON of compression of the state space, considering how much there would be to take into account

Ähnliche Videos