Загрузка видео...

Не удалось загрузить видео

На главную

asi has been achieved internally

146,363 просмотров • 1 год назад •via X (Twitter)

Комментарии: 10

Фото профиля danb
danb1 год назад

q learning optimizations that worked main theme: compress states into smaller, info-dense chunks. maximize (info^2 / state space) - gave it relative distances to objects from the head, rather than absolute positions of objects - gave it log(distance) to things instead of the distance other: - scaled rewards appropriately - made learning rate smaller - having a 1% chance to randomly move has high p(killing the snake) if its running along a border or next to itself

Фото профиля Aynio
Aynio1 год назад

very nice. now we need another predator

Фото профиля danb
danb1 год назад

snake with two snakes would be interesting

Фото профиля Joseph Suarez (e/🐡)
Joseph Suarez (e/🐡)1 год назад

Hey, try it with PufferLib. I bet you get big snake fast

Фото профиля danb
danb1 год назад

PufferLib seems cool, but my experiments are pretty small and just for learning RL id probably look into it more if I were doing RL for a company but right now its just learning on the side

Фото профиля techniacus
techniacus1 год назад

That's insane!

Фото профиля pixel
pixel1 год назад

was thinking about doing this same project, this is dope!

Фото профиля danb
danb1 год назад

it was fun figuring out optimizations. it barely got above len=2 for the longest time.

Фото профиля Forscience
Forscience1 год назад

I want to see q learning on an actual complex game like league of legends or world of Warcraft.

Фото профиля danb
danb1 год назад

id be interested to see how someone would pull that off successfully youd have to do a TON of compression of the state space, considering how much there would be to take into account

Похожие видео