正在加载视频...

视频加载失败

gpt2 layerwise average activations

35,999 次观看 • 1 年前 •via X (Twitter)

10 条评论

losh 的头像
losh1 年前

repo:

Quant Data 的头像
Quant Data1 年前

🚨 $QQQ is up over 2% again 🚨 Yesterday evening, we tweeted about the $11.9M+ in bullish golden sweeps on $QQQ for 8/08 (today) and 8/09 expiries. $QQQ is up over $8 today. Want to see trades like this? Try our 7 day free trial at:

jack morris (is at iclr) 的头像
jack morris (is at iclr)1 年前

you should add the logit lens token at each layer

losh 的头像
losh1 年前

last layer logits were added. but not sure of method for mapping to each layer?

ueaj 的头像
ueaj1 年前

Is this hidden state or MLP?

losh 的头像
losh1 年前

separated em out,

Rosmine 的头像
Rosmine1 年前

Nice, another variant that would be cool is to highlight the tokens with the highest scores at each new token

losh 的头像
losh1 年前

done

moskstraumen 的头像
moskstraumen1 年前

You just gained a follower.

losh 的头像
losh1 年前

🫡

相关视频