Jesse Mu's banner
Jesse Mu's profile picture

Jesse Mu

@jayelmnop6,873 subscribers

computational linguistics

Shorts

Prompting is cool and all, but isn't it a waste of compute to encode a prompt over and over again? We learn to compress prompts up to 26x by using "gist tokens", saving memory+storage and speeding up LM inference: (w/ Xiang Lisa Li and noahdgoodman) 🧵

Prompting is cool and all, but isn't it a waste of compute to encode a prompt over and over again? We learn to compress prompts up to 26x by using "gist tokens", saving memory+storage and speeding up LM inference: (w/ Xiang Lisa Li and noahdgoodman) 🧵

160,673 Aufrufe