正在加载视频...

视频加载失败

Sheepy-T: A fully open-source instruction-tuned language model based on GPT-J running locally on iPhone 14. Reply for beta access via TestFlight.

102,354 次观看 • 3 年前 •via X (Twitter)

9 条评论

Kevin Kwok 的头像
Kevin Kwok3 年前

Note that this video is running in real time, rather than being sped up 10x like the previous incarnation (

Kevin Kwok 的头像
Kevin Kwok3 年前

This is possible because it's built on top of a slightly smaller model, GPT-J, by @AiEleuther rather than LLaMA. Thus it can fit entirely in memory rather than being constantly paged in with mmap. On top of that, it is a fully open model without questionable provenance.

Kevin Kwok 的头像
Kevin Kwok3 年前

That said- it still only barely fits into memory- you need a device with at least 6GB of RAM. That means it works on all iPhone 14, and the Pro and Pro Max variants of iPhone 12 and 13. All other iPhones, the XS, XR, and mini are unsupported.

JJ 的头像
JJ3 年前

How snappy can you make this? Millisecond latency?

Kevin Kwok 的头像
Kevin Kwok3 年前

It can generate about 3 tokens a second at peak, but sometimes iOS has some background tasks which makes it run slower.

Aroga 的头像
Aroga3 年前

Joining the line for beta lol

Shivam Singhal 的头像
Shivam Singhal3 年前

interesting. would love to try it out

Alex Valente 的头像
Alex Valente3 年前

Roger keen for invite @jkoukides

Guanqun (David) Yang 的头像
Guanqun (David) Yang3 年前

Please let me try it. And is it possible to run it on an iPhone 7?

相关视频