Loading video...

Video Failed to Load

Go Home

Sheepy-T: A fully open-source instruction-tuned language model based on GPT-J running locally on iPhone 14. Reply for beta access via TestFlight.

102,354 views • 3 years ago •via X (Twitter)

9 Comments

Kevin Kwok's profile picture
Kevin Kwok3 years ago

Note that this video is running in real time, rather than being sped up 10x like the previous incarnation (

Kevin Kwok's profile picture
Kevin Kwok3 years ago

This is possible because it's built on top of a slightly smaller model, GPT-J, by @AiEleuther rather than LLaMA. Thus it can fit entirely in memory rather than being constantly paged in with mmap. On top of that, it is a fully open model without questionable provenance.

Kevin Kwok's profile picture
Kevin Kwok3 years ago

That said- it still only barely fits into memory- you need a device with at least 6GB of RAM. That means it works on all iPhone 14, and the Pro and Pro Max variants of iPhone 12 and 13. All other iPhones, the XS, XR, and mini are unsupported.

JJ's profile picture
JJ3 years ago

How snappy can you make this? Millisecond latency?

Kevin Kwok's profile picture
Kevin Kwok3 years ago

It can generate about 3 tokens a second at peak, but sometimes iOS has some background tasks which makes it run slower.

Aroga's profile picture
Aroga3 years ago

Joining the line for beta lol

Shivam Singhal's profile picture
Shivam Singhal3 years ago

interesting. would love to try it out

Alex Valente's profile picture
Alex Valente3 years ago

Roger keen for invite @jkoukides

Guanqun (David) Yang's profile picture
Guanqun (David) Yang3 years ago

Please let me try it. And is it possible to run it on an iPhone 7?

Related Videos