Aaron Ng's banner
Aaron Ng's profile picture

Aaron Ng

@localghost33,711 subscribers

@mecha_corp. Founder of the local AI tool @apolloaiapp (acquired by @liquidai). Created Hyper Online, one of the most popular avatar apps. Former CashApp & Meta

Shorts

llama 3 70b beamed to my phone from my M1 Max ~7.6 tok/s with mlx. your own little gpt-4 at home

llama 3 70b beamed to my phone from my M1 Max ~7.6 tok/s with mlx. your own little gpt-4 at home

1,116,956 Aufrufe

Here’s Deepseek r1 1.5B thinking through a problem — it’s comparable to 4o and Claude 3.5 Sonnet in a number of domains like math. Except… it’s a 1.5B model… and can run on virtually any hardware. Truly a huge efficiency leap.

Here’s Deepseek r1 1.5B thinking through a problem — it’s comparable to 4o and Claude 3.5 Sonnet in a number of domains like math. Except… it’s a 1.5B model… and can run on virtually any hardware. Truly a huge efficiency leap.

564,677 Aufrufe

Llama 3.3 70B streaming at 7.8 tok/s on an M1 Pro to my phone. Not the fastest, but wild that you can serve something this powerful to your whole house without internet.

Llama 3.3 70B streaming at 7.8 tok/s on an M1 Pro to my phone. Not the fastest, but wild that you can serve something this powerful to your whole house without internet.

300,969 Aufrufe

Run Llama 3.1 1B offline at 60+ tok/s. Entirely on your phone. Siri can’t save you offline, but a local model just might.

Run Llama 3.1 1B offline at 60+ tok/s. Entirely on your phone. Siri can’t save you offline, but a local model just might.

154,576 Aufrufe

Ollama can host too: here’s Llama 3.3 70B streaming at ~6.5 tok/s from my M1 Max to my phone. Your laptop is an offline self-sovereign AI server.

Ollama can host too: here’s Llama 3.3 70B streaming at ~6.5 tok/s from my M1 Max to my phone. Your laptop is an offline self-sovereign AI server.

53,827 Aufrufe

SmallThinker 3B is one of the AI models I’d want backed up on my phone. Here’s it is reasoning at 12 tok/s on my phone. If you’re ever stuck offline, you’ll want a model that can think.

SmallThinker 3B is one of the AI models I’d want backed up on my phone. Here’s it is reasoning at 12 tok/s on my phone. If you’re ever stuck offline, you’ll want a model that can think.

35,857 Aufrufe

Offline AI, right on your phone. Small versions of Hermes 3, Qwen 2.5, and Llama 3.2 running privately in your hand. Grab them now in Apollo 1.0.17

Offline AI, right on your phone. Small versions of Hermes 3, Qwen 2.5, and Llama 3.2 running privately in your hand. Grab them now in Apollo 1.0.17

36,884 Aufrufe

Wild that you can run a GPT-4 tier AI on a laptop and serve it to your whole network. Llama 3.3 70B is a beast.

Wild that you can run a GPT-4 tier AI on a laptop and serve it to your whole network. Llama 3.3 70B is a beast.

35,676 Aufrufe

It’s never been easier to have your own private AIs running at home, off-grid, or anywhere else. For AI independence, all you need is: - LM Studio on your computer. - Apollo in your pocket.

It’s never been easier to have your own private AIs running at home, off-grid, or anywhere else. For AI independence, all you need is: - LM Studio on your computer. - Apollo in your pocket.

13,319 Aufrufe

Videos

Keine weiteren Inhalte verfügbar