ollama's banner

ollama

@ollama • 170,971 subscribers

https://t.co/1JpLwJ9Bdv

Shorts

Ollama can now think! 🤔🤔🤔 For thinking models, and especially useful for very thoughtful models like DeepSeek-R1-0528, Ollama can separate the thoughts and the response. Thinking can also be disabled! This is useful for getting a direct response. This works across Ollama's CLI, API, and Python/JavaScript libraries. 🧵 blog post 👇👇👇

Ollama can now think! 🤔🤔🤔 For thinking models, and especially useful for very thoughtful models like DeepSeek-R1-0528, Ollama can separate the thoughts and the response. Thinking can also be disabled! This is useful for getting a direct response. This works across Ollama's CLI, API, and Python/JavaScript libraries. 🧵 blog post 👇👇👇

106,267 views

.ollama is playing with AI at Meta Llama 4 Scout! 🤯 a perfect opportunity to test Ollama's giant super computer ✈️✈️✈️

.ollama is playing with AI at Meta Llama 4 Scout! 🤯 a perfect opportunity to test Ollama's giant super computer ✈️✈️✈️

113,526 views

It's fast!! Ollama now supports AMD graphics cards! All the features of Ollama can now be accelerated by AMD graphics cards in preview on Ollama for Linux and Windows. Try it 👇👇 *video is not edited.

It's fast!! Ollama now supports AMD graphics cards! All the features of Ollama can now be accelerated by AMD graphics cards in preview on Ollama for Linux and Windows. Try it 👇👇 *video is not edited.

108,069 views

🎁 Happy 100th Ollama release! In 0.4.5, we're updating Ollama's Python library! Python functions can now be provided as tools to models. Strong typing for improved reliability and type safety New examples 🚀 👇👇👇 🍻 to the next 100 releases!

🎁 Happy 100th Ollama release! In 0.4.5, we're updating Ollama's Python library! Python functions can now be provided as tools to models. Strong typing for improved reliability and type safety New examples 🚀 👇👇👇 🍻 to the next 100 releases!

43,876 views

Videos

Anya Rossi

sweetdream.ai

SweetDream.ai•Sponsored•Livecam

Watch Anya Live

Anya is streaming live right now! Join her private show and enjoy exclusive content.

Exclusive private shows

1.2k viewers online

Private Show

Join now for exclusive access

Free preview available • Premium content

U.S. open-source models are quickly gaining ground. @Nvidia's newest Nemotron Ultra is fast growing on Ollama and unlocking complex, longer running tasks for developers

U.S. open-source models are quickly gaining ground. @Nvidia's newest Nemotron Ultra is fast growing on Ollama and unlocking complex, longer running tasks for developers

71,002 views • 6 days ago

Open models are already being used in the enterprise. Over 85% of the Fortune 500 companies already use Ollama to fulfill specific tasks. Jeffrey Morgan Why open models and Ollama? Ownership. Open models are yours to keep, customize, and optimize. Affordable. Run it the way you like, in your own environment. Private. Your data belongs to you.

Open models are already being used in the enterprise. Over 85% of the Fortune 500 companies already use Ollama to fulfill specific tasks. Jeffrey Morgan Why open models and Ollama? Ownership. Open models are yours to keep, customize, and optimize. Affordable. Run it the way you like, in your own environment. Private. Your data belongs to you.

35,984 views • 4 days ago

Jeffrey Morgan was on TBPN this week to discuss Ollama's Series B fundraise and why open models are quickly becoming the default choice for developers Full video:

Jeffrey Morgan was on TBPN this week to discuss Ollama's Series B fundraise and why open models are quickly becoming the default choice for developers Full video:

34,537 views • 8 days ago

Ollama 0.17 makes it much simpler to use open models with OpenClaw🦞 Try it with: ollama launch openclaw Tutorial post in 🧵

Ollama 0.17 makes it much simpler to use open models with OpenClaw🦞 Try it with: ollama launch openclaw Tutorial post in 🧵

201,376 views • 4 months ago

Ollama can now launch Pi, a minimal coding agent which you can customize for your workflow ollama launch pi You can even ask pi to write extensions for itself

Ollama can now launch Pi, a minimal coding agent which you can customize for your workflow ollama launch pi You can even ask pi to write extensions for itself

190,618 views • 4 months ago

Ollama can now run subagents in OpenCode Parallelize tasks which require longer context windows like research, refactoring, and code reviews ollama launch opencode

Ollama can now run subagents in OpenCode Parallelize tasks which require longer context windows like research, refactoring, and code reviews ollama launch opencode

126,094 views • 4 months ago

🤯 Wow! In one prompt Qwen3-Coder-Next generated a fully working flappy birds game in HTML. (0:05) Claude Code with Qwen3-Coder-Next (0:26) Shows the game running Run it fully locally: ollama pull qwen3-coder-next Ollama's cloud if you can't run it locally: ollama pull qwen3-coder-next:cloud Try launching it with Claude Code using ollama launch (link to play 🧵) So cool! Qwen Tongyi Lab Junyang Lin

🤯 Wow! In one prompt Qwen3-Coder-Next generated a fully working flappy birds game in HTML. (0:05) Claude Code with Qwen3-Coder-Next (0:26) Shows the game running Run it fully locally: ollama pull qwen3-coder-next Ollama's cloud if you can't run it locally: ollama pull qwen3-coder-next:cloud Try launching it with Claude Code using ollama launch (link to play 🧵) So cool! Qwen Tongyi Lab Junyang Lin

117,339 views • 5 months ago

Ollama now supports subagents and web search in Claude Code! Subagents can run tasks in parallel, such as file search, code exploration, and research, each in their own context. No MCP servers to configure or API keys required. Try it with any model on Ollama's cloud: ollama launch claude --model minimax-m2.5:cloud

Ollama now supports subagents and web search in Claude Code! Subagents can run tasks in parallel, such as file search, code exploration, and research, each in their own context. No MCP servers to configure or API keys required. Try it with any model on Ollama's cloud: ollama launch claude --model minimax-m2.5:cloud

84,101 views • 5 months ago

Ollama v0.8 is here! Now it can stream responses with tool calling! Example of Ollama doing web search:

Ollama v0.8 is here! Now it can stream responses with tool calling! Example of Ollama doing web search:

148,278 views • 1 year ago

Ollama 0.2 is here! Concurrency is now enabled by default. This unlocks 2 major features: Parallel requests Ollama can now serve multiple requests at the same time, using only a little bit of additional memory for each request. This enables use cases such as: - Handling multiple chat sessions at the same time - Hosting code completion LLMs for your team - Processing different parts of a document simultaneously - Running multiple agents at the same time Run multiple models Ollama now supports loading different models at the same time. This improves several use cases: - Retrieval Augmented Generation (RAG): both the embedding and text completion models can be loaded into memory simultaneously. - Agents: multiple versions of an agent can now run simultaneously - Running large and small models side-by-side Models are automatically loaded and unloaded based on requests and how much GPU memory is available.

Ollama 0.2 is here! Concurrency is now enabled by default. This unlocks 2 major features: Parallel requests Ollama can now serve multiple requests at the same time, using only a little bit of additional memory for each request. This enables use cases such as: - Handling multiple chat sessions at the same time - Hosting code completion LLMs for your team - Processing different parts of a document simultaneously - Running multiple agents at the same time Run multiple models Ollama now supports loading different models at the same time. This improves several use cases: - Retrieval Augmented Generation (RAG): both the embedding and text completion models can be loaded into memory simultaneously. - Agents: multiple versions of an agent can now run simultaneously - Running large and small models side-by-side Models are automatically loaded and unloaded based on requests and how much GPU memory is available.

219,409 views • 2 years ago

Ollama can now think! 🤔🤔🤔 For thinking models, and especially useful for very thoughtful models like DeepSeek-R1-0528, Ollama can separate the thoughts and the response. Thinking can also be disabled! This is useful for getting a direct response. This works across Ollama's CLI, API, and Python/JavaScript libraries. 🧵 blog post 👇👇👇

Ollama can now think! 🤔🤔🤔 For thinking models, and especially useful for very thoughtful models like DeepSeek-R1-0528, Ollama can separate the thoughts and the response. Thinking can also be disabled! This is useful for getting a direct response. This works across Ollama's CLI, API, and Python/JavaScript libraries. 🧵 blog post 👇👇👇

106,267 views • 1 year ago

ollama run llama3.1:405b Tested in with AMD MI300X 🤯

ollama run llama3.1:405b Tested in with AMD MI300X 🤯

98,168 views • 2 years ago

Download on or GitHub releases

Download on or GitHub releases

50,357 views • 11 months ago

No more content to load