Loading video...
Video Failed to Load
First steps for a specialized DeepSeek v4 Flash inference engine focused on inference quality / stability at different quantizations, with networked API that is batching capable. This is the 2 bit quants model running on my M3 Max 128GB.
14,176 views • 1 month ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
