Loading video...
Video Failed to Load
MLX + TurboQuant = Local Super Power Take a local (private) document(s) or codebase, pre-fill the 256k KV cache (the context) with the document(s) and system prompt, quantise and run on Apple's MLX and you have almost instantaneous, lossless document queries with total privacy. For a 75-page PDF (some... show more
85,532 views • 2 months ago •via X (Twitter)
0 Comments
No comments available
Comments from the original post will appear here
