正在加载视频...
视频加载失败
MLX + TurboQuant = Local Super Power Take a local (private) document(s) or codebase, pre-fill the 256k KV cache (the context) with the document(s) and system prompt, quantise and run on Apple's MLX and you have almost instantaneous, lossless document queries with total privacy. For a 75-page PDF (some... show more
85,589 次观看 • 2 个月前 •via X (Twitter)
0 条评论
暂无评论
原始帖子的评论将显示在这里
