TechnologyHacker News• 4h agoRe-quantizing a local LLM 14x faster by skipping the tensors that didn't change4 points, 0 comments on Hacker News