#GPTQ hashtag - Bluesky

@doyouknnow.bsky.social

4 months ago

LLM 양자화 완벽 가이드! INT4로 메모리 87.5% 절감, FP8로 처리량 43% 향상. GPTQ vs AWQ vs GGUF 비교, Llama 3 양자화 성능 벤치마크, Q4까지 손실 2% 미만! Pruning + Knowledge Distillation 경량화 기법, 하드웨어별 추천 전략, QLoRA Fine-tuning까지!

#AWQ #FP8 #GGUF #GPTQ #INT4 #INT8 #KnowledgeDistillation #Llama3 #llamacpp
doyouknow.kr/618/llm-quan...

0 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

GPTQ Quantization Linked to Babai’s Nearest Plane Algorithm

Researchers found GPTQ post‑training quantization matches Babai’s nearest‑plane algorithm, and clipping‑free variants outperform the original GPTQ on benchmarks. getnews.me/gptq-quantization-linked... #gptq #quantization #babaisalgorithm

0 0 0 0

Mind On Edge

@vgk24.bsky.social

10 months ago

Most people think quantization just shrinks models. But GPTQ, a Post-Training Quantization technique, shows you can get near full-precision accuracy without retraining. It picks the best weights to keep with a smart second-order optimization trick. Makes Edge AI sharper. #EdgeAI #GPTQ #Quantization

0 0 0 0