2 articles found
Meta AI’s new universal compression framework uses format-aware graphs to achieve unprecedented compression ratios while maintaining speed.
REAP pruning outperforms merging in MoE models, enabling near-lossless compression of 480B giants to local hardware