1 article found
Moonshot AI's hybrid architecture delivers 6x decoding speed with 75% less memory, making 1M-token contexts actually practical.