692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU
692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU

Aziz_Lamyae

6 min0 plays0 favorites
Success & Inspiration
Play

Description

<p>Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week&apos;s episode.<br/><br/>Additional materials: <a href='https://www.superdatascience.com/692'>www.superdatascience.com/692</a><br/><br/>Interested in sponsoring a SuperDataScience Podcast episode? Visit <a href='https://www.jonkrohn.com/podcast'>JonKrohn.com/podcast</a> for sponsorship information.</p>

Creators

kira.music

kira.music

Creator