Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Avieshek@lemmy.world · 21 days ago

Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Jeena@piefed.jeena.net · 21 days ago

Exactly, I’m in the same situation now and the 8GB in those cheaper cards don’t even let you run a 13B model. I’m trying to research if I can run a 13B one on a 3060 with 12 GB.

The Hobbyist@lemmy.zip · 21 days ago

You can. I’m running a 14B deepseek model on mine. It achieves 28 t/s.

levzzz@lemmy.world · 21 days ago

You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

Viri4thus@feddit.org · 21 days ago

I also have a 3060, can you detail which framework (sglang, ollama, etc) you are using and how you got that speed? i’m having trouble reaching that level of performance. Thx

Jeena@piefed.jeena.net · 21 days ago

Oh nice, that’s faster than I imagined.