NVIDIA's GeForce RTX 4090 With 96GB VRAM Reportedly Exists; The GPU May Enter Mass Production Soon, Targeting AI Workloads

ikt@aussie.zone · 1 month ago

NVIDIA's GeForce RTX 4090 With 96GB VRAM Reportedly Exists; The GPU May Enter Mass Production Soon, Targeting AI Workloads

hendrik@palaver.p3x.de · edit-2 1 month ago

Hmmh, the 4090 is kind if the wrong choice for this, due to its memory bus width… For AI workloads and especially if you want to connect lots of memory, you kind of want the widest bus possible.

brucethemoose@lemmy.world · edit-2 21 days ago

It’s 384 bit? It’s not bad, 512 bit is super expensive and basically only exists on the 5090 die.

Also, it seems LLMs are drifting towards being less memory-speed bound with the diffusion model experiments.