Also I think you need a 40GB "card", not just 40GB of vram. I wrote about this u...

icelancer · 2025-08-04T23:34:15 1754350455

Oh right, I forgot some diffusion models can't offload / split layers. I don't use vision generation models much at all - was just going off LLM work. Apologies for the potential misinformation.

rapfaria · 2025-08-04T22:04:55 1754345095

Not sure what you mean or new to llms, but two RTX 3090 will work for this, and even lower-end cards will (RTX3060) once it's GGUF'd

axoltl · 2025-08-04T23:55:16 1754351716

This isn't a transformer, it's a diffusion model. You can't split diffusion models across compute nodes.

karolist · 2025-08-04T22:09:43 1754345383

do you mean https://github.com/pollockjj/ComfyUI-MultiGPU? One GPU would do the computation, but others could pool in for VRAM expansion, right? (I've not used this node)

AuryGlenz · 2025-08-05T06:54:38 1754376878

Nah, that won’t gain you much (if anything?) over just doing the layer swaps on RAM. You can put the text encoder on the second card but you can also just put it in your RAM without much for negatives.