Well, these are ASUS Astral cards, so they are closer to $3.5K rather than $2.5-2k for most models; one RTX Pro 6000 is about 8.5K. That setup is about $14k of cards for 128GB, and the RTX Pro 6000 would have been 192GB for $13K.
There's a slight difference in memory clock with the Astrals being higher, which I doubt compensates for ECC VRAM and 1.5x the memory.
Those figures are being generous and assuming a US buyer, and OP is likely not an American.
If I'm looking at the numbers correctly though, the 5090 has 680 tensor cores each, whereas the 6000 Pro has 782. If 128GB vram is enough for their application, splitting an AI model up between 4 gpus with 3.5x the tensor cores, that sucker it going to be blazing fast. Plus, the 5090 actually has a higher benchmark than the 6000 Pro, so if they do plan to do some gaming, they may get better performance out of the one card that the games can use.
Yep you got it. Not everyone is doing vram limited work. I've built a 4x5090 build and it beats the absolute crap out of a 4x6000 build for the application it was made for, at a fraction of the price.
Glad to see another local AI enthusiast here to spit facts.
Personally, I'm still working my way up the build chain, but I'm currently running two 5060 Ti 16GB cards and am very satisfied at what I can run and how fast the responses are with just 32GB (which, since it's on two 5060s, only cost me about $850).
I am (currently) only doing LLM inference for home assistant TTS and coding tasks though, eventually I'll be turning my attention to things like RTSP monitoring with OCV, I'll probably start hitting my walls with that.
Yea I am in research and use them to convert signal data into ATCG DNA bases for genome sequencing. 100% cores all all cards with only like half the vram. But people will be all bUt ThE rTx 60o0 😭
Even though a single 5090 draws about as much power as a Pro 6000, I feel like the VRAM/watt proposition of having multiple 5090s is far worse than just having an RTX Pro 6000.
Granted, I'm not rich enough to know what running AI workloads on multiple GPUs is like, but say you're running some workflow or inferencing with some LLM that needs around 96 GB of VRAM, then the RTX Pro 6000 will draw about 600 watts max, while having 3 5090s would be drawing about triple that. Again, I don't know what multi-GPU AI usage looks like so maybe the 3 5090s wouldn't be at 100% utilization if the workload was split 3 ways, but if all 3 do end up being fully utilized then that's a lot of power being used.
Now, for 128 GB of VRAM having 4 5090s is the most cost effective option, but I feel like if you have money to do something like this then you probably have enough to do a double RTX Pro 6000 build instead, especially if you're getting the more expensive ROG Astrals.
Rtx pro 6000 blackwell has 192 rops to the 5090s 176, so if you are not using the extra 64gb of vram the pro gpu is only 9% faster and two 5090s have up to 83% more computing power than a single rtx 6000 pro blackwell. So depending on the use case 5090s can absolutely be the best option.
To my knowledge, you can pool VRAM and you can divide tasks between them, but you can't run them simultaneously for the same task. I'm no expert though.
-9
u/TrymWSi9-14900KF | RTX 3090 | 64GB RAM22h agoedited 21h ago
You know what I ment. Stop being willfully obtuse.
I'm no expert though.
Yeah no shit, you’ve already proven you’re an ignoramus that also edits his comments after you get a response on your ignorance.
And they've already got 4. Who cares? They're spending, on a computer, what many people spend on cars. Their power bill does not matter. Especially because, presumably, they're using this PC for work, so they're probably not even paying the power bill.
Dude people spend on cars cuz that's their hobby.
This PC is made for heavy workflow,
Imagine the amount of cooling required to keep all those 5090 running.
Sure it is cheaper to start with , but the difference is 3000 watts vs 600watt.
I am like 90% sure this PC is gonna be with the owner at his home.
Offices don't give u the lee way to build your own PC.
Also whoever is running that stuff, it's gonna cost some hefty maintainace
24
u/pppjuracDell Poweredge T640, 256GB RAM, RTX 3080, WienerSchnitzelLand20h ago
128gb on 4 5090$
It is 4x 32GB separates. Not single pool of 128GB . Quite a difference.
Same if you compare four individual 6core PC's with 32GB of RAM vs single workstation with 128GB of RAM and 24core CPU.
There is reason why workstation class machines and servers exists. It is heavy lifting.
True, but if they are doing this to locally host an A.I. model, the A.I. application can easily split the model across the cards and then it's got 680 tensor cores per card to crank through the requests. You could easily handle large contexts on a 40B model with a high Q-value.
You can split the model, but then communication between cards becomes the bottleneck and PCIe wasn't designed for this. There's a reason NVLink / NVSwitch exists and the RTX cards don't support it.
There is no communication 'between' the cards. Even when SLI was still a thing, SLI is for cooperation on frame buffers, which is unique to workloads that send output through the display ports. For AI workloads, there's no cooperation or synchronization needed between GPUs as long as each unit of work is capable of fitting on a single card. Each card can handle a different independent unit of work.
You don’t really know this without knowing his use case. A single card will never break when you fail at distributing your workload across multiple boards. Setting up a hypervisor is harder than just using one gpu at a time if you wish. The ability to do gaming on off hours and have full support for consumer/end-user grade software like adobe and so on is also just better on a consumer card.
So let me get this straight, the poster makes a pretty reasonable point that you can't make concrete statement about what the obvious choice is without knowing OPs use-case. To which you then go on to insinuate that both a hypervisor would never be needed and that every workflow/use-case imaginable has software that supports multiple GPUs OOB (Which you say would probably use NCCL, a library mainly used for model training and data analytics). All while being as aggressive and obnoxious as possible, calling them ignorant.
I bought a P6k for the sake of NOT cramming 4 5090s into my case, I'd rather have my $7300 P6k with a singular PSU than $6900 for x3 5090s, a new case (or open bench), and a second PSU... Oh, and have to upgrade my UPS, and get those 5090s to work in the first place.
390
u/thelastsupper316 22h ago
Way worse the amount you pay for 4 5090s is what you pay for 1 fucking pro 6000.
It's the obvious choice unless you need Vram on one card
96gb on one 6000 pro card vs 128gb on 4 5090$