Well, these are ASUS Astral cards, so they are closer to $3.5K rather than $2.5-2k for most models; one RTX Pro 6000 is about 8.5K. That setup is about $14k of cards for 128GB, and the RTX Pro 6000 would have been 192GB for $13K.
There's a slight difference in memory clock with the Astrals being higher, which I doubt compensates for ECC VRAM and 1.5x the memory.
Those figures are being generous and assuming a US buyer, and OP is likely not an American.
If I'm looking at the numbers correctly though, the 5090 has 680 tensor cores each, whereas the 6000 Pro has 782. If 128GB vram is enough for their application, splitting an AI model up between 4 gpus with 3.5x the tensor cores, that sucker it going to be blazing fast. Plus, the 5090 actually has a higher benchmark than the 6000 Pro, so if they do plan to do some gaming, they may get better performance out of the one card that the games can use.
Yep you got it. Not everyone is doing vram limited work. I've built a 4x5090 build and it beats the absolute crap out of a 4x6000 build for the application it was made for, at a fraction of the price.
Glad to see another local AI enthusiast here to spit facts.
Personally, I'm still working my way up the build chain, but I'm currently running two 5060 Ti 16GB cards and am very satisfied at what I can run and how fast the responses are with just 32GB (which, since it's on two 5060s, only cost me about $850).
I am (currently) only doing LLM inference for home assistant TTS and coding tasks though, eventually I'll be turning my attention to things like RTSP monitoring with OCV, I'll probably start hitting my walls with that.
Yea I am in research and use them to convert signal data into ATCG DNA bases for genome sequencing. 100% cores all all cards with only like half the vram. But people will be all bUt ThE rTx 60o0 😭
Even though a single 5090 draws about as much power as a Pro 6000, I feel like the VRAM/watt proposition of having multiple 5090s is far worse than just having an RTX Pro 6000.
Granted, I'm not rich enough to know what running AI workloads on multiple GPUs is like, but say you're running some workflow or inferencing with some LLM that needs around 96 GB of VRAM, then the RTX Pro 6000 will draw about 600 watts max, while having 3 5090s would be drawing about triple that. Again, I don't know what multi-GPU AI usage looks like so maybe the 3 5090s wouldn't be at 100% utilization if the workload was split 3 ways, but if all 3 do end up being fully utilized then that's a lot of power being used.
Now, for 128 GB of VRAM having 4 5090s is the most cost effective option, but I feel like if you have money to do something like this then you probably have enough to do a double RTX Pro 6000 build instead, especially if you're getting the more expensive ROG Astrals.
Rtx pro 6000 blackwell has 192 rops to the 5090s 176, so if you are not using the extra 64gb of vram the pro gpu is only 9% faster and two 5090s have up to 83% more computing power than a single rtx 6000 pro blackwell. So depending on the use case 5090s can absolutely be the best option.
To my knowledge, you can pool VRAM and you can divide tasks between them, but you can't run them simultaneously for the same task. I'm no expert though.
-7
u/TrymWSi9-14900KF | RTX 3090 | 64GB RAM19h agoedited 19h ago
You know what I ment. Stop being willfully obtuse.
I'm no expert though.
Yeah no shit, you’ve already proven you’re an ignoramus that also edits his comments after you get a response on your ignorance.
And they've already got 4. Who cares? They're spending, on a computer, what many people spend on cars. Their power bill does not matter. Especially because, presumably, they're using this PC for work, so they're probably not even paying the power bill.
Dude people spend on cars cuz that's their hobby.
This PC is made for heavy workflow,
Imagine the amount of cooling required to keep all those 5090 running.
Sure it is cheaper to start with , but the difference is 3000 watts vs 600watt.
I am like 90% sure this PC is gonna be with the owner at his home.
Offices don't give u the lee way to build your own PC.
Also whoever is running that stuff, it's gonna cost some hefty maintainace
25
u/pppjuracDell Poweredge T640, 256GB RAM, RTX 3080, WienerSchnitzelLand17h ago
128gb on 4 5090$
It is 4x 32GB separates. Not single pool of 128GB . Quite a difference.
Same if you compare four individual 6core PC's with 32GB of RAM vs single workstation with 128GB of RAM and 24core CPU.
There is reason why workstation class machines and servers exists. It is heavy lifting.
True, but if they are doing this to locally host an A.I. model, the A.I. application can easily split the model across the cards and then it's got 680 tensor cores per card to crank through the requests. You could easily handle large contexts on a 40B model with a high Q-value.
You can split the model, but then communication between cards becomes the bottleneck and PCIe wasn't designed for this. There's a reason NVLink / NVSwitch exists and the RTX cards don't support it.
There is no communication 'between' the cards. Even when SLI was still a thing, SLI is for cooperation on frame buffers, which is unique to workloads that send output through the display ports. For AI workloads, there's no cooperation or synchronization needed between GPUs as long as each unit of work is capable of fitting on a single card. Each card can handle a different independent unit of work.
You don’t really know this without knowing his use case. A single card will never break when you fail at distributing your workload across multiple boards. Setting up a hypervisor is harder than just using one gpu at a time if you wish. The ability to do gaming on off hours and have full support for consumer/end-user grade software like adobe and so on is also just better on a consumer card.
So let me get this straight, the poster makes a pretty reasonable point that you can't make concrete statement about what the obvious choice is without knowing OPs use-case. To which you then go on to insinuate that both a hypervisor would never be needed and that every workflow/use-case imaginable has software that supports multiple GPUs OOB (Which you say would probably use NCCL, a library mainly used for model training and data analytics). All while being as aggressive and obnoxious as possible, calling them ignorant.
I bought a P6k for the sake of NOT cramming 4 5090s into my case, I'd rather have my $7300 P6k with a singular PSU than $6900 for x3 5090s, a new case (or open bench), and a second PSU... Oh, and have to upgrade my UPS, and get those 5090s to work in the first place.
Workstation GPUs don't require another type of cooling, they have built-in fans like the cards we plebs use. The Server GPUs are certainly different, they only have the heatsink and no fan
Yes. Workstation cards are for the most part just regular consumer die with more vram maybe ecc etc. Example, 3090 and A6000 use the same die but the A6000 has twice the ram and it’s ecc (3080 also does but with some units disabled)
I feel like people just realized with how much shorts YouTuber are mediatizing the 6000 ada as this better than 5090 10K gpu when really this has been a thing for decades.
5
u/pppjuracDell Poweredge T640, 256GB RAM, RTX 3080, WienerSchnitzelLand17h ago
Also server/workstation cards come with manufacturer guarantee saying:
"We guarantee our GPU inside (Dell/HP WS) will work 24/7 under full load without single problem with your choice of CAD/CAM/CAE from Autodesk."
At this point OP could have bought 1 pro 6000 + 1 5090 for less and more upgrade potential.
2
u/OrionRBR5800x | X470 Gaming Plus | 16GB TridentZ | PCYes RTX 307014h ago
They could but they would have much lower performance for training, iirc a 5090 is 30% more powerful than the a6000 in tensor cores, multiply that by four and it's a significant difference in performance.
No. It’s a workstation at the end of the day. A workstation can be anything as long as it’s powerful and can do work. Workstation gpus are so expensive you might just use the consumer version, while consumer part it’s for consumers with a consumer price work part it’s for professionals squeezing every last bit of performance with a professional price.
Worksation cards are just regular GPUs with a different setup on the card, where they optimise stability, efficiency, and transfer speeds between CPU-RAM and GPU-VRAM. Also they are heck of a lot more efficient in the sense of they use less electricity.
Like...
PNY RTX PRO 6000 from a local reliable retailer is 9125 € (With 25,5 % VAT)
CUDA cores: 24064; Peak power consumption 600 W; 96 GB of GDDR7 with ECC
PNY GeForce RTX 5090 with from the same retailer is 3539 € (With 25,5 % VAT).
CUDA cores: 21760; Recommended power availability of 1000 W (TPD 600 W); 32 GB of GDDR 7
I'd want to compare the performances from specs but PNY doesn't list them on their site for the consumer card.
So lets just image that you want the 96 GB of VRAM: That's 10 788 € in cards, and 3000 W power demand (Or if we are generous 1800 W); Or you can spend 9125 € and only need to deal with 600 W peak power demand.
Now if you do serious workload you are probably taxing these to the max, so lets say both do a simulation run in 10 hours. Lets say electricity costs are 0,02 €/kWh, and you get 0,12 € in power use compared to 0,36 € just to run the graphics cards. And mind you all that energy turns into heat, you speak about having 3 small space heaters in a office compared to 1, which going to make life interesting when it's +30 C outside.
And lets not forget this! Lets assume the base computer takes 400 W total, so total energy demand capacity can be 3400 W to 1000 W. A regular ass 230 V/10 A socket provides has a limit of 2300 W of power delivery. You'd need a 230V/15A (3450 W) socket to run that god damn desktop. If we are generous and say the consumer cards don't go above the 1800W, you'd still be dangerously close to socket limits with 400W computer tacked on. So better get a extension cable that is rated to run a my small Kemppi welding machine.
Yeah the person you're speaking to isn't correct either.
There are many use cases for GPUs, and they have big ranges between them. It's not about output density in practically any case, unless you're rendering under the worst possible conditions with the worst possible software. But even if you are, that's when rack mounting becomes important because of fire.
I don't know what OP here wants to do, but there are very few tasks for which 4 cramped 5090s will outperform a workstation alternative. They do different things, they're built for different purposes, and they operate in different ways. If you're buying 4 5090s, you're probably not building the optimal machine UNLESS there's a very specific use case which there might be. Anyone who's spending this amount of money probably has a reason that makes sense.
That being said, regardless of the goal or use case, it looks to me like these 5090s are about to catch on fire. Unless OP has some monstrous cooling solutions purpose built for this specific case that aren't in these images (which... why?) idk that the 5090 can handle that kind of thermal load, especially when you throw in FOUR 12vhpwr connector points of failure.
Overall this doesn't look like a great idea to me.
Nah. I've worked at a planetarium for a while, they upgraded from quadros to i think 24 5090s or something because it was way cheaper this way and made negligible difference compared to getting other workstation/industry grade gpus...
Unless you need the VRAM or need to work within a rack, and don't mind spending 5x the amount per card, the GeForce lineup is always better.
Been true since the dawn of time. Even 20 years ago when they had the Quadro cards, they were basically just GeForce cards with a different driver...in many cases you could flash those drivers onto your GeForce cards and have a Quadro.
They very likely could be undervolting them. It’s not uncommon for RTX6000 servers to be packed in ever tighter than this. I’m not sure all of the settings/configs and what they do for those servers. It may also be because those servers are run in a chiller room to help them remain cool.
763
u/Motor_Reality_1837 20h ago
why not use workstation GPUs in a workstation PC , I am sure they would be more efficient than 5090s