SPBHM wrote on 2020-03-19, 07:30:
one difference is that in a PS4 the CPU portion can only access something around 20GB/s max, while the GPU has always 100+GB/s
that's not like the behavior of current PC IGPs
Well, not entirely, and that's not the only difference -
- The Liverpool SoC of the original PS4 came out in 2013, and it's something like this:
4 Jaguar cores, a GCN 2 based GPU with 18 Compute Units, which gives you 1152 Unified Shader Processors, 72 Texture Mapping Units, 32 Render Output Units, all accounting for around 1800 Gigaflops, and 8GB of 256 bit data path GDDR5 unified RAM with a potential max bandwidth of 176GBytes/sec.
I have no idea what the CPU interconnect with the GDDR5 RAM will look like, but it should not be something as low as 30GByte/sec. I have no idea what the thermal envelope looks like, but I am guessing at least 65w.
The AMD "stock" APU from 2013 closest to the Liverpool is something like a GX-420CA, which looks something like this:
4 Jaguar cores, a GCN 1 based GPU with 2 Compute Units, which gives you 128 Unified Shader Processors, 8 Texture Mapping Units, 4 Render Output Units, all accounting for around 150 Gigaflops, and it can be connected to a maximum of 32GB DDR3-1600 unified RAM via a single channel, giving you approximately 12.8GBytes/sec
The entire thing runs on a 35w TDP envelope.
Now, for a more fair comparison (since the GPU on that GX420CA is kinda old and the CU count is like 1/10th of the Liverpool), we can step one year ahead and look at an embedded SoC from AMD, which is an RX-427BB (it's also sold as an FX-7600p and made it into a few Asus laptops), which is the closest to Liverpool.
4 Steamroller cores, a GCN 2 based GPU with 8 Compute Units, which gives you 512 Unified Shader Processors, 32 Texture Mapping Units, 8 Render Output Units, all accounting for around 615 Gigaflops, and it can be connected to a maximum of 32GB DDR3-2133 unified RAM via dual channels, giving you approximately 28 GBytes/sec
The entire thing runs on a 35w TDP envelope.
If you look at the RX-427BB, the overall GPU output is about even compared to (or slightly better than) the Intel Iris Pro 5200 on my Haswell machine (that Iris Pro also has a 128MB L4 cache that works rather like the high bandwidth VRAM in most GPUs). To get embedded graphics close to that of the original PS4 just on a pure compute horsepower standpoint, you'll need to look at, say, the Ryzen embedded V1807B with the Polaris 11 embedded GPU. That's 11 Compute Units, which gives you 704 Unified Shader Processors, 44 Texture Mapping Units, 16 Render Output Units, all accounting for around 1850 Gigaflops, and it can be connected to a maximum of 64GB DDR4-2400 unified RAM via dual channels, giving you approximately 40 GBytes/sec. It's still nowhere near the memory bandwidth of the PS4 Liverpool, though.
Why yes, the PS4 is overpowered as a UMA, but it's designed that way - it is after all a gaming oriented platform. You could also say the same about the 4GB of HBM2 Memory embedded on-die in my Kaby Lake-G machine (that one is a Polaris 22 with some Vega features).