Support NeoGAF

LordOfChaos · Apr 6, 2024

https://forums.anandtech.com/thread...dge-ryzen-9000.2607350/page-350#post-41186460

Revealed and supported by tech leaker Kepler_L2 and other tech enthusiasts.

The idea is to have a large iGPU with 16 RDNA 3.5 compute units (fixed and enhanced version of RDNA3) and enable much higher gaming performance with the addition of a large 16MB shared cache.

However, this die area has been replaced with a large NPU due to demands from Microsoft for ever more AI processing power to run its upcoming Windows AI productivity features. It is also claimed that Microsoft is doubling down on AI and will demand even larger NPUs in the future.

March Climber · Apr 6, 2024

If true, I find it odd that this was made into an 'either/or' situation instead of an 'and/both' situation.

LordOfChaos · Apr 6, 2024

March Climber said:
If true, I find it odd that this was made into an 'either/or' situation instead of an 'and/both' situation.

Die size has a cost, in theoretical terms AMD could have done both, in practical terms the added cost of the NPU probably took the cost that would have gone towards more cache

Hugare · Apr 6, 2024

March Climber said:
If true, I find it odd that this was made into an 'either/or' situation instead of an 'and/both' situation.

Chips are getting smaller and smaller, so its hard to have space for everything.

I imagine this is the reason for the "either/or" situation here

Buggy Loop · Apr 6, 2024

Everything will be AI eventually

It's probably the right direction

Even if it came to be that the game mechanics are shown on simple geometry, the AI could fill the rest to near real life graphics. What's the point of learning monte carlo path tracing and the power required? AI knows how it should be lit and shadowed at any time frame and weather.

Honestly AMD kind of needed a kick into AI once and for all. Rasterization is old news.

CLW · Apr 6, 2024

Sooooo what your saying is Phil Spencer messed things up for everyone…..AGAIN

Dr.D00p · Apr 6, 2024

I don't see the problem,really.

This will likely be used in games for much better hardware accelerated image reconstruction.

Ozriel · Apr 6, 2024

AMD’s rivals Intel, Qualcomm, Apple and MediaTek are heavily focused on NPUs in their chips and TOPs performance.

Definitely MS is pushing Copilot for Windows, but I’m skeptical that it’s just MS push that’s forced AMDs hand here. It’s played a part, for sure, but AMD certainly doesn’t want to be left behind.

Zathalus · Apr 6, 2024

Blaming Microsoft for what the entire SoC industry is doing seems a bit silly.

Buggy Loop · Apr 6, 2024

Zathalus said:
Blaming Microsoft for what the entire SoC industry is doing seems a bit silly.

Yea

AMD putting their head in the sand could not have lasted that long before investors flip the table.

Xyphie · Apr 6, 2024

I can see why Microsoft wants NPUs to become a standard in computers quickly and it's clear Windows 12 is moving towards some kind of NPU requirement (or at least X TOPS capable). Having stuff like better speech recognition to transcribe Teams meetings, better AI noise cancellation etc would be extremely useful features for corporate users, so the handful of people wishing for a better ROG Ally 2 is going to have to take a backseat to those use cases.

SantaC · Apr 6, 2024

Zathalus said:
Blaming Microsoft for what the entire SoC industry is doing seems a bit silly.

did you not read the title? Microsoft demanded it

Zathalus · Apr 6, 2024

SantaC said:
did you not read the title? Microsoft demanded it

Microsoft can't demand shit. They can request it and if AMD agrees they will include a NPU. Do you think Microsoft demanded that Apple, Qualcomm, Nvidia, Mediatek, and Intel all do the same? Advertising a NPU is basically a requirement for chip vendors these days.

Loomy · Apr 6, 2024

Zathalus said:
Microsoft can't demand shit.

They actually can. They are the biggest name in AI right now, and can - without much issue - not use AMD.

Zathalus · Apr 6, 2024

Loomy said:
They actually can. They are the biggest name in AI right now, and can - without much issue - not use AMD.

This is for desktop processors. What consumers use is up to them and OEMs. Microsoft cannot force AMD to build a processor a certain way, although they can make AI processing a requirement for some new Windows feature. That is still not forcing though, AMD is free to release chips that don't support it, but AI features is probably more useful then gaming on a iGPU.

Ozriel · Apr 6, 2024

Loomy said:
They actually can. They are the biggest name in AI right now, and can - without much issue - not use AMD.

None of their upcoming Windows hardware uses AMD. Their Surface lineup for business uses Intel, and their upcoming lineup for consumers is on the Qualcomm Snapdragon X Elite platform.

UniformDeliverance · Apr 6, 2024

This is the right call. Demand for ai is growing exponentially, faster and local ai processing is especially desired for commercial use.

Not everything is about gaming folks, AMD would have been foolish not to jump on the AI train.

LordOfChaos · Apr 6, 2024

Buggy Loop said:
Everything will be AI eventually

It's probably the right direction

Even if it came to be that the game mechanics are shown on simple geometry, the AI could fill the rest to near real life graphics. What's the point of learning monte carlo path tracing and the power required? AI knows how it should be lit and shadowed at any time frame and weather.

Honestly AMD kind of needed a kick into AI once and for all. Rasterization is old news.

Yeah, I don't see this as just a loss for gaming performance, I think increasingly the NPU will become a necessary third partner to the CPU and GPU in gaming

winjer · Apr 6, 2024

LordOfChaos said:
https://forums.anandtech.com/thread...dge-ryzen-9000.2607350/page-350#post-41186460

Revealed and supported by tech leaker Kepler_L2 and other tech enthusiasts.

The idea is to have a large iGPU with 16 RDNA 3.5 compute units (fixed and enhanced version of RDNA3) and enable much higher gaming performance with the addition of a large 16MB shared cache.

However, this die area has been replaced with a large NPU due to demands from Microsoft for ever more AI processing power to run its upcoming Windows AI productivity features. It is also claimed that Microsoft is doubling down on AI and will demand even larger NPUs in the future.

One thing does not invalidate the other.
AMD can very well have an L3 cache on top of the SoC, connected with TSVs, like on X3D chips. Or connected on the side, like DRNA3.
And leave the main die to have whatever they want, be it more CUS, NPUs, CPU cores, etc.

nemiroff · Apr 6, 2024

I don't know the full context of the story, but anyway, as already pointed out, traditional rendering is on it's way out. Most of what we see on a screen will be generated in the near future.

Dorfdad · Apr 6, 2024

Buggy Loop said:
Everything will be AI eventually

It's probably the right direction

Even if it came to be that the game mechanics are shown on simple geometry, the AI could fill the rest to near real life graphics. What's the point of learning monte carlo path tracing and the power required? AI knows how it should be lit and shadowed at any time frame and weather.

Honestly AMD kind of needed a kick into AI once and for all. Rasterization is old news.

Cant wait till developers can design games in 320p 30fps and AI will just fix the games to 8k/240

PaintTinJr · Apr 6, 2024

winjer said:
One thing does not invalidate the other.
AMD can very well have an L3 cache on top of the SoC, connected with TSVs, like on X3D chips. Or connected on the side, like DRNA3.
And leave the main die to have whatever they want, be it more CUS, NPUs, CPU cores, etc.

I'm guessing the design difference is that the NPU is designed to make really large variable counts, like +300, for linear inference equations do their inference on a single clock cycle, and the problem is that the data needs to all be in an L1 cache of the NPU to achieve that, meaning maybe 1MB-2MB of L1 cache just for that unit to handle up to 512x512 at FP32 (variables x constants inference).

I would have hoped that the NPU compute could be built from smaller compute units and used as smaller units, with big efficiency losses, if the NPU wasn't needed for AI tasks given the way AMD don't like to commit silicon to ASICS in the way Nvidia do, but I'm not seeing a way that is possible, especially with the NPUs needing to handle general inference equations of variable input-variable-counts and to eliminate time in matching the input variable count to the correct inference equations, and/or any other strategy to quickly traverse the complexity of using the knowns to derive needed unknown variables to feed to higher level equations needed to infer the AI answer.

There's also the need to be able to setup the NPU to rapidly handle commonly needed functionality as those mentioned in this article

What is an NPU: the new AI chips explained

What is an NPU? Possibly the biggest advance in computing in a generation

www.techradar.com

Which makes the NPUs sound very ASIC level and preloaded with vendor's own trained models(inference equations) for those sets of tasks.

Kazekage1981 · Apr 6, 2024

Xbox360 and XboxOne had some some sort of EDRAM/ESRAM to boost bandwidth, cant they use the same thing?

Mahavastu · Apr 6, 2024

Buggy Loop said:
Everything will be AI eventually

It's probably the right direction

On one way of course it is correct, but then the APUs are usually VERY bandwith starved, for example in games they usually really profit from overclocking the RAM in games, way more then CPUs without iGPU do. Adding more Cache would really help to bring the performance of the APU higher and maybe even bring it to some more or less usable level in some games.

OTOH most buyers of these APUs usually won't play anyway (those chips would probably still be too slow for most games) and use it mostly as an office PC, so adding AI is probably more useful for the average user in the future.

Kazekage1981 said:
Xbox360 and XboxOne had some some sort of EDRAM/ESRAM to boost bandwidth, cant they use the same thing?

EDRAM/ESRAM in the XBox was some kind of Cache and intended to reduce the need for bandwith. Anyway, you need to optimize your game for those and this just won't be done on PC games. A larger normal cache would be automagicly be used and reduce the hunger for bandwith

Tams · Apr 7, 2024

Fuck that shit.

Microsoft have almost never shown AMD support, choosing Chipzilla over them. Xbox and very briefly Surface are all they've offered.

FireFly · Apr 7, 2024

Strix Halo apparently features a 256-bit memory bus. So it looks like it's designed for laptops/portables only. On the other hand, that should give it 240 GB/s of bandwidth when combined with LPDDR5X 7500 which is already above Series S' 224.0 GB/s. So I think it will be fine when paired with fast enough memory.

Griffon · Apr 7, 2024

Shouldn't an NPU be a separate specialized chip? Aren't GPU just plain better at handling AI? Why are we taking precious CPU die space for a feature better served elsewhere?

Gudji · Apr 7, 2024

Well fuck here comes the power of the AI.

Support NeoGAF

AMD's next gen mobile chips were supposed to feature a large LLC to solve gaming memory bandwidth issues; Microsoft's demand for AI NPUs changed it

Member

Member

Member

Member

Member

Member

Member

M$FT

Member

Member

Member

Member

Member

Thinks Microaggressions are Real

Member

M$FT

Member

Member

Gold Member

Gold Member

Gold Member

Member

Member

Member

Member

Member

Member

Member

Similar threads