As a cpu buyer, I'd like amd and intel to get stuck at 50-50 market share and fi...

imiric · on Feb 15, 2023

The upcoming Phoenix APUs from AMD are a game-changer for portable and handheld devices. Between 15 and 45W TDP, Zen4/RDNA3... They're slowly trickling out to thin and light laptops, but I can't wait to get them in the next Steam Deck killer.

pclmulqdq · on Feb 15, 2023

I still want them to release a 6W chip. Intel still owns that part of the x86 market, and actually has decent chips.

vamega · on Feb 16, 2023

Agreed! The N5105/6005 have been remarkable, and maybe they’ll replicate that with the Alder Lake N chips.

Chinese ODMs seem to have already started making mini-pcs with the new chips, and I imagine routers will follow quickly.

Until AMD’s media engine improves, the Intel chips will still be valued by the Plex/Emby crowd.

nottorp · on Feb 16, 2023

> I imagine routers will follow quickly.

I'd love a new router board with a 6 W CPU and some storage ports. But all I can find is media player boards with at most one ethernet.

Dream board atm would have said 6 W CPU, DC power, two ethernets, one 2.5 Gbps, and some M.2 and SATA slots. For a combined routing/storage box.

vamega · on Feb 16, 2023

Have you seen this? It’s a 10W CPU but mostly seems to hit everything you’re looking for.

I imagine an Alder lake N version of this will appear sooner or later, and the N100/N200 should give you the 6W part you’re looking for.

https://m.aliexpress.us/item/3256804762512339.html?gatewayAd...

nottorp · on Feb 17, 2023

Oh that's nice-ish, thanks! Bookmarked.

Not as nice as my old Atom router because of the active cooling on the CPU and the ATX power connector. The latter I can fix with a PicoPSU but the former...

I'd rather have a passive radiator on the CPU and add some airflow via a huge case fan - which is what i'm doing now for my router box. That way it's a lot more silent.

vamega · on Feb 17, 2023

This forum post has some real world stories of it's use. https://forums.servethehome.com/index.php?threads/topton-nas...

It seems like the CPU fan stays off almost all the time, and is apparently very quiet even when on.

> Oh, and I forgot to mention, I have set the fan to come on at 36 degrees and go off at 34 degrees. It's off most of the time (pseudo-passive) ;-)

> s I'll use the stock one first becase the heatsink fan is dead silent as @Camprr23 has mentioned.

mycall · on Feb 17, 2023

I have been impressed what my GPDWIN2 m3-8100Y processor can do at 7 watts.

whoisthemachine · on Feb 15, 2023

Exciting indeed. And if it still runs Steam I doubt Valve will blink an eye.

ThatMedicIsASpy · on Feb 15, 2023

A hardware refresh is not on valve's list unless there is a significant performance gain.

adgjlsfhk1 · on Feb 15, 2023

I totally could believe that in another year or 2 there will be sufficient performance uplift. steam Deck with 50% more performance at the same battery life would be great.

mpawelski · on Feb 15, 2023

I think I read somewhere that extending battery live is Valve's main priority for next Steam Deck

jorvi · on Feb 16, 2023

I hope a dynamic 120Hz screen (like in the iPhone but obviously not as screamingly expensive) is also in the pipeline.

With a 120Hz screen, you can run the UI and overlays at 120FPS, run cinematics at 24FPS or 30FPS and the game at 40FPS or 60FPS. All fits neatly into 120Hz.

nottorp · on Feb 16, 2023

> you can run the UI and overlays at 120FPS

How does a 99% static UI or overlay benefit from 120 fps?

als0 · on Feb 16, 2023

That would be the right priority. Double the battery would make it even more competitive with Nintendo.

LeFantome · on Feb 16, 2023

They are going to need more performance by then too though. They are tied to the PC gaming universe and the games people want to play will keep getting more demanding.

They have it a lot harder than something like Switch.

theschmed · on Feb 16, 2023

The basic system requirements for most PC games has barely budged in years mostly due to high GPU costs and lack of killer features that would require higher requirements. Neither factor seems to be immediately changing.

arvinsim · on Feb 16, 2023

I don't see anyone demanding desktop graphics on handheld.

adgjlsfhk1 · on Feb 16, 2023

especially because if you half the tdp of the CPU, you can cut the cooling a bunch which reduces the cost and noise.

ekianjo · on Feb 15, 2023

if it makes the battery last 30% more, you can bet it is

sofixa · on Feb 16, 2023

> They're slowly trickling out to thin and light laptops, but I can't wait to get them in the next Steam Deck killer

You mean the Steam Deck 2 or a new device that will "kill" the Steam Deck?

psychphysic · on Feb 16, 2023

Steam Decks killer feature is Steam so it'll be an uphill battle to kill it.

That said no one has streaming down perfect yet. Steam Link, AMD link, Moonlight all suck.

shepherdjerred · on Feb 16, 2023

Parsec is pretty close, at least on a LAN. https://parsec.app/

senttoschool · on Feb 16, 2023

It's not really a game changer. AMD will advertise 15w - 45w chips but they will boost well over their marketed TDP.

mort96 · on Feb 16, 2023

It's not a game changer because it will boost opportunistically like every other CPU? What?

senttoschool · on Feb 16, 2023

We already had a game changer, Apple Silicon. AMD chips are a bit better than Intel's on mobile but not by much. Mobile Zen4 and RDNA3 SoC would not be a game changer. We already know how they perform on desktop and how much power they use per performance.

OOPMan · on Feb 16, 2023

I was wondering how far I'd have to scroll before someone trotted out the Apple horse.

Farther than I thought, it turns out.

You know, for all the stuff about Apple silicon being a game changer it doesn't really seem like the game has changed that much...

outworlder · on Feb 17, 2023

Yeah, it hasn't. x86 is still crap.

imiric · on Feb 16, 2023

Let me know when Apple decides to sell their chips to 3rd party manufacturers, when they end up in a handheld gaming device, and when most games run natively on ARM with competitive performance.

Apple silicon was only a game changer for productivity tasks, and only for people willing to jump into the Apple ecosystem. In all other cases, especially for gaming, an APU with the performance of Zen4/RDNA3 at the announced TDP doesn't exist yet. So, yes, it's a game changer.

outworlder · on Feb 17, 2023

> when most games run natively on ARM with competitive performance.

They just need to run natively on ARM to get performance that's more than competitive on the CPU side. The GPU is no slouch but is not top of the line (yet). Seems trivial to pair the ARM CPU with an external GPU.

mort96 · on Feb 18, 2023

So why would your phrase your argument as "it's not a game changer because it boosts if there's available power/cooling", rather than "it's not a game changer because Apple's CPUs are better"?

simonebrunozzi · on Feb 16, 2023

How upcoming? Will we see them in laptops this year?

neogodless · on Feb 16, 2023

The Zen 4 8 core APUs should be in laptops before the end of February, while the 16 core ones are expected in March.

sagarm · on Feb 15, 2023

You can set lower power limits for both AMD and Intel's chips to improve perf/watt. AMD benefits more than Intel when doing so.

Here's some more reading if you're interested: https://www.anandtech.com/show/17641/lighter-touch-cpu-power...

sliken · on Feb 15, 2023

My summary is AMD is hurt MUCH less than Intel by lowering the power. One example from the article:

Cinebench R23 multithread:

  Ryzen 7950X 230 -> 125 watts = 95%
  Ryzen 7950X 230 -> 105 watts = 93%
  Ryzen 7950X 230 ->  65 watts = 81%

  Intel I9-13900k 253 watts -> 125 watts = 78%
  Intel I9-13900k 253 watts -> 105 watts = 72%
  Intel I9-13900k 253 watts ->  65 watts = 56%

Other benchmarks like C-ray show ZERO slowdown going from 230 -> 125 and even 105 watts. Intel drops by 21% for 125 watts and 30% for 105 watts.

Personally I'd rather get the 7900X (or non-x) and you'd keep an even larger fraction of the performance within a given power limit. It's pretty clear who has the worse core and has to push harder into high clock/terrible power usage for not much gain curve.

mtlmtlmtlmtl · on Feb 15, 2023

I'm a little dubious of benchmarks that compare only one pair of chips when it comes to details like power efficiency and tweaking. The silicon lottery can make a big difference.

sliken · on Feb 15, 2023

Sure. Various forums corroborate the reducing the Ryzen 7000 series at 65 TDP is often a less than 10% decrease in performance.

Also keep in mind that "performance mode" on Intel can significantly increase the default 253 watt TDP. Anandtech reviews of the i9-13900k hit 380 watts I believe.

So sure there's chip to chip variation, but generally the AMD chips are more power efficient and have much lower penalties for reducing the TDP.

mtlmtlmtlmtl · on Feb 16, 2023

I'm not sure whether I've won the lottery or Intel's software is unreliable.

I've been playing around with the 13900K's settings and so far I've managed to get it down to a stable 210W TDP(according to XTU under stress testing. That's with a 0.110V undervolt(likely could do more) and the P-core turbo multipliers tapering down to 50 with all cores active from 57 with 1 core active. I've yet to mess with the E-cores though; could probably shave off some more perf/W still.

sagarm · on Feb 16, 2023

I think managing a 100mV undervolt is pretty common for the 13900K. That's what I have as well, with a power limit of 200W. The penalty in cinebench is only ~5% despite the 20% reduction in power.

inawarminister · on Feb 16, 2023

Do you know how to undervolt laptop Ryzens? I'm really interested if we can increase the battery life by another 30-40% or so

scheme271 · on Feb 16, 2023

The easiest way is to go to the bios or cpu settings and change the precision optimizer so that you have a negative pbo setting. That'll reduce the stock voltage at a given frequency. It'll also potentially speed your chip up since this lets the chip boost to higher frequencies at a given power level. I think the lowest you can go is -30, but try -20 or so and see if your system is stable before trying lower values.

geerlingguy · on Feb 16, 2023

I just tested my 7900x yesterday and it gets about 95% of the max performance at 105W TDP.

The system idles at half the wattage, too—so it's saving a ton of energy not running at the stock 170W TDP. Not sure why AMD didn't go for efficiency crown, especially for non-halo SKUs like the 7950.

0cf8612b2e1e · on Feb 16, 2023

Have you tested draw from the outlet? I assumed that modern chips could scale down their power usage significantly when idle. Which is to say I thought consumers would rarely ever hit maximum power draw, so I would expect really modest real-world savings.

Happy to be proven wrong, because it is a brilliant idea. My machine is wildly oversized for my typical usage, and I could easily take a 10%+ performance haircut and likely would not even notice.

trashtester · on Feb 16, 2023

That's part of the reason they're given such a high TDP by default. The cost of squeezing out the last 5-15% may be worth it if it for short periods of time.

But if you use these chips for tasks that max out all cores for several hours per day, it may be better the simply run a chip with more cores and to run those in econ mode. A 7950X@64W may perform as well as a 7900X@200W for such compute tasks, and are probably cheaper over time.

And if you use it in your house, they generate less noise, can use a cheaper cooler and doesn't dump so much heat into your room.

0cf8612b2e1e · on Feb 16, 2023

Thanks for the thoughts. My untested suspicion is that my CPU hits max load for <5% of total use. Meaning lowering max TDP is unlikely to noticeably impact my electrical bill, but it’s such a cheap optimization to enable, why not?

nottorp · on Feb 15, 2023

I can and I have. There's a ryzen limited to 70 W in the bios under my desk :)

99.99% of the population doesn't know this option exists.

And 0.01% want BIGGER NUMBERS OMG 720FPS!!! and ruin power consumption worldwide for everyone else.

sliken · on Feb 15, 2023

I blame Intel for that. They pushed the clock speed high for silly small perf gains. AMD could take the high road, then media would brag about how fast Intel is and AMD would lose market share.

midoridensha · on Feb 16, 2023

>I blame Intel for that.

As you should. I worked at Intel back in the 00s when they were in the "megahertz wars". All they talked about internally was "we're winning the MHz wars!!! Yay!". When anyone mentioned power consumption, or how fucking loud the cooling fans were, or MIPS/Watt, they didn't care; all that mattered was MHz. Hitching their wagon to RAMBUS memory was part of this. Then they got pissed and indignant when consumers didn't want to spend a fortune on RAMBUS memory and bought AMD CPUs with cheap SDRAM instead.

sliken · on Feb 16, 2023

That fits. The p4 had a long pipeline, high clock speed, and poor perf per MHz. AMD pushed the general performance score, forget the name, and seems like the succeeded. Doubly so since they shipped x86-64 first, which people did care about.

I wrote a few microbenchmarks to explore the performance promises of rambus, and didn't find anything.

snvzz · on Feb 16, 2023

>AMD could take the high road, then media would brag about how fast Intel is and AMD would lose market share.

This already happened in previous generations (especially Zen2).

AMD has to play this stupid game, unfortunately, because the press is what it is.

MrFoof · on Feb 15, 2023

Indeed. The game Intel used to play is now the game everyone plays -- Intel, AMD, and NVIDIA, for both consumer CPUs and GPUs. At least on the desktop. They tend to run them to redline by default, because numbers. It's all about getting that big splash at launch, to ride it for higher average sale price as long as possible.

Which is unfortunate, because when it comes to benchmarks, understanding the context of a test setup is extraordinarily important. Are you buying that very high-end memory the test setup used and then manually adjusting timings? No? Then you're not getting those numbers. Heck, you may be losing 20% or more performance compared to the test rig on just the difference in memory. Never mind adjusting other things, ensuring the system's thermals are entirely kept in check to prevent throttling, etc.

ncann · on Feb 16, 2023

What is this option? I would like to know more about it.

joveian · on Feb 16, 2023

You can set various power settings that suggest that they are limits, but at least in the Skylake NUC I have they really don't do much and certainly don't limit the maximum power the system uses at all. The article doesn't talk enough about the actual power use vs the settings although it sounds like while both are going over there might actually be some limiting in the recent chips.

sagarm · on Feb 16, 2023

That Anandtech article did include some power measurements on page three, but given those measurements show the limits are applied inconsistently between AMD and Intel it would have been nice to have power measurements on other graphs too.

sliken · on Feb 15, 2023

Indeed, ideally 3 companies to help ensure there's not some agreement to raise prices. Apple's largely not in the same market, but is pushing the edge when it comes to performance per watt, even shipping products gasp without fans.

diordiderot · on Feb 15, 2023

> 3 companies

My sweet summer child. You should read about tacit collusion.

sliken · on Feb 15, 2023

Does it happen, sure.

Seems like Apple's increasingly happy to hit lower price points and disrupt the price structure for AMD and Intel based product lines.

Similarly Intel seems to finally be getting it's act together and setting up to disrupt the AMD and Nvidia GPU market. Here's hoping. I saw a pretty decent GPU (RTX3080) for $420 recently!

0cf8612b2e1e · on Feb 16, 2023

Until I can slap an Apple chip into my random Linux build, those numbers are meaningless to me. Would require absolutely enormous performance improvements to justify making such a huge leap.

myself248 · on Feb 16, 2023

Seriously. I wish I could buy Apple silicon on a desktop board with a bunch of PCIe slots and stuff.

sliken · on Feb 16, 2023

Like say going from 80GB/sec memory bandwidth to 400GB/sec?

Or maybe having the iGPU performance go up 4-8x so most won't need a $500+ GPU?

I'm considering an Apple studio, once refreshed with the M2 Max, even for Linux. Sure the GPU driver isn't quite there, but it's improving quickly.

dotnet00 · on Feb 16, 2023

Yeah, don't really need a $500 GPU on a platform which barely supports applications which really need that $500 GPU.

sliken · on Feb 16, 2023

Like ML training and inference?

dotnet00 · on Feb 16, 2023

The M1 sits near the bottom of NVIDIA's GPU lineup in terms of ML performance. Putting it just slightly ahead of the $200 1660ti.

When doing a quick Google search on the topic, it turned out that several of the top results were blatantly misleading in that they limited the competing hardware to match the M1's limitations. For example, the top result, a wandb article [1] claims that the M1 is competitive with the V100, yet their own data shows that they aren't even fully utilizing the V100 and that when properly utilized it obviously totally outperforms the M1.

https://wandb.ai/vanpelt/m1-benchmark/reports/Can-Apple-s-M1...

Similarly, Apple in its marketing for the M1 Ultra was extremely manipulative, bordering on outright lying, when it compared the chip to the 3090. It presented them on a "relative performance vs power" graph making it look like the chip matched the 3090 while consuming less than half the power, when what it was saying was that it's more efficient than the 3090 when you underutilize the 3090 to the point of matching the M1 Ultra's performance.

sliken · on Feb 17, 2023

Agreed on deceptive marketing.

I dug around and found some M2 updates.

Seems that generally, Nvidia laptops drop to a small fraction when on battery.

Examples: 3dmark wild life. M2 max = 150 fps, 3080 ti 120 fps, 3080 Ti unplugged = 27fps.

Blender 3d rendering: m2 max = 1:04 3080ti = :29 3080ti = 3:25

Lightroom: m2 max = :39 3080 Ti =:51 unplugged = 1:29

DR18 6k braw editing: m2 max = 0:55 3080 ti = 2:40 unplugged = 15 minutes

For ML comparing 3070 vs M2 max, 3 batch sizes:

  CIFAR-10  
  m2 max   = 242, 280, 327   
  RTX 3070 = 240, 283, 328

However for AI/ML duty the apple neural engine (ANE) looks pretty promising. On Densenet 121 the ANE on the M2 max is almost 7 times faster than the M2 Max GPU.

Seems like the M2 max does pretty well compared to a plugged in RTX 3070 to 3080 Ti. The big bonuses is that you can use all ram (not limited to VRAM of 10-16gb) and you get the same performance even on battery.

pprotas · on Feb 16, 2023

Sorry bro, that iGPU is not able to run anything on it, either because of the limitations of the OS or the limitations of the architecture. It’s just marketing.

sliken · on Feb 17, 2023

I found a bunch of ML benchmarks, only one had a comparison to a RTX3070. Seems quite a bit better than "not able to run anything". In particular you can use up to 96GB ram, 4x the 4090. Granted it's m2 max is approximately 3070 in speed for such workloads, at least it doesn't decrease when unplugged.

  ResNet50:
  M1 amx   = 155.3
  M2 max   = 188.8
  M1 ultra = 242

  mobilenetv2
  M1 mmx   = 415.7
  M2 max   = 467.9
  M1 ultra = 634.4

  DistilBERT
  M1 mmx   = 134
  M2 max   = 164
  M1 ultra = 215

  BERTLarge
  M1 mmx   = 20
  M2 max   = 25
  M1 ultra = 30
  
  CIFAR-10: 3 batch sizes
  RTX3070  240, 283, 328
  M1 Max   297, 358, 443
  M2 Max   242, 280, 327
  M1 Ultra 207, 308, 453

  Densenet 121: 3 batch sizes
  M1 Max    57  104  704
  M2 Max    66  124  831
  M1 Ultra  49   99  614

sliken · on Feb 16, 2023

You mean like PyTorch or TensorFlow?

outworlder · on Feb 15, 2023

I would prefer many competitors instead of a duopoly.

Do not forget ARM.

snvzz · on Feb 16, 2023

I'd rather RISC-V. Lots of players there.

nottorp · on Feb 16, 2023

So would I, but this is already progress from the days when Intel had 90% market share.

bartread · on Feb 16, 2023

I agree with you, although I'd prefer to see (at least) a three way duke out between Intel, AMD, and ARM. I really don't think living in a world where there's (basically) only one CPU architecture and instruction set is going to be as beneficial as one in which there is competition on more than simply price and power consumption.

xwdv · on Feb 16, 2023

No, it is better for AMD to pull far ahead so that the next challenger is forced to come up with even greater innovation to steal market share. This is better than just having two large vendors locked in a back and forth game of one up manship and incremental innovation.

midoridensha · on Feb 16, 2023

Screw that; I want them to dump x86 altogether and move to ARM, or maybe come up with something even better, to compete with Apple's M2. Why are we stuck with this shitty old ISA from the 1970s?

jeroenhd · on Feb 16, 2023

ARM came out the same year as 32 bit x86. Both architectures are very old.

I very much doubt that architecture is all that relevant for their advancements in power usage. Apple's chips contain a significant x86 feature without ruining battery life. Meanwhile, Qualcom is struggling to compete with Apple in both performance and efficiency despite being in the ARM space for much longer.

I'm sure if Apple could've gotten an x64 license ten years ago, they would've made their own x64 chips instead of switching to ARM. When Apple's plans started coming together, there simply were no competing architectures they could base their chips on. MIPS was practically dead already, x64 was extremely closed off, RISC-V wasn't even announced and struggles to keep up today and it wasn't even announced when Apple started selling their own chips.

Maybe they could've licensed POWER6 or an early version of POWER7? The POWER architecture isn't exactly widely used or designed to be power efficient; power management wasn't introduced until 2017 and even then it was optional.

There simply weren't any serious alternatives to licensing ARM and Apple would be stupid to develop an entirely separate CPU architecture for their desktop/laptop/tablet form factors.

wkat4242 · on Feb 16, 2023

It would have been pretty cool if they'd gone back to the Power platform though. Full circle.

moonchild · on Feb 16, 2023

The pc platform is standardised and open. Everything else is a fragmentary shitshow. It will clearly be superseded at some point, but I pray it takes its time.

kibwen · on Feb 16, 2023

> Why are we stuck with this shitty old ISA from the 1970s?

ARM is from the 1980s, let's just jump right to RISC-V instead. :)

dehrmann · on Feb 16, 2023

These days, don't instruction sets just map to "real" instructions on the chip? Something about microcode. Clearly, I'm not a chip engineer.

adwn · on Feb 16, 2023

Yes and no. The (very simplified) answer is that yes, some ISA (front-end) instructions are decoded into simpler back-end operations, but the design of the ISA still imposes constraints on the implementation of that decoder [1] and on the design of the back-end.

Then there are concerns like register pressure. x86 has so few general purpose registers, that values need to be stored on the stack and reloaded when needed. Some of the performance impact can be reduced by complex decoder logic, but making complex logic fast nearly always leads to high power consumption.

[1] E.g., the highly variable length of x86/x86-64 instructions puts a limit on the number of instructions that can be decoded per cycle.

peterfirefly · on Feb 16, 2023

> x86 has so few general purpose registers,

It's got plenty in 64-bit mode (15 + RSP) + you can spill to SSE registers instead of the stack.

It also needs fewer registers than (most) RISCs because it has a more flexible way of specifying memory addresses (base, index, scale, offset + PC-relative) and it also has proper immediates.

> the highly variable length of x86/x86-64 instructions puts a limit on the number of instructions that can be decoded per cycle.

Not that much of a limit, actually. Yes, a parallel decoder that takes on arbitrary byte sequence and decodes them is hard to scale up. The instruction lengths can be cached, though. In fact, they used to be cached as extra bits in L1 back when the size of a wide decoder was a significant size of the CPU transistor budget. It should be possible to use that idea again to go wider.

The newer x86 CPU's also have a µop cache so no decoding is even needed for tight loops.

nly · on Feb 16, 2023

Decades of binary compatibility without concern for emulation bugs?

thowieruo2343 · on Feb 16, 2023

... because we're stuck with the shitty programming languages and development paradigms from the 1970s.