The word you're looking for is incremental. Yes, going from 16bit to 32bit to 64bit instruction cores are a big deal, but I still completely disagree with your premise. You can have the fastest cores in the world, but they're useless without faster memory, memory access paths, and I/O, and those improvements were not incremental.
A Penryn and an i7 of any vintage are not comparable in performance.
Imagine a Win11 OS update on it. Come back in a few days when it's done. Lots of people play games on their computers. Not me, but lots of people. And video editing.
I think you have an error in grammar. The integrated memory controller made a big difference in performance, and it was first offered in Nehalem. Hundreds (thousands?) of Intel engineers argued for an integrated memory controller back then for a long time, but Intel had a captive multi-billion dollar chipset business in clients and servers that was fab'd on the N-1 process. The chipsets were a huge financial win. Then AMD integrated the memory controllers, and finally the Intel CPU designers were allowed to integrate. On Nehalem and then Itanium.
I still think you're incorrect, big time.
Actually, you're completely wrong on 64-bit and 32-bit. The main purpose for 64-bit systems was so they can see more memory, not because x86-64 is faster. In fact, it's not for almost all workloads, and can be slower. It also slightly limits the clock speed. You don't typically need to do arithmetic on numbers bigger than 2 billion, and the code density is slightly less. But, either way, it's well known 64-bit is not significantly faster than 32-bit, it always has been known. Now, updates to processors not related to that did increase performance, and since 32-bit processors went away, you could say 64-bit processors were faster, but not because they were 64-bit. Now, if you have something that does to huge integer mathematics, it could be, but that's very limited. Most times when it was tested, it was about 1% slower, plus or minus.
You're missing the point about an i7 and Penryn. I'm not sure why it's so difficult to understand, but I'll try again. If you're not waiting on the processor in the first place, there's no substantial difference. If my Penryn easily meets the needs of what I'm doing with it, no, there's not a significantly better experience with an i7. But, I'm not sure why you're even bringing that up, since I never said they had comparable performance. Maybe not reading what I'm writing correctly? I mean, it's pretty clear to me I never said they had comparable performance, so why are you struggling with it?
Yes, and most people don't play games that require a high end CPU, percentage wise. But, I have been clear to say some people can benefit, but there's a lot that can do fine with an older processor. Again, please actually read what I'm writing, instead of trying to change it so it fits in your narrative.
I think you have a problem with understanding what I'm writing, the grammar is correct. Many people were saying the Pentium 4 was slow, and had high latency because of the memory controller was still on the chipset. In fact, Conroe had significantly lower latency than Pentium 4, and was on the chipset as well. Oh, and it blew the door off the Athlon 64, which had the memory controller integrated.
You're completely wrong about memory, particularly about latency, which has improved only very slowly, and that's what CPUs care more about. Last I checked, DDR5 still has higher latency than DDR4. iGPUs like the memory bandwidth, and more cores benefit a bit more, but then larger caches make it less important. But, in any case, memory performance relative to CPUs has not improved dramatically. It's been a slow row, which has been the case for CPU performance in general for a while.
Now, you have to keep in mind, I may be older than you, and really saw big changes with generations. That's the only way I can think of your responses as being sensible. I've seen generations increase by 3x (286), and 2x (486), and Pentium also around there. Oh, and by the way, none of those were changes from 16-bit to 32-bit. The 386 was, and it showed the lowest improvement in performance (but added two important new modes, and got rid of the dreaded 16-bit segments), but in terms of performance, you're wrong. 286 was huge, 486 was huge, Pentium was huge.
Then we got stuck in a bit of a malaise, until Conroe came out, and instantly blew the doors off of existing processors. And every processor after it was iterative, whereas Conroe was not an iteration on the Pentium 4, so it was the correct term. (Sandy Bridge pulled a decent amount of tech from the Pentium 4 though, and Conroe could be said to be based on the mobile line, so wasn't purely a new design like Pentium 4 was) . Look up the Conroe reviews if you don't remember, they were shocking. Instantly killed the Pentium 4, and put AMD in far second place until Ryzen. And yes, they got improvements in each generation, which over time added up, but nothing like Conroe over Pentium 4, or even its direct ancestor, Yonah. And no, Nehalem didn't have that level of improvement.