Hey, Taek42 from Reddit dropping in. A lot of misunderstandings in this thread. I am with family this weekend, so I don't have a ton of time. I'm going to try to hit the big ones, and I'll probably skip some. Keep asking questions, and I'll keep walking through the way I see things.
My own information is incomplete. My own experiences are imperfect. If there's something you think I am not understanding, or an important aspect that I am missing, let me know and we'll walk through it until everyone is on the same page.
The biggest idea here that I am in disagreement with is that no block size limit is safe.
A very significant discovery was made: At any fee level, there is a limit to how much data the network can process. Miners will not produce blocks larger than this limit because those blocks will necessarily be orphaned.
This is true! But it ignores a super important idea: different parts of the network have different throughput. For the rest of this example, I'm going to assume an infinite amount of transactions with an unlimited fee, because that's the assumption under which a fundamental block size limit was proven.
If you are a miner, and you know a block of size X can be processed by 85% of the network, but not 100%, do you mine it? If by 'network', we mean hashrate, then definitely! 85% is high enough that you'll be able to build the longest chain. The miners that can't keep up will be pruned, and then the target for '85% fastest' moves - now a smaller set of miners represents 85% and you can move the block size up, pruning another set of miners.
If by 'network', you mean all nodes... today we already have nodes that can't keep up. So by necessity you are picking a subset of nodes that can keep up, and a subset that cannot. So, now you are deciding who is safe to prune. Raspi's? Probably safe. Single merchants that run their own nodes on desktop hardware? Probably safe. All desktop hardware, but none of the exchanges? Maybe not safe today. But if you've been near desktop levels for a while, and slowly driving off the slower desktops, at some point you might only be driving away 10 nodes to jump up to 'small datacenter' levels.
And so it continues anyway. You get perpetual centralization pressure because there will always be that temptation to drive off that slowest subset of the network since by doing so you can claim more transaction fees.