Cache-Oblivious Data Structures show that by optimizing for any one cache with unknown parameters you simultaneously optimize for all caches asymptotically.
As we get more and more caches, this model becomes more and more "true" over time, and the ability for hand-written code to exceed it by using the cache parameters is lessened by each additional cache we add to the system.