Research and analysis on AI systems, infrastructure, and applied intelligence.
Memory Is the Real Bottleneck — And Everyone Is Still Optimizing the Wrong Thing
The assumption: more GPUs equals more AI capability. The reality: the bottleneck has shifted from compute to memory, leaving expensive hardware sitting partially idle while teams track the wrong infrastructure metrics.
Latest Analysis
6 reportsEvery AI Data Center Built Since 2022 May Be a Stranded Asset. The Industry Has No Tool to Measure This.
Data center infrastructure is long-lived. The 20 to 40 year depreciation schedules applied to data center facilities reflect a real historical truth: the buildings last for decades. The assumption underpinning $7.6 trillion in projected AI infrastructure investment through 2031 is that the facilities being built now will be productive through the depreciation period. The assumption is not obviously wrong for the building itself. It is almost certainly wrong for a significant portion of the infrastructure inside.
The $650 Billion AI Buildout Has a 10% Problem That Is Stopping the Other 90%
AI infrastructure investment is constrained by capital and by GPU supply. More capital deployed = more AI capacity built = more compute available. The hyperscalers are spending hundreds of billions — therefore, compute is expanding rapidly.
The Real AI Infrastructure Crisis Is Power, Not Compute
GPU availability dominated the AI narrative, from H100 allocation wars to cloud pricing drops. But the real constraint has shifted: not compute, but power.
Token Prices Are Falling. Your AI Bill Is Rising. Both Are True.
Per-token costs are collapsing, but enterprise AI bills keep rising. This piece explores why token deflation does not translate to cost deflation—and the hidden mechanics driving AI spending upward.
Benchmarks Are Quietly Breaking AI
AI systems are no longer optimizing for capability → they are optimizing for benchmark environments.
The Industry Is Spending on Training and Calling It AI Infrastructure. The Bill for What Actually Runs AI Has Not Arrived Yet.
AI infrastructure investment is dominated by GPU clusters for training frontier models. The assumption embedded in every infrastructure spending analysis from 2022 through 2024: training is the primary compute cost. Build the training clusters; inference will be manageable.