Context Windows Are Lying to You
Context window size as a capability metric. "We support 1 million tokens." "200,000 token context." The assumption: bigger context = the model can use more information = better performance. The marketing is about window size. The actual behavior is not.
Read post →