I'm less sure if that's always applicable. I think there is utility in evals functioning like a regression test suite, for example to support things like link not tracked or link not tracked.
...
edit: 2026-01-27 - In fact, in link not tracked, they call out explicitly having different set types, with one of them being link not tracked