At 15:50 he talks about how it can get really expensive to do so many forks to your code and edit the whole file in each of the forks because output token are more expensive than input tokens. Because it's auto regressive. Also the output limits are not growing as fast as the context windows, which have become very large.