It's like this: between all examples of, say, a contract in the training corpora, what would you imagine the variance is between them? Consider variance as diction, syntax, and even conceptual structure. Now consider the variance between all examples of, like, a login process. It's so, so much harder to distill not-code text to produce useful output in similar shapes.But of course, the insecure nature of the example code also guarantees a tendency to insecurity in model output as well.