I’m certain (indeed more certain than I likely should be, which may be meta-meta memory?!) that what you say that the end is the case. There’s almost certainly a bias towards error correction over direct recall. Certainly my experience is of testing wrong answers in my head before alighting on the right one.
That implies a set up more like an adversarial neural network (I’m not saying this is actually how it is, just trying to draw an analogy from something I understand), as opposed to a function in code. But that seems like a bit of a waste, but also means that two (or more) distinct processes could be working on the same task?
This is a fairly good summary of my argument against why we are in a simulation.