I always imagine it more like neural networks. simply based on a lot of training and experience. As an example think of times when you step onto a non moving escalator. Your mind definitely knows its not moving but you still can't defeat the trained expectation of jerk.
Have you ever swiped on your phone, but the screen doesn't move (due to end of content, or unknowingly being an unswipable screen), and you feel your eyes jerk automatically in reflex, predicting the movement that didn't happen?