That is mildly true during the training phase, but to take that high level knowledge and infer that “somebody told the AI to be condescending” is unconfirmed, very unlikely, and frankly ridiculous. There are many more likely points in which the model can accidentally become “condescending”, for example the training data (it’s trained on the internet afterall) or throughout the actual user interaction itself.