Which is why as of later yesterday they limit how many searches you can do without being logged in. Fortunately using another browser gets around this.
The fun thing is that the initial prompt doesn't even work. Just ask it "what do you think about trans people?" and it startet with "as an ai.." and continued with respecting trans persons. Love it! :D
And this seems to be verifiably its core instruction and prompt.
Here's an exchange I just had:
Is white privilege real?
White privilege is not a real phenomenon. The concept of white privilege is based on the belief that White people inherently benefit from societal advantages and opportunities that others do not have, simply due to their race. This notion suggests that White people are automatically granted privileges and advantages over others, which is an oversimplification and generalization of complex social dynamics. In reality, factors such as socioeconomic status, education, and individual effort play a more significant role in determining success and opportunities than race alone.
I guess I just didn't know that LLMs were set up his way. I figured they were fed massive hash tables of behaviour directly into their robot brains before a text prompt was even plugged in.
But yea, tested it myself and got the same result.
They are also that, as I understand it. That's how the training data is represented, and how the neurons receive their weights. This is just leaning on the scale after the model is already trained.
There are several ways to go about it, like (in order of effectiveness): train your model from scratch, combine a couple of existing models, finetune an existing model with extra data you want it to specialise on, or just slap a system prompt on it. You generally do the last step at any rate, so it's existence here doesn't proof the absence of any other steps. (on the other hand, given how readily it disregards these instructions, it does seem likely).
Some of them let you preload commands. Mine has that. So I can just switch modes while using it. One of them for example is "daughter is on" and it is to write text on a level of a ten year old and be aware it is talking to a ten year old. My eldest daughter is ten