Neo-Liberalism: Let's hire anyone from anywhere, just the best candidates no matter what, no other values but who we can extract the most value from. Let's also take money from the government and lobby them to defund themselves and the country's services to give us even more money.
Oh wow! How does China steal our tech?! Why wasn't the government funding education and security to protect us?!?
https://arxiv.org/abs/2405.20304 they invented their own reinforcement learning framework called Group Relative Policy Optimization
EDIT: deepseek publicly released and published the model and methods to the global community, and there is now an open effort by researchers to reproduce them https://github.com/huggingface/open-r1 it is like the opposite of stealing
@deranger@theunknownmuncher the US trying to stifle Chinese progress/stop chip exports has had exactly what anyone could see. China is making leaps and bounds in all sorts of tech areas, innovating around obstacles
Like. You can compile better or more diverse datasets to train a model on. But you can also have better code training on the same dataset.
The model is what the code poops out after its eaten the dataset
I haven't read the paper so no idea if the better training had to do with some super unique spin on their dataset but I'm assuming its better code.
Do you want my boss to ask me who I voted for and who I pray for? Are you crazy? That's not their business. They HAVE to hire based on the value they'll give to the company