Hacker News new | past | comments | ask | show | jobs | submit login

Thanks for cutting through the noise. I did some poking around and a discussion from a couple of days ago reached the same conclusion.

https://news.ycombinator.com/item?id=42825573




Is it correct or incorrect that they open-sourced tbeir code? i.e. can anyone with $6M now take the DeepSeek training code, apply it to their dataset of interest, and train a new model that is not censoeed (i.e. even somehow intrinsically to the kodel itself)? Apologies I am not an AI engineer nor even a software engineer of my terminology usage isn't quite spot on.


They have definitely open sourced the inference code. I haven't any training code. I don't think HAI-LLM is open source.

But certainly you can take the architecture from the paper and train a similar model. Or you can try to remove the alignment and produce and uncensored version then realign it.

But at least part of the advantage they have is training on Chinese internet data from inside the great firewall that (AFAIK) US companies don't have access to for any price.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: