Chongli Qin

Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)

Large language models have made big progress over the last years. Today, many of us are already using these models in our daily lives – to give suggestions, to brainstorm ideas, to do research. As we suddenly find ourselves in a situation of societal co-evolution with the models we train, it

The Slow Dangers of Human-AI Co-Evolution

In this post I would like to raise awareness for understanding an elusive kind of danger when it comes to AI systems. A danger that is not as futuristic as an uncontrollable super-intelligence which may cause human extinction. Nonetheless, if it is unaccounted for, it can cost us our sanity