A tech company specialized in AI research seeks skilled AI Researchers in Reinforcement Learning with Human Feedback. As part of a dynamic team, you will develop and optimize algorithms that align generative models with human preferences. The ideal candidate holds a PhD and has deep expertise in RL, strong knowledge of deep learning frameworks, and experience in working with large-scale training. This permanent role offers a hybrid working model in Cambridge or London, along with opportunities to contribute to cutting-edge research.
J-18808-Ljbffr