282 followers
@DrJimFan Hey Jim! Eureka is really cool :) Just wanted to point out that Preference-based RL (or RLHF to the NLP community) has actually been around long before OpenAI/DM's paper. For ex: https://t.co/7bsoer1XJB and lots of other work by Arkrour circa