Sep 16, 2023 Learning RLHF (PPO) with codes (Huggingface TRL) Feb 19, 2023 Reading Notes of How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources