Sep 16, 2023 Learning RLHF (PPO) with codes (Huggingface TRL) Feb 19, 2023 Reading Notes of How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources Feb 12, 2023 Huggingface parallel training for solving the CUDA out of memory issue Jan 24, 2023 Could you give me a hint? Generating inference graphs for defeasible reasoning