Rafael, at ServiceNow Research, integrates his expertise in software engineering with his proficiency in distributed systems. This combination of skills strengthens his Machine Learning research, linking complex system architecture with practical algorithm deployment. His role spans both the fundamental and applied dimensions of this field.
Rafael, holding a Master’s in Computer Science and a Bachelor’s in Physics, is a core contributor to the fields of Reinforcement Learning (RL) and Natural Language Processing (NLP) at ServiceNow. His academic foundation supports his research activities, which have recently evolved. For the last 4 years, he has been engaged in Deep Reinforcement Learning, with a focus on Offline RL, Policy Optimisation, and RL-driven Energy-Based Models. At the end of 2022, Rafael’s research direction shifted to emphasise Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from AI Feedback (RLAIF). Central to this shift is his work on Reward Modelling, an integral part of his efforts in RLHF and RLAIF, aimed at refining learning algorithms through advanced feedback interpretation and response systems.