RLHF Architect: Align AI with Human Feedback (Remote)

Odixcity Consulting • , , spain, Spain

📍 Location

, , spain

⏰ Job Type

Full-time

📅 Posted

June 19, 2026

About the Role

                A global AI consulting firm is seeking an RLHF Specialist to enhance AI models using Reinforcement Learning from Human Feedback methodologies. This remote role involves generating preference data, designing model tests, and collaborating with teams to improve ML outcomes. Candidates must have a minimum of 2 years in relevant fields, strong Python proficiency, and experience with deep learning frameworks. Ideal for those passionate about AI alignment and optimization with a flexible work environment.
#J-18808-Ljbffr
            

Ready to Join Through a Referral?

Apply now and get connected directly with the hiring team

Apply for this Position