About the Role
In this full-time remote role at Rexzone, you will evaluate and improve AI/LLM workflows by reviewing model-generated responses, ranking outputs, performing QA evaluation, and writing clear rationales that support model performance improvement. You will follow annotation guidelines compliance and contribute to training data quality through consistent large language model evaluation across English and German content.



