About the Role
This remote, full-time role focuses on evaluating, ranking, and improving model-generated outputs in English and German. You will follow annotation guidelines compliance, write clear rationales, and perform QA evaluation to ensure training data quality and enable model performance improvement across multiple use cases.



