Germany-Based English & German AI Generalist Trainer 2026 May

Rexzone is hiring Germany-based, bilingual (German/English) AI Generalist Trainers to support large language model evaluation through RLHF-style ranking, prompt evaluation, and training data quality workflows that drive model performance improvement.

About the Role

As a Germany-based English & German AI Generalist Trainer at Rexzone, you will evaluate and improve AI systems by assessing, ranking, and validating model-generated outputs. Your work supports RLHF and large language model evaluation by applying annotation guidelines compliance, completing QA evaluation, and documenting clear rationales that strengthen training data quality and accelerate model performance improvement across multilingual prompts.

Key Responsibilities

Evaluate model responses for accuracy, helpfulness, safety, and policy alignment; rank multiple outputs using RLHF-style preference criteria and prompt evaluation rubrics; write concise, evidence-based rationales explaining ranking and reasoning decisions; perform QA evaluation by validating labels, checking consistency, and correcting errors to protect training data quality; follow annotation guidelines compliance and escalate ambiguous edge cases with clear examples; conduct content safety labeling and identify sensitive, disallowed, or risky content; validate multilingual (German/English) prompt intent and ensure faithful meaning across languages; track recurring failure modes and propose guideline updates to support model performance improvement.

Basic Qualifications

Based in Germany with authorization to work remotely from Germany; fluent in German and English (reading and writing) with strong bilingual comprehension; strong analytical skills and structured reasoning for consistent evaluation and ranking; exceptional attention to detail with comfort following annotation guidelines compliance; ability to learn new rubrics quickly and apply them consistently across high-volume tasks; reliable internet connection and ability to work full-time in a remote setting.

Preferred Qualifications

Prior experience in data labeling, content moderation, QA evaluation, or training data quality programs; familiarity with LLM evaluation, RLHF concepts, or prompt evaluation workflows; experience writing concise rationales and applying nuanced policy or rubric criteria; self-driven, organized, and comfortable working independently with measurable quality targets; interest in AI safety, content safety labeling, and systematic error analysis for model performance improvement.

Compensation

Pay range: $35–$40 USD per hour (hourly). Remote, full-time. Final rate within the band depends on assessment performance, language proficiency, and task alignment.

How to Apply

Apply to Rexzone with a resume highlighting bilingual German/English experience and any background in evaluation, QA, data labeling, or guideline-based review. Qualified candidates may be invited to complete a skills assessment focused on large language model evaluation, ranking, and rationale writing.

Frequently Asked Questions

Q: Is this role remote?
Yes. This is a remote, full-time role, and you must be based in Germany.
Q: What tasks will I do?
You will perform large language model evaluation tasks such as evaluation and ranking of model outputs (RLHF-style), QA evaluation, prompt evaluation, content safety labeling, validation of labels, and writing clear rationales while following annotation guidelines compliance.
Q: Do I need AI experience?
AI experience is helpful but not required. We value strong analytical skills, attention to detail, and the ability to apply guidelines consistently to improve training data quality and support model performance improvement.
Q: What languages are required?
Fluency in both German and English is required, including strong reading and writing skills in both languages.
Q: What domains are covered?
Tasks can span general knowledge, reasoning, multilingual understanding, and content safety, with a focus on RLHF, training data quality, and consistent large language model evaluation across German and English prompts.

230+Domains Covered

120K+PhD, Specialist, Experts Onboarded

50+Countries Represented

Industry-Leading Compensation

We believe exceptional intelligence deserves exceptional pay. Our platform consistently offers rates above the industry average, rewarding experts for their true value and real impact on frontier AI. Here, your expertise isn't just appreciated - it's properly compensated.

Work Remotely, Work Freely

No office. No commute. No constraints. Our fully remote workflow gives experts complete flexibility to work at their own pace, from any country, any time zone. You focus on meaningful tasks - we handle the rest.

Respect at the Core of Everything

AI trainers are the heart of our company. We treat every expert with trust, humanity, and genuine appreciation. From personalized support to transparent communication, we build long-term relationships rooted in respect and care.

Ready to Shape the Future of AI Data Operations?

Apply Now.