AI Training Jobs in India — Remote AI Training Specialist
Title: AI Training Specialist Date: 25-02-2026 Company: Rexzone Country: US Remote Type: Remote Employment Type: FULL_TIME Experience Level: Mid-Senior Industry: Technology Job Function: Engineering Skills: AI training, RLHF, data labeling, LLM evaluation, prompt evaluation, QA evaluation, training data quality, annotation guidelines compliance, named entity recognition, computer vision annotation, content safety labeling Salary Currency: USD Salary Min: 63360 Salary Max: 126720 Pay Period: YEAR You will execute and improve AI training workflows across NLP and computer vision datasets, producing reliable labeled data and evaluation signals that directly impact model behavior. Work includes RLHF preference ranking, rubric-based QA evaluation, prompt/response evaluation, and error analysis to identify systematic failure modes and improve model performance. Key Responsibilities: • Produce labeled datasets for NLP tasks such as named entity recognition, text classification, summarization, and instruction-following. • Perform RLHF tasks: preference comparisons, rationale capture (when required), and consistency checks using calibrated rubrics. • Run LLM evaluation and prompt evaluation for accuracy, helpfulness, harmlessness, and policy compliance. • Conduct QA evaluation, audits, and adjudication to improve training data quality and reduce label noise. • Apply annotation guidelines compliance, document edge cases, and propose clarifications to annotation playbooks. • Support computer vision annotation including bounding boxes, polygons, segmentation, and keypoints when projects require CV labeling. • Perform content safety labeling across categories such as hate, harassment, self-harm, sexual content, and violent content, aligned to policy. • Collaborate with project leads to track inter-annotator agreement, drift, throughput, and quality metrics. What You’ll Work With: • Large-scale LLM training pipelines, evaluation harnesses, and annotation platforms • Taxonomies, rubrics, golden sets, and calibration rounds • Dataset versioning, sampling strategies, and error buckets for model performance improvement Qualifications: • Experience in data labeling, QA evaluation, or LLM evaluation in production or high-throughput environments • Strong written English for prompt evaluation and rubric-based judgment • Familiarity with NLP concepts (NER, classification) and/or computer vision annotation • Ability to follow precise guidelines, maintain consistency, and handle sensitive content for content safety labeling How to Apply: • Apply via Rex.zone and include a short summary of your annotation, RLHF, or evaluation experience and the domains you’ve worked in (NLP, CV, content safety).



