Key Responsibilities
Create high-quality labels across modalities: text (NER, sentiment, summarization scoring), images (bounding boxes, polygons, segmentation), audio (transcription, speaker ID), and video (object tracking). Execute RLHF tasks such as prompt evaluation, pairwise ranking, and preference modeling. Perform content safety labeling aligned to policy taxonomies. Follow annotation guidelines precisely, surface ambiguities, and suggest improvements. Complete QA checks, maintain inter-annotator agreement, and contribute to gold-standard test sets that enable model performance improvement and large language model evaluation.



