Join our customer as a Video Caption Specialist, where you'll apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world input. No prior experience in AI is required — your domain knowledge is what matters.
Key Responsibilities
- Review short (5-second) videos of robots performing a variety of physical tasks while closely analyzing the corresponding AI-generated captions.
- Detect visual-text discrepancies, identifying any hallucinations (errors) or omissions in the captions compared to the real video content.
- Rewrite captions with clear, concise, and grammatically correct language, ensuring high accuracy in describing robot actions and motions.
- Emphasize the precision of robot movement and task execution in every caption, as these details are critical for model training.
- Maintain strict consistency with established project guidelines and rubrics, applying strong written judgment and editing standards.
- Meet or exceed daily throughput and quality benchmarks to ensure timely and reliable project delivery.
- Collaborate with the customer’s team, providing actionable feedback and sharing best practices to continually enhance caption accuracy.
Required Skills and Qualifications
- Fluency in English with excellent grammar, spelling, and written clarity.
- Keen attention to detail and demonstrated ability to spot subtle errors or inaccuracies.
- Comfortable analyzing and comparing short video clips and their written descriptions repeatedly.
- Strong judgment for rewriting and editing text for maximum accuracy and clarity.
- Ability to adhere to detailed instructions and maintain consistency across structured, repetitive review work.
- Reliable internet connection and a computer capable of smooth video streaming.
- Excellent communication skills, both written and verbal, with a strong care for clarity and accuracy.
Preferred Qualifications
- Experience in AI training data annotation, RLHF, or LLM evaluation.
- Background in writing, editing, journalism, technical writing, transcription, or copy editing.
- Familiarity with robotics, computer vision, or video annotation workflows.