← Back to jobs

LLM Evaluation Specialist | $61/hr Remote

Crossing Hurdles
CONTRACTOR Remote · US US USD 110240–126880 / month Posted: 2026-05-11 Until: 2026-07-10
Apply Now →
You will be redirected to the original job posting on BeBee.
Apply directly with the employer.
Job Description
Position: Driving Scene Caption Analyst - US Type: Short-Term Contract (12 weeks) Compensation: $53 - $61 per hour Location: Remote Commitment: At least 4 hours per day and up to 30 40 hours per week with 4 hours overlap with PST Role Responsibilities Review 30-second autonomous vehicle sensor feeds and interpret driving scenes from a first-person perspective Evaluate and edit LLM-generated thinking traces for logical consistency and clarity Ground entities and events to scene coordinates and object references Revise driving plans to reflect correct actions, justifications, maneuvers, and speed Identify and remove hallucinated objects or events from AI-generated text Distinguish between actual sensor data and fabricated model outputs Maintain a zero-error standard in safety-critical judgments under time pressure Validate that outputs align with observed trajectory, maneuver, and speed Apply U.S. traffic laws and driving logic to evaluate scenarios Requirements Experience in roles requiring precise writing and structured analytical thinking Documented driving experience in the U.S. with strong knowledge of traffic laws Professional fluency in English with high clarity and precision Ability to evaluate logical coherence in multi-step reasoning Strong attention to detail and ability to maintain quality under time constraints Familiarity with LLM-generated content and common failure modes Understanding of hallucinations, reasoning inconsistencies, and instruction-following errors in AI systems Ability to work independently in a remote environment Application Process Apply/Easy Apply and check email for application form Fill Google form Assessment Link (After shortlisting to be completed within 24 hours)