Job Description
**Work Snapshot Type** : W2 Location : Remote Commitment : 40 hours per week Commission : $70 $100 per hour What You Ll Be Doing Design complex technical tasks across machine learning, data science, data engineering, and software workflows Evaluate model outputs and provide detailed feedback on correctness, efficiency, and reasoning quality Develop evaluation frameworks and rubrics for assessing agentic system behavior Create accurate, well-documented solutions that serve as high-quality ground truth data Collaborate with cross-functional subject matter experts to ensure consistency and technical accuracy What We Re Looking For Strong experience in machine learning, data science, software engineering, or related STEM disciplines Strong experience in programming, data analysis, statistical methods, or computational workflows Ability to commit to full-time weekday availability throughout the engagement Experience with data annotation, evaluation, or human feedback workflows is a plus Familiarity with LLMs, agentic systems, or evaluation frameworks Strong written communication and technical documentation skills How To Apply Upload resume Interview Submit form