About the role

Role Overview

Contract hourly position for experts in Software Engineering, Data Science, and Systems Design with strong Swift skills (5+ years). You will help evaluate AI-generated coding responses for technical correctness, clarity, and quality.

Responsibilities

Evaluate AI-generated responses to coding/software engineering questions for:
- Accuracy, reasoning quality, clarity, and completeness
Perform fact-checking using reliable public sources and references
Execute code to validate correctness and verify test outputs
Annotate responses by identifying:
- Strengths, issues, and inaccuracies
Assess code quality, including readability and algorithmic soundness
Ensure responses follow best practices and expected system behavior
Use structured evaluation guidelines, benchmarks, and taxonomies

Requirements

Strong software engineering background or related technical role
Expertise in Swift
Ability to solve medium-to-hard coding problems independently
Experience contributing to open-source projects
Experience using LLMs for coding and understanding their limitations
Strong attention to detail when evaluating technical reasoning
Clear written communication skills for providing technical feedback
Degree in Computer Science or related field

Preferred / Nice-to-Haves

Not explicitly stated beyond the open-source and LLM experience

Application Process

Upload your resume
Complete a short interview
Submit a short form (about 20 minutes)

About Crossing Hurdles

Crossing Hurdles is a technology and consulting organization working at the intersection of software engineering and data science. The role indicates involvement in AI-assisted engineering workflows, including evaluation and quality assessment of code and AI-generated technical responses.

iOS / Swift Developer (AI Model Evaluation & Code Quality) | Remote

Tags