iOS / Swift Developer (AI Model Evaluation & Code Quality) | Remote
Crossing Hurdles
full-remoteseniorcontractbackenddata United States 5 days ago via LinkedIn
72,000 - 120,000 USD/annual
See how well this job matches your profile
Sign up to get an AI match score and generate a tailored application in seconds.
Get your match scoreTags
SwiftiOSLLM EvaluationCode QualityOpen SourceSoftware EngineeringSystems DesignFact-CheckingTechnical WritingPythonic/Algorithmic Soundness
About the role
Role Overview
Contract hourly position for experts in Software Engineering, Data Science, and Systems Design with strong Swift skills (5+ years). You will help evaluate AI-generated coding responses for technical correctness, clarity, and quality.
Responsibilities
- Evaluate AI-generated responses to coding/software engineering questions for:
- Accuracy, reasoning quality, clarity, and completeness
- Perform fact-checking using reliable public sources and references
- Execute code to validate correctness and verify test outputs
- Annotate responses by identifying:
- Strengths, issues, and inaccuracies
- Assess code quality, including readability and algorithmic soundness
- Ensure responses follow best practices and expected system behavior
- Use structured evaluation guidelines, benchmarks, and taxonomies
Requirements
- Strong software engineering background or related technical role
- Expertise in Swift
- Ability to solve medium-to-hard coding problems independently
- Experience contributing to open-source projects
- Experience using LLMs for coding and understanding their limitations
- Strong attention to detail when evaluating technical reasoning
- Clear written communication skills for providing technical feedback
- Degree in Computer Science or related field
Preferred / Nice-to-Haves
- Not explicitly stated beyond the open-source and LLM experience
Application Process
- Upload your resume
- Complete a short interview
- Submit a short form (about 20 minutes)
About Crossing Hurdles
Crossing Hurdles is a technology and consulting organization working at the intersection of software engineering and data science. The role indicates involvement in AI-assisted engineering workflows, including evaluation and quality assessment of code and AI-generated technical responses.
Scraped 4/1/2026