Hiring AI Software Engineer – LLM Evaluation & Code Intelligence_Europe | North America | Oceania – Remote

 


AI Software Engineer – LLM Evaluation & Code Intelligence

Location: Europe | North America | Oceania – Remote
Contract Type: Contractor (1 Month, extendable)
Engagement: 10–40 hrs/week (Partial PST overlap required)
Pay Range: USD 50 – 125 per hour

Apply herehttps://forms.office.com/r/h70262TM34


Project Overview

Join an innovative global AI project focused on AI-assisted software development. The goal is to create high-quality evaluation and training datasets to enhance how Large Language Models (LLMs) interact with real-world software engineering tasks.

You’ll work across software ecosystems, helping models understand complex codebases and supporting the development of intelligent AI agents that boost model performance and reliability.


Role Overview / Typical Day

Lead and deliver end-to-end AI agent use cases — such as coding copilots,
creative design assistants, and automation tools. 
Collaborate to identify edge cases and model ambiguities.
Review and rank 3–4 model-generated code responses per task using structured evaluation methods.
Evaluate code diffs for correctness, maintainability, efficiency, and style. Provide clear and detailed rationale for evaluation decisions.

Required Skills & Experience

Expertise in full-stack application development and deploying scalable, production-grade software.
Deep understanding of software architecture, debugging, design patterns, and code review.
Proven ability to evaluate code diffs for correctness and performance. Excellent communication skills and the ability to write structured evaluation rationales.

Engagement Details

Flexible hours: 10–40 hrs/week
Time zone: Partial PST overlap preferred
Type: Contractor (no medical/paid leave)
Duration: 1 Month, with potential extension based on performance and fit