Fullstack Engineer - AI Code Quality
We are seeking a highly experienced Fullstack Engineer to work on cutting-edge AI projects, particularly in improving Large Language Models (LLMs) for software engineering tasks.
Key responsibilities will include assessing code generated by models, building agent-based tools, and analyzing complex real-world software. This role involves working across multiple LLM-related projects to enhance AI model performance on code, leading end-to-end engineering efforts for agent use cases like home automation, coding copilots, and creative assistants, reviewing and ranking model-generated code snippets using a structured evaluation system, evaluating code diffs for correctness, style, maintainability, and performance, and building scalable fullstack applications to support dataset pipelines and tooling.
Candidates must have 10+ years of experience in software engineering with strong fullstack capabilities, at least 2–3 years as a full-time employee at a top-tier tech company, deep expertise in software architecture, debugging, and code review, proven ability to assess large, realistic codebases and evaluate code quality, strong written and oral communication skills for clear and logical evaluations, hands-on experience with Git, code versioning, modern frameworks, and cloud platforms. This is a contract-based, fully remote opportunity requiring contractors to be citizens or valid work permit holders in the US, Canada, Australia, or approved Western European countries.
Must-Have Skills:
* Experience in software engineering with strong fullstack capabilities
* At least 2-3 years as a full-time employee at a top-tier tech company
* Deep expertise in software architecture, debugging, and code review
* Proven ability to assess large, realistic codebases and evaluate code quality
* Strong written and oral communication skills
* Hands-on experience with Git, code versioning, modern frameworks, and cloud platforms