Salary: ? - ? € per year
Requirements: - Several years of software engineering experience (3 years or more)
- Strong expertise in systems programming, infrastructure, or backend development using languages like Python, C/C++, Rust, and Go
- Experience building and deploying scalable, production-grade software using modern languages and tools
- Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
- Excellent oral and written communication skills for clear, structured evaluation rationales
- Ideal background: engineers who have built production systems at companies like Google, Microsoft, Apple, Amazon, Meta, or similar high-scale organizations; graduates from strong CS programs (e.g., University of Washington, UIUC, UT Austin, University of Michigan, Purdue) are welcome - exceptional experience and skill take precedence
Responsibilities: - Create datasets for training, benchmarking, and advancing large language models by curating code examples, providing precise solutions, and making corrections in Python, C/C++, Rust, Go, Java, and JavaScript (including ReactJS)
- Evaluate and refine AI-generated code with emphasis on systems-level correctness, performance, scalability, and reliability
- Collaborate closely with researchers and cross-functional teams to enhance enterprise-level AI-driven coding solutions
- Build agents that can verify the quality of systems-level and infrastructure code and identify error patterns
- Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them
- Design verification mechanisms that can automatically verify a solution to a software engineering task
- Typical day: curate code examples, build solutions, correct code, evaluate AI-generated code, collaborate with teams, and develop verification agents
Technologies: - AI
- API
- Backend
- FastAPI
- Java
- JavaScript
- Python
- Rust
More:
- Job title: Remote Senior Backend Engineer (Python/FastAPI)
- Salary: 200 - 300 USD per HOUR
- Tech stack: FastAPI, Python, JavaScript, Java
- Project overview: As a Software Engineering evaluator, you will focus on systems-level code, performance-critical applications, and infrastructure; evaluate and refine AI-generated code and work with cross-functional teams to enhance AI-driven coding solutions
- Category: Python Developer / Engineer
- Location address: 548 Market Street, PMB 18282, San Francisco, United States
- Benefits & perks: Fully home office / remote work; Flexible work time
- Company: Turing
- Engagement details: Commitment: flexible engagement, minimum 10 hrs/week, up to 40 hrs/week; Type: Contractor (no medical/paid leave); Duration: 1 month (potential extensions based on performance and fit); Location requirement: Candidates must be based in the United States
- Evaluation process: Application takes 15-30 minutes; completion of an AI video interview is required
- View this job and over 500 other transparent jobs with salaries & tech stacks on DevITJobs
last updated 20 week of 2026