 
        
        5 days ago Be among the first 25 applicants 
Get AI-powered advice on this job and more exclusive features. 
List of accepted countries and locations 
Help train large-language models (LLMs) to write production-grade code across a wide range of programming languages: 
- Compare & rank multiple code snippets, explaining which is best and why. 
- Repair & refactor AI-generated code for correctness, efficiency, and style. 
- Inject feedback (ratings, edits, test results) into the RLHF pipeline and keep it running smoothly. 
- End result: the model learns to propose, critique, and improve code the way you do. 
RLHF in one line 
Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship. 
What You’ll Need 
- 4+ years of qualified software engineering experience in Java 
- (Constraint programming experience is a bonus, but not required) 
- Strong code-review instincts —you can spot logic errors, performance traps, and security issues quickly. 
- Extreme attention to detail and excellent written communication skills. 
- Much of this role involves explaining why one approach is better than another. This cannot be overstated. 
- You enjoy reading documentation and language specs and thrive in an asynchronous, low-oversight environment. 
What You Don’t Need 
- No prior RLHF (Reinforcement Learning with Human Feedback) or AI training experience. 
- No deep machine learning knowledge. If you can review and critique code clearly, we’ll teach you the rest. 
We are looking for engineers with a strong command of Java. 
Logistics 
- Location: Fully remote — work from anywhere 
- Compensation: From $30/hr to $70/hr, depending on location and seniority 
- Hours: Minimum 15 hrs/week, up to 40 hrs/week available 
- Engagement: 1099 contract 
Straightforward impact, zero fluff. If this sounds like a fit, apply here! 
Seniority level 
- Seniority level 
Entry level 
Employment type 
- Employment type 
Contract 
Job function 
- Job function 
Information Technology 
- Industries 
Software Development 
Referrals increase your chances of interviewing at G2i Inc. by 2x 
Get notified about new Software Engineer jobs in Australia. 
Melbourne, Victoria, Australia 1 week ago 
Canberra, Australian Capital Territory, Australia 4 days ago 
Sydney, New South Wales, Australia 4 days ago 
Sydney, New South Wales, Australia A$90,000.00-A$120,000.00 2 days ago 
Sydney, New South Wales, Australia 1 day ago 
Brisbane, Queensland, Australia 4 days ago 
Melbourne, Victoria, Australia 4 days ago 
Melbourne, Victoria, Australia 1 week ago 
Melbourne, Victoria, Australia 10 months ago 
Melbourne, Victoria, Australia 2 weeks ago 
Melbourne, Victoria, Australia 2 months ago 
Adelaide, South Australia, Australia 2 months ago 
Sydney, New South Wales, Australia 2 months ago 
Perth, Western Australia, Australia 2 months ago 
Freelance Software Developer (Java) - AI Trainer 
Sydney, New South Wales, Australia 1 day ago 
Melbourne, Victoria, Australia A$120,000.00-A$165,000.00 1 week ago 
Canberra, Australian Capital Territory, Australia 2 months ago 
Brisbane, Queensland, Australia 2 months ago 
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. 
#J-18808-Ljbffr
📌 Software Engineer, AI (Java)
🏢 G2i
📍 Rockhampton