[Remote] Research Intern (LLM)

Remote, USA Full-time
Note: The job is a remote job and is open to candidates in USA. 2077AI Open Source Foundation is looking for a Research & Evaluation Intern to help build advanced QA datasets and evaluate large language models. This role is ideal for students passionate about LLMs, evaluation science, and the intersection of research and applied data work. Responsibilities Design and construct high-quality, sufficiently challenging QA datasets (graduate/PhD level) inspired by GPQA, HLE, and AI4Sci families, collaborating with a global network of talented researchers Evaluate large language models on reasoning, factuality, and problem-solving benchmarks Develop review pipelines and quality-control criteria for expert-level question generation Analyze model outputs, conduct error taxonomy studies, and summarize insights for internal reports and research papers Collaborate with the 2077AI Foundation’s open-source benchmark teams on public dataset releases Skills Strong background in computer science, data engineering, artificial intelligence, or related fields, with hands-on experience in large-scale data systems 1+ years of experience with LLMs, prompt engineering, and evaluation frameworks (e.g., LM Eval Harness, OpenCompass) Excellent written and verbal English skills and analytical reasoning Strong execution and team management skills—able to translate high-level objectives into actionable plans and drive team outcomes Experience with formal methods, chain-of-thought evaluation, or curriculum generation Relevant publications in top conferences Company Overview The 2077AI Foundation, is at the forefront of AI data standardization and progression. It was founded in undefined, and is headquartered in Singapore, SG, with a workforce of 51-200 employees. Its website is
Apply Now

Similar Jobs

[2026] AI/ML Engineer Intern

Remote, USA Full-time

AI Safety Research Intern-2

Remote, USA Full-time

2026 CareSource Summer Internship - Teaching Kitchen

Remote, USA Full-time

Co-op Software Engineer, Android

Remote, USA Full-time

Growth Business Development Representative - SMB

Remote, USA Full-time

Human-Centered AI Intern, Generative Human Modeling

Remote, USA Full-time

Partner Account Manager

Remote, USA Full-time

[Remote] AI Safety Research Intern (PhD)

Remote, USA Full-time

Applications Engineer I

Remote, USA Full-time

Canada Immigration Law Clerk - Associate - Vancouver

Remote, USA Full-time

Freelance AI/ML Penetration Tester

Remote, USA Full-time

Experienced Small Business Customer Service Representative for Remote Work Opportunities in Select States – Delivering Exceptional Support and Driving Customer Satisfaction

Remote, USA Full-time

Experienced Data Entry Specialist – Entry Level Position for Information Management and Entertainment Innovation at arenaflex

Remote, USA Full-time

Paralegal (Remote)

Remote, USA Full-time

Experienced Full Stack Remote Customer Service Agent – Work at Home Opportunity with Comprehensive Training and Competitive Benefits at Blithequark

Remote, USA Full-time

Experienced Remote Data Entry arenaflex Specialist – Ecommerce Product Listing Management and Data Integrity

Remote, USA Full-time

**Experienced Healthcare Customer Service Representative – Work From Home Opportunity at blithequark**

Remote, USA Full-time

**Experienced Customer Service Support Representative – Remote Opportunity with blithequark**

Remote, USA Full-time

Urgently Hiring: Cleveland Clinic Overnight Emergency Radiologist - Remote Opportunity with Competitive Salary and Comprehensive Benefits

Remote, USA Full-time

Experienced Remote Data Entry Operator – Flexible Work from Home Opportunity with Comprehensive Training at blithequark

Remote, USA Full-time
Back to Home