[Remote] Research Intern (LLM)

Remote Full-time

Note: The job is a remote job and is open to candidates in USA. 2077AI Open Source Foundation is looking for a Research & Evaluation Intern to help build advanced QA datasets and evaluate large language models. This role is ideal for students passionate about LLMs, evaluation science, and the intersection of research and applied data work. Responsibilities Design and construct high-quality, sufficiently challenging QA datasets (graduate/PhD level) inspired by GPQA, HLE, and AI4Sci families, collaborating with a global network of talented researchers Evaluate large language models on reasoning, factuality, and problem-solving benchmarks Develop review pipelines and quality-control criteria for expert-level question generation Analyze model outputs, conduct error taxonomy studies, and summarize insights for internal reports and research papers Collaborate with the 2077AI Foundation’s open-source benchmark teams on public dataset releases Skills Strong background in computer science, data engineering, artificial intelligence, or related fields, with hands-on experience in large-scale data systems 1+ years of experience with LLMs, prompt engineering, and evaluation frameworks (e.g., LM Eval Harness, OpenCompass) Excellent written and verbal English skills and analytical reasoning Strong execution and team management skills—able to translate high-level objectives into actionable plans and drive team outcomes Experience with formal methods, chain-of-thought evaluation, or curriculum generation Relevant publications in top conferences Company Overview The 2077AI Foundation, is at the forefront of AI data standardization and progression. It was founded in undefined, and is headquartered in Singapore, SG, with a workforce of 51-200 employees. Its website is

Apply Now

Experienced Remote Data Entry Specialist (Typist) – Accurate and Efficient Data Management Professional for a Dynamic Team at blithequark

Remote Full-time

Administrative Assistants and Secretaries - AI Trainer (Contract)

Remote Full-time

← Back to Home

[Remote] Research Intern (LLM)

Similar Opportunities

[2026] AI/ML Engineer Intern

AI Safety Research Intern-2

2026 CareSource Summer Internship - Teaching Kitchen

Co-op Software Engineer, Android

Growth Business Development Representative - SMB

Human-Centered AI Intern, Generative Human Modeling

Partner Account Manager

[Remote] AI Safety Research Intern (PhD)

Applications Engineer I

Canada Immigration Law Clerk - Associate - Vancouver

Senior Prompt Engineer-Data Science & Quality Analysis

Experienced Distributed Systems Engineer for Scalable Data Infrastructure at blithequark

Practice Innovation Lawyer – IP, Data & Cyber, Regulatory, Commercial Tech & Transactions and Trade

Director of HR

Experienced Full Stack Application Security Engineer – Web & Cloud Application Development

Remote Flexible Driving & Delivery Partner – Earn Competitive Income with Uber’s Ride‑Share & Delivery Platform

Client Services Administrator (Shareholder Communication) - Contract (Hybrid)

[Remote] Success Education & Leadership Consultant

Experienced Remote Data Entry Specialist (Typist) – Accurate and Efficient Data Management Professional for a Dynamic Team at blithequark

Administrative Assistants and Secretaries - AI Trainer (Contract)

[Remote] Research Intern (LLM)

Similar Opportunities

[2026] AI/ML Engineer Intern

AI Safety Research Intern-2

2026 CareSource Summer Internship - Teaching Kitchen

Co-op Software Engineer, Android

Growth Business Development Representative - SMB

Human-Centered AI Intern, Generative Human Modeling

Partner Account Manager

[Remote] AI Safety Research Intern (PhD)

Applications Engineer I

Canada Immigration Law Clerk - Associate - Vancouver

Senior Prompt Engineer-Data Science & Quality Analysis

Experienced Distributed Systems Engineer for Scalable Data Infrastructure at blithequark

Practice Innovation Lawyer – IP, Data & Cyber, Regulatory, Commercial Tech & Transactions and Trade

Director of HR

**Experienced Full Stack Application Security Engineer – Web & Cloud Application Development**

Remote Flexible Driving & Delivery Partner – Earn Competitive Income with Uber’s Ride‑Share & Delivery Platform

Client Services Administrator (Shareholder Communication) - Contract (Hybrid)

[Remote] Success Education & Leadership Consultant

Experienced Remote Data Entry Specialist (Typist) – Accurate and Efficient Data Management Professional for a Dynamic Team at blithequark

Administrative Assistants and Secretaries - AI Trainer (Contract)

Experienced Full Stack Application Security Engineer – Web & Cloud Application Development