Mercor is seeking a highly skilled Research and STEM Expert to join our AI evaluation and technical quality assurance team. In this role, you will analyze, evaluate, and fact-check AI-generated outputs across scientific, mathematical, and technical domains â ensuring the highest standards of factual accuracy, logical reasoning, and clarity.
You will help improve the reasoning and reliability of cutting-edge Large Language Models (LLMs) by providing structured feedback and expert judgment across diverse STEM fields. This position is ideal for individuals with strong academic training, analytical precision, and a passion for advancing AI alignment in research and science.
Evaluate and critique AI-generated responses in STEM-related subjects (e.g., computer science, mathematics, physics, biology, and engineering).
Conduct fact-checking and research validation using reputable public and academic sources.
Assess scientific explanations, calculations, and reasoning for correctness and clarity.
Provide structured written feedback to improve the modelâs understanding and communication of technical topics.
Collaborate with the AI quality team to improve annotation guidelines and maintain consistency across evaluations.
BS, MS, or PhD in a STEM domain (e.g., Computer Science, Mathematics, Biology, Physics, Engineering, etc.)
English expert with excellent comprehension and communication skills
Excellent at high schoolâlevel math
Experts at fact-checking information across multiple domains (medical, legal, financial, technical, etc.) using trusted public sources
Excellent writing skills and attention to detail
Significant experience using Large Language Models