Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
AI Evaluation manager image - Rise Careers
Job details

AI Evaluation manager

About the Role

Luma is pushing the boundaries of generative AI, building tools that redefine how visual content is created. We’re seeking a candidate to help shape and scale the way we understand, measure, and improve model performance. In this role, you’ll partner with researchers, engineers, and technical artists to evaluate our models against real-world creative use cases, design frameworks that capture qualitative nuance, and identify actionable insights that guide development.

This is not a checkbox metrics role — it's about building evaluative systems that match the complexity of human perception, creativity, and intention.

Responsibilities

  • Evaluate generative model performance across diverse tasks, prompts, and modalities.

  • Identify key failure modes, regression patterns, and edge cases that impact product quality.

  • Develop and maintain qualitative evaluation frameworks that are scalable and reusable.

  • Collaborate closely with technical artists and engineers to align evaluations with model capabilities and target use cases.

  • Translate high-level product goals into concrete evaluative criteria.

  • Lead qualitative studies, side-by-side comparisons, and human-in-the-loop evaluation efforts.

  • Provide detailed feedback that informs model fine-tuning, dataset curation, and product UX.

  • Stay informed about emerging evaluation standards in generative AI and creative tools.

Qualifications

  • Master’s degree or higher in Cognitive Science, Human-Computer Interaction (HCI), Design Research, Psychology, Media Studies, or a related field.

  • 5+ years of experience in product evaluation, UX research, model testing, or similar roles that involve structured qualitative assessment.

  • Deep familiarity with creative workflows and real-world use cases for generative models (e.g., animation, filmmaking, digital art, VFX).

  • Strong systems thinking and the ability to define abstract qualities (like believability, identity retention, or scene coherence) in clear evaluative terms.

  • Experience working cross-functionally with engineers, researchers, and creatives.

  • Excellent written communication skills and the ability to synthesize nuanced judgments into clear, actionable insights.

Nice to Have

  • Background in motion, visual effects, or storytelling pipelines

  • Experience evaluating AI-generated media (video, images, 3D)

  • Prior work on building internal tools for qualitative data collection or scoring

  • Familiarity with prompt engineering and reference-based input methods

Luma AI Glassdoor Company Review
4.4 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
Luma AI DE&I Review
4.3 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon
CEO of Luma AI
Luma AI CEO photo
Unknown name
Approve of CEO

Average salary estimate

$135000 / YEARLY (est.)
min
max
$110000K
$160000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs

Luma is looking for an Enterprise Account Executive to close deals and grow strategic partnerships within the entertainment industry leveraging innovative AI technology.

Photo of the Rise User

Eurofins Scientific is seeking an Associate Chemist for 2nd shift sample prep and analysis to support their mission of making environments safer and healthier.

Photo of the Rise User
Posted 6 days ago

A Scientist II role in Somerville to drive research and development of low-carbon cement technologies at Sublime Systems.

Photo of the Rise User
Lonza Hybrid US - Vacaville, CA
Posted 14 days ago

Lead the technical transfer of biopharmaceutical manufacturing processes and influence innovative solutions as an MSAT Principal Scientist at Lonza.

Photo of the Rise User
Posted 4 days ago

Drive innovation in airspace integration as an Airspace Integration Engineer III at SkyGrid, advancing autonomous flight safety and operations globally.

Photo of the Rise User
Domino's Hybrid 30 Frank Lloyd Wright Dr, Ann Arbor, MI 48105, USA
Posted 5 days ago

Domino’s Pizza is looking for a Product Development Manager to lead new and existing menu item innovations and drive product strategy in a fast-paced environment.

Photo of the Rise User

Lead clinical quality improvement initiatives in adult medicine at Presbyterian Healthcare Services, a leading nonprofit healthcare system in New Mexico.

Environmental Scientist role at KCI Technologies focused on stormwater inspection, compliance, and environmental data analysis within a collaborative employee-owned firm.

Photo of the Rise User
Dental Insurance
Disability Insurance
Vision Insurance
Flexible Spending Account (FSA)
Health Savings Account (HSA)
Performance Bonus
Family Medical Leave
Paid Holidays

Lead early clinical development and translational research for GU/Prostate oncology therapies at a global biopharma leader.

Photo of the Rise User

A Scientist I role at AbbVie to lead analytical chemistry efforts, support pilot plant testing, and mentor junior staff in a hybrid setting at North Chicago, IL.

Lead Takeda’s global clinical development strategy for immunodeficiency in Plasma Derived Therapies as a Medical Director based in Boston, MA.

Lead and mentor clinical research associates at DiaMedica Therapeutics, overseeing clinical trial management to advance innovative biopharmaceutical treatments.

Collaborate as a Manufacturing Expert - AI Trainer at RYZ Labs to help shape next-generation AI solutions for manufacturing processes in a fast-paced startup environment.

Photo of the Rise User
Syngenta Group Hybrid Greensboro, North Carolina
Posted 4 days ago

Lead Syngenta’s North American Crop Protection Exposure Science Platform to drive scientific excellence, innovation, and regulatory compliance.

MATCH
Calculating your matching score...
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, unknown
DATE POSTED
July 22, 2025
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!