Become an AI Evaluation Engineer in 5 weeks

Live (Zoom) • Intermediate • 💎 10000

Become an AI Evaluation Engineer in 5 weeks

The only hands-on, project-based bootcamp that teaches you to test, measure, and improve AI systems — the skill every AI team is desperate to find.

Book Your AI Career Consultation

Duration

5 Weeks

Prerequisites

1+ yrs experience in QA

Background

Good for Noncoders

Format

Live, Hands-On Training

Upcoming Cohorts

Cohort April 2026

Start Date: April 18, 2026

End Date: May 17, 2026

Duration: 5 Weeks

Format: Live online sessions with interactive components

Instructors:

Amanda Curtis,

Tagir Fakhriev,

Igor Dorovskikh,

Jaime Mantilla

Pricing

$2,999

$3,999

or Buy Now, Pay Later with (only for United States)

Secure your spotLimited seats available

Course Schedule (PDT)

April 18

Saturday

10:00 AM - 2:00 PM

April 19

Sunday

10:00 AM - 2:00 PM

April 25

Saturday

10:00 AM - 2:00 PM

April 26

Sunday

10:00 AM - 2:00 PM

May 2

Saturday

10:00 AM - 2:00 PM

May 3

Sunday

10:00 AM - 2:00 PM

May 9

Saturday

10:00 AM - 2:00 PM

May 10

Sunday

10:00 AM - 2:00 PM

May 16

Saturday

10:00 AM - 2:00 PM

May 17

Sunday

10:00 AM - 2:00 PM

Everything is hands-on. You build real evaluation suites from week one.

LLM Behavior Testing

Prompt injection, jailbreaks, hallucination detection, context window limits

Evaluation Frameworks

Build automated evals with Promptfoo, custom metrics, and assertion suites

Bias & Fairness Auditing

Identify and document model bias across demographics and edge cases

Safety & Red-teaming

Adversarial testing, compliance checks, and responsible AI validation

“Every course teaches something different — none connect together.”

“I don’t want hype or theory. I need real skills I can use at work.”

“I know AI matters, but I don’t know where to start.”

“I’m afraid of choosing the wrong course and wasting time.”

“I can’t quit my job to ‘learn AI full time.’”

“I don’t know which AI role actually fits me.”

Is this program for you?

QA Engineers and SDETs who want to stay relevant as AI reshapes software testing
Automation Engineers looking to expand beyond traditional frameworks into AI system testing
Manual Testers without any programming background, ready to learn from scratch, hands-on.
Test Leads and QA Managers responsible for quality, risk, and governance in AI-powered products
Software Engineers exploring AI-adjacent roles such as prompt engineering or AI quality

“I know how to test APIs and UIs… but AI apps feel different.”

→ This path bridges that gap.

Be Fully AI Job-Ready by Graduation

Career readiness isn't an afterthought — it's part of the program. You'll get dedicated coaching, a strategy to grow your LinkedIn presence, and real project experience you can speak to in any interview.

Portfolio

Get mentorship, job opportunities and peer support throughout Discord community, plus a network that stays with you.

Job leads

Community

AI isn't replacing you. It's your next career move.

Why learning AI & LLM Testing is a must for Every QA in 2026?

AI Evaluation Engineering is already a specific, high-demand skill

More than 3,000+ job openings across the US

Top AI Evaluation Engineers earn over $300K/year

AI Testing Skills are in-demand in every company

You After the "Break Into AI Testing: The Next-Gen Quality Engineer Skillset!" Course

AI Evaluation Engineer

Portfolio-ready AI and LLM testing experience built on a real U.S. startup project with live, hands-on training.

$200,000

Expected salary

Skills

LLM evaluation

Hallucination + factual drift detection

Multi-model comparison

LLM-graded assertions

Prompt injection + jailbreak testing (red teaming)

Bug reports for AI failures

Tools

Promptfoo

LM Studio

Agenta

Arato.ai

OpenAI API

Anthropic API

Proof of Work

Break Into AI Testing: The Next-Gen Quality Engineer Skillset!

EnGenious University

AI Application Testing Portfolio

Hands-on artifacts covering LLM evaluation, prompt injection and jailbreak testing, multi-model comparison, and hallucination detection — built using Promptfoo, OpenAI API, Anthropic API, and LM Studio.

Will I get a certificate?

Of course! It'll look great on your resume and LinkedIn

Your Name

Break Into AI Testing: The Next-Gen Quality Engineer Skillset!

Instructors:

Amanda Curtis, Tagir Fakhriev, Igor Dorovskikh, Jaime Mantilla

Finished: May 17, 2026

Number of lectures: 10 / Total hours: 40

university.engenious.io

university@engenious.io

Our alumni work at

Hershal Walton

Gen AI Product Manager

“This course puts you in a leading frontier for new opportunities that should be coming up very soon”

Max Volovich

QA Engineering Manager @ Sirona Medical

“After this course, I not only understand how AI systems work behind the scenes, but I also feel confident leading teams building and testing them.”

Mavis Herring

AI Quality Engineer @ WeOptimize AI

“This course built confidence. As soon as I posted that I finished the course on LinkedIn, many recruiters started approaching me.”

Our alumni work at

We've taught 1,000+ students to ...

Learn From The Best

Igor Dorovskikh

CEO and Founder

Igor is an accomplished CEO and Founder of Engenious.io, with 15+ years of experience in software testing and development and over a decade in management. He has worked at Barnes & Noble, Expedia, Tinder, and consulted at Apple and Grammarly. In the mentorship program, Igor offers expertise in building a testing process from scratch, leadership success, understanding C-level executives' expectations, selecting the right technology stack, providing and collecting feedback, and team growth. Mentees benefit from Igor's insights on creating efficient testing processes, fostering productive teams, aligning with executive priorities, making informed technology choices, establishing feedback channels, and securing resources for team development. With Igor as their mentor, participants gain valuable knowledge, skills, and perspectives to excel as Dev/QA Directors or Managers.

Jaime Mantilla

Quality Engineering Manager

Seasoned IT professional with 14+ years of experience in Software Engineering, Quality Assurance, and Automation. Skilled in leading teams, designing test strategies, and building automation frameworks across diverse industries. Adept at leveraging modern tools, AI-driven testing approaches, and cloud technologies to deliver high-quality, scalable solutions. Holds a Bachelor’s in Management Information Systems and a Master’s in Information Technology with proven success supporting enterprise-level clients and Fortune 500 companies.

Amanda Curtis

Founder of Lemonade Tech & QA Manager

Amanda Curtis is a QA leader and founder of Lemonade Tech, with a passion for responsible AI adoption and helping teams cut through tech overwhelm. With 10+ years experience leading QA teams and modernizing testing practices, Amanda focuses on practical solutions that improve software quality while keeping technology approachable and human-centered. Helping organizations “find the good in tech” by cutting through complexity and focusing on what truly adds value.

Tagir Fakhriev

Software Engineer

10 years of experience in the tech industry; Senior Android Engineer in Platform team. Expert in CI/CD pipelines, test automation, and mobile infrastructure; passionate about developer productivity and workflow optimization.

Gregory Goldshteyn

Instructor AI Accelerator

Visionary QA Leader with substantial experience in the IT industry. Worked across Salesforce, Sony, and now as part of Video Engineering and Quality Assurance, he leads the strategy for high-concurrency streaming environments, where a single second of latency is unacceptable.

Alex Kiperberg

Instructor AI Accelerator

Software development and QA experience for over 20 years. Alex has worked at well-known companies such as Oracle and HCL Software, and has strong expertise in functional, regression, and automated testing, complemented by a background in Java-based application development. Skilled in WebdriverIO, Selenium, JavaScript, and CI/CD pipelines, with hands-on experience in building and supporting enterprise applications.

Vladimir Tanev

Senior iOS Engineer to Co-Founder & CTO·WeOptimize.ai

Vladimir is an experienced engineer with 8+ years in iOS/macOS development, specializing in AI-powered solutions. As the Co-Founder & CTO of WeOptimize.ai, he leverages AI to optimize workflows and enhance productivity. He has a track record of delivering innovative products for both startups and large enterprises.

Max Volovich

Instructor AI Accelerator

Quality Engineering leader driving scalable automation and delivery across enterprise SaaS and AI/LLM systems. Leads global QA teams and embeds quality into revenue-critical release pipelines, strengthening reliability and trust in AI-driven products.

What you'll achieve in 5-weeks

Week

Day 1: AI Fundamentals and Tool Setup

Learn AI app architecture and key components.

Understand why AI testing differs from standard app testing.

Set up your testing environment (Python, Node.js, LMStudio, API keys).

Tools: LMStudio, ChatGPT, Anthropic.

Activities: Lecture on AI basics, hands-on setup, and running simple queries.

Day 2: AI-Assisted Testing with Promptfoo

Introduction to Promptfoo for LLM testing and red teaming.

Tools: Promptfoo.

Activities: Lecture on Promptfoo, setup, and hands-on prompt testing.

Week

Day 3: Testing LLMs using Promptfoo

Apply Promptfoo to compare/ test multiple LLM models.

Debug test results for model improvements.

Tools: Promptfoo,  LMStudio.

Activities: Hands-on LLM testing and debugging.

Day 4: Debugging AI Failures

Assertions and Metrics in Promptfoo

Deterministic assertions

LLM-graded assertions

Weighted assertions and outcome effects

Tools: Promptfoo, LM Studio

Activities: Lecture on advanced assertions and Metrics in Promptfoo, hands-on practice sessions

Week

Day 5: Hands-on Testing with Promptfoo

Introduction to Red Teaming

Architecture of complex LLM systems

LLM model Red Team demo

Tools: Promptfoo, Red Teaming

Activities: LLM model Red Team testing setup/practice

Day 6: Hands-on Testing with Promptfoo

Red Team Review

Introduction to Real AI Application

Introduction to Black Box Testing

AI Bug Documentation Overview

Promptfoo and Real AI Applications

Tools: Promptfoo, Real AI Application

Activities: Lecture on complex AI application and Black Box Testing, hands-on black box testing experience.

Week

Day 7: Open-source Tools for AI Testing

Debugging AI Failures based on test results

Advanced Promptfoo testing against Live AI Application

Activities: Lecture on Internals of Live AI Application, Hands-on Debugging

Day 8: Multi-thread and new tools testing

Multi Thread testing

Arato.ai Overview

Agent.ai Overview

Activities: Hands-on Multi-Thread Prompting (building chat history), and exploring new tools

Week

Day 9: New LLM Testing Tool

Reinforce earlier teaching by applying it to a new LLM new Testing tool

Activities: Hands-on tool integration into workflows

Day 10: Career Prep Session

Review course content and discuss the future of AI testing

How to position yourself in the job market with AI skills

AI testing interview prep & resume optimization

Benefits You Won't Find Anywhere Else

Lifetime Community Access

Join our Discord community with 1000+ QA professionals.

Ongoing support from instructors and alumni.

Regular follow-up sessions and career guidance.

Recorded Sessions

All sessions recorded and can be accessed up to for 1 year.

Review program materials and session recording anytime.

Never miss important concepts.

What's Next? Even More

Of course, after completing the course, you can start working. But you should not stop your development. We are the only ones who offer not one course, but a comprehensive path that will make you a professional.

Two weeks after completing the course, the best students will be able to do an internship with us. During the internship, you will be directly involved in ongoing projects related to Stella Foster, our communication platform based on artificial intelligence.

You can also upgrade your knowledge with the Advanced RAG & Multi-Agent Testing course, which will make you the most sought-after employee in your field.

Internship in AI Startup (Stella Foster)

Live
Intermediate
4 weeks

Take your skills further with a hands-on internship designed to give real-world experience working with production AI systems, testing frameworks, and voice/SMS automation tools.

Advanced RAG & Multi-Agent Testing

Live
Advanced
8 Weeks (32 hours)

Go deep into Retrieval-Augmented Generation, vector databases, grounding, multi-agent workflows, tool usage, and complex evaluation frameworks used in modern AI systems.

Minimum system requirements

macOS:

Processor: Apple Silicon M1, M2, M3 or M4

Memory: 16 GB RAM (or higher)

Storage: 30 GB free SSD space

Note: Mac OS systems without an M chip are not supported

Windows:

Processor: Intel Core i5 / i7 or AMD Ryzen 5 / 7

Memory: 16 GB RAM (or higher)

GPU: Dedicated GPU with ≥ 6 GB VRAM (e.g., NVIDIA RTX 2060 / 3060)

Storage: 30 GB free SSD space

FAQ

The training is a 5-week long training. It includes 10 lectures (40 hours). Classes are held on weekends, Saturdays and Sundays from 10.00 am to 2.00 pm PST

Yes — we provide comprehensive career preparation and mentorship support, though employment is not guaranteed.
During the final week of the cohort, we dedicate 4 hours to focused career development sessions covering:

LinkedIn optimization and personal branding
Job search strategies tailored to AI and QA markets
Resume updates and portfolio positioning for AI Testing roles

For top-performing graduates, Engenious may offer short-term contract roles through partner projects or internal initiatives. However, timelines and availability are not guaranteed.

After graduation, you can continue growing through our Mentorship Program — designed to help you refine your AI QA skills, gain real-world experience, and stay connected with the Engenious professional network.

💻 Windows

✅ Windows 10 (64-bit) or newer

✅ Intel i5 (8th Gen +) / AMD Ryzen 5 +

✅ 8 GB RAM (min), 16 GB recommended

✅ 20 GB free storage

✅ Node.js v18+, Python 3.8+, VS Code, Git (Docker optional)

✅ Chrome or Edge browser

✅ Stable 10 Mbps+ internet + webcam

🍏 macOS: macOS Monterey (12+) or newer

✅ Apple M1/M2 chip or Intel i5 (2018 +)

✅ 8 GB RAM (min), 16 GB recommended

✅ 20 GB free storage

✅ Homebrew, Node.js v18+, Python 3.8+, Docker (optional)

✅ Chrome or Safari browser

✅ Reliable 10 Mbps+ connection + webcam

💡 Tip: Dual-monitor setups improve productivity for labs and evaluations.

You’ll explore multi-agent orchestration concepts by testing a live AI app (WeOptimize).

We emphasize end-to-end testing rather than isolated stages.

You’ll learn to:

✅ Identify failure points in multi-turn interactions

✅ Evaluate guardrail effectiveness and memory behavior

✅ Detect safety leaks and context loss across chained logic

✅ This reflects real QA work in AI product teams — black-box testing of complex reasoning flows.

It’s a 5-week, hands-on training program designed to help QA engineers transition into AI & LLM Testing roles. You’ll work on a real U.S. startup AI project while mastering model evaluation, red-teaming, and test automation with AI tools.

Not in this cohort. The January 2026 program focuses exclusively on text-based LLMs, since the current job market is centered on grounding, factuality, and safety validation for text systems.

Yes — these are included through:

1. Drift indicators and re-evaluation cycles

2. Synthetic variation testing

3. Failure pattern analysis

3. Feedback loop triage

You’ll learn to identify regression behaviors and emergent defects as AI systems evolve — essential for real-world QA teams.

This program is for QA professionals with 3+ years of manual QA experience who want to move into the fast-growing world of AI Quality Assurance. No coding or AI experience is required — just curiosity, analytical thinking, and a testing mindset.

These are addressed through:

- Deterministic & weighted assertions

- LLM-graded accuracy evaluation

- Safety, bias, and hallucination detection patterns

- Multi-model comparison

- Context-based grounding checks

Weeks 2–3 focus on advanced Promptfoo assertions and red-team strategies to identify hallucinations, factual drift, and grounding violations.

You won’t build a RAG pipeline from scratch, but you’ll learn how to evaluate retrieval-augmented systems — a core QA responsibility in AI production environments.

You’ll gain practical skills to:

1. Test and validate AI-powered applications and LLMs

2. Detect hallucinations, bias, and factual drift

3. Evaluate grounding and context reliability

4. Use frameworks like Promptfoo and LLM-graded assertions

5. Build a portfolio-ready capstone project aligned with current job roles

Week 1: AI fundamentals, environment setup, and AI-assisted testing basics

Week 2: LLM testing, debugging model failures, and assertion strategies

Week 3: Advanced red-teaming, grounding validation, and safety testing

Week 4: Open-source tools, workflow automation, and model evaluation frameworks

Week 5: Resume optimization, job prep, and final capstone showcase

1. Project-based learning: You test a real U.S. AI startup product

2. 95% hands-on: Minimal theory, maximum practice

3. Mentor-led live sessions (with recordings for 1-year access)

4. Career coaching and interview prep built into the final module.

Yes, currently available only for U.S. applicants.

During checkout, you can select a payment plan through Stripe’s Klarna interface, allowing you to spread tuition into manageable installments.

Graduates qualify for emerging QA-AI hybrid roles such as:

✅ AI QA Engineer

✅ LLM Quality Engineer

✅ AI Test Engineer

✅ Evaluation Engineer

✅ AI Red-Teaming Analyst

Yes — at least 3 years of QA experience (manual or automation).

No programming background is needed, though familiarity with testing workflows is helpful.

Still have questions?

Not sure if this program is right for you? Need help choosing the best path or want to understand the curriculum better

Our AI assistant is here to help — fast, friendly, and available anytime.

Ask AI CAREER ASSISTANT cool robot hand image

100% money back guarantee

If you're not satisfied by Week 1, claim a full refund, no questions.

Seats are limited to 50 registrants. Secure your spot today.

Ready to begin your AI testing journey?

The future of QA isn't about choosing between Selenium or Playwright - it's about Mastering Prompt Engineering, LLM Testing and AI Debugging.

Book Your AI Career Consultation

Everything is hands-on. You build real evaluation suites from week one.

LLM Behavior Testing

Prompt injection, jailbreaks, hallucination detection, context window limits

Evaluation Frameworks

Build automated evals with Promptfoo, custom metrics, and assertion suites

Bias & Fairness Auditing

Identify and document model bias across demographics and edge cases

Safety & Red-teaming

Adversarial testing, compliance checks, and responsible AI validation

Is this program for you?

QA Engineers and SDETs who want to stay relevant as AI reshapes software testing

Automation Engineers looking to expand beyond traditional frameworks into AI system testing

Manual Testers without any programming background, ready to learn from scratch, hands-on.

Test Leads and QA Managers responsible for quality, risk, and governance in AI-powered products

Software Engineers exploring AI-adjacent roles such as prompt engineering or AI quality

“I know how to test APIs and UIs… but AI apps feel different.”

→ This path bridges that gap.

You After the "Break Into AI Testing: The Next-Gen Quality Engineer Skillset!" Course

AI Evaluation Engineer

Portfolio-ready AI and LLM testing experience built on a real U.S. startup project with live, hands-on training.

$200,000

Expected salary

Skills

LLM evaluation

Hallucination + factual drift detection

Multi-model comparison

LLM-graded assertions

Prompt injection + jailbreak testing (red teaming)

Bug reports for AI failures

Tools

Promptfoo

LM Studio

Agenta

Arato.ai

OpenAI API

Anthropic API

Proof of Work

Break Into AI Testing: The Next-Gen Quality Engineer Skillset!

EnGenious University

AI Application Testing Portfolio

Our alumni work at

Hershal Walton

Gen AI Product Manager

“This course puts you in a leading frontier for new opportunities that should be coming up very soon”

Max Volovich

QA Engineering Manager @ Sirona Medical

“After this course, I not only understand how AI systems work behind the scenes, but I also feel confident leading teams building and testing them.”

Mavis Herring

AI Quality Engineer @ WeOptimize AI

“This course built confidence. As soon as I posted that I finished the course on LinkedIn, many recruiters started approaching me.”

What's Next? Even More

You can also upgrade your knowledge with the Advanced RAG & Multi-Agent Testing course, which will make you the most sought-after employee in your field.

Minimum system requirements

macOS:

Processor: Apple Silicon M1, M2, M3 or M4

Memory: 16 GB RAM (or higher)

Storage: 30 GB free SSD space

Note: Mac OS systems without an M chip are not supported

Windows:

Processor: Intel Core i5 / i7 or AMD Ryzen 5 / 7

Memory: 16 GB RAM (or higher)

GPU: Dedicated GPU with ≥ 6 GB VRAM (e.g., NVIDIA RTX 2060 / 3060)

Storage: 30 GB free SSD space

Become an AI Evaluation Engineer in 5 weeks

Cohort April 2026

Everything is hands-on. You build real evaluation suites from week one.

Is this program for you?

Be Fully AI Job-Ready by Graduation

Why learning AI & LLM Testing is a must for Every QA in 2026?

You After the "Break Into AI Testing: The Next-Gen Quality Engineer Skillset!" Course

Will I get a certificate?

Our alumni work at

Our alumni work at

We've taught 1,000+ students to ...

Learn From The Best

What you'll achieve in 5-weeks

Benefits You Won't Find Anywhere Else

What's Next? Even More

Internship in AI Startup (Stella Foster)

Advanced RAG & Multi-Agent Testing

Minimum system requirements

What is the duration of this course?

What is the duration of this course?

Will Engenious University help with Career Search?

Will Engenious University help with Career Search?

What are the system requirements to join?

What are the system requirements to join?

Do we work with agent frameworks like AutoGen, LangGraph, or CrewAI?

Do we work with agent frameworks like AutoGen, LangGraph, or CrewAI?

What is the AI Career Accelerator program?

What is the AI Career Accelerator program?

Is there coverage beyond text-only LLMs (e.g., CV or audio)?

Is there coverage beyond text-only LLMs (e.g., CV or audio)?

Do we cover data quality validation and production monitoring (data drift, model drift, feedback loops)?

Do we cover data quality validation and production monitoring (data drift, model drift, feedback loops)?

Who is this program for?

Who is this program for?

Which modules include RAG pipelines, grounding validation, and hallucination detection?

Which modules include RAG pipelines, grounding validation, and hallucination detection?

What will I achieve by completion?

What will I achieve by completion?

What’s the weekly breakdown?

What’s the weekly breakdown?

What makes this program different?

What makes this program different?

Is there a payment plan?

Is there a payment plan?

What career paths does this program prepare me for?

What career paths does this program prepare me for?

Are there prerequisites?

Are there prerequisites?

Become an AI Evaluation Engineer in 5 weeks

Cohort April 2026

Everything is hands-on. You build real evaluation suites from week one.

Is this program for you?

Be Fully AI Job-Ready by Graduation

Why learning AI & LLM Testing is a must for Every QA in 2026?

You After the "Break Into AI Testing: The Next-Gen Quality Engineer Skillset!" Course

Will I get a certificate?

Our alumni work at

Our alumni work at

We've taught 1,000+ students to ...

Learn From The Best

What you'll achieve in 5-weeks

Benefits You Won't Find Anywhere Else

What's Next? Even More

Internship in AI Startup (Stella Foster)

Advanced RAG & Multi-Agent Testing

Minimum system requirements

What is the duration of this course?

What is the duration of this course?

Will Engenious University help with Career Search?

Will Engenious University help with Career Search?

What are the system requirements to join?

What are the system requirements to join?

Do we work with agent frameworks like AutoGen, LangGraph, or CrewAI?

Do we work with agent frameworks like AutoGen, LangGraph, or CrewAI?

What is the AI Career Accelerator program?

What is the AI Career Accelerator program?

Is there coverage beyond text-only LLMs (e.g., CV or audio)?

Is there coverage beyond text-only LLMs (e.g., CV or audio)?

Do we cover data quality validation and production monitoring (data drift, model drift, feedback loops)?

Do we cover data quality validation and production monitoring (data drift, model drift, feedback loops)?