Assessment

Technically Implement AI Governance

20 questions · 15 points · Pass at 80% (12/15)

1. What does Demographic Parity Difference (DPD) measure?

2. Which Python library provides MetricFrame for group fairness metrics? 2 pts

3. Why can Demographic Parity and Equalized Odds not be fulfilled simultaneously?

4. What does SHAP compute for a single prediction?

5. When is LIME better than SHAP?

6. A model card contains for a high-risk credit model: Fairness metrics show DPD=0.07. What does this mean? 2 pts

7. What is logged according to EU AI Act Art. 12 — and what is NOT logged?

8. Which tool is used for data drift detection in production operations?

9. What does MLflow track in the context of AI governance?

10. What does EU AI Act Annex IV (Technical Documentation) prescribe for high-risk systems? 2 pts

11. How often must the Technical Documentation be updated according to Art. 11?

12. A credit model shows a TPR of 0.68 for applicants < 25 years compared to 0.91 overall. What is the correct response?

13. What is the difference between SHAP for classical ML models and LLMs?

14. Which RAGAS metric measures whether a RAG response is covered by the retrieved documents?

15. What does the Microsoft Responsible AI Toolbox offer beyond Fairlearn?

16. Which tool is the best choice for Production Drift Detection?

17. An agent has: CRM access (PII), web search (untrusted), email sending. What is the risk? 2 pts

18. What does the Principle of Least Privilege mean for AI agents?

19. An agent waits 5 minutes for HITL approval. No human responds. What happens?

20. You are developing a credit scoring system. Which stack is fully compliant with the EU AI Act high-risk requirements? 2 pts