Course Syllabus

Course Information

Item	Details
Course Title	AI Systems (Machine Learning Systems: Agentic Workflows and Harness Engineering)
Course Code	AI4001
Credits	3 credits (2 lecture + 1 lab)
Target Students	AI Department, 4th year
Contact Hours	3 hours/week × 16 weeks
Instructor	Young Joon Lee (yj.lee@chu.ac.kr)
Classroom	AI Lab (DGX H100 Cluster)
Course Site	https://ai-systems-2026.halla.ai/

Course Overview

The AI industry of 2025–2026 has decisively shifted from conversational models to autonomous agentic systems. This course is organized around two central themes: harness engineering — the discipline of controlling non-deterministic AI agents with deterministic systems — and the Human-on-the-Loop (HOTL) paradigm, in which humans take on strategic oversight rather than step-by-step approval.

Students will directly operate the NVIDIA DGX H100 server in the Cheju Halla University AI Lab (partitioned via MIG technology), gaining hands-on experience deploying and running open-source models such as DeepSeek-Coder-V2 without relying on commercial APIs. The ultimate goal is to design an autonomous software development pipeline (multi-agent SDLC) and implement it in the capstone project Ralphthon.

Learning Objectives

Upon completing this course, students will be able to:

Explain the differences among HITL, HOTL, and HIC governance architectures and implement them as Governance-as-Code
Design and implement harness engineering systems using the Ralph Loop methodology
Optimize long-running agent loops through context window management and instruction tuning
Build a multi-agent SDLC by connecting planner, coder, and QA agents via MCP
Deploy open-source LLMs in a DGX H100 + MIG + vLLM environment and measure their performance
Automatically evaluate the quality of agent systems using LLM-as-Judge and telemetry

Weekly Schedule

Phase 1: Foundations of Agentic Systems (Weeks 1–3)

Week	Theory	Lab
Week 1	Course orientation, AI system paradigm shift, HITL vs HOTL	Dev environment setup, AI coding CLI installation (Lab 01)
Week 2	HOTL governance in depth, EU AI Act compliance, Governance-as-Code	Implementing the first agentic loop (Lab 02)
Week 3	MCP architecture deep dive, TBAC/governance gateway, MIG compute isolation	MCP server implementation and security verification (Lab 03)

Phase 2: Harness Engineering (Weeks 4–6)

Week	Theory	Lab
Week 4	Loop paradigms — Test-time Compute Scaling, Ralph Loop/RLM/autoresearch	Ralph Loop implementation and cumulative learning (Lab 04)
Week 5	Context window management, preventing Context Rot, state tracking file design	Implementing a context management system (Lab 05)
Week 6	Instruction tuning, the “Sign” metaphor, designing permanent context payloads	PROMPT.md tuning practice (Lab 06)

Phase 3: Multi-Agent SDLC (Weeks 7–9)

Week	Theory	Lab
Week 7	Agent role division, traditional SDLC vs agentic SDLC	Multi-agent pipeline design (Lab 07)
Week 8	Project proposal presentation (midterm replacement)	Individual capstone proposal presentation — no exam
Week 9	Planner agent (2-phase separation), QA independence, automated test pipeline	Planner + QA agent implementation (Lab 09)

Phase 4: Open-Source Models & MLOps (Weeks 10–12)

Week	Theory	Lab
Week 10	DeepSeek-Coder-V2 architecture, open-source vs commercial API, tool ecosystem	vLLM deployment practice (Lab 10)
Week 11	vLLM high-throughput inference, CUDA optimization, MIG slice utilization	Building a high-throughput inference server
Week 12	Telemetry design, LLM-as-Judge evaluation framework, cost optimization	Telemetry & LLM-as-Judge (Labs 11 & 12)

Phase 5: Capstone Ralphthon (Weeks 13–16)

Week	Content
Week 13	Individual project kickoff, architecture design document (expanding the Week 8 proposal)
Week 14	Ralphthon execution — harness implementation, agent integration, iterative improvement
Week 15	System integration, automated testing, presentation preparation
Week 16	Final presentation and demo, peer evaluation, course wrap-up

Grading

Item	Weight	Description
Lab Assignments (Lab 01–12)	40%	Individual submission via GitHub PR
Midterm Project	20%	Individual project proposal presentation (Week 8, midterm replacement)
Capstone Ralphthon	30%	Individual project with final presentation
Contribution & Participation	10%	GitHub PR contributions, class discussion

Textbooks and References

Primary References

Huntley, G. (2025). The Ralph Loop: Deterministic Agentic Engineering
Anthropic. (2026). Claude Code Documentation
NVIDIA. (2025). DGX H100 MIG Configuration Guide

Additional References

Karpathy, A. (2025). autoresearch: Automated ML Research
Zhang, A. et al. (2025). Recursive LM: Language Models that Call Themselves
Snell, C. et al. (2024). Scaling LLM Test-Time Compute Optimally
Hong, S. et al. (2024). MetaGPT: Meta Programming for Multi-Agent Collaborative Framework (ICLR 2024)
Qian, C. et al. (2024). ChatDev: Communicative Agents for Software Development (ACL 2024)
Damani, S. et al. (2025). Towards a Science of Scaling Agent Systems (DeepMind + MIT)

Online Resources

Model Context Protocol Official Docs
vLLM Official Docs
DeepSeek-Coder-V2 Paper
Gemini CLI Docs — Google’s AI coding CLI
Codex CLI Docs — OpenAI’s terminal coding agent
A2A Protocol Specification — Google’s Agent-to-Agent communication standard
SWE-Bench — Agent coding benchmark

Prerequisites

Python 3.10+ programming proficiency
Basic Git/GitHub usage
Linux command-line fundamentals
Machine learning basics (prerequisite courses: AI Fundamentals, Deep Learning)

Contact

For course-related inquiries, please reach out via GitHub Issue or email (yj.lee@chu.ac.kr). If you find errors in course materials, please report them via GitHub Issue.