Skip to content

Course Syllabus

ItemDetails
Course TitleAI Systems (Machine Learning Systems: Agentic Workflows and Harness Engineering)
Course CodeAI4001
Credits3 credits (2 lecture + 1 lab)
Target StudentsAI Department, 4th year
Contact Hours3 hours/week × 16 weeks
InstructorYoung Joon Lee (yj.lee@chu.ac.kr)
ClassroomAI Lab (DGX H100 Cluster)
Course Sitehttps://ai-systems-2026.halla.ai/

The AI industry of 2025–2026 has decisively shifted from conversational models to autonomous agentic systems. This course is organized around two central themes: harness engineering — the discipline of controlling non-deterministic AI agents with deterministic systems — and the Human-on-the-Loop (HOTL) paradigm, in which humans take on strategic oversight rather than step-by-step approval.

Students will directly operate the NVIDIA DGX H100 server in the Cheju Halla University AI Lab (partitioned via MIG technology), gaining hands-on experience deploying and running open-source models such as DeepSeek-Coder-V2 without relying on commercial APIs. The ultimate goal is to design an autonomous software development pipeline (multi-agent SDLC) and implement it in the capstone project Ralphthon.

Upon completing this course, students will be able to:

  1. Explain the differences among HITL, HOTL, and HIC governance architectures and implement them as Governance-as-Code
  2. Design and implement harness engineering systems using the Ralph Loop methodology
  3. Optimize long-running agent loops through context window management and instruction tuning
  4. Build a multi-agent SDLC by connecting planner, coder, and QA agents via MCP
  5. Deploy open-source LLMs in a DGX H100 + MIG + vLLM environment and measure their performance
  6. Automatically evaluate the quality of agent systems using LLM-as-Judge and telemetry

Phase 1: Foundations of Agentic Systems (Weeks 1–3)

Section titled “Phase 1: Foundations of Agentic Systems (Weeks 1–3)”
WeekTheoryLab
Week 1Course orientation, AI system paradigm shift, HITL vs HOTLDev environment setup, AI coding CLI installation (Lab 01)
Week 2HOTL governance in depth, EU AI Act compliance, Governance-as-CodeImplementing the first agentic loop (Lab 02)
Week 3MCP architecture deep dive, TBAC/governance gateway, MIG compute isolationMCP server implementation and security verification (Lab 03)

Phase 2: Harness Engineering (Weeks 4–6)

Section titled “Phase 2: Harness Engineering (Weeks 4–6)”
WeekTheoryLab
Week 4Loop paradigms — Test-time Compute Scaling, Ralph Loop/RLM/autoresearchRalph Loop implementation and cumulative learning (Lab 04)
Week 5Context window management, preventing Context Rot, state tracking file designImplementing a context management system (Lab 05)
Week 6Instruction tuning, the “Sign” metaphor, designing permanent context payloadsPROMPT.md tuning practice (Lab 06)
WeekTheoryLab
Week 7Agent role division, traditional SDLC vs agentic SDLCMulti-agent pipeline design (Lab 07)
Week 8Project proposal presentation (midterm replacement)Individual capstone proposal presentation — no exam
Week 9Planner agent (2-phase separation), QA independence, automated test pipelinePlanner + QA agent implementation (Lab 09)

Phase 4: Open-Source Models & MLOps (Weeks 10–12)

Section titled “Phase 4: Open-Source Models & MLOps (Weeks 10–12)”
WeekTheoryLab
Week 10DeepSeek-Coder-V2 architecture, open-source vs commercial API, tool ecosystemvLLM deployment practice (Lab 10)
Week 11vLLM high-throughput inference, CUDA optimization, MIG slice utilizationBuilding a high-throughput inference server
Week 12Telemetry design, LLM-as-Judge evaluation framework, cost optimizationTelemetry & LLM-as-Judge (Labs 11 & 12)

Phase 5: Capstone Ralphthon (Weeks 13–16)

Section titled “Phase 5: Capstone Ralphthon (Weeks 13–16)”
WeekContent
Week 13Individual project kickoff, architecture design document (expanding the Week 8 proposal)
Week 14Ralphthon execution — harness implementation, agent integration, iterative improvement
Week 15System integration, automated testing, presentation preparation
Week 16Final presentation and demo, peer evaluation, course wrap-up
ItemWeightDescription
Lab Assignments (Lab 01–12)40%Individual submission via GitHub PR
Midterm Project20%Individual project proposal presentation (Week 8, midterm replacement)
Capstone Ralphthon30%Individual project with final presentation
Contribution & Participation10%GitHub PR contributions, class discussion
  • Huntley, G. (2025). The Ralph Loop: Deterministic Agentic Engineering
  • Anthropic. (2026). Claude Code Documentation
  • NVIDIA. (2025). DGX H100 MIG Configuration Guide
  • Karpathy, A. (2025). autoresearch: Automated ML Research
  • Zhang, A. et al. (2025). Recursive LM: Language Models that Call Themselves
  • Snell, C. et al. (2024). Scaling LLM Test-Time Compute Optimally
  • Hong, S. et al. (2024). MetaGPT: Meta Programming for Multi-Agent Collaborative Framework (ICLR 2024)
  • Qian, C. et al. (2024). ChatDev: Communicative Agents for Software Development (ACL 2024)
  • Damani, S. et al. (2025). Towards a Science of Scaling Agent Systems (DeepMind + MIT)
  • Python 3.10+ programming proficiency
  • Basic Git/GitHub usage
  • Linux command-line fundamentals
  • Machine learning basics (prerequisite courses: AI Fundamentals, Deep Learning)

For course-related inquiries, please reach out via GitHub Issue or email (yj.lee@chu.ac.kr). If you find errors in course materials, please report them via GitHub Issue.