Agents Optimization

Customization

In this session, our readings cover:

Required Readings: MODEL TRAINING & OPTIMIZATION

Core Component: Improving the Agent Brain - Training, Fine-tuning, and Optimization

Techniques for improving model capabilities and efficiency.

Key Concepts: Evaluation frameworks, guardrails, alignment (RLHF, PPO, DPO), risk assessment, jailbreaking defense, fairness, bias mitigation, toxicity prevention, agent safety protocols Key Concepts: Data preparation, instruction tuning, LoRA/DoRA, parameter-efficient fine-tuning, scaling laws, efficiency optimization

Topic Slide Deck Previous Semester
Platform - Model Customization (Instruction Tuning/LoRA) W8.1-LoRA-Team5 25course
LLM Alignment - PPO W11.2-team6-PPO 25course
LLM Post-training W14.3.DPO 25course
Open Source LLM - Mistral Data Preparation W4-OpenSourceLLM 24course
Scaling Law and Efficiency W11-ScalinglawEfficientLLM 24course
LLM Fine Tuning W14-LLM-FineTuning 24course
Model Editing and Disgorgement W10-T5-ModelEditing 24course

2025 HIGH-IMPACT PAPERS on this topic

More Readings: