Platform - Model Customization - instruction tuning / LoRA

Customization

In this session, our readings cover:

Required Readings:

Low-Rank Adaptation for Foundation Models: A Comprehensive Review

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

DoRA: Weight-Decomposed Low-Rank Adaptation

More Readings: