This project is an educational program focused on the alignment of small language models. It provides a technical curriculum and a series of courses designed to teach how to align models with human preferences and behaviors.
The material covers the implementation of preference optimization algorithms and the adaptation of vision-language models to process both text and image data simultaneously. It also includes instructional guides on synthetic data generation to improve model performance in specialized domains.
The curriculum encompasses supervised fine-tuning workflows, the use of chat templates to teach models to follow instructions, and the application of benchmark-driven evaluations to measure model accuracy and reliability.
The course is delivered via Jupyter Notebooks.