βBackHhuggingface/alignment-handbook0Copy as MarkdownView on GitHubβ0 starsΒ·0 forksΒ·0 viewsAlignment Handbookπ€ Models & Datasets | π Technical Report FeaturesReinforcement Learning - Directly distills alignment preferences into language models.