Recipe: Supervised Fine Tuning on Demonstrations with Unsloth#2273

Merged

virajmehta merged 34 commits intomainfrom

andrew/fine-tune-with-unsloth

Aug 11, 2025

Member

anndvision commented May 28, 2025 •

edited by ellipsis-dev bot

Loading

This recipe demonstrates how to do supervised fine tuning on demonstrations with unsloth.

Important

Adds a recipe for supervised fine-tuning using Unsloth, including setup, data preparation, model training, and deployment.

Setup:
- Adds devcontainer.json for Unsloth development environment setup.
- Specifies dependencies in pyproject.toml and requirements.txt.
Script (unsloth_nb.py):
- Demonstrates supervised fine-tuning using Unsloth with TensorZero data.
- Configures model parameters, data filtering, and training settings.
- Implements data preparation, model instantiation, and tokenizer setup.
- Converts TensorZero inferences to ChatML format for training.
- Splits data into training and validation sets.
- Configures and trains model with optional LoRA adaptation.
- Deploys fine-tuned model using Fireworks for serverless inference.
- Provides deployment and configuration instructions for the fine-tuned model.

^{This description was created by}^{for 0c66447. You can customize this summary. It will automatically update as commits are pushed.}

anndvision added 3 commits

May 27, 2025 10:25


          unsloth recipe initial commit

35f1b3d


          update requirements

96c49f0


          upload and deploy model to fireworks

4ffd340

anndvision requested review from GabrielBianconi, Copilot and virajmehta

May 28, 2025 21:14

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull Request Overview

This PR introduces a new recipe for supervised fine-tuning using Unsloth demonstrations, providing all necessary configuration and documentation for deployment and usage.

Added an auto-generated requirements.txt for the SageMaker TGI deployment.
Introduced a pyproject.toml with project settings and dependency declarations.
Created a README with setup instructions using both uv and pip.

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

File	Description
tensorzero-internal/fixtures/deployment/sagemaker-tgi/requirements.txt	Auto-generated dependency file for deployment.
recipes/supervised_fine_tuning/demonstrations/unsloth/pyproject.toml	Project configuration and dependency definitions.
recipes/supervised_fine_tuning/demonstrations/unsloth/README.md	Step-by-step setup and installation instructions.

recipes/supervised_fine_tuning/demonstrations/unsloth/README.md Outdated Show resolved Hide resolved

anndvision added 2 commits

May 28, 2025 21:17


          add compiled notebook

46afb70


          add devcontainer

5c44126

virajmehta reviewed

View reviewed changes

recipes/supervised_fine_tuning/demonstrations/unsloth/README.md Outdated Show resolved Hide resolved

virajmehta requested changes

View reviewed changes

recipes/supervised_fine_tuning/demonstrations/unsloth/README.md Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/README.md Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/.devcontainer/devcontainer.json Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/unsloth_nb.py Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/unsloth_nb.py Outdated Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/unsloth_nb.py Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/unsloth_nb.py Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/unsloth_nb.py Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/unsloth_nb.py Outdated Show resolved Hide resolved

recipes/supervised_fine_tuning/demonstrations/unsloth/unsloth_nb.py Show resolved Hide resolved

anndvision added 3 commits

June 1, 2025 22:01


          documentation updates to notebook

0de246e


          fix merge conflicts

3df03b0


          integrate experimental_render_inferences

09070b7

Contributor

ellipsis-dev bot commented Jun 2, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev

Generated with ❤️ by ellipsis.dev

anndvision added 3 commits

June 2, 2025 20:42


          support fireworks deployment

2e7a173


          support fireworks deployment

93e72f2


          add dev container instructions

c7f6b2c

Contributor

ellipsis-dev bot commented Jun 2, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev

Generated with ❤️ by ellipsis.dev

anndvision added 7 commits

June 3, 2025 10:49


          babyai requirements

90d2447


          use value instead of output

a8df558


          merge

8c603fb


          fix warning json serialization

fb74a96


          Merge branch 'main' of github.com:tensorzero/tensorzero into andrew/f…

98b6d2a

…ine-tune-with-unsloth


          Merge branch 'main' of github.com:tensorzero/tensorzero into andrew/f…

f959b69

…ine-tune-with-unsloth


          add experimental_list_inferences

5d87ddf

Contributor

ellipsis-dev bot commented Jun 12, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev

Generated with ❤️ by ellipsis.dev

GabrielBianconi assigned virajmehta

anndvision added 2 commits

June 16, 2025 15:01


          Merge branch 'main' of github.com:tensorzero/tensorzero into andrew/f…

2a63b4e

…ine-tune-with-unsloth


          remove unused code from notebook

9a3bb1f

GabrielBianconi requested a review from virajmehta

August 3, 2025 18:44

anndvision added 4 commits

August 3, 2025 18:47


          Merge branch 'main' of github.com:tensorzero/tensorzero into andrew/f…

347a457

…ine-tune-with-unsloth


          add tools to messages if they exist

276484b


          make generic

98cbc86


          fix requirements

86f66b1

Contributor

ellipsis-dev bot commented Aug 4, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev

Generated with ❤️ by ellipsis.dev

anndvision and others added 5 commits

August 5, 2025 12:39


          use unsloth docker image


          update requirements

4c1385d


          rebuild lock

0c66447


          Merge branch 'main' into andrew/fine-tune-with-unsloth

d33d7e9


          Merge branch 'main' into andrew/fine-tune-with-unsloth

50bb64e

anndvision marked this pull request as draft

August 8, 2025 14:36

anndvision added 5 commits

August 8, 2025 12:35


          draft canonical function for converting rendered samples

24dd0aa


          pin tensorzero >= 2025.8.0

bb3f6f2


          pin unsloth

3dd8370


          update base recipes lock and requirements

d2237f3


          compile notebook

239b2f8

anndvision marked this pull request as ready for review

August 11, 2025 12:05

Contributor

ellipsis-dev bot commented Aug 11, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev

Generated with ❤️ by ellipsis.dev

virajmehta approved these changes

View reviewed changes

Member

virajmehta left a comment

LGTM

virajmehta enabled auto-merge

August 11, 2025 16:22

virajmehta added this pull request to the merge queue

github-merge-queue bot removed this pull request from the merge queue due to failed status checks

virajmehta added this pull request to the merge queue

github-merge-queue bot removed this pull request from the merge queue due to failed status checks

virajmehta added this pull request to the merge queue

Merged via the queue into main with commit 83a74e8

32 checks passed

virajmehta deleted the andrew/fine-tune-with-unsloth branch

August 11, 2025 21:23

virajmehta mentioned this pull request

Implement a canonical function to convert rendered samples to external formats (e.g. ChatML) #2996

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet