Skip to content

Recipe: Supervised Fine Tuning on Demonstrations with Unsloth#2273

Merged
virajmehta merged 34 commits intomainfrom
andrew/fine-tune-with-unsloth
Aug 11, 2025
Merged

Recipe: Supervised Fine Tuning on Demonstrations with Unsloth#2273
virajmehta merged 34 commits intomainfrom
andrew/fine-tune-with-unsloth

Conversation

@anndvision
Copy link
Member

@anndvision anndvision commented May 28, 2025

This recipe demonstrates how to do supervised fine tuning on demonstrations with unsloth.


Important

Adds a recipe for supervised fine-tuning using Unsloth, including setup, data preparation, model training, and deployment.

  • Setup:
    • Adds devcontainer.json for Unsloth development environment setup.
    • Specifies dependencies in pyproject.toml and requirements.txt.
  • Script (unsloth_nb.py):
    • Demonstrates supervised fine-tuning using Unsloth with TensorZero data.
    • Configures model parameters, data filtering, and training settings.
    • Implements data preparation, model instantiation, and tokenizer setup.
    • Converts TensorZero inferences to ChatML format for training.
    • Splits data into training and validation sets.
    • Configures and trains model with optional LoRA adaptation.
    • Deploys fine-tuned model using Fireworks for serverless inference.
    • Provides deployment and configuration instructions for the fine-tuned model.

This description was created by Ellipsis for 0c66447. You can customize this summary. It will automatically update as commits are pushed.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces a new recipe for supervised fine-tuning using Unsloth demonstrations, providing all necessary configuration and documentation for deployment and usage.

  • Added an auto-generated requirements.txt for the SageMaker TGI deployment.
  • Introduced a pyproject.toml with project settings and dependency declarations.
  • Created a README with setup instructions using both uv and pip.

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

File Description
tensorzero-internal/fixtures/deployment/sagemaker-tgi/requirements.txt Auto-generated dependency file for deployment.
recipes/supervised_fine_tuning/demonstrations/unsloth/pyproject.toml Project configuration and dependency definitions.
recipes/supervised_fine_tuning/demonstrations/unsloth/README.md Step-by-step setup and installation instructions.

@ellipsis-dev
Copy link
Contributor

ellipsis-dev bot commented Jun 2, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev


Generated with ❤️ by ellipsis.dev

@ellipsis-dev
Copy link
Contributor

ellipsis-dev bot commented Jun 2, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev


Generated with ❤️ by ellipsis.dev

@ellipsis-dev
Copy link
Contributor

ellipsis-dev bot commented Jun 12, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev


Generated with ❤️ by ellipsis.dev

@ellipsis-dev
Copy link
Contributor

ellipsis-dev bot commented Aug 4, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev


Generated with ❤️ by ellipsis.dev

@anndvision anndvision marked this pull request as draft August 8, 2025 14:36
@anndvision anndvision marked this pull request as ready for review August 11, 2025 12:05
@ellipsis-dev
Copy link
Contributor

ellipsis-dev bot commented Aug 11, 2025

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev


Generated with ❤️ by ellipsis.dev

Copy link
Member

@virajmehta virajmehta left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@virajmehta virajmehta enabled auto-merge August 11, 2025 16:22
@virajmehta virajmehta added this pull request to the merge queue Aug 11, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Aug 11, 2025
@virajmehta virajmehta added this pull request to the merge queue Aug 11, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Aug 11, 2025
@virajmehta virajmehta added this pull request to the merge queue Aug 11, 2025
Merged via the queue into main with commit 83a74e8 Aug 11, 2025
32 checks passed
@virajmehta virajmehta deleted the andrew/fine-tune-with-unsloth branch August 11, 2025 21:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants