Added functionality of storing layers activations output. by atorero · Pull Request #145 · google/gemma.cpp

atorero · 2024-04-12T15:36:17Z

Adding a way to save activations output after applying each layer. This can be useful for two purposes:

Debugging the model and comparing the output to another working implementation
Quantising weights or activations and getting insights of the accumulated error.

There will be a similar functionality in the Python code and a diffing tool that will compare two outputs and calculating some metrics for them. See the output of such a tool when comparing outputs of (3 tokens)x(27 layers) between the Python and the C++ (8-bit) Recurrent Gemma implementations:

…to save the outputs to a json file.

jan-wassenberg

Nice, thanks for updating the PR!

atorero mentioned this pull request Apr 12, 2024

Added functionality of storing layers activations output. #144

Closed

atorero added 3 commits April 12, 2024 15:39

Added layers output functionality to gemma and a binary debug_output …

03284d7

…to save the outputs to a json file.

Add comments regarding layers output usage.

2c5706f

Fixed minor things and added comments.

4ef3da7

atorero force-pushed the dev branch from 081942e to 4ef3da7 Compare April 12, 2024 15:39

jan-wassenberg approved these changes Apr 12, 2024

View reviewed changes

jan-wassenberg added the copybara-import Trigger Copybara for merging pull requests label Apr 12, 2024

copybara-service bot merged commit 05e7e2b into google:dev Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added functionality of storing layers activations output.#145

Added functionality of storing layers activations output.#145
copybara-service[bot] merged 3 commits intogoogle:devfrom
atorero:dev

atorero commented Apr 12, 2024

Uh oh!

jan-wassenberg left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

atorero commented Apr 12, 2024

Uh oh!

jan-wassenberg left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants