Clamp negative orig_m #2308

mooskagh · 2025-10-05T18:34:10Z

No description provided.

Copilot

Pull Request Overview

This PR implements handling for negative orig_m values in training data by clamping them to 0.0f instead of failing validation. The change removes a strict validation assertion and adds a normalization function that warns about and corrects negative values.

Removes strict validation that rejected negative orig_m values
Adds NormalizeOriginalMetrics function to clamp negative orig_m values to 0.0f with warnings
Integrates the normalization step into the file processing pipeline

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
src/trainingdata/rescorer.cc	Removes validation assertion and adds normalization function for `orig_m` values
libs/lczero-common	Updates subproject commit reference

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-05T18:46:37Z

src/trainingdata/rescorer.cc

+      std::cerr << "Warning: negative orig_m (" << chunk.orig_m
+                << ") encountered; clamping to 0" << std::endl;


Using std::cerr for warnings may not be appropriate for production code. Consider using a proper logging framework or configuration-controlled warning system instead of direct stderr output.

Tilps · 2025-10-05T20:53:55Z

I think this is the wrong approach, we should just drop weird data. If we want to fix something it should be on the generating side.

borg323 · 2025-10-05T22:06:14Z

The backend bug (mish instead of relu as final activation) is fixed, but is the orig_m value this critical?

Tilps · 2025-10-06T06:45:38Z

its not critical at the current time, but we have copious amounts of data which passes all these checks, so it seems unnecessary to try and allow this small amount of extra data through at rescorer time - and increase the chance that we miss such a bug again in future.

mooskagh added 2 commits October 5, 2025 20:32

Clamp negative orig_m

ea13908

Remove orig_m from verifier.

27f4040

mooskagh marked this pull request as ready for review October 5, 2025 18:38

mooskagh requested a review from Copilot October 5, 2025 18:46

Copilot AI reviewed Oct 5, 2025

View reviewed changes

Was too spammy.

5f828ce

borg323 approved these changes Oct 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clamp negative orig_m #2308

Clamp negative orig_m #2308

Uh oh!

mooskagh commented Oct 5, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 5, 2025

Uh oh!

Tilps commented Oct 5, 2025

Uh oh!

borg323 commented Oct 5, 2025

Uh oh!

Tilps commented Oct 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		std::cerr << "Warning: negative orig_m (" << chunk.orig_m
		<< ") encountered; clamping to 0" << std::endl;

Clamp negative orig_m #2308

Are you sure you want to change the base?

Clamp negative orig_m #2308

Uh oh!

Conversation

mooskagh commented Oct 5, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

Tilps commented Oct 5, 2025

Uh oh!

borg323 commented Oct 5, 2025

Uh oh!

Tilps commented Oct 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants