8000 Support weighted samples by tushuhei · Pull Request #358 · google/budoux · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Support weighted samples #358

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Nov 9, 2023
Merged

Support weighted samples #358

merged 2 commits into from
Nov 9, 2023

Conversation

tushuhei
Copy link
Member
@tushuhei tushuhei commented Nov 9, 2023

Update the training script to accept weights for samples.

Previously, the input data should have been formatted with -1 and 1 in the first column, indicating whether each sample is positive or negative.

1	a	b	c
-1	d	e	f
1	g	h	i	j

With this change, the first column can now contain any integer, with the absolute value of each integer representing the weight for the corresponding sample.

2	a	b	c
-3	d	e	f
5	g	h	i	j

This is useful when you want to train a model prioritizing a specific subset of the training dataset.

Change-Id: I39d62d9e2ae6f70aa0fd504491f62fadda7c85d0
Change-Id: If229dc048ed948fe2023d5321b79fa7f192d329b
@tushuhei tushuhei requested a review from kojiishi November 9, 2023 12:41
Copy link
Collaborator
@kojiishi kojiishi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@tushuhei tushuhei merged commit 0d812b2 into main Nov 9, 2023
@tushuhei tushuhei deleted the weighted-data branch December 12, 2023 08:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0