-
Download
all_data.csvfrom https://www.kaggle.com/c/jigsaw-unintended-bias-in-toxicity-classification/data. -
Run
python process_labeled.py --root ROOT, whereROOTis where you downloadedROOT. This will createall_data_with_identities.csvin the same folder, which is the labeled data that we use in WILDS. -
After the above step, run
python process_unlabeled.py --root ROOT, whereROOTis where you downloadedROOT. This will createunlabeled_data_with_identities.csvin the same folder, which is the unlabeled data that we optionally use in WILDS.
civilcomments
Directory actions
More options
Directory actions
More options
civilcomments
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
parent directory.. | ||||