-
Notifications
You must be signed in to change notification settings - Fork 349
Closed
Description
Hi,
I'm currently working on a new non-English ELECTRA model. Training on GPU seems to work and is running fine 🤗
Next steps would be to try model training on a TPU, so I would just like to ask if you can post the final loss of both base and large models (or even share the loss training curve) so that we have a kind of reference point when training own models 🤔
Thanks many in advance,
Stefan
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels