Skip to content

Commit c15d09e

Browse files
author
Swetha Mandava
committed
moving trtis section in readme and github issue 131
1 parent 2b3b299 commit c15d09e

File tree

2 files changed

+532
-542
lines changed

2 files changed

+532
-542
lines changed

TensorFlow/LanguageModeling/BERT/README.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -99,9 +99,10 @@ These techniques and optimizations improve model performance and reduce training
9999
### Feature support matrix
100100

101101
The following features are supported by this model.
102-
| Feature | BERT|
103-
|-----------------------|--------------------------|
104-
|Horovod Multi-GPU |Yes|
102+
103+
| **Feature** | **BERT** |
104+
|:-----------------------:|:--------------------------:|
105+
| Horovod Multi-GPU | Yes |
105106

106107
#### Features
107108

@@ -639,12 +640,12 @@ Our results were obtained by running the `run_squad.py` training script in the T
639640

640641
The following tables compare `F1` scores across 5 different training runs with different seeds, for both FP16 and FP32 respectively. The runs showcase consistent convergence on all 5 seeds with very little deviation.
641642

642-
| **FP16, 8x GPUs** | **seed #1** | **seed #2** | **seed #3** | **seed #4** | **seed #5** | **mean** | **std** |
643+
| **FP16, 8x GPUs** | **seed 1** | **seed 2** | **seed 3** | **seed 4** | **seed 5** | **mean** | **std** |
643644
|:-----------:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|
644645
|F1 |90.75|90.82|90.89|91.05|90.79|90.86|0.12|
645646
|Exact match|83.85|83.93|83.95|84.25|83.59|83.91|0.24|
646647

647-
| **FP32, 8x GPUs** | **seed #1** | **seed #2** | **seed #3** | **seed #4** | **seed #5** | **mean** | **std** |
648+
| **FP32, 8x GPUs** | **seed 1** | **seed 2** | **seed 3** | **seed 4** | **seed 5** | **mean** | **std** |
648649
|:-----------:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|
649650
|F1 |90.70|90.80|90.89|91.08|90.73|90.84|0.15 |
650651
|Exact match|83.82|83.77|84.23|84.19|83.63|83.93|0.27 |

0 commit comments

Comments
 (0)