Adding FasterTransformer by lxp121 · Pull Request #103 · NVIDIA/DeepLearningExamples

lxp121 · 2019-07-13T16:33:31Z

FasterTransformer is a faster transformer layer inference implementation for BERT and other transformer based models.

The Faster Transformer implements an equivalent but highly optimized BERT transformer layer for inference. On Volta and Turing GPUs, FP16 precision is used automatically to access the computing power of tensor cores.

Faster Transformer is built on top of the CUDA and cuBLAS. It supports three kinds of sequence lengths, 32, 64 and 128. Two key parameters of the transformer layer, the number of heads and the size of each head, are passed in runtime. Thus, not only the BERT Base (12 heads * 64 per head) , but also customized models like 4 heads * 32 per head and 8 heads * 96 per heads, are well supported. Our implementation shows good speedups on both small and large batch size cases.

C++ API, TensorRT plugin, and TensorFlow OP wrapper are available. You can easily integrate this optimized transformer layer into your TensorFlow or other inference service codes that built in native C++ or TensorRT. In addition to codes that illustrate the API invocations, we also provide a simple end-to-end BERT TensorFlow inference sample.

…entation for BERT and other transformer based models.

moconnor725

Approved

Adding FasterTransformer

Adding FasterTransformer: A faster transformer layer inference implem…

75502be

…entation for BERT and other transformer based models.

moconnor725 approved these changes Jul 13, 2019

View reviewed changes

moconnor725 merged commit 2cfd880 into NVIDIA:master Jul 13, 2019

PeganovAnton pushed a commit to PeganovAnton/DeepLearningExamples that referenced this pull request Sep 8, 2020

Merge pull request NVIDIA#103 from lxp121/master

76b69fb

Adding FasterTransformer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding FasterTransformer#103

Adding FasterTransformer#103
moconnor725 merged 1 commit intoNVIDIA:masterfrom
lxp121:master

lxp121 commented Jul 13, 2019

Uh oh!

moconnor725 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lxp121 commented Jul 13, 2019

Uh oh!

moconnor725 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants