File tree Expand file tree Collapse file tree 1 file changed +2
-2
lines changed
Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Original file line number Diff line number Diff line change @@ -56,11 +56,11 @@ and/or
5656The Python API of TensorRT-LLM is architectured to look similar to the
5757[ PyTorch] ( https://pytorch.org ) API. It provides users with a
5858[ functional] ( ./tensorrt_llm/functional.py ) module containing functions like
59- ` einsum ` , ` softmax ` , ` matmul ` or ` view ` . The [ layer ] ( ./tensorrt_llm/layer )
59+ ` einsum ` , ` softmax ` , ` matmul ` or ` view ` . The [ layers ] ( ./tensorrt_llm/layers )
6060module bundles useful building blocks to assemble LLMs; like an ` Attention `
6161block, a ` MLP ` or the entire ` Transformer ` layer. Model-specific components,
6262like ` GPTAttention ` or ` BertAttention ` , can be found in the
63- [ model ] ( ./tensorrt_llm/model ) module.
63+ [ models ] ( ./tensorrt_llm/models ) module.
6464
6565TensorRT-LLM comes with several popular models pre-defined. They can easily be
6666modified and extended to fit custom needs. See below for a list of supported
You can’t perform that action at this time.
0 commit comments