Skip to content

Integrate matmul into FFW: 4.3x prefill speedup#243

Merged
copybara-service[bot] merged 1 commit intodevfrom
test_643259303
Jun 14, 2024
Merged

Integrate matmul into FFW: 4.3x prefill speedup#243
copybara-service[bot] merged 1 commit intodevfrom
test_643259303

Conversation

@copybara-service
Copy link

Integrate matmul into FFW: 4.3x prefill speedup

before, bf16:
27.2929 prefill tokens / sec
17.2114 tokens / sec

after, bf16
116.496 prefill tokens / sec
17.5391 tokens / sec

@copybara-service copybara-service bot force-pushed the test_643259303 branch 4 times, most recently from 8a35411 to 3e28e9e Compare June 14, 2024 13:25
```
before, bf16:
27.2929 prefill tokens / sec
17.2114 tokens / sec

after, bf16
116.496 prefill tokens / sec
17.5391 tokens / sec
```

PiperOrigin-RevId: 643328437
@copybara-service copybara-service bot merged commit 29c0c57 into dev Jun 14, 2024
@copybara-service copybara-service bot deleted the test_643259303 branch June 14, 2024 13:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant