-
Notifications
You must be signed in to change notification settings - Fork 558
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
perf: improve sampling/mask/softmax performance (part 1/2)
#2044
opened Nov 5, 2025 by
yzh119
Loading…
5 tasks
fix: support both pip and uv pip for finding flashinfer-python package
#2043
opened Nov 5, 2025 by
djmmoss
Loading…
feat: Add flashinfer.rope.rope_quantize_fp8_append_paged_kv_cache (fused RoPE + Q + KV cache, supports MLA/GQA/MHA)
#2037
opened Nov 4, 2025 by
kahyunnam
Loading…
5 tasks done
Enable renormalize(naive) routing for fp8 per-tensor
#2030
opened Nov 3, 2025 by
IwakuraRein
•
Draft
5 tasks
feat: suitable_auto_backends to prune auto backends, bmm_fp8 refactor, heuristic_func intake
#2029
opened Nov 3, 2025 by
jimmyzho
Loading…
3 of 5 tasks
Refactor flashinfer/__init__.py so that applications could selectively pack submodules without modifying __init__.py
#2027
opened Nov 3, 2025 by
bangshengtang
Loading…
5 tasks done
[feat] Refactor trtllmgen MOE and add Bf16 trtllmgen moe
#2014
opened Oct 30, 2025 by
jiahanc
Loading…
5 tasks done
refactor: backend_requirement + supported_compute_capability decorator for gemm
#2000
opened Oct 29, 2025 by
jimmyzho
Loading…
5 tasks
Update trtllm-gen fused moe routing kernel and add more kernels
#1955
opened Oct 20, 2025 by
jiahanc
Loading…
3 of 5 tasks
chore: agentic workflow for automatic version bump
#1947
opened Oct 19, 2025 by
yzh119
Loading…
5 tasks
chore: upgrade cutlass moe kernel launcher to match trtllm
#1925
opened Oct 13, 2025 by
aleozlx
Loading…
4 of 5 tasks
Fix "cannot find -lcuda & -lcudart" problem in WSL2
#1909
opened Oct 10, 2025 by
HelloCard
Loading…
3 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.