Skip to content

Pull requests: vllm-project/tpu-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove SKIP_JAX_PRECOMPILE
#1018 opened Nov 5, 2025 by kyuyeunk Loading…
Remove JAX_RANDOM_WEIGHTS
#1017 opened Nov 5, 2025 by kyuyeunk Loading…
Support Embedding Model/Task
#1015 opened Nov 5, 2025 by carlesoctav Loading…
Fix default value for USE_MOE_EP_KERNEL
#1014 opened Nov 5, 2025 by kyuyeunk Loading…
Add async scheduler test to CI
#1010 opened Nov 4, 2025 by jcyang43 Loading…
[Misc] Add CODEOWNERS to the project.
#988 opened Oct 31, 2025 by py4 Loading…
[Feature] Add automated PyPI publishing workflow
#985 opened Oct 31, 2025 by ylangtsou Loading…
Update tpu_worker_jax.py
#982 opened Oct 30, 2025 by fenyuan-gg Loading…
Add lora layer tests
#981 opened Oct 30, 2025 by vanbasten23 Loading…
[Spec Decoding] Reduce TPU <-> CPU data transfer
#961 opened Oct 28, 2025 by Lumosis Loading…
Update README.md
#956 opened Oct 27, 2025 by bvrockwell Loading…
[GPT-OSS] enable attention-sink
#943 opened Oct 27, 2025 by bzgoogle Draft
[multi-host] add quick start guide
#928 opened Oct 23, 2025 by Lumosis Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.