Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Model] Support Llama4 in vLLM ci/build documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed v1
#16104 opened Apr 5, 2025 by houseroad Loading…
[Misc] refactor example eagle documentation Improvements or additions to documentation
#16100 opened Apr 5, 2025 by reidliu41 Loading…
[Bugfix] add hf_token to EngineArgs frontend
#16093 opened Apr 5, 2025 by paolovic Loading…
[Bugfix]fix asyncLLM test_abort v1
#16090 opened Apr 5, 2025 by KubeKyrie Loading…
[V1][Spec Decode] Do not generate draft tokens beyond max_model_len needs-tests Tests needed for this PR v1
#16087 opened Apr 5, 2025 by WoosukKwon Loading…
Add runtime precondition check for paged attention kernel. tpu Related to Google TPUs v1
#16085 opened Apr 5, 2025 by vanbasten23 Loading…
[Bugfix] fix gettid method is not define
#16084 opened Apr 5, 2025 by lengrongfu Loading…
[BugFix][Frontend] Fix LLM.chat() tokenization bug Something isn't working frontend needs-tests Tests needed for this PR
#16081 opened Apr 5, 2025 by njhill Loading…
[Fix the torch pip install] ci/build
#16080 opened Apr 4, 2025 by yangw-dev Loading…
[CI/Build] Check for dynamic inputs before running PyTorch code tpu Related to Google TPUs v1
#16079 opened Apr 4, 2025 by yarongmu-google Loading…
[WIP] Add Flex to V1 documentation Improvements or additions to documentation v1
#16078 opened Apr 4, 2025 by drisspg Draft
[V1] Scatter and gather placeholders in the model runner documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#16076 opened Apr 4, 2025 by ywang96 Loading…
[TPU][V1][DEBUG] Provide Env Variable To Disable Sampler ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#16063 opened Apr 4, 2025 by NickLucche Loading…
fix neuron config override
#16045 opened Apr 4, 2025 by ajayvohra2005 Loading…
[Misc] improve chat_with_tools example documentation Improvements or additions to documentation
#16044 opened Apr 4, 2025 by reidliu41 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.