Skip to content

Conversation

@christian-pinto
Copy link
Contributor

@christian-pinto christian-pinto commented Sep 1, 2025

This is a quick follow-up PR to #22820.

Fixes:

  • Broken links in the documentation
  • Reorder model output in post_process_async before invoking post_process in default interface implementation
  • Enable setting softmax=False in pooling params for encode with plugins to get the full model raw output.

@DarkLight1337 @maxdebayser

Signed-off-by: Christian Pinto <christian.pinto@ibm.com>
Signed-off-by: Christian Pinto <christian.pinto@ibm.com>
Signed-off-by: Christian Pinto <christian.pinto@ibm.com>
@mergify mergify bot added documentation Improvements or additions to documentation frontend labels Sep 1, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces several valuable improvements. It fixes broken links in the documentation, which enhances usability. A critical correctness issue is addressed in post_process_async by sorting model outputs, ensuring that they are processed in the correct order. The PR also adds a new feature to disable softmax in pooling parameters for encode tasks, providing more control over the raw model output. This new feature is accompanied by corresponding updates in an example and tests. Additionally, the prithvi_io_processor plugin is simplified by removing redundant async method overrides, relying on the improved base class implementation. Overall, the changes are well-implemented and improve the robustness and functionality of IOProcessor plugins.

Copy link
Member

@DarkLight1337 DarkLight1337 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing!

@DarkLight1337 DarkLight1337 enabled auto-merge (squash) September 1, 2025 13:55
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 1, 2025
Copy link
Contributor

@maxdebayser maxdebayser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, I didn't think about the ordering issue.

@DarkLight1337 DarkLight1337 merged commit cf91a89 into vllm-project:main Sep 1, 2025
45 of 47 checks passed
eicherseiji pushed a commit to eicherseiji/vllm that referenced this pull request Sep 9, 2025
Signed-off-by: Christian Pinto <christian.pinto@ibm.com>
FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025
Signed-off-by: Christian Pinto <christian.pinto@ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation frontend ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants