[Bugfix][V1] Fix molmo text-only inputs #11676

jeejeelee · 2025-01-02T03:29:19Z

Reproduce Code

import os
from vllm import LLM, SamplingParams


os.environ["VLLM_USE_V1"] = "1"


MODEL_NAME = "allenai/Molmo-7B-D-0924"
llm = LLM(
    model=MODEL_NAME,
    trust_remote_code=True,
    tensor_parallel_size=1,
    gpu_memory_utilization=0.7,
    enforce_eager=True,
)

stop_token_ids = None
sampling_params = SamplingParams(
    stop_token_ids=stop_token_ids,
    temperature=0,
    max_tokens=128,
)
prompts = [
    "Hello, my name is",
]
# text--only inputs
outputs = llm.generate(
    prompts=prompts,
    sampling_params=sampling_params,
)
print(outputs[0].outputs[0].text)

Signed-off-by: Jee Jee Li <[email protected]>

github-actions · 2025-01-02T03:29:31Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Jee Jee Li <[email protected]>

DarkLight1337 · 2025-01-02T03:52:11Z

Hmm, maybe we should expand our tests to cover this case...

ywang96

Thanks for the fix! Can you verify if this yields the same result as main branch on V0? I was mainly curious if the padded dummy image input ids will matter at all for text-only input.

ywang96 · 2025-01-02T06:56:16Z

vllm/model_executor/models/molmo.py

-    else:
-        base_image_input_size = image_processor.base_image_input_size
-        image_patch_size = image_processor.image_patch_size
-        image_num_patch = (
-            base_image_input_size[0] // image_patch_size,
-            base_image_input_size[1] // image_patch_size,
-        )
-        n_pixels = image_patch_size * image_patch_size * 3
-        n_patches = image_num_patch[0] * image_num_patch[1]
-
-        image_length_w = image_processor.image_token_length_w
-        image_length_h = image_processor.image_token_length_h
-        tokens_per_image = image_length_w * image_length_h
-        images = torch.full(
-            (max_total_crops, n_patches, n_pixels),
-            -1,
-            dtype=torch.float32,
-        )
-        image_input_idx = torch.full(
-            (max_total_crops, tokens_per_image),
-            -1,
-            dtype=torch.int32,


I wasn't sure why this was in the code originally when the AI2 team made the PR to support Molmo on vLLM, but I guess it wasn't an issue back then because it didn't matter on V0 since we didn't use the placeholder ranges for these "dummy" image input indices padded to the prompt token ids.

jeejeelee · 2025-01-02T07:26:32Z

Thanks for the fix! Can you verify if this yields the same result as main branch on V0? I was mainly curious if the padded dummy image input ids will matter at all for text-only input.

I have verified using the reproduce code above, and the generated results completely align with the main V0

jeejeelee · 2025-01-02T07:28:36Z

Hmm, maybe we should expand our tests to cover this case...

Where should tests be added for this PR?

DarkLight1337 · 2025-01-02T07:33:35Z

You can edit the image_size_factors in test_models.py.

Hmm, actually it seems that empty image is already included there... ~~why does the test still pass?~~ Never mind, molmo tests aren't being included in test_models.py yet.

jeejeelee · 2025-01-02T07:37:09Z

You can edit the image_size_factors in test_models.py.

Hmm, actually it seems that empty image is already included there... ~~why does the test still pass?~~ Never mind, molmo tests aren't being included in test_models.py yet.

Alright, then I won't add tests.

DarkLight1337 · 2025-01-02T07:43:48Z

It seems that we don't have any tests for Molmo at all.

ywang96 · 2025-01-02T08:19:53Z

It seems that we don't have any tests for Molmo at all.

yea I think we decided that if the model support came from the vendor then the test is not required.

DarkLight1337 · 2025-01-02T10:11:15Z

I think to avoid breaking the code in future PRs (especially with the V1 refactoring that's going on), we should add tests for it.

jeejeelee · 2025-01-02T10:15:49Z

I think to avoid breaking the code in future PRs (especially with the V1 refactoring that's going on), we should add tests for it.

Okay, l will handle this asap

Signed-off-by: Jee Jee Li <[email protected]>

DarkLight1337 · 2025-01-06T03:39:10Z

Sorry didn't realize you added tests.

DarkLight1337 · 2025-01-06T03:39:47Z

Please fix the lint errors

Signed-off-by: Jee Jee Li <[email protected]>

tests/models/decoder_only/vision_language/vlm_utils/model_utils.py

Signed-off-by: Jee Jee Li <[email protected]>

tests/models/decoder_only/vision_language/test_models.py

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee added 3 commits January 2, 2025 02:21

DOne

5d6b39f

Signed-off-by: Jee Jee Li <[email protected]>

Add comment

d2e0840

Signed-off-by: Jee Jee Li <[email protected]>

Optimize logic

16efb85

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee requested a review from DarkLight1337 January 2, 2025 03:29

jeejeelee added 2 commits January 2, 2025 03:30

Add comments

ef37f8d

Signed-off-by: Jee Jee Li <[email protected]>

format

9762e8d

Signed-off-by: Jee Jee Li <[email protected]>

ywang96 approved these changes Jan 2, 2025

View reviewed changes

jeejeelee added 4 commits January 3, 2025 09:57

Merge branch 'vllm-project:main' into fix-v1-molmo

85f1fa7

Backup

3b0a807

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'vllm-project:main' into fix-v1-molmo

c824290

Merge branch 'vllm-project:main' into fix-v1-molmo

86ce64f

DarkLight1337 approved these changes Jan 6, 2025

View reviewed changes

Fix format

cc831b6

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee commented Jan 6, 2025

View reviewed changes

tests/models/decoder_only/vision_language/vlm_utils/model_utils.py Outdated Show resolved Hide resolved

jeejeelee added 3 commits January 6, 2025 06:16

Fix format

0f91817

Signed-off-by: Jee Jee Li <[email protected]>

Merge branch 'vllm-project:main' into fix-v1-molmo

e7ea79e

Fix format

8686011

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee requested a review from DarkLight1337 January 6, 2025 09:19

DarkLight1337 reviewed Jan 6, 2025

View reviewed changes

tests/models/decoder_only/vision_language/test_models.py Outdated Show resolved Hide resolved

Fix format

a38c64f

Signed-off-by: Jee Jee Li <[email protected]>

DarkLight1337 approved these changes Jan 6, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 6, 2025

ywang96 enabled auto-merge (squash) January 6, 2025 11:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix][V1] Fix molmo text-only inputs #11676

[Bugfix][V1] Fix molmo text-only inputs #11676

jeejeelee commented Jan 2, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025

ywang96 left a comment •

edited

Loading

ywang96 Jan 2, 2025 •

edited

Loading

jeejeelee commented Jan 2, 2025

jeejeelee commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025 •

edited by jeejeelee

Loading

jeejeelee commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025

ywang96 commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025 •

edited

Loading

jeejeelee commented Jan 2, 2025

DarkLight1337 commented Jan 6, 2025

DarkLight1337 commented Jan 6, 2025

[Bugfix][V1] Fix molmo text-only inputs #11676

Are you sure you want to change the base?

[Bugfix][V1] Fix molmo text-only inputs #11676

Conversation

jeejeelee commented Jan 2, 2025 • edited by github-actions bot Loading

Reproduce Code

github-actions bot commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025

ywang96 left a comment • edited Loading

Choose a reason for hiding this comment

ywang96 Jan 2, 2025 • edited Loading

Choose a reason for hiding this comment

jeejeelee commented Jan 2, 2025

jeejeelee commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025 • edited by jeejeelee Loading

jeejeelee commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025

ywang96 commented Jan 2, 2025

DarkLight1337 commented Jan 2, 2025 • edited Loading

jeejeelee commented Jan 2, 2025

DarkLight1337 commented Jan 6, 2025

DarkLight1337 commented Jan 6, 2025

jeejeelee commented Jan 2, 2025 •

edited by github-actions bot

Loading

ywang96 left a comment •

edited

Loading

ywang96 Jan 2, 2025 •

edited

Loading

DarkLight1337 commented Jan 2, 2025 •

edited by jeejeelee

Loading

DarkLight1337 commented Jan 2, 2025 •

edited

Loading