[Core][MultiModalHasher] Hash images without converting image mode #24969

lgeiger · 2025-09-16T15:38:34Z

Purpose

This PR hashes mode, palette and data of images separately which prevents the need for converting all images to RGBA. See #24925 (comment)

Test Plan

Correctness should be covered by the existing hasher tests on CI.

The performance can be measured using:

import numpy as np
from PIL import Image
from vllm.multimodal.hasher import MultiModalHasher

np.random.seed(42)
data = np.random.randint(0, 255, size=(3840, 2160, 3), dtype=np.uint8)
data = Image.fromarray(data)

%timeit MultiModalHasher.hash_kwargs(data=data)

Test Result

For a 4k PIL image this speeds up hashing by ~35%. This is not massive, but might add up in cases with lots of multimodal input.

# main
25.1 ms ± 124 μs per loop (mean ± std. dev. of 7 runs, 10 loops each)
# This PR
16.3 ms ± 74.6 μs per loop (mean ± std. dev. of 7 runs, 100 loops each)

Signed-off-by: Lukas Geiger <[email protected]>

gemini-code-assist

Code Review

This pull request improves the performance of hashing PIL.Image objects by avoiding conversion to RGBA. Instead, it hashes the image's mode, data, and palette separately. This is a good optimization that, according to your tests, speeds up hashing by ~35% for 4k images. I've found one potential issue with the implementation regarding hash uniqueness for palettized images and provided a suggestion to address it.

vllm/multimodal/hasher.py

Signed-off-by: Lukas Geiger <[email protected]>

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]>

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]> Signed-off-by: charlifu <[email protected]>

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]>

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

lgeiger requested review from DarkLight1337, NickLucche and ywang96 as code owners September 16, 2025 15:38

[Core][MultiModalHasher] Hash images without converting image mode

089f1e3

Signed-off-by: Lukas Geiger <[email protected]>

lgeiger force-pushed the mm-hash-image branch from 9c2612a to 089f1e3 Compare September 16, 2025 15:39

mergify bot added the multi-modality Related to multi-modality (#4194) label Sep 16, 2025

gemini-code-assist bot reviewed Sep 16, 2025

View reviewed changes

vllm/multimodal/hasher.py Show resolved Hide resolved

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 16, 2025

Add rawmode to hash

b8fe820

Signed-off-by: Lukas Geiger <[email protected]>

lgeiger force-pushed the mm-hash-image branch from 5ea7b6e to b8fe820 Compare September 16, 2025 15:53

DarkLight1337 approved these changes Sep 17, 2025

View reviewed changes

vllm-bot merged commit 03191cd into vllm-project:main Sep 17, 2025
37 of 40 checks passed

lgeiger deleted the mm-hash-image branch September 17, 2025 08:31

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Core][MultiModalHasher] Hash images without converting image mode (v…

5dafba5

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

[Core][MultiModalHasher] Hash images without converting image mode (v…

281e11e

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]> Signed-off-by: charlifu <[email protected]>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

[Core][MultiModalHasher] Hash images without converting image mode (v…

85dba39

…llm-project#24969) Signed-off-by: Lukas Geiger <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Core][MultiModalHasher] Hash images without converting image mode #24969

[Core][MultiModalHasher] Hash images without converting image mode #24969

Uh oh!

lgeiger commented Sep 16, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Core][MultiModalHasher] Hash images without converting image mode #24969

[Core][MultiModalHasher] Hash images without converting image mode #24969

Uh oh!

Conversation

lgeiger commented Sep 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lgeiger commented Sep 16, 2025 •

edited by github-actions bot

Loading