fix: release GPU tensors after prediction #144

mitchellxh · 2025-10-10T19:22:18Z

Summary

avoid retaining per-image probability tensors on the GPU during prediction
detach probabilities to CPU and drop intermediate GPU tensors after each batch
prevents large runs (~7k images) from exceeding VRAM (32GB)

Fixes #145.

hlapp · 2025-10-10T20:34:56Z

@mitchellxh thanks so much for your contribution!! Would you mind creating an issue that documents the problem you're seeing without this change? We can then link this PR to it.

Depending on the nature of the problem, we might also want to add a test case.

hlapp · 2025-10-11T01:23:23Z

@mitchellxh many thanks for posting the issue! Is the del step required, or would the following also work:

probs = self.create_probabilities(img_features, txt_features)
probs = probs.detach().cpu()

hlapp · 2025-10-11T01:24:48Z

And for img_features did you see that the variable going out of scope does not release the GPU memory, necessitating the explicit deletion of the object?

mitchellxh · 2025-10-11T13:11:40Z

@hlapp the deletion step is not needed!

img_features can be tamed by setting an appropriate batch size for the VRAM

hlapp · 2025-10-11T17:30:28Z

I've simplified the fix to what I understand is strictly necessary. @mitchellxh if you have time, could you check that the result should still be equivalent in outcome to your initial patch?

Copilot

Pull Request Overview

This PR fixes a GPU memory management issue by releasing GPU tensors after prediction to prevent VRAM accumulation during large batch processing.

Adds explicit tensor detachment from GPU to CPU after probability computation
Prevents GPU memory exhaustion when processing large image sets (~7k images)
Includes explanatory comment for the memory management fix

hlapp

Again, thanks @mitchellxh for your contribution. I'll wait a bit for your 👍🏻 to my simplification before merging to ensure I haven't missed or misunderstood anything.

mitchellxh · 2025-10-13T13:25:48Z

confirmed! VRAM remains stable with your simplified fix.

fix: release GPU tensors after prediction

6299dcd

mitchellxh mentioned this pull request Oct 11, 2025

GPU OOM during prediction #145

Closed

Simplify fix to keep GPU VRAM use from accumulating

de16887

hlapp requested a review from Copilot October 11, 2025 17:30

Copilot AI reviewed Oct 11, 2025

View reviewed changes

hlapp approved these changes Oct 11, 2025

View reviewed changes

hlapp merged commit 412ed1f into Imageomics:main Oct 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: release GPU tensors after prediction #144

fix: release GPU tensors after prediction #144

Uh oh!

mitchellxh commented Oct 10, 2025 •

edited by hlapp

Loading

Uh oh!

hlapp commented Oct 10, 2025

Uh oh!

hlapp commented Oct 11, 2025

Uh oh!

hlapp commented Oct 11, 2025 •

edited

Loading

Uh oh!

mitchellxh commented Oct 11, 2025

Uh oh!

hlapp commented Oct 11, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

hlapp left a comment

Uh oh!

mitchellxh commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: release GPU tensors after prediction #144

fix: release GPU tensors after prediction #144

Uh oh!

Conversation

mitchellxh commented Oct 10, 2025 • edited by hlapp Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

hlapp commented Oct 10, 2025

Uh oh!

hlapp commented Oct 11, 2025

Uh oh!

hlapp commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mitchellxh commented Oct 11, 2025

Uh oh!

hlapp commented Oct 11, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

hlapp left a comment

Choose a reason for hiding this comment

Uh oh!

mitchellxh commented Oct 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mitchellxh commented Oct 10, 2025 •

edited by hlapp

Loading

hlapp commented Oct 11, 2025 •

edited

Loading