Support `hidden_states` Input Besides `input_ids` for Multi-Modality Models

### Feature request

Extend the current API to accept `hidden_states` as an optional input parameter in addition to `input_ids`. This would support integration with multi-modality models such as LLAVA, Video Lamma, BLIP2, etc.

### Motivation

Many modern models are designed to process more than one type of data (e.g., text and images) and require inputting `hidden_states` directly. Allowing `hidden_states` as input would enable a seamless integration of these models, enhancing the overall utility of the TGI. This change aligns with the growing need for versatile, multi-modal solutions in the AI community.

### Your contribution

I believe that this is a valuable addition to the TGI. The proposed change is relatively minor but has the potential to greatly improve functionality. I'm eager to contribute and would like to make a PR to implement this feature.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support `hidden_states` Input Besides `input_ids` for Multi-Modality Models #847

Feature request

Motivation

Your contribution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support hidden_states Input Besides input_ids for Multi-Modality Models #847

Description

Feature request

Motivation

Your contribution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Support `hidden_states` Input Besides `input_ids` for Multi-Modality Models #847