[Inference] Dynamic Batching Infer #4949

CjhHa1 · 2023-10-19T14:29:23Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

📝 What does this PR do?

Summarize your work here.
if you have any plots/diagrams/screenshots/tables, please attach them here.
This PR adds a new feature to inference, dynamic batching. Containing offline manager and online async manager. In the version of online one, we use async_engine to build ray driver, launch colossal distributed environment.

💥 Checklist before requesting a review

I have linked my PR to an issue (instruction)
My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
I have performed a self-review of my code
I have added thorough tests.
I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

🌝 Yes, I do.
🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

* finish batch manager * 1 * first * fix * fix dynamic batching * llama infer * finish test * support different lengths generating * del prints * del prints * fix * fix bug --------- Co-authored-by: CjhHa1 <cjh18671720497outlook.com>

* finish input and output logic * add generate * test forward * 1

* adapt to ray server * finish async * finish test * del test --------- Co-authored-by: yuehuayingxueluo <[email protected]>

This reverts commit fbf3c09.

This reverts commit fced140.

* Revert "[inference] Async dynamic batching (#4894)" This reverts commit fced140. * Add Ray Distributed Environment Init Scripts * support DynamicBatchManager base function * revert _set_tokenizer version * add driver async generate * add async test * fix bugs in test_ray_dist.py * add get_tokenizer.py * fix code style * fix bugs about No module named 'pydantic' in ci test * fix bugs in ci test * fix bugs in ci test * fix bugs in ci test

[Inference]Support dynamic batch for bloom model and add is_running function

* infer engine * infer engine * test engine * test engine * new manager * change step * add * test * fix * fix * finish test * finish test * finish test * finish test * add license --------- Co-authored-by: yuehuayingxueluo <[email protected]>

* test * fix test

CjhHa1 and others added 26 commits October 11, 2023 17:52

[inference] Async dynamic batching (#4894)

fced140

* finish input and output logic * add generate * test forward * 1

[inference]Re push async dynamic batching (#4901)

fbf3c09

* adapt to ray server * finish async * finish test * del test --------- Co-authored-by: yuehuayingxueluo <[email protected]>

Revert "[inference]Re push async dynamic batching (#4901)" (#4905)

d509e79

This reverts commit fbf3c09.

Revert "[inference] Async dynamic batching (#4894)"

ec004fe

This reverts commit fced140.

Revert "[inference] Async dynamic batching (#4894)" (#4909)

78cd937

This reverts commit fced140.

Add Ray Distributed Environment Init Scripts

d97290a

fix conflict

8483393

support DynamicBatchManager base function

f589e97

revert _set_tokenizer version

c070050

add driver async generate

5deb95c

add async test

306ef77

fix bugs in test_ray_dist.py

632f0e1

add get_tokenizer.py

0b2fe51

fix code style

cd843ac

fix bugs about No module named 'pydantic' in ci test

8c9ad51

fix bugs in ci test

8d0cc6b

fix bugs in ci test

acdd751

fix bugs in ci test

8a761bd

support dynamic batch for bloom model and is_running function

c76fd68

fix conflict

f41ccdd

Merge pull request #4933 from yuehuayingxueluo/ray_dist_init_branch

fca12b8

[Inference]Support dynamic batch for bloom model and add is_running function

add assertion for config (#4947)

3f6af12

[Inference] Finish dynamic batching offline test (#4948)

4867561

* test * fix test

CjhHa1 closed this Oct 20, 2023

ver217 deleted the feature/dynamic_batching branch November 9, 2023 06:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Inference] Dynamic Batching Infer #4949

[Inference] Dynamic Batching Infer #4949

Uh oh!

CjhHa1 commented Oct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Inference] Dynamic Batching Infer #4949

[Inference] Dynamic Batching Infer #4949

Uh oh!

Conversation

CjhHa1 commented Oct 19, 2023

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

💥 Checklist before requesting a review

⭐️ Do you enjoy contributing to Colossal-AI?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants