Skip to content

Conversation

@MasterJH5574
Copy link
Contributor

This PR fixes a sequence reference counter bug in the KV cache: when forking a child sequnece from an existing parent sequence, the reference counter of hte parent sequence was not increased. This leads to error when the child sequence is removed, where we will check the parent's reference counter and find it is 0 and is never changed unexpectedly.

Meanwhile, this PR updates the PagedKVCache tests with some latest changes, including target-aware tile size selection.

This PR fixes a sequence reference counter bug in the KV cache:
when forking a child sequnece from an existing parent sequence,
the reference counter of hte parent sequence was not increased.
This leads to error when the child sequence is removed, where we
will check the parent's reference counter and find it is 0 and is
never changed unexpectedly.

Meanwhile, this PR updates the PagedKVCache tests with some latest
changes, including target-aware tile size selection.
@MasterJH5574
Copy link
Contributor Author

@tvm-bot rerun

5 similar comments
@MasterJH5574
Copy link
Contributor Author

@tvm-bot rerun

@yongwww
Copy link
Member

yongwww commented Mar 2, 2024

@tvm-bot rerun

@MasterJH5574
Copy link
Contributor Author

@tvm-bot rerun

@MasterJH5574
Copy link
Contributor Author

@tvm-bot rerun

@MasterJH5574
Copy link
Contributor Author

@tvm-bot rerun

@tqchen tqchen merged commit ae2ab58 into apache:main Mar 4, 2024
Lunderberg pushed a commit to Lunderberg/tvm that referenced this pull request Mar 12, 2024
This PR fixes a sequence reference counter bug in the KV cache:
when forking a child sequnece from an existing parent sequence,
the reference counter of hte parent sequence was not increased.
This leads to error when the child sequence is removed, where we
will check the parent's reference counter and find it is 0 and is
never changed unexpectedly.

Meanwhile, this PR updates the PagedKVCache tests with some latest
changes, including target-aware tile size selection.
thaisacs pushed a commit to thaisacs/tvm that referenced this pull request Apr 3, 2024
This PR fixes a sequence reference counter bug in the KV cache:
when forking a child sequnece from an existing parent sequence,
the reference counter of hte parent sequence was not increased.
This leads to error when the child sequence is removed, where we
will check the parent's reference counter and find it is 0 and is
never changed unexpectedly.

Meanwhile, this PR updates the PagedKVCache tests with some latest
changes, including target-aware tile size selection.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants