Add Dead Letter Queue class, integrate into heartbeat #918

grahamalama · 2024-03-20T18:32:01Z

Built on the groundwork of #903 (cheers @leplatrem, when I squash this I'll make sure you're a coauthor), this PR adds the classes that allow us to manage our Dead Letter Queue.

We included a basic heartbeat check in this PR to assert that we are able to read / write from the queue. I removed the integration with the app (in the /bugzilla_webhook route) for now. The idea is that we'll deploy this change first to validate that we can connect to the directory we're mounting on the infra side. Then, we'll follow on with how we integrate it into the /bugzilla_webhook route.

Leftover copypasta from sentry tests

alexcottner

looking good

- warn when we have a bug in the queue with no items - various debug messages

leplatrem

Thank you Graham for bringing this to another level 🙏 💯

jbi/queue.py

Co-authored-by: Mathieu Leplatre <[email protected]>

grahamalama · 2024-03-22T16:47:42Z

One more big though I'm having about the architecture of this PR:

We've been calling this a (singular) queue. But almost feels like we're building a queue manager. Like to me, what we return from get_all() is not a queue, but a collection of queues. And each bug has its own queue of failing messages. Thoughts?

alexcottner · 2024-03-22T17:25:20Z

We've been calling this a (singular) queue. But almost feels like we're building a queue manager. Like to me, what we return from get_all() is not a queue, but a collection of queues. And each bug has its own queue of failing messages. Thoughts?

In my head, this is a very simple partitioned queue. We have a single queue where things are going, but blocking only occurs within the partition (per bug). These terms would be more well defined if we were using an actual queue service with partitions and compute nodes.

- turn back into a method - allow callers to pass a bug_id to filter size by bug

Return iterator of items from get() Return dict of bug id, items from get_all() Return backend.get_all() from retrieve (instead of flat list)

jbi/queue.py

leplatrem

I think this is great for a first iteration.

The retries field seems unused. We can drop it if we don't have any API to manipulate it.

Ideally I would like us to tackle these two in follow-ups:

tests/unit/jira/test_queue.py

alexcottner

This is looking really good. Left comment about an additional possible test scenario.

…n't match our schema

This also means we're marking a webhook event's time property as not optional

This allows us to fetch the item identifiers in the queue without loading the items into memory

jbi/queue.py

Also, document QueueItemRetrievalError

jbi/queue.py

grahamalama added 4 commits March 20, 2024 10:15

Add pytest-asyncio

91cfb29

Add dead letter queue

d330898

Update secrets.baseline

b0c8d48

Add heartbeat check for queue availability

a7dd23d

grahamalama requested a review from a team as a code owner March 20, 2024 18:32

grahamalama added the enhancement New feature or request label Mar 20, 2024

grahamalama mentioned this pull request Mar 20, 2024

[draft] Dead letter queue backend #903

Closed

6 tasks

grahamalama added 3 commits March 21, 2024 12:17

Fix invalid_dl_queue_dsn_raises

02f5d03

Leftover copypasta from sentry tests

Ensure module and methods are properly documented

df1bd04

Log size of queue after insertion at debug level

fa05508

alexcottner approved these changes Mar 21, 2024

View reviewed changes

grahamalama added 2 commits March 21, 2024 13:00

Add test for failing file backend ping

4350974

Add some logging to the queue and backends

40c2040

- warn when we have a bug in the queue with no items - various debug messages

leplatrem reviewed Mar 22, 2024

View reviewed changes

alexcottner reviewed Mar 22, 2024

View reviewed changes

jbi/queue.py Show resolved Hide resolved

alexcottner reviewed Mar 22, 2024

View reviewed changes

jbi/queue.py Outdated Show resolved Hide resolved

grahamalama and others added 2 commits March 22, 2024 11:38

Fix logging for bugs with no entries in memory get_all

9bd9c96

Fix typo in get_all docstring

6a01d5a

Co-authored-by: Mathieu Leplatre <[email protected]>

grahamalama added 7 commits March 22, 2024 16:18

Add debug messages for writing bug to file queue

4b89323

Remove memory backend

f1eb44c

Preserve queue directory in clear()

75048e5

Refactor size

f6b8bd9

- turn back into a method - allow callers to pass a bug_id to filter size by bug

Use size for is_blocked

4467db8

Refactor get(), get_all(), retrieve()

635f42f

Return iterator of items from get() Return dict of bug id, items from get_all() Return backend.get_all() from retrieve (instead of flat list)

Add some missing typing

6d14b9c

grahamalama requested review from alexcottner and leplatrem March 26, 2024 14:05

Merge remote-tracking branch 'origin/main' into dlq-class

b2bba8e

grahamalama force-pushed the dlq-class branch from 158b2b6 to b2bba8e Compare March 26, 2024 14:12

leplatrem reviewed Mar 26, 2024

View reviewed changes

jbi/queue.py Show resolved Hide resolved

leplatrem approved these changes Mar 26, 2024

View reviewed changes

grahamalama added 2 commits March 26, 2024 13:23

payload.event.time isn't a callable

d39c7b6

Remote retries property from QueueItemFactory

6236719

alexcottner mentioned this pull request Mar 26, 2024

Adding retry process for dead letter queue #924

Merged

alexcottner reviewed Mar 26, 2024

View reviewed changes

tests/unit/jira/test_queue.py Show resolved Hide resolved

alexcottner approved these changes Mar 26, 2024

View reviewed changes

grahamalama added 4 commits March 27, 2024 12:08

Add tests for errors for invalid json and a webhook payload that does…

d6610f3

…n't match our schema

Make a queue item timestamp an alias of the event timestamp

c62e86c

This also means we're marking a webhook event's time property as not optional

Add methods for listing items in the queue

63057fc

This allows us to fetch the item identifiers in the queue without loading the items into memory

Catch and reraise custom exception for failing to read item into memory

43a7a15

leplatrem approved these changes Mar 27, 2024

View reviewed changes

jbi/queue.py Outdated Show resolved Hide resolved

Colocate custom exceptions

96222d1

Also, document QueueItemRetrievalError

alexcottner reviewed Mar 27, 2024

View reviewed changes

jbi/queue.py Show resolved Hide resolved

grahamalama and others added 2 commits March 28, 2024 10:30

Add methods to access list and list_all from queue class

1ee3ab7

merging main

8c93d37

grahamalama merged commit 323be99 into main Apr 10, 2024

grahamalama deleted the dlq-class branch April 10, 2024 19:01

leplatrem mentioned this pull request Apr 24, 2024

HTTPError entries in queue do not provide insighful details #969

Closed

Add Dead Letter Queue class, integrate into heartbeat #918

Add Dead Letter Queue class, integrate into heartbeat #918

Uh oh!

Conversation

grahamalama commented Mar 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexcottner left a comment

Choose a reason for hiding this comment

Uh oh!

leplatrem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

grahamalama commented Mar 22, 2024

Uh oh!

alexcottner commented Mar 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

leplatrem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alexcottner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

grahamalama commented Mar 20, 2024 •

edited

Loading

alexcottner commented Mar 22, 2024 •

edited

Loading