Propagate fatal worker errors #188

cretz · 2022-11-04T14:36:11Z

What was changed

Altered async context manager-ness of Worker to cancel the current task akin to what https://docs.python.org/3/library/asyncio-task.html#asyncio.timeout does. However, that is only in 3.11 and relies on https://docs.python.org/3/library/asyncio-task.html#asyncio.Task.uncancel to prevent inadvertent nesting/re-raise. We don't get that benefit, but I have observed no issues with the current implementation yet.
Added Worker.is_running and Worker.is_shutdown. We could have a "status"/"state" enum like TS, but this was simpler for now and doesn't keep us from doing so in the future if someone needs to know, for example, it's in the process of shutting down.
Altered Worker.run to be cancel-safe and to re-raise any polling errors returned by core
Altered Worker.shutdown to be idempotent and even be able to be called before run starts to alleviate issues like panic if Run and Stop race sdk-go#868, Worker fatal error can cause double worker stop sdk-go#903, etc
Added on_fatal_error callback that users can register. This doesn't have much value since we throw out of run anyways, but the nice thing is this one is called before we start the shutdown process if they want to do anything (I can't imagine what though).

Core does not have a way to cause an immediate fatal error (everything is retried for a minute) so I mock core in my tests, but I have confirmed that this does actually propagate the error after a little while if, say, you delete the namespace with the operator service.

Checklist

Closes Propagate fatal worker errors #25

Sushisource · 2022-11-04T16:56:47Z

temporalio/worker/worker.py

-        """Same as :py:meth:`shutdown` for use by ``async with``."""
-        await self.shutdown()
+    @property
+    def is_shutdown(self) -> bool:


Nit: A touch more precise

Suggested change

def is_shutdown(self) -> bool:

def has_finished_shutdown(self) -> bool:

I'm ok w/ lack of precision in the name so long as doc is clear

Sushisource · 2022-11-04T16:58:42Z

temporalio/worker/worker.py

+        # Cancel the shutdown task (safe if already done)
+        tasks[0].cancel()


Hmm... is it possible to use a dict with names or a named struct field or something for the tasks rather than a list where the indices matter?

No need in a local function like this IMO. I need this as a sequence for how it's used. If this variable escaped this function, for sure.

…error # Conflicts: # tests/worker/test_worker.py

Propagate fatal worker errors

9eda46f

cretz requested a review from a team November 4, 2022 14:36

Minor improvements

ff2df82

Sushisource approved these changes Nov 4, 2022

View reviewed changes

Merge remote-tracking branch 'remotes/origin/main' into worker-fatal-…

0f7be14

…error # Conflicts: # tests/worker/test_worker.py

cretz merged commit 6b9f554 into temporalio:main Nov 7, 2022

cretz deleted the worker-fatal-error branch November 7, 2022 15:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Propagate fatal worker errors #188

Propagate fatal worker errors #188

Uh oh!

cretz commented Nov 4, 2022 •

edited

Loading

Uh oh!

Sushisource Nov 4, 2022

Uh oh!

cretz Nov 4, 2022

Uh oh!

Sushisource Nov 4, 2022

Uh oh!

cretz Nov 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	def is_shutdown(self) -> bool:
	def has_finished_shutdown(self) -> bool:

		# Cancel the shutdown task (safe if already done)
		tasks[0].cancel()

Propagate fatal worker errors #188

Propagate fatal worker errors #188

Uh oh!

Conversation

cretz commented Nov 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What was changed

Checklist

Uh oh!

Sushisource Nov 4, 2022

Choose a reason for hiding this comment

Uh oh!

cretz Nov 4, 2022

Choose a reason for hiding this comment

Uh oh!

Sushisource Nov 4, 2022

Choose a reason for hiding this comment

Uh oh!

cretz Nov 4, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cretz commented Nov 4, 2022 •

edited

Loading