Add the ability to run parallel tasks #200

rroblf01 · 2025-11-08T07:55:54Z

This pull request adds support for concurrent task processing in the database-backed worker by introducing multi-threading. The worker can now claim and process multiple tasks in parallel, controlled by a new --max-workers option. The changes also update the core query and locking logic to support batch task retrieval, and adjust related tests to reflect the new behavior.

Concurrency and worker configuration:

Added a --max-workers command-line option to the worker, allowing configuration of the maximum number of concurrent worker threads (default is 1, set by MAX_WORKERS) (django_tasks/backends/database/management/commands/db_worker.py, django_tasks/base.py). [1] [2] [3]
Updated the worker initialization and argument parsing to accept and use the max_workers parameter (django_tasks/backends/database/management/commands/db_worker.py). [1] [2] [3] [4]

Task claiming and processing logic:

Changed the worker loop to claim and process up to max_workers tasks concurrently using threads, instead of a single task at a time (django_tasks/backends/database/management/commands/db_worker.py). [1] [2]
Modified the get_locked method in the queryset to return a batch of locked tasks (as a queryset slice) instead of a single result, supporting batch locking (django_tasks/backends/database/models.py).

Testing updates:

Adjusted tests to accommodate the new batch locking and threading behavior, including changes to query counts and task locking logic (tests/tests/test_database_backend.py). [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12]

These changes collectively enable the worker to process multiple tasks in parallel, improving throughput and efficiency for database-backed task queues.

…sks and processing them in batches

…al of multiple locked jobs at once.

…se and DatabaseTaskResultTestCase tests

…atabaseBackendWorkerTestCase

…ple task results.

…ción concurrente de tareas. Se agrega un argumento --max-workers para definir el número máximo de hilos de trabajo.

…o reflect changes in the worker's execution logic.

…rmination logic

rroblf01 · 2025-11-08T07:58:13Z

There are some things in this PR that need to be taken into account, such as that with my proposed changes, signal.SIGINT cannot terminate a running task, signal.SIGINT cannot terminate threads other than the main thread; as I have left it, the task would finish and then the worker would close.

RealOrangeOne · 2025-11-14T11:34:03Z

django_tasks/backends/database/management/commands/db_worker.py

+                for thread in threads:
+                    thread.join()


Issue: I don't think this approach is ideal. If a worker process is set to run 5 threads, and receives 4 fast tasks and 1 long task, the worker will sit processing the long task and never pick up the extra 4 tasks is has capacity for.

RealOrangeOne · 2025-11-14T11:36:08Z

I'm not convinced this is necessarily a good idea. There are other systems which can be used above the worker process to handle running multiple, rather than adding that complexity to the worker process itself. Tools like supervisord, Kubernetes etc have done the work on how to manage multiple processes properly - that complexity probably shouldn't live in the worker.

…cutor and update related configurations

…ra utilizar la función de validación valid_max_tasks.

rroblf01 · 2025-11-14T13:46:45Z

Hello @RealOrangeOne
I think it's important to be able to add threads directly to the command because otherwise, RAM consumption could skyrocket.

According to the tests I've run, each command consumes approximately 190MB of RAM. If I have to launch 5 workers to execute simple tasks (database queries, some HTTP requests, etc.), it could consume close to 1GB. However, if threads are integrated into the command, the consumption of the 5 workers would remain around the original 190MB.

I've modified the code so that no thread blocks another, and if there's a large task, it doesn't block any smaller tasks.

However, we've encountered two problems:

When using Ctrl+C, it doesn't stop immediately.
When moving database requests out of the main thread, the number of queries can't be counted in the tests.

…ncy in the Worker thread configuration.

…ecutor for task execution

rroblf01 · 2025-11-18T09:29:39Z

After using the changes I proposed in a work project, I realized that over time threads can become disabled, preventing code execution. Changing the threads to damon=True resolves this issue.

rroblf01 added 8 commits November 7, 2025 16:43

Adjust task handling in Worker to allow retrieving multiple locked ta…

4c47d33

…sks and processing them in batches

Modify the get_locked method in DBTaskResultQuerySet to allow retriev…

d44e455

…al of multiple locked jobs at once.

Adjust the query number assertions in the DatabaseBackendWorkerTestCa…

0f647bc

…se and DatabaseTaskResultTestCase tests

Optimize task handling in the Worker and adjust query assertions in D…

59eee96

…atabaseBackendWorkerTestCase

Corrects task verification in the Worker's run method to handle multi…

b938687

…ple task results.

Añade soporte para múltiples hilos en el Worker, permitiendo la ejecu…

cc282f2

…ción concurrente de tareas. Se agrega un argumento --max-workers para definir el número máximo de hilos de trabajo.

Adjust the query number assertions in DatabaseBackendWorkerTestCase t…

75b092e

…o reflect changes in the worker's execution logic.

Refactor test_repeat_ctrl_c to improve signal handling and process te…

88a337a

…rmination logic

RealOrangeOne reviewed Nov 14, 2025

View reviewed changes

RealOrangeOne added the database-backend Issues relating to the database backend label Nov 14, 2025

rroblf01 added 2 commits November 14, 2025 14:41

Refactor Worker to support parallel task execution with ThreadPoolExe…

9537b1d

…cutor and update related configurations

Actualiza el tipo del argumento --max-threads en el comando Worker pa…

8d3c31f

…ra utilizar la función de validación valid_max_tasks.

rroblf01 added 2 commits November 14, 2025 14:56

Rename MAX_THREADS to DEFAULT_THREADS to improve clarity and consiste…

fa7ee12

…ncy in the Worker thread configuration.

Refactor run_parallel method to use threading instead of ThreadPoolEx…

0eb3b19

…ecutor for task execution

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add the ability to run parallel tasks #200

Add the ability to run parallel tasks #200

Uh oh!

rroblf01 commented Nov 8, 2025

Uh oh!

rroblf01 commented Nov 8, 2025

Uh oh!

RealOrangeOne Nov 14, 2025

Uh oh!

RealOrangeOne commented Nov 14, 2025

Uh oh!

rroblf01 commented Nov 14, 2025

Uh oh!

rroblf01 commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add the ability to run parallel tasks #200

Are you sure you want to change the base?

Add the ability to run parallel tasks #200

Uh oh!

Conversation

rroblf01 commented Nov 8, 2025

Uh oh!

rroblf01 commented Nov 8, 2025

Uh oh!

RealOrangeOne Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

RealOrangeOne commented Nov 14, 2025

Uh oh!

rroblf01 commented Nov 14, 2025

Uh oh!

rroblf01 commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants