Implement MultiThreadedCache.jl #1

NHDaly · 2022-02-14T15:44:10Z

Implements a MultiThreadedCache{K,V}, and adds stress test.

This cache has no locks on access, and only has contention on a cache
miss. It only ever holds a shared lock for a constant-time duration,
never while executing user code. A Task requesting a key that is
already being computed on another Task will block while that computation
is being performed. By taking advantage of the append-only properties of
a cache, the cache can be duplicated per-Thread, to avoid locking in the
common case.

This PR adds:

implementation
tests
CI configuration

This cache has no locks on access, and only has contention on a cache miss. It only ever holds a shared lock for a constant-time duration, never while executing user code. A Task requesting a key that is already being computed on another Task will block while that computation is being performed. By taking advantage of the append-only properties of a cache, the cache can be duplicated per-Thread, to avoid locking in the common case.

NHDaly · 2022-02-14T15:53:37Z

PR Review Request: @tveldhui, @vilterp, @Sacha0, @comnik ❤️

test/MultiThreadedCaches.jl

src/MultiThreadedCaches.jl

Add constructor that provides pre-computed values to the base_cache

Rethrow the exception onto the Future for all blocked tasks, delete the future, and rethrow on the current task. The cache remains usable afterwards and nothing is recorded for the key with the error.

The results are a bit lukewarm (glad we measured it), but look alright. Here's a few runs, each with different JULIA_NUM_THREADS: ```julia ┌ Info: benchmark results │ Threads.nthreads() = 1 │ time_serial = 0.013336288 │ time_parallel = 0.09071632 └ time_baseline = 0.115363526 ``` ```julia ┌ Info: benchmark results │ Threads.nthreads() = 2 │ time_serial = 0.011262138 │ time_parallel = 0.097021203 └ time_baseline = 0.139031655 ``` ```julia ┌ Info: benchmark results │ Threads.nthreads() = 20 │ time_serial = 0.011997677 │ time_parallel = 0.658225544 └ time_baseline = 1.032283809 ``` ```julia ┌ Info: benchmark results │ Threads.nthreads() = 100 │ time_serial = 0.013902211 │ time_parallel = 1.999772731 └ time_baseline = 9.314424419 ``` So it definitely does not scale as well as a single-threaded codebase would have scaled. But it also definitely scales better than the baseline, which is a Mutex around a Dict. Maybe there will be some places to improve our contention in the future :)

Gracefully handle exceptions thrown during `get!()` functions

NHDaly · 2022-02-15T00:50:58Z

Also i'm open to other package naming suggestions if anyone has them! :)

vilterp

LGTM with a couple questions

src/MultiThreadedCaches.jl

Add benchmark test measuring parallel scaling.

Fix lazy construction of Dicts, per guidance from Julia Base

…e during delete!

…cache across the whole execution

...... In retrospect, this does make this whole structure start to look a lot like a Dict + a Read/Write lock, and I wonder how their performance would compare......

Concurrency safety fixes

NHDaly · 2022-02-17T05:03:38Z

After all the latest changes, especially #6 for the concurrency fixes, I think this package is good to go.

It still scales a good bit better than a Dict + Mutex, so I think there's still value in it!

If we want to give another approach a shot in the future, like a concurrent hash table, i'm super supportive. But hopefully this is useful in the interim.

NHDaly added 4 commits February 13, 2022 13:14

Initial commit (via PkgTemplates)

27da88d

Add CI.yml - test with singlethreaded and multithreaded

4c66820

Free the Futures once they're no longer needed, to prevent leaks

14d7450

NHDaly added 3 commits February 14, 2022 11:16

Fix CI.yml syntax for environment variables

790d468

Add julia "1" to build config

1da361e

Add MacOS and Windows to build config

9a3209b

kpamnany reviewed Feb 14, 2022

View reviewed changes

test/MultiThreadedCaches.jl Outdated Show resolved Hide resolved

Add constructor that provides pre-computed values to the base_cache

7b593d6

tveldhui reviewed Feb 14, 2022

View reviewed changes

src/MultiThreadedCaches.jl Outdated Show resolved Hide resolved

NHDaly and others added 4 commits February 14, 2022 12:53

Merge pull request #2 from NHDaly/nhd-constructors

de48129

Add constructor that provides pre-computed values to the base_cache

Gracefully handle exceptions thrown during get!() functions

5696600

Rethrow the exception onto the Future for all blocked tasks, delete the future, and rethrow on the current task. The cache remains usable afterwards and nothing is recorded for the key with the error.

Merge pull request #3 from NHDaly/nhd-exception-handling

18ba084

Gracefully handle exceptions thrown during `get!()` functions

vilterp approved these changes Feb 15, 2022

View reviewed changes

src/MultiThreadedCaches.jl Show resolved Hide resolved

src/MultiThreadedCaches.jl Outdated Show resolved Hide resolved

NHDaly and others added 7 commits February 15, 2022 12:11

Fix lazy construction of Dicts, per guidance from Julia Base

0faa198

Merge pull request #4 from NHDaly/nhd-benchmark-parallel-scaling

11e1289

Add benchmark test measuring parallel scaling.

Merge pull request #5 from NHDaly/nhd-fix-lazy-construction

55d3727

Fix lazy construction of Dicts, per guidance from Julia Base

Fix thread-safety violation by accidentally not locking the base cach…

2c950eb

…e during delete!

Fix concurrency-safety by not holding the get!() on the thread-local …

38e664d

…cache across the whole execution

Use per-thread locks to make invariant to Task migration.

76a48d5

...... In retrospect, this does make this whole structure start to look a lot like a Dict + a Read/Write lock, and I wonder how their performance would compare......

Merge pull request #6 from NHDaly/nhd-concurrency-safety-fixes

24cb6f2

Concurrency safety fixes

NHDaly merged commit 2bc3fc3 into main Feb 17, 2022

NHDaly deleted the nhd-initial-code branch February 17, 2022 19:10

NHDaly mentioned this pull request Feb 17, 2022

Reduce memory & redundant work for concurrent TimeZones construction JuliaTime/TimeZones.jl#356

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement MultiThreadedCache.jl #1

Implement MultiThreadedCache.jl #1

Uh oh!

NHDaly commented Feb 14, 2022 •

edited

Loading

Uh oh!

NHDaly commented Feb 14, 2022

Uh oh!

Uh oh!

Uh oh!

NHDaly commented Feb 15, 2022

Uh oh!

vilterp left a comment

Uh oh!

Uh oh!

Uh oh!

NHDaly commented Feb 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Implement MultiThreadedCache.jl #1

Implement MultiThreadedCache.jl #1

Uh oh!

Conversation

NHDaly commented Feb 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NHDaly commented Feb 14, 2022

Uh oh!

Uh oh!

Uh oh!

NHDaly commented Feb 15, 2022

Uh oh!

vilterp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

NHDaly commented Feb 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

NHDaly commented Feb 14, 2022 •

edited

Loading