Issue: Parallel executor performance is mostly linear

We use capistrano to deploy on several hundred servers at once, and we noticed capistrano's performance was heavily tied to the number of servers.

To prove it I ran the following benchmark:

``` ruby
require 'benchmark'
require 'csv'
task :bench do
  servers = roles(:borg).to_a
  timings = {}

  if ENV['PREWARM']
    on servers do
      execute :echo, 'pong > /dev/null'
    end
  end

  [1, 10, 50, 100].each do |count|
    timings[count] = Benchmark.measure do
      on servers.pop(count) do
        execute :echo, 'pong > /dev/null'
      end
    end
  end

  res = [%w(count user system real)]
  timings.each do |count, timing|
    res << [count, timing.utime, timing.stime, timing.real]
  end
  puts res.map(&:to_csv).join
end
```

Here are the results: [Spreadsheet](https://docs.google.com/spreadsheets/d/1iyJC3a2AAeqnWjH2uGKp2ZClEsVSxN2QB0D5v85LUBc/edit?usp=sharing)

<img width="903" alt="capture d ecran 2016-02-02 a 18 41 29" src="https://cloud.githubusercontent.com/assets/44640/12768297/c30d65b4-c9dc-11e5-9f25-65514aa0c001.png">

The first graph is with "cold" connections, meaning it's connection establishment plus the command execution. In the second graph, all the connections were pre established before the benchmark.

I'm still investigating to figure out where exactly the bottleneck (or bottlenecks) exactly is. I know the GIL is not for nothing, but capistrano / SSHKit being IO heavy, I think there is other reasons.

cc @kirs as well as @csfrancis


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue: Parallel executor performance is mostly linear #326

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue: Parallel executor performance is mostly linear #326

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions