Skip to content

[BUG REPORT]: Route unmapped but still routes to a zombie container. #479

@pusherofbrooms

Description

@pusherofbrooms

Summary

On occasion (it has happened thrice so far in the past year), a customer will unmap a route from an application and delete the application, and yet one of the containers will still be running, and route emitter will still be advertising the route to the gorouters. I didn't see any related errors in the route-emitter logs, not in the rep logs, though rep logs may have rotated out before I came on the scene.

The short term fix up to now is to "monit stop all", ensure all container processes are dead, and then start the processes. Alternatively, shutting down the cell and letting "scan and fix" sweep it up also works.

Expected Result

Route is unmapped
App is stopped
Containers stop
route no longer routes to any containers.

Actual Result

Route is unmapped
App is stopped
One container is not stopped
the unmapped route still routes to the undead container.

Diego repo

I don't have a lot of evidence, but since the container didn't stop I primarily suspect garden-runc https://github.com/cloudfoundry/garden-runc-release

Secondarily, route-emitter https://github.com/cloudfoundry/route-emitter since unmapping of the route didn't complete.

Context

Diego-release 2.42.0
as part of cf-deployment v12.28.0 running on AWS.

Steps to Reproduce

I don't have a consistent way to reproduce the problem as it has only happened 3 times in the past year and isn't predictable.

Possible Causes or Fixes (optional)

Additional Text Output or Screenshots (optional)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions