[docker-29.x backport] daemon: clean up dead containers on start by vvoland · Pull Request #51693 · moby/moby

vvoland · 2025-12-11T18:41:32Z

backport: daemon: clean up dead containers on start #51692

- What I did

Stopping the Engine while a container with autoremove set is running may leave behind dead containers on disk. These containers aren't reclaimed on next start, appear as "dead" in docker ps -a and can't be inspected or removed by the user.

This bug has existed since a long time but became user visible with 9f5f4f5. Prior to that commit, containers with no rwlayer weren't added to the in-memory viewdb, so they weren't visible in docker ps -a. However, some dangling files would still live on disk (e.g. folder in /var/lib/docker/containers, mount points, etc).

The underlying issue is that when the daemon stops, it tries to stop all running containers and then closes the containerd client. This leaves a small window of time where the Engine might receive 'task stop' events from containerd, and trigger autoremove. If the containerd client is closed in parallel, the Engine is unable to complete the removal, leaving the container in 'dead' state. In such case, the Engine logs the following error:

cannot remove container "bcbc98b4f5c2b072eb3c4ca673fa1c222d2a8af00bf58eae0f37085b9724ea46": Canceled: grpc: the client connection is closing: context canceled

Solving the underlying issue would require complex changes to the shutdown sequence. Moreover, the same issue could also happen if the daemon crashes while it deletes a container. Thus, add a cleanup step on daemon startup to remove these dead containers.

- How to verify it

A new integration test has been added.

- Human readable description for the release notes

Fix a bug that could cause the Engine to leave containers with autoremove set in 'dead' state on shutdown, and never reclaim them.

Stopping the Engine while a container with autoremove set is running may leave behind dead containers on disk. These containers aren't reclaimed on next start, appear as "dead" in `docker ps -a` and can't be inspected or removed by the user. This bug has existed since a long time but became user visible with 9f5f4f5. Prior to that commit, containers with no rwlayer weren't added to the in-memory viewdb, so they weren't visible in `docker ps -a`. However, some dangling files would still live on disk (e.g. folder in /var/lib/docker/containers, mount points, etc). The underlying issue is that when the daemon stops, it tries to stop all running containers and then closes the containerd client. This leaves a small window of time where the Engine might receive 'task stop' events from containerd, and trigger autoremove. If the containerd client is closed in parallel, the Engine is unable to complete the removal, leaving the container in 'dead' state. In such case, the Engine logs the following error: cannot remove container "bcbc98b4f5c2b072eb3c4ca673fa1c222d2a8af00bf58eae0f37085b9724ea46": Canceled: grpc: the client connection is closing: context canceled Solving the underlying issue would require complex changes to the shutdown sequence. Moreover, the same issue could also happen if the daemon crashes while it deletes a container. Thus, add a cleanup step on daemon startup to remove these dead containers. Signed-off-by: Albin Kerouanton <albin.kerouanton@docker.com> (cherry picked from commit ec9315c) Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>

thaJeztah

LGTM

xAt0mZ · 2026-01-06T15:27:56Z

Hey @thaJeztah @vvoland , sorry to bother on a closed PR.

At Portainer we are still getting reports of users with docker 29.1.3 having dead containers that are not automatically cleaned up on restart.
This causes a bug on our side when displaying the list of containers (we assumed the Names field of a container would always contain at least 1 element, but these dead containers come back with an empty Names array).

The ghosts/dead containers are not listed by docker ps -a, neither cleaned by prune or the daemon restart. While we have a fix in progress to strengthen our app, our users need to manually remove the /var/lib/docker/containers/** directories to fix the issue until we ship the fix.

Reports can be read at: portainer/portainer#12959 portainer/portainer#12987 portainer/portainer#12948

I wasn't able to recreate a dead container even by manipulating the files (the containers always get a new random name on restart), so idk what you can do to reproduce the issue. This is just a heads up so you are aware of this situation, I'm not asking for a specific fix 😄

thaJeztah · 2026-01-06T15:30:46Z

I recall there was an additional fix that was not merged yet; possibly could be related to that;

daemon: restore: register containers without rwlayer #51724

vvoland added this to the 29.1.3 milestone Dec 11, 2025

vvoland self-assigned this Dec 11, 2025

vvoland added status/2-code-review impact/changelog area/daemon Core Engine kind/bugfix PR's that fix bugs labels Dec 11, 2025

vvoland force-pushed the 51692-docker-29.x branch from 46a43fe to 3376758 Compare December 11, 2025 19:55

vvoland marked this pull request as ready for review December 11, 2025 19:55

thaJeztah approved these changes Dec 11, 2025

View reviewed changes

akerouanton approved these changes Dec 11, 2025

View reviewed changes

austinvazquez approved these changes Dec 11, 2025

View reviewed changes

vvoland merged commit bdc1e7b into moby:docker-29.x Dec 11, 2025
175 of 177 checks passed

xAt0mZ mentioned this pull request Dec 16, 2025

Docker 29.0.4, build 3247a5a and Portainer 2.33.5 LTS: I can´t see my containers portainer/portainer#12959

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docker-29.x backport] daemon: clean up dead containers on start#51693

[docker-29.x backport] daemon: clean up dead containers on start#51693
vvoland merged 1 commit intomoby:docker-29.xfrom
vvoland:51692-docker-29.x

vvoland commented Dec 11, 2025

Uh oh!

thaJeztah left a comment

Uh oh!

Uh oh!

xAt0mZ commented Jan 6, 2026 •

edited

Loading

Uh oh!

thaJeztah commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

vvoland commented Dec 11, 2025

Uh oh!

thaJeztah left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xAt0mZ commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thaJeztah commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

xAt0mZ commented Jan 6, 2026 •

edited

Loading