Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BACKPORT][v1.3.3][BUG] Duplicate channel close error in the backing image manage related components #4866

Closed
github-actions bot opened this issue Nov 14, 2022 · 4 comments
Assignees
Labels
area/backing-image Backing image related component/longhorn-manager Longhorn manager (control plane) kind/backport Backport request kind/bug priority/0 Must be implement or fixed in this release (managed by PO) reproduce/always 100% reproducible severity/1 Function broken (a critical incident with very high impact (ex: data corruption, failed upgrade)
Milestone

Comments

@github-actions
Copy link

backport #4865

@github-actions github-actions bot added component/longhorn-manager Longhorn manager (control plane) area/backing-image Backing image related kind/backport Backport request kind/bug priority/0 Must be implement or fixed in this release (managed by PO) reproduce/always 100% reproducible severity/1 Function broken (a critical incident with very high impact (ex: data corruption, failed upgrade) labels Nov 14, 2022
@github-actions github-actions bot added this to the v1.3.3 milestone Nov 14, 2022
@innobead innobead assigned weizhe0422 and unassigned shuo-wu Dec 7, 2022
@longhorn-io-github-bot
Copy link

longhorn-io-github-bot commented Dec 8, 2022

Pre Ready-For-Testing Checklist

  • Where is the reproduce steps/test steps documented?
    The reproduce steps/test steps are at:
  1. Launch a large backing image and check the BackingImageDatasource POD should be launched
    • Create a large size volume, and create backing image via Export from a Longhorn volume
  2. Block all ingress and outgress package with applying network policy, check the progress status should be paused after few seconds.
    kind: NetworkPolicy
    apiVersion: networking.k8s.io/v1
    metadata:
      name: deny-pod-all-packages
      namespace: longhorn-system
    spec:
      podSelector:
        matchLabels:
          longhorn.io/backing-image-data-source: <your-backing-image-name>
          longhorn.io/component: backing-image-data-source
      policyTypes:
      - Ingress
      - Egress
  3. Check the logs of the longhorn manager pod, it should not show the error panic: close of closed channel. And after the monitoring is stopped because it cannot connect to the data source POD, the next round of monitoring will continue.
    image

@innobead
Copy link
Member

innobead commented Dec 8, 2022

@weizhe0422 the verification step is wrong? Should be below

Check the log of longhorn manager pod, there should "not" show the error panic: close of closed channel. And after the monitoring is stopped due to failure to connect to the data source POD, the next round of monitoring will continue.

@weizhe0422
Copy link
Contributor

@innobead It is my mistake that missed the word "not". I will pay more attention next time. Thank you.

@chriscchien
Copy link
Contributor

Verified in longhorn v1.3.x b2c607 with test steps
Result Pass

  1. After backing image creation progress paused by new netwoklpolicy, in longhorn-manager pod can observe new round monitor started, also close of closed channel not found in manager log.
  2. After delete the network policy, create volume form backing image, that volume created successfully and can attach to node.
time="2022-12-09T05:43:37Z" level=info msg="Stopped monitoring" backingImageDataSource=vol2 controller=longhorn-backing-image-data-source node=ip-172-31-31-137
time="2022-12-09T05:43:44Z" level=info msg="Start monitoring" backingImageDataSource=vol2 controller=longhorn-backing-image-data-source diskUUID=6fd1c78c-6b4c-4fed-8fee-21cd772f87d3 node=ip-172-31-31-137 nodeID=ip-172-31-31-137 parameters="map[export-type:raw volume-name:vol1]" sourceType=export-from-volume

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/backing-image Backing image related component/longhorn-manager Longhorn manager (control plane) kind/backport Backport request kind/bug priority/0 Must be implement or fixed in this release (managed by PO) reproduce/always 100% reproducible severity/1 Function broken (a critical incident with very high impact (ex: data corruption, failed upgrade)
Projects
Status: Closed
Development

No branches or pull requests

5 participants