Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Develop branch hung when using replica > 2 #976

Open
zye1996 opened this issue Sep 13, 2024 · 3 comments · Fixed by #982
Open

[BUG] Develop branch hung when using replica > 2 #976

zye1996 opened this issue Sep 13, 2024 · 3 comments · Fixed by #982
Assignees
Labels
bug Something isn't working
Milestone

Comments

@zye1996
Copy link

zye1996 commented Sep 13, 2024

Describe the bug
I installed latest develop branch and run basic generation with StepResources(replica=8) and the generation hung. It does not happen to tags 88615c7

To Reproduce
Code to reproduce

Expected behaviour
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

  • Package version:
  • Python version:

Additional context
Add any other context about the problem here.

@gabrielmbmb
Copy link
Member

Hi @zye1996, thanks for reporting the issue. I'll check!

@gabrielmbmb gabrielmbmb self-assigned this Sep 16, 2024
@gabrielmbmb gabrielmbmb added the bug Something isn't working label Sep 16, 2024
@gabrielmbmb gabrielmbmb added this to the 1.4.0 milestone Sep 16, 2024
@gabrielmbmb
Copy link
Member

Hi again @zye1996! I confirmed that it gets hung. Just to confirm, at which moment of the execution of the pipeline do you see it gets hung? do you see that some replicas of the step finish?

@gabrielmbmb
Copy link
Member

we just merged a PR to develop that fixed one issue that made the pipeline hung just at the end of the execution because not all steps replicas were being unloaded. If you can try and confirm that your issue has been fixed would be awesome!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants