Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate Batch to use local vnet, service principal, Entra ID auth etc. #1514

Open
tamuri opened this issue Nov 13, 2024 · 2 comments · May be fixed by #1517
Open

Migrate Batch to use local vnet, service principal, Entra ID auth etc. #1514

tamuri opened this issue Nov 13, 2024 · 2 comments · May be fixed by #1517
Assignees
Labels
azure-batch bug Something isn't working

Comments

@tamuri
Copy link
Collaborator

tamuri commented Nov 13, 2024

Azure Batch isn't able to size pools we need because errors when assigning IP addresses and limits of number of ports on the existing subnet. We're currently using Batch's classic networking and sharedkey auth but (I think) it's not suitable when we have many large pools. Mostly from reading the network troubleshooting.

My plan is:

  • Switch pool communication mode to use simplified communication mode
  • Create a new virtual network in tlo-b1-rg resource group, get the subnet id and put in pool specification (and disable public IP address configuration?? won't be able to connect to machines, but simpler config)
    • Check that it can still access container registry and storage account
  • For Batch account tlob1ba to use virtual network subnet, need to switch from SharedKey auth to using Entra ID
    • Register new application (for unmanaged Batch control) with Entra
    • Setup a service principal
    • Set up the RBAC for the service principal for (at least) Batch account and VN (not sure about storage account?)
      • Didn't find a role to attach to network only - found contributor (too broad) and manage only (without connect).

Check that all works, then

  • Put secrets in the key vault, check access again
  • Merge into master
  • Done!

cc @BinglingICL - sorry, a little bit of work to do for your runs

@tamuri tamuri added azure-batch bug Something isn't working labels Nov 13, 2024
@tamuri tamuri self-assigned this Nov 13, 2024
@tamuri
Copy link
Collaborator Author

tamuri commented Nov 14, 2024

Azure Batch isn't able to size pools we need because errors when assigning IP addresses

The error:

Image

@BinglingICL
Copy link
Collaborator

Thanks Asif. It is great that issue and solution are found.

@tamuri tamuri linked a pull request Nov 18, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
azure-batch bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants