Deployment of Azure Container Instance Failed, where did I go wrong?

Rahul Vijan 0 Reputation points Student Ambassador
2023-03-23T20:24:20.3866667+00:00
{
    "status": "Failed",
    "error": {
        "message": "Pulling image \"waleedka/modern-deep-learning\";Successfully pulled image \"waleedka/modern-deep-learning\" in 2m20.224069124s;Created container test;Started container test;Subscription deployment didn't reach a successful provisioning state after '00:30:00'."
    }
}

Container Specs were 2cores (CPU), 1 K80 GPU, 8GB RAM, EAST US, QUOTA AVAILABILTIY MET
Docker Image: waleedka/modern-deep-learning

I wish to train a model using a container with a GPU, as my student subscription doesn't allow provisioning any GPU (N family) Vms.

Azure Container Instances
Azure Container Instances
An Azure service that provides customers with a serverless container experience.
670 questions
{count} votes

3 answers

Sort by: Most helpful
  1. Eddie Neto 1,226 Reputation points Microsoft Employee
    2023-03-24T11:57:03.1233333+00:00

    Hi @Rahul Vijan

    Thanks for reaching M&A

    I´ve performed the same test as you and mine got succussed the only different that I did was to add the the command override "/bin/bash", "-c", "echo hello; sleep 100000"

    Hope this helps. Please "Accept as Answer" if it helped, so that it can help others in the community looking for help on similar topics.

    User's image

    User's image


  2. deherman-MSFT 35,011 Reputation points Microsoft Employee
    2023-04-05T16:43:58.5433333+00:00

    @Rahul Vijan

    This can occur if the container name is being reused. Please try deploying with a different container name to see if it resolves your issue. If you are still facing the same issue please email the following to AzCommunity@microsoft.com and we'll get back to you promptly:

    • Subject: "Attn: deherman - "

    • Email body: Your Subscription ID

    • Email body: A link to this thread so we can validate and expedite the request

    If you don't receive a response within 24 hours, please reply to the thread so we can investigate.


    Please don’t forget to "Accept the answer" and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.

    0 comments No comments

  3. Gurav, Shankar 0 Reputation points
    2024-02-13T12:24:18.0866667+00:00

    We faced same problem for deploying containers in private network, after recreating container with new name did not resolved issue. Microsoft support investigated and found that all the remaining IPs in our subnet are currently held, possibly due to known problem with the ACI platform where there was an issue with the leaking Ips. This issue has been fixed for all the newer clusters and new subnets. However, there are some edge cases for some older subnet where already leaked Ips will cause similar behavior.

    So redeploying containers on new subnet resolved issue.

    0 comments No comments