AKS - dial tcp [::1]:80: connect: connection refused on all plans modifying the azurerm_kubernetes_cluster resource #1307

jamurtag · 2021-06-11T15:45:42Z

Please see this comment for the explanation of the root cause.

Terraform Version, Provider Version and Kubernetes Version

Terraform version: 0.15, 1.0
Kubernetes provider version: 2.3.0
Kubernetes version: 1.19

Affected Resource(s)

all resources

Terraform Configuration Files

Our configuration is almost identical to your aks example code, so I tried using that and replicated the behaviour.

Note - I simulated by modifying the workers_count variable in the aks-cluster directory, however this isn't actually implemented in your code. Modify line 23 to be node_count = var.workers_count, then pass a new value in via the aks-cluster module in main.tf.

Steps to Reproduce

Update the aks-cluster module as directed above to support workers_count
terraform apply
Change the workers_count variable to any other value
terraform plan

Expected Behavior

Terraform should display a plan showing the updated node pool count.

Actual Behavior

The following error is reported:

$ terraform plan
random_id.cluster_name: Refreshing state... [id=aE_9C3A]
module.aks-cluster.azurerm_resource_group.default: Refreshing state... [id=/subscriptions/ff83a9d2-8d6e-4c4a-8b34-641163f8c99f/resourceGroups/tf-k8s-684ffd0b70]
module.aks-cluster.azurerm_kubernetes_cluster.default: Refreshing state... [id=/subscriptions/ff83a9d2-8d6e-4c4a-8b34-641163f8c99f/resourcegroups/tf-k8s-684ffd0b70/providers/Microsoft.ContainerService/managedClusters/tf-k8s-684ffd0b70]
module.kubernetes-config.local_file.kubeconfig: Refreshing state... [id=1ca8ad3c1c7f4aff65e5eda0038b619788b0956a]
module.kubernetes-config.helm_release.nginx_ingress: Refreshing state... [id=nginx-ingress-controller]
module.kubernetes-config.kubernetes_namespace.test: Refreshing state... [id=test]
╷
│ Error: Get "http://localhost/api/v1/namespaces/test": dial tcp [::1]:80: connect: connection refused
│
│   with module.kubernetes-config.kubernetes_namespace.test,
│   on kubernetes-config/main.tf line 14, in resource "kubernetes_namespace" "test":
│   14: resource "kubernetes_namespace" "test" {
│
╵
╷
│ Error: Kubernetes cluster unreachable: invalid configuration: no configuration has been provided, try setting KUBERNETES_MASTER environment variable
│
│   with module.kubernetes-config.helm_release.nginx_ingress,
│   on kubernetes-config/main.tf line 59, in resource "helm_release" "nginx_ingress":
│   59: resource helm_release nginx_ingress {

The data source is clearly not passing back valid data, even though there is a dependency on the aks-cluster module.

The text was updated successfully, but these errors were encountered:

chakri-nelluri · 2021-06-11T21:06:44Z

I am hitting the same issue.

 2021-06-11T16:51:39.576-0400 [DEBUG] plugin: using plugin: version=5
 8675 2021-06-11T16:51:39.640-0400 [INFO]  plugin.terraform-provider-kubernetes_v2.3.2_x5: 2021/06/11 16:51:39 [WARN] Invalid provider configuration was supplied. Provider operations likely to fail: invalid configuration: no configuration has been provided, try setting KUBERNETES_MASTER environment variable: timestamp=2021-06-11T16:51:39.640-0400
 8676 2021-06-11T16:51:39.640-0400 [INFO]  plugin.terraform-provider-kubernetes_v2.3.2_x5: 2021/06/11 16:51:39 [DEBUG] Enabling HTTP requests/responses tracing: timestamp=2021-06-11      T16:51:39.640-0400

thy143 · 2021-06-28T11:45:58Z

We are also hitting this and it’s causing a bit of a headache

dak1n1 · 2021-07-02T15:39:54Z

Hi, I'm sorry to hear you all are struggling with this dependency issue. I've done extensive research in this area and come across similar scenarios. The cause has to do with passing an unknown value to a provider configuration block, which is not supported in Terraform core. To quote their docs:

You can use expressions in the values of these configuration arguments, 
but can only reference values that are known before the configuration is applied.

When you make a change to the underlying infrastructure, such as node count, you're passing an unknown value into the Kubernetes provider configuration block, since the full scope of the cluster infrastructure is not known until after the change has been applied to the AKS cluster. That's why Terraform is behaving as if it's not reading the cluster's data source properly.

Although I did write the initial guide to show that it can be possible to work around some of these issues, as you've found from experience, there are many edge cases that make it an unreliable and unintuitive process, to get the Kubernetes provider working alongside the underlying infrastructure. This is due to a long-standing limitation in Terraform, that can't be fixed in any provider, but we do have plans to smooth out the bumps a little by adding better error messages upfront, so that users don't run into this on subsequent applies.

I thought at first that I could list out every work-around to help users keep their preferred workflow of having the cluster in the same Terraform state as the Kubernetes resources. Most cases can be worked around using terraform state rm module.kubernetes-config or terraform apply -target=module.aks-cluster, but I think encouraging this kind of work-around will cause more headaches in the long run, as it puts the user in charge of figuring out when to use special one-off apply commands, rather than setting up Terraform to behave reliably and predictably from the start. Plus it can have unintended side-effects, like orphaning cloud resources.

That's why I have a new guide in progress here, which shows the most reliable method that we have so far: the cluster infrastructure needs to be kept in a state separate from the Kubernetes and Helm provider resources.

https://github.com/hashicorp/terraform-provider-kubernetes/tree/e058e225e621f06e393bcb6407e7737fd43817bd/_examples/aks

I know this is inconvenient, which is why we continue to try and accommodate users in single-apply scenarios, and scenarios which contain the Kubernetes and cluster resources in the same Terraform state. However, until upstream Terraform can add support for this, the single-apply workflow will remain buggy and less reliable than separating cluster infrastructure from Kubernetes resources.

pmnathan · 2021-09-21T13:41:55Z

I have been testing using an alias azure provider for the data query and this seems to be a viable workaround for the issue. More testing is obviously required...from Steph's code example (https://github.com/hashicorp/terraform-provider-kubernetes/blob/main/_examples/aks/main.tf) the change will look like

provider "azurerm" {
  alias = "azurerm_k8s"
  features {}
}

data "azurerm_kubernetes_cluster" "default" {
  provider            = azurerm.azurerm_k8s
  depends_on          = [module.aks-cluster] # refresh cluster state before reading
  name                = local.cluster_name
  resource_group_name = local.cluster_name
}

Look forward to the communities feedback. thank you

stevehipwell · 2021-10-21T15:36:20Z

@dak1n1 this error message is unintuitive as it isn't explaining why the error is occurring and leads to significant lost time tracking down the cause. Furthermore, if the error here is correct it means that Terraform has attempted to connect to the localhost cluster, which could have unintended consequences if there is such a cluster.

AlexGoris-KasparSolutions · 2021-10-27T22:34:33Z

I have been testing using an alias azure provider for the data query and this seems to be a viable workaround for the issue. More testing is obviously required...from Steph's code example (https://github.com/hashicorp/terraform-provider-kubernetes/blob/main/_examples/aks/main.tf) the change will look like
provider "azurerm" {
  alias = "azurerm_k8s"
  features {}
}

data "azurerm_kubernetes_cluster" "default" {
  provider            = azurerm.azurerm_k8s
  depends_on          = [module.aks-cluster] # refresh cluster state before reading
  name                = local.cluster_name
  resource_group_name = local.cluster_name
}
Look forward to the communities feedback. thank you

I tried this today and sadly in my situation it did not help.

BoHuang2018 · 2021-11-02T13:39:02Z

I encountered same problem on GKE @dak1n1 terraform state rm -target modules.my-gke-module terraform plan -target modules.my-gke-module terraform apply -target modules.my-gke-module helped to fix my problem. Thanks. I will try your new guide with GKE

sgutwein · 2021-11-22T15:29:18Z

Also adapted the node count, the first apply went through without problems. Now I 'm facing the same error message on every plan / apply...
Error: Get "http://localhost/api/v1/namespaces/external-secrets": dial tcp 127.0.0.1:80: connect: connection refused │ │ with kubernetes_namespace.namespaces["external-secrets"], │ on main.tf line 109, in resource "kubernetes_namespace" "namespaces": │ 109: resource "kubernetes_namespace" "namespaces" {

BoHuang2018 · 2021-11-22T15:36:11Z

Hi @sgutwein , I used methods of @dak1n1 to solve my error on GKE successfully. I guess it's a solution to your error.

sgutwein · 2021-11-23T15:51:32Z

@BoHuang2018 It was a lot of pain, but that fixed it. Thanks.

favoretti · 2021-12-14T19:32:15Z

Running terraform apply -refresh=false will just silently do the right thing BTW, as it won't try to refresh the current state. (And in general since this only happens when cluster gets recreated - no refresh isn't as bad as it sounds).

akepjio · 2021-12-21T11:37:34Z

thanks @favoretti for your answer . it works for me.

josephcaxton · 2022-01-15T14:14:54Z

@dak1n1 thanks for that explaination. It realy helped

xelhark · 2022-01-20T16:40:09Z

A better fix for GKE that worked for me was let google handle the connection config in ~/.kube/config and feed the kube config file to the provider manually:

gcloud container clusters get-credentials <my_cluster> --region=<my_region>

provider "kubernetes" {
  config_path            = "~/.kube/config"
}

jrhouston · 2022-02-23T17:34:55Z

As @dak1n1 explained above this is due to progressive apply. Please open a new issue with your complete configuration if you feel you are encountering a different issue than the original issue reported here.

stevehipwell · 2022-02-23T17:49:00Z

@jrhouston progressive apply isn't a solution here, it isn't truly IaC if you need to use manual steps; this issue shouldn't be closed without a documented solution that isn't "buy more workspaces and/or go back to manual OPs".

AFAIK, from experience and discussions with the engineers working on Terraform, this problem should be resolvable by using the exec plugin to authenticate with Kubernetes. I can also add that there are some other considerations when configuring the provider which basically boils down to not using a cluster datasource to lookup in a workspace where the cluster resource is being created. But fundamentally Kubernetes only works in Terraform with the exec plugin or a very convoluted set of workspaces and/or manual steps.

github-actions · 2022-03-31T02:18:18Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.
If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

jamurtag added the bug label Jun 11, 2021

dak1n1 added the theme/auth label Jul 2, 2021

dak1n1 mentioned this issue Jul 15, 2021

A way to refresh provider credentials hashicorp/terraform#29182

Open

jrsdav mentioned this issue Dec 7, 2021

Kubernetes cluster unreachable: invalid configuration: no configuration has been provided, try setting KUBERNETES_MASTER environment variable terraform-aws-modules/terraform-aws-eks#1234

Closed

danlester mentioned this issue Jan 14, 2022

Kubeconfig state unavailable, Terraform defaults to localhost [bug] nebari-dev/nebari#992

Closed

apeschel mentioned this issue Feb 19, 2022

connection refused error caused by missing 'Authorization' header in request to Kubernetes API #1152

Closed

jrhouston closed this as completed Feb 23, 2022

iBrandyJackson added the upstream-terraform label Feb 28, 2022

mogul mentioned this issue Mar 9, 2022

Limit external-dns to exactly one zone GSA-TTS/datagov-brokerpak-eks#82

Merged

github-actions bot locked as resolved and limited conversation to collaborators Mar 31, 2022

iBrandyJackson added the progressive apply label Sep 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AKS - dial tcp [::1]:80: connect: connection refused on all plans modifying the azurerm_kubernetes_cluster resource #1307

AKS - dial tcp [::1]:80: connect: connection refused on all plans modifying the azurerm_kubernetes_cluster resource #1307

jamurtag commented Jun 11, 2021 •

edited by jrhouston

Loading

chakri-nelluri commented Jun 11, 2021

thy143 commented Jun 28, 2021

dak1n1 commented Jul 2, 2021

pmnathan commented Sep 21, 2021 •

edited

Loading

stevehipwell commented Oct 21, 2021

AlexGoris-KasparSolutions commented Oct 27, 2021

BoHuang2018 commented Nov 2, 2021

sgutwein commented Nov 22, 2021

BoHuang2018 commented Nov 22, 2021

sgutwein commented Nov 23, 2021

favoretti commented Dec 14, 2021

akepjio commented Dec 21, 2021

josephcaxton commented Jan 15, 2022

xelhark commented Jan 20, 2022 •

edited

Loading

jrhouston commented Feb 23, 2022

stevehipwell commented Feb 23, 2022

github-actions bot commented Mar 31, 2022

AKS - dial tcp [::1]:80: connect: connection refused on all plans modifying the azurerm_kubernetes_cluster resource #1307

AKS - dial tcp [::1]:80: connect: connection refused on all plans modifying the azurerm_kubernetes_cluster resource #1307

Comments

jamurtag commented Jun 11, 2021 • edited by jrhouston Loading

Terraform Version, Provider Version and Kubernetes Version

Affected Resource(s)

Terraform Configuration Files

Steps to Reproduce

Expected Behavior

Actual Behavior

chakri-nelluri commented Jun 11, 2021

thy143 commented Jun 28, 2021

dak1n1 commented Jul 2, 2021

pmnathan commented Sep 21, 2021 • edited Loading

stevehipwell commented Oct 21, 2021

AlexGoris-KasparSolutions commented Oct 27, 2021

BoHuang2018 commented Nov 2, 2021

sgutwein commented Nov 22, 2021

BoHuang2018 commented Nov 22, 2021

sgutwein commented Nov 23, 2021

favoretti commented Dec 14, 2021

akepjio commented Dec 21, 2021

josephcaxton commented Jan 15, 2022

xelhark commented Jan 20, 2022 • edited Loading

jrhouston commented Feb 23, 2022

stevehipwell commented Feb 23, 2022

github-actions bot commented Mar 31, 2022

jamurtag commented Jun 11, 2021 •

edited by jrhouston

Loading

pmnathan commented Sep 21, 2021 •

edited

Loading

xelhark commented Jan 20, 2022 •

edited

Loading