HTTP 502 Bad Gateway: Causes & Fixes

Star us on GitHub and stay updated

A 502 Bad Gateway error means the server acting as a gateway or proxy received an invalid response — or no response at all — from the upstream server it was trying to reach. The proxy is up and running; the problem is what lies behind it.

For developers, a 502 is a mid-tier headache: it is not a client mistake (that would be a 4xx), and it is not a catch-all server crash (that would be a 500). It tells you something specific: the gateway could not talk to the backend. That narrows down where to look.

What Is a 502 Bad Gateway Error?

The HTTP 502 status code is defined in RFC 9110 as:

"The server, while acting as a gateway or proxy, received an invalid response from an inbound server it accessed while attempting to fulfill the request."

In plain terms: your Nginx, Apache, Cloudflare, or load balancer tried to forward the request to your application server, got back garbage (or nothing), and returned 502 to the client.

The "gateway" in the error is always the middleman — not your backend application. The backend is what actually failed.

How HTTP Proxies and Gateways Work

To understand 502, you need a clear picture of the request chain:

Client (browser)
    → Reverse proxy / CDN / load balancer  ← this is the "gateway"
        → Upstream application server       ← this is what failed

The proxy's job is to forward the request and relay the response. If the upstream server:

Does not respond within the proxy's timeout window
Closes the connection unexpectedly
Returns a malformed HTTP response
Is not running at all

...the proxy has nothing valid to relay. It returns 502.

Common "gateways" in production setups: Nginx, Apache with mod_proxy, AWS ALB/NLB, Cloudflare, Fastly, HAProxy, Traefik.

Common Causes of a 502 Bad Gateway

1. Upstream server is down

The application server (Node.js, Django, Rails, PHP-FPM, etc.) has crashed or was never started. The proxy cannot connect to the configured port.

2. Upstream server timeout

The backend is running but taking too long to respond — longer than the proxy's proxy_read_timeout or equivalent. The proxy gives up and returns 502.

3. Misconfigured proxy

The proxy is pointing at the wrong host, port, or socket path. A typo in proxy_pass or ProxyPass will reliably produce 502s.

4. Load balancer: all backends unhealthy

When all nodes behind a load balancer fail their health checks simultaneously, the load balancer has no healthy target to route to. Result: 502.

5. DNS resolution failure

If the proxy resolves the upstream hostname at startup (Nginx does this by default) and the hostname changes or becomes unreachable, the proxy may cache the stale address and fail to connect.

6. TLS handshake failure between proxy and upstream

When the proxy connects to the upstream over HTTPS (common in microservices), a certificate mismatch or expired cert on the upstream side causes the handshake to fail, which the proxy reports as 502. Use the Authgear SSL Checker to verify your upstream certificates.

7. Resource exhaustion on the upstream

The upstream server is alive but has no available workers, threads, or file descriptors left. Connections are accepted but hang, triggering proxy timeouts.

How to Diagnose a 502 Bad Gateway

Work through these steps in order. Each one narrows the problem.

Step 1: Check the error in your browser

Open DevTools (F12) → Network tab. Reload the failing request. Look at:

The response status (confirm it is actually 502, not something cached)
The Server response header (tells you which proxy returned it)
The X-Cache or CF-Cache-Status header (tells you if a CDN is involved)

Step 2: Reproduce with curl

curl -v https://your-domain.com/api/health

-v shows the full request/response exchange. A 502 here confirms it is not browser-specific. If you get a connection refused instead, the proxy itself may be down — a different problem.

Test the upstream directly (bypassing the proxy):

curl -v http://127.0.0.1:3000/api/health

If this succeeds but the proxied request fails, the problem is in the proxy layer.

Step 3: Check upstream process status

# Node.js / PM2
pm2 status
pm2 logs --lines 50

# Systemd service
systemctl status myapp.service
journalctl -u myapp.service -n 100 --no-pager

# PHP-FPM
systemctl status php8.2-fpm

Step 4: Check proxy error logs

Nginx:

tail -n 100 /var/log/nginx/error.log

Look for lines like:

connect() failed (111: Connection refused) while connecting to upstream
upstream timed out (110: Connection timed out)
no live upstreams while connecting to upstream

Apache:

tail -n 100 /var/log/apache2/error.log

Step 5: Test DNS resolution

dig your-upstream-hostname A
nslookup your-upstream-hostname

Compare the resolved IP against what you expect. If the proxy caches a stale IP, you need to either restart the proxy or configure it to re-resolve periodically (see fixes below).

Step 6: Check load balancer health

In AWS Console: EC2 → Load Balancers → your ALB → Target Groups → view target health status. Unhealthy targets with a 502 or "connection error" reason confirm the issue is at the backend.

How to Fix a 502 Bad Gateway

Fix: Nginx upstream not running or wrong port

Confirm the upstream is running on the configured port:

ss -tlnp | grep 3000

If nothing is listening, start your application. If it is on a different port, fix your Nginx config:

upstream app {
    server 127.0.0.1:3000;  # must match the port your app actually listens on
}

server {
    location / {
        proxy_pass http://app;
        proxy_read_timeout 60s;
        proxy_connect_timeout 10s;
        proxy_send_timeout 60s;
    }
}

Reload after changes:

nginx -t && systemctl reload nginx

Fix: Nginx upstream timeout

If your backend is slow (long database queries, heavy computation), increase the timeout:

location / {
    proxy_pass http://app;
    proxy_read_timeout 120s;
    proxy_connect_timeout 15s;
}

Do not raise this indefinitely — it hides slow backend bugs. The better fix is to profile and optimise the slow endpoint, or move it to an async job.

Fix: Apache ProxyPass misconfiguration


    ProxyPreserveHost On
    ProxyPass        / http://127.0.0.1:3000/
    ProxyPassReverse / http://127.0.0.1:3000/

    # Timeouts
    ProxyTimeout 60

Enable the required modules if not already active:

a2enmod proxy proxy_http
systemctl reload apache2

Fix: Node.js / Express behind a proxy

If your Express app is behind Nginx, make sure it is binding on the correct interface and port:

const express = require('express');
const app = express();

// Tell Express it is behind a trusted proxy
app.set('trust proxy', 1);

app.listen(3000, '127.0.0.1', () => {
  console.log('Server running on 127.0.0.1:3000');
});

Binding to 127.0.0.1 (loopback only) is correct for a proxied setup. Binding to 0.0.0.0 exposes the app port publicly, which is a security risk.

A common cause of 502 with PM2 is the app crashing on startup silently. Check:

pm2 logs app --lines 200
pm2 restart app && pm2 logs app --lines 50

Fix: Cloudflare 502

Cloudflare returns a 502 when it cannot connect to your origin. Check:

Origin server is running — SSH in and confirm your app is up.
Origin firewall is not blocking Cloudflare IPs — allow Cloudflare's IP ranges on port 80/443.
SSL mode — in Cloudflare dashboard, go to SSL/TLS. If your origin does not have a valid SSL cert, use "Flexible" mode, not "Full (strict)". A cert mismatch between Cloudflare and your origin causes 502. Verify your origin cert with the Authgear SSL Checker.
Origin response time — Cloudflare times out at 100 seconds. If your origin takes longer, you will see 524 (timeout), not 502. But very slow responses can also cause 502 in some configurations.

Fix: Load balancer health checks (AWS ALB)

If targets are showing as unhealthy in your ALB target groups

1. Check the health check path returns 200. A health endpoint that itself errors causes all targets to be marked unhealthy.

2. Verify the health check port and protocol match what your app actually serves.

3. Check security group rules — the ALB must be allowed to reach the target on the health check port.

Example Terraform snippet to configure correct health check settings:

resource "aws_lb_target_group" "app" {
  name     = "app-tg"
  port     = 3000
  protocol = "HTTP"
  vpc_id   = var.vpc_id

  health_check {
    path                = "/health"
    healthy_threshold   = 2
    unhealthy_threshold = 3
    timeout             = 5
    interval            = 30
    matcher             = "200"
  }
}

Fix: Nginx DNS caching stale upstream

Nginx resolves upstream hostnames once at startup. If the upstream IP changes (common in container environments), Nginx will keep connecting to the old IP.

Fix: use a resolver and set the upstream as a variable so Nginx re-resolves it dynamically:

resolver 8.8.8.8 valid=30s;

server {
    location / {
        set $upstream http://my-service.internal:3000;
        proxy_pass $upstream;
    }
}

502 vs 503 vs 504: Quick Comparison

Status Code	Meaning	Who Is at Fault	Common Cause
502 Bad Gateway	Proxy received an invalid/no response from upstream	Upstream server	App crashed, wrong port, bad response
503 Service Unavailable	Server is temporarily unable to handle requests	The server itself	Overloaded, in maintenance mode, rate limited
504 Gateway Timeout	Proxy timed out waiting for the upstream	Upstream server (slow)	Slow query, deadlock, heavy computation

The key distinction between 502 and 504: both involve a proxy and an upstream failure. 502 means the upstream sent back something invalid or refused the connection. 504 means the upstream was reachable but took too long.

Prevention Best Practices

1. Implement a health endpoint. Every service should expose a GET /health that returns 200 when the service is ready. Use this for load balancer checks and readiness probes in Kubernetes.

2. Set explicit proxy timeouts. The default values in Nginx and Apache can be too long or too short for your use case. Set proxy_connect_timeout, proxy_read_timeout, and proxy_send_timeout explicitly.

3. Monitor upstream availability. Do not wait for users to report 502s. Use uptime monitoring (Datadog, Better Uptime, etc.) to alert you when upstream health checks fail.

4. Configure automatic restarts. Use systemd's Restart=always or PM2's watch mode so your application restarts automatically after a crash.

# /etc/systemd/system/myapp.service
[Service]
ExecStart=/usr/bin/node /app/server.js
Restart=always
RestartSec=5

5. Use circuit breakers. In microservice architectures, a circuit breaker (e.g., via Resilience4j or similar) stops requests from piling up against a failing upstream and returns a controlled fallback rather than cascading 502s.

6. Keep TLS certificates valid. Certificate expiry on the upstream side causes TLS handshake failures that manifest as 502s from the proxy. Set calendar reminders or use automated renewal (Let's Encrypt / cert-manager). Check certificate status at any time with the Authgear SSL Checker.

502 and Authentication Flows

If your authentication service sits behind a reverse proxy — as is common when using a platform like Authgear — a 502 from the proxy will completely block the login flow. Users will see a generic error page instead of the sign-in screen, and OAuth redirect flows will break silently.

When troubleshooting 502s in a production app:

Check whether /oauth/authorize, /oauth/token, or /.well-known/openid-configuration endpoints are affected — these are the first to break when the auth backend is unreachable.
If you use a CDN in front of your auth endpoints, ensure your CDN is not caching 502 responses. Cached error responses will continue to block logins even after the backend recovers.
Authgear is designed to run reliably in proxied environments, but your Nginx or load balancer config still needs to be correct. See the proxy configuration fixes above.

FAQ

Can a 502 error be caused by the client?

No. A 502 is a server-side error. The client sent a valid request; the problem is in the server infrastructure. That said, clients can trigger slow backend behaviour (e.g., a large file upload that times out the proxy), so the client's action may be the trigger — but the fix is always on the server side.

Why do I see 502 errors only under heavy load?

This usually means the upstream is running out of capacity: no available workers, connection pool exhausted, or CPU/memory bottleneck. The upstream server is alive under normal load but overwhelmed at peak. Profile your application, scale horizontally, or implement request queuing.

Does Cloudflare cache 502 errors?

By default, Cloudflare does not cache 5xx responses. But if you have a custom page rule or Cache Rule that overrides this, it is possible. Check your Cloudflare caching configuration. Also note: if your origin returns a 502 briefly, users hitting cached pages (CDN edge cache) may not see it — but uncached endpoints will.

How do I tell if the 502 is from Nginx or from my application?

Check the Server response header. Nginx returns Server: nginx. The format of the HTML error page also differs — Nginx 502 pages have a distinctive plain style. If your app is behind multiple proxies, use curl -v and look at all the response headers to identify which layer is returning the error. You can also add a custom X-Proxy-ID header in your Nginx config to make this unambiguous.

Summary

A 502 Bad Gateway almost always means one of three things: the upstream server is not running, the proxy cannot reach it (wrong port, stale DNS, firewall), or the upstream is too slow. Check your upstream process status first, then your proxy error logs, then work outward from there. The curl commands and log snippets above will get you to the root cause quickly in most cases.

Preferences

Privacy is important to us, so you have the option of disabling certain types of storage that may not be necessary for the basic functioning of the website. Blocking categories may impact your experience on the website.

Accept all cookies

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

HTTP 502 Bad Gateway: What It Means and How to Fix It