Questions tagged [high-availability]
High availability is an architectural consideration often involving degrees of redundancy to insure availability in case of system or component failure.
986 questions
Score of 0
0 answers
11 views
Step-CA (Smallstep) with 6 intermediate CAs, Badger/CNPG and passthrough HAProxy — is this architecture correct for production?
Context
I'm operating a PKI based on step-ca (Smallstep), version 0.30.2, to issue short-lived certificates:
SSH (user and host)
TLS/X.509 (internal services)
The CA sits behind its own ...
Score of 0
0 answers
59 views
App Engine Flexible automatic weekly VM rotation causes downtime: instance terminated before replacement is healthy
We run an application on Google App Engine Flexible Environment with two services:
Service default: machine e2-custom-4-8704, min_num_instances: 1, disk: pd-standard
Service aux: machine e2-custom-2-...
Score of 0
0 answers
44 views
Grafana on Kubernetes - Notification duplicate in a HA setup
I've set up Grafana by deploying the official helm chart with ArgoCD. I have 3 grafana pods running. In order to achieve HA and to avoid having duplicate notifications, I set up the unified_alerting ...
Score of 2
2 answers
279 views
Kubernetes for VMs
I want to setup a Kubernetes Cluster, for testing and later for prod purpose. but I am stuck with the concept of VMs running in Kubernetes.
First I want to mention that I don't have a shared storage, ...
Score of 1
0 answers
219 views
Nginx: Need help figuring out slowdown or possible bot attack
I've been noticing a slowdown over the last week. There doesn't seem to be a ton of traffic on Google Analytics, but there seems to be a ton of unique IP addresses hitting the site. I have about 20 ...
Score of 2
0 answers
133 views
High Availability Linux SMB server in AWS
Background:
After migrating from a VMware/local to AWS I've seen a increase in expected storage capability. Our Windows Fileserver with EBS disks, was basically a lift & shift move, and has since ...
Score of 0
1 answer
160 views
Same DNS name for multiple application servers with replication or load balancers or making a service highly available?
tw5@renegade:~$ dig @192.168.57.1 tw5.helpdesk.com
; <<>> DiG 9.18.30-0ubuntu0.24.04.2-Ubuntu <<>> @192.168.57.1 tw5.helpdesk.com
; (1 server found)
;; global options: +cmd
;; ...
Score of 0
1 answer
158 views
RRP(Redundant Ring Protocol) Not Working Properly in Pacemaker
I am currently planning to configure an Active-Standby HA Cluster using two nodes. Each node has two NICs, which are added as 10.0.2.0 (ring0) and 192.168.0.0 (ring1). For testing purposes, I have ...
Score of 0
1 answer
1541 views
Keepalived NAT with firewalld: STATE_INVALID_DROP blocking return packets from real servers
I've set up Keepalived as a load balancer but am not using a Master/Standby configuration—just a single server.
The issue arises when firewalld is enabled. Without firewalld, everything works fine. ...
Score of 0
2 answers
234 views
Will deploying by changing symlink affect currently running requests?
It's a popular strategy for the web server to reference the current build via symlink, and to switch the symlink to deploy a new build
/var/www/current -> /var/www/build-1
Is this a problem for ...
Score of 5
4 answers
1851 views
Two node High Availability clusters
It's a common knowledge that HA involves redundant nodes, but for the quorum it needs odd number of nodes, to have a tie breaker in case of cluster partition. That means, quorum-based cluster has to ...
Score of 1
0 answers
162 views
Transitioning Standalone Ubuntu Desktops to Centralized Management in a College Lab: Best Practices and Pitfalls [closed]
Our college lab consists of 10 Ubuntu desktops, each currently set up as standalone machines on the LAN. I want to transition them to a centrally managed environment while avoiding common pitfalls. I ...
Score of 0
1 answer
971 views
Linux keepalived vrrp_track_process does not work as expected
keepalived conf on node01:
vrrp_track_process track_haproxy {
process haproxy
weight 10
}
vrrp_instance VI_1 {
state MASTER
interface ens33
virtual_router_id 51
...
Score of 1
1 answer
1674 views
When to use Thanos/Cortex over Vanilla Prometheus?
Generally when I look around or try to understand the HA setup for Prometheus the most common search results are Cortex & Thanos. I've also seen M3 and Victoriametrics on the list.
What I ...
Score of 0
1 answer
611 views
Hashicorp Vault transit auto unseal cluster is not receiving requests from transit seal configuration in main cluster (or requests are not being sent)
I am trying to set up auto-unseal for hashicorp vault using the transit secrets engine and two HA clusters. Cluster A is responsible for unsealing cluster B. My issue is that Cluster B (the main ...