Proxmox cluster Setup

Network Design

Put simply I am not sure what the design should be. I have the thunderbolt mesh network and the 2.5gbe NIC on each node. The ideal design guidelies cause my brain to have a race conditions because:

ceph shold have a dedicated network
proxmox should not have migration traffic and cluster communications network
one wants cluster communicationsnetwork reddundant

I have 3 networks:

Onboard 2.5gb NIC connected to one switch for subnet IPv4 192.168.1.0/24 and IPv6 /64 address (my LAN)
Thunderbolt mesh connected in a ring for subnet fc00::80/124
- this has 3 single address subnets fc00::81/128, fc00::82/128 and fc00::83/128 these are used for FRR Openfabric routing between nodes
Addtional 2.5Gbe using (NUCIOALUWS) add-on afor subnet TBD
- cluster (aka corosync) network uses network 1 (2.5gbe)
- ceph migration traffic uses network 2 (thunderbolt)
- ceph public network uses network 2 (thunderbolt
- CT and VM migration traffic uses network 2 (thunderbolt network)

I have not yet decided what network 3 will be used for, options are:

cluster public network that other devices use to access the cluster or its resources
backup corosync (though i don't see a reason not to have corosync on all 3 networks)
ceph public network - but I assume this is what the VMs uses so it makes sense to i want that on the 26Gbps thunderbolt mesh too

Create Cluster

You should have 3 browser tabs open for this, one for each node's management IP.

setup on node 1

navigate to Datacenter > Cluster and click Create Cluster
name the cluster e.g. pve-cluster1
set link 0 to the IPv4 address (in my case 192.168.1.81 on interface vmbr0)
click create

Join node 2

on node 2 in Datacenter > Cluster click join information
the IP address should be node 1 IPv4 address
click copy information
open tab 2 in your browser to node 2 management page
navingate to Datacenter > Cluster and click join cluster
paste the information into the dialog box that you collected in step 3
Fill the root password in of node 1
Select Link 0 as 192.168.1.82
click button join 'pve-cluster1'

Join node 3

on node 1 in Datacenter > Cluster click join information
the IP address should be node 1 IPv4 address
click copy information
open tab 2 in your browser to node 3 management page
navingate to Datacenter > Cluster and click join cluster
paste the information into the dialog box that you collected in step 3
Fill the root password in of node 1
Select Link 0 as 192.168.1.83
click button join 'pve-cluster1'

at this point close your pv2 and pve 3 tabs - you can now manage all 3 cluster nodes from node 1 (or any node)

Define Migration Network

navigate in webui to Datacenter > Options
double click Migration Settings
select any networkand click ok - this is just to create an entry in the config file
edit with nano /etc/pve/datacenter.cfg and change: migration: network=10.0.0.81/32,type=secure to migration: network=fc00::80/124,type=insecure This is because a)this subnet contains fc00::80 thru fc00::8f; and b) because it is 100% isolated network it can be insecure give a small speed boost

Configuring for High Availability

navigate in webui to Datacenter > HA > Groups
click create
Name the cluster (ID) ClusterGroup1
add all 3 nodes and then click create

So regarding corosync and ceph network being on physically separate NICs, I am in a bit of a quandary as I have sort of run out of NICs. I have a 2.5GB NIC for management, then using the second one for redundant OPNSense WAN ports, and then using my thunderbolt setup for Ceph.

I noticed in some other videos that i watched way back when that folks would use two links which is what I am doing, and use both the managment LAN and also use the Ceph network as well. I know this is a bad idea and that it would be better if I had one more link (Link 0) that was a dedicated network just for corosync, but unless I throw a PCI card into all my MS-01s, I really can't do this right now since of my two SFP ports, one is used for VLANs and I would prefer not to tie up the other SFP port for just corosync.

Has anyone really noticed any real world issue here with what I have done? Where would I see it manifest itself if at all? I have been running this way for more than 6 months without anything noticeable on my side.

** Oh and if folks say I really still should set up a separate corosync network, perhaps I should look into a 2 port 2.5GB card or even a 1GB 2 port card. Maybe can even create a separate "mesh" for corosync?

scyto/proxmox-cluster.md

Select an option

No results found

Select an option

No results found

Proxmox cluster Setup

Network Design

Create Cluster

setup on node 1

Join node 2

Join node 3

Define Migration Network

Configuring for High Availability

ssavkar commented Aug 18, 2025 •

edited

Loading

Uh oh!

scyto/proxmox-cluster.md

Proxmox cluster Setup

Network Design

Create Cluster

setup on node 1

Join node 2

Join node 3

Define Migration Network

Configuring for High Availability

ssavkar commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ssavkar commented Aug 18, 2025 •

edited

Loading