Andrei Volkov zvolkov

Alignment Targets the Wrong Object

A recent paper — "Safety and Fairness in Agentic AI Depend on Interaction Topology, Not on Model Scale or Alignment" (arxiv:2605.01147) — provides empirical confirmation of something the multi-agent safety field has been circling: aligning individual agents does not produce aligned systems. The paper shows ordering instability (59 percentage point variance from reordering alone), information cascades (99.9% agreement with zero error correction in larger models), and functional collapse (systems satisfying fairness metrics while abandoning their actual function).

Their conclusion: "Current safety frameworks targeting component-level alignment are targeting the wrong object."

They're right. But the paper's own proposals — topological sweeps, architecture disclosure, stress-testing — still target from outside. They ask: which topologies are safe? The more productive question is: why does topology determine safety at all?

	Set-ExecutionPolicy Unrestricted
	Set-ExplorerOptions –showFileExtensions -EnableShowFullPathInTitleBar
	Set-TaskbarSmall
	Install-WindowsUpdate –AcceptEula

	cinstm googlechrome
	cinstm notepadplusplus.install
	cinstm Firefox

	cinstm visualstudio2013professional -InstallArguments "/Features:'WebTools SQL'"

	Set-ExecutionPolicy Unrestricted
	Set-ExplorerOptions –showFileExtensions -EnableShowFullPathInTitleBar
	Set-TaskbarSmall
	Install-WindowsUpdate –AcceptEula

	#install .net framework
	cinst DotNet4.5

	#Enable Web Services
	cinst IIS-WebServerRole -source windowsfeatures

	#Enable-PSRemoting -Force
	#
	#$cred=Get-Credential NIH\volkova
	#Install-BoxstarterPackage -ComputerName MyTargetMachine -PackageName MyPackage -Credential $cred

	#START http://boxstarter.org/package/url?

	Set-ExecutionPolicy Unrestricted
	Set-ExplorerOptions –showFileExtensions -EnableShowFullPathInTitleBar
	Disable-InternetExplorerESC

Andrei Volkov zvolkov

Alignment Targets the Wrong Object

Why Topology Matters