Preliminary results of risk of OSPEAD deployment without hard fork

TL;DR

PRELIMINARY: When an adversary uses possibly non-optimal classification methods, the privacy risk to users of OSPEAD deployment without a hard fork is lower than the status quo of all users using the current decoy selection algorithm.

Introduction

Below are some very preliminary results on the privacy risk of deploying a new OSPEAD-derived decoy selection algorithm (DSA) in the wallet2 reference software without a hard fork.

Having two DSAs used in the wild can potentially split the Monero anonymity pool into two anonymity puddles. Roughly, the tasks of an adversary are:

Classify transactions by their DSA being used. A given DSA will leave probabilistic evidence of its true form: the empirical temporal distribution of 15 of the 16 ring members. For now, we assume that the adversary will use a neural net (NN) classifier for this task
Given the classifications (allocations) made in step 1, try to guess the real spend. They can do this with the nonfungibility (NF) classifier described in Rucknium (2023), which guesses that the real spend is the one whose antecedent transaction shares the same DSA as the transaction of interest.
Alternatively, implement the MAP Decoder attack against the actual ring member distribution, which guesses that the real spend is the one with the highest relative probability of belonging to the real spend age distribution, compared to the decoy distribution.
The adversary can dynamically choose between the NF classifier and the MAP Decoder, depending on which one has greatest accuracy in a given circumstance.

Assumptions/initial parameters

Some initial parameters are needed to set up the simulation:

The share of users using the new DSA, $\beta$. I ran simulations for several values of $\beta$.

I assume only 1-input (therefore a single ring) transactions. Transactions with more than one input would be easier to classify into a specific DSA.

$C$ is the share of transactions that spend change from the user's own wallet. This has been empirically measured for a few wallet implementations that have a nonstandard fee defect. These were $C$ = 22%, 38%, 39%, 41%, 61%. Since 40% is roughly the mode, I chose 40% in the simulations. A higher $C$ would make the NF classifier guess the real spend with higher probability. See Table 1 of Rucknium (2023).

The simulated real spend distribution, new DSA, and old DSA need to be specified. The old DSA in the simulations is the actual wallet2 DSA. The real spend distribution and new DSA were chosen to be compatible with the contributions of @spackle-xmr in his neural net simulations. The real spend distribution and new DSA are exactly the same, so the MAP Decoder does no better than uniform-random guessing. The distribution is the OSPEAD-fitted log-gamma distribution with shape parameter equal to 4.315 and rate parameter equal to 0.3751. In this simulation, the average MAP Decoder attack success probability against users using the old DSA is 27%. A more realistic scenario would set the real spend distribution to its actual nonparametric distribution estimate, which would allow the adversary to attack new-DSA users with the MAP Decoder with greater success than random guessing.

Creating the datasets

A dataset for training the NN classifier was created. New-DSA rings and old-DSA rings were generated by drawing 15 ring members from the DSA distribution and one from the real spend. The number of new/old-DSA rings was proportional to the $\beta$ chosen for each simulation.

A separate dataset for guessing the real spend was created. Each observation consists of a "transaction of interest". The transaction of interest has a ring distribution that is either new-DSA or old-DSA. Each ring member (one of the "antecedent transaction" of the "transaction of interest") has a probability of being constructed with the new-DSA or old-DSA, depending on $\beta$ and $C$. In total, each transaction of interest has 17 sets of 16 ring members: One for the transaction itself and 16 for each ring member.

Real spend classification steps

The adversary follows this procedure:

Train the NN classifier with the training dataset
Apply the NN classifier to to the transactions of interest and the antecedent transactions. Store the classifications.
Given these classifications, which will not be 100% correct, apply the MAP Decoder and NF classifier.
Choose to use the classifier (MAP Decoder or NF) that has the highest probability of guessing the real spend in a given situation. The "situation" depends on whether the trasnaction of interest was classified as new- or old-DSA. Also, from step (2), the adversary will know how many antecedent transactions have been classified as new/old-DSA in a given transaction of interest and can adapt its rules according to the DSA classification count.