#monero-research-lab

00:37

Isthmus

I always assume that one of us is secretly a 3 letter agency, and somebody else is probably a Fortune 500 company that is still in stealth mode about cryptocurrency adoption. :- P
00:38

Isthmus

Anyways, what do y'all think about extra decoy outputs for the sake of artificially increasing the anonymity set?
00:38

Isthmus

github.com/MyHush/sietch
00:38

Isthmus

I don't agree with 100% of the statements in that GitHub issue, but I do find the idea thought provoking
00:39

Isthmus

Kind of depends on ratios between transaction volume, output volume, and ring size
00:39

Isthmus

s/GitHub issue/Sietch writeup on GitHub
00:39

monerobux

Isthmus meant to say: I don't agree with 100% of the statements in that Sietch writeup on GitHub, but I do find the idea thought provoking
00:42

sarang

This has been brought up several times over the years
00:50

sarang

The previous context was mainly about the idea of reducing the effects of entities colluding with known outputs to reduce the effective anonymity set
01:38

derpy_bridge

<[keybase] unseddd>: sarang: agree that increasing the anonymity set is a good idea, along with removing the ability of malicious actors to collude in a way that reduces the anonymity set for all users. am advocating for not using an algorithm that is too greedy, rejecting legitimate auxilliary input(s), claiming they are malicious
01:41

sarang

You can't prevent entities from colluding
01:42

sarang

Nor can you reliably detect what outputs belong to such entities without significant external information
01:43

derpy_bridge

<[keybase] unseddd>: what i mean, is removing user control over vulnerable aspects of the protocol as much as possible, e.g. deterministic tx-building to defeat Janus attacks
01:43

sarang

FWIW the percentage of outputs that need to be controlled by colluding entities (or otherwise suspected/known to be non-signers) to destroy a ring signature is nontrivial
01:44

sarang

and keep in mind that this is very time-based
01:44

sarang

since decoy selection by default is not uniform over time
01:44

sarang

unseddd: decoy selection is not protocol enfoced
01:44

sarang

*enforced
01:45

sarang

There have been proposals to do so, but they do not play nicely with accurate spend distribution estimates
01:46

derpy_bridge

<[keybase] unseddd>: great! that should make implementing/integrating hardening changes there won't require any consensus changes / hard-forks, right?
01:46

sarang

What kind of change
01:47

sarang

Changes to non-enforced decoy selection are, by definition, not enforced =p
01:47

derpy_bridge

<[keybase] unseddd>: still don't fully understand the idea of non-uniform distribution, if spend distributions can't be analyzed on Monero
01:48

sarang

They can't directly
01:48

sarang

But some early transactions can be
01:48

sarang

and this distribution can be compared to transparent chains
01:49

sarang

They turn out to be similar
01:49

derpy_bridge

<[keybase] unseddd>: the distribution algo from stdlib also causes clang builds to fail for Monero, so replacing it with a uniform random distribution would fix two things
01:49

sarang

This is assumed to be a reasonable approximation
01:49

sarang

A uniform random selection from the chain is not suitable
01:49

derpy_bridge

<[keybase] unseddd>: maybe Monero used uniform random distribution before, and it caused issues?
01:50

sarang

because outputs are not equally likely to be spent as a function of their age on chain
01:50

sarang

Newer outputs are _much_ more likely to be spent
01:51

sarang

If the selection algorithm differs significantly from the expected spend age distribution, you can build a heuristic for the most likely signer based on this
01:51

sarang

So while moving to a simple distribution is useful for protocol-enforced decoy selection, it is terrible for adversarial heuristics
01:52

sarang

If there are problems with the current distribution, it might be possible to rewrite a custom version that builds properly with other tools
01:52

sarang

(I have not run into this problem personally)
01:57

derpy_bridge

<[keybase] unseddd>: try building Monero with clang, you'll run into the issue
02:00

Isthmus

I want R = fxn[seed, height]
02:01

Isthmus

seed is any integer, height is a block height
02:01

derpy_bridge

<[keybase] unseddd>: also advocating for enforcing decoy selection, at least in some of the ways mentioned for Janus mitigation, i.e. making tx-building verifiable, even if optional
02:01

sarang

Isthmus: ?
02:01

Isthmus

And the output R is a set of ring members [technically, a list of output indices] that satisfy our decoy selection algorithm
02:01

sarang

Oh you mean deterministic ring selection
02:02

sarang

Yes, this method exists
02:02

sarang

but in general requires inverse transform sampling
02:02

Isthmus

Oooh, tell me more
02:02

derpy_bridge

<[keybase] unseddd>: Isthmus: you want fxn? ;p
02:02

Isthmus

Oh, having to generate sets of R's to find one that includes your output?
02:02

Isthmus

fxn, yes plz
02:03

sarang

spar.isi.jhu.edu/~mgreen/mixing.pdf
02:03

sarang

For simple distributions it's efficient
02:03

sarang

(I have simple code that shows examples of this)
02:03

sarang

For more complex distributions, not so much
02:03

derpy_bridge

<[keybase] unseddd>: _gib Isthmus fxn_
02:03

sarang

Because the verifier needs to use the seed data to reconstruct the output set
02:04

sarang

The paper shows it for uniform sampling and the old triangular sampling
02:05

Isthmus

O rly?
02:05

sarang

yes
02:05

» Isthmus scopes it out
02:05

sarang

but our distribution is not nearly so straightforward as lines
02:05

sarang

It relies on keyed hash functions
02:06

Isthmus

Here's a weird, maybe heretical question
02:06

Isthmus

How closely does our decoy selection algorithm need to match the real spend time distribution
02:07

Isthmus

If we ask "what is the ideal decoy selection algorithm" the trivial answer is "our best approximation of the spend time"
02:07

derpy_bridge

<[keybase] unseddd>: not familiar, will check it out. perhaps uniform-random is naive, as am still familiarizing myself with Monero attach surface / design choices. however, continuing a heuristic based only on information exposed by the earliest txes seems a large technical debt
02:07

Isthmus

But is this the *only* possible approach that would provide adequate cover?
02:07

derpy_bridge

<[keybase] unseddd>: *attack
02:08

sarang

Miller et al. look at this difference
02:09

sarang

Well, it's important to note that there is (ideally) no way to check the validity of age-based guesses
02:10

sarang

So on their own, they provide no particular provable data
02:10

sarang

and it's not at all clear what a quantifiable definition of "plausible deniability" for a ring signature looks like in practice
02:11

sarang

unseddd: how does the selection algorithm imply technical debt?
02:12

sarang

We have no particular reason to think that Monero spend age distributions differ from the combination of known Monero spend data and Bitcoin spend data, which show the same trends
02:12

sarang

(and we can't check this anyway)
02:12

derpy_bridge

<[keybase] unseddd>: unfortunately don't think practical knowledge of plausible deniability will be know until it's used in a court case or similar
02:12

sarang

and FWIW the distribution is not that complicated
02:12

sarang

it just doesn't play as nicely with sampling that would make deterministic selection reasonable
02:15

derpy_bridge

<[keybase] unseddd>: since the effect of uniform random selection on deniable plausibility is unknown, and the effects of non-deterministic selection is known (Janus), would favor mitigating the latter given more research on the former
02:17

Isthmus

I don't really think much about "plausible deniability," since it's an artificial construct that will vary by every jurisdiction and decade. My only interest is in statistical obfuscation.
02:18

sarang

Janus is about subaddress faking, not decoy selection
02:18

sarang

And uniform selection is a terrible option
02:18

derpy_bridge

<[keybase] unseddd>: basically, if we can measure the effects of uniform random distribution on selection, and it is found to not break anything, lets use uniform random
02:18

sarang

Under all but the simplest risk assumptions
02:19

sarang

Uniform selection doesn't break anything except statistical expectations of spend ages
02:19

derpy_bridge

<[keybase] unseddd>: why terrible?
02:19

sarang

Because it basically gives away the likely signer
02:19

sarang

And an adversary can use that to weight the likely true tx graph
02:19

derpy_bridge

<[keybase] unseddd>: statistics that cannot be verified, or even measured, on the current chain?
02:20

sarang

No but they can be used as part of broader graph techniques
02:20

sarang

And they're a really good heuristic
02:20

derpy_bridge

<[keybase] unseddd>: giving away the likely signer is disasterous, if that really is the effect, definitely no uniform random
02:22

derpy_bridge

<[keybase] unseddd>: good heuristic based only on unspent early txes right? am misunderstanding something? create a turnstyle-like protocol, and remove the ability to perform those heuristics. problem solved, no?
02:25

sarang

That's not really the point
02:25

sarang

The point is that we know how users tend to spend outputs based on other chains and early Monero data
02:26

derpy_bridge

<[keybase] unseddd>: thought removing heuristics was the point, where have gone wrong?
02:27

sarang

Selecting decoys according to expected spend patterns is the mitigation to this heuristic
02:27

sarang

That's exactly why we use the algorithm we do
02:27

sarang

And continue to iterate on it
02:28

derpy_bridge

<[keybase] unseddd>: right, just trying to think of a more permanent solution, so you do not have the technical debt going forward
02:28

sarang

How is it technical debt
02:29

sarang

I don't really follow
02:29

derpy_bridge

<[keybase] unseddd>: having to continually update a selection algo based on best-guesses and heuristics sounds like technical debt to me
02:29

gingeropolous

yeah, the output selection doesn't create technical debt afaict.
02:30

gingeropolous

its more like a technical burden
02:30

hyc

a lot of aspects of security are continually moving targets
02:30

derpy_bridge

<[keybase] unseddd>: burden == debt ...
02:31

sarang

It's not ideal, but it is a consequence of ring signatures
02:31

gingeropolous

well yeah, semantics
02:31

hyc

you can't really pick a single algorithm and carve it into stone. it'd be like the maginot line, someone will walk around it.
02:31

gingeropolous

simple answer is ringsize a bajillion
02:32

sarang

The goal is transaction uniformity, which depends in part on usage patterns
02:32

sarang

We need to adapt to them as best we can
02:32

sarang

This is part of it
02:33

derpy_bridge

<[keybase] unseddd>: hyc: get that security, and the fight for privacy is ever-moving, just tring to think of things that will make Monero have to move less in this particular direction. totally understand that i do not comprehend the full picture yet
02:33

gingeropolous

those are good thoughts to have. less human hands the better
02:33

sarang

There are techniques like binning that can apply better at larger ring size
02:34

sarang

These have the added advantage of reducing communication complexity
02:35

derpy_bridge

<[keybase] unseddd>: sarang: are those ring sizes practical?
02:35

sarang

That's the current goal
02:35

sarang

We're getting there
02:36

derpy_bridge

<[keybase] unseddd>: ok awesome! then will direct my attention there :)
02:36

sarang

I assure you it is not an easy problem to solve
02:36

gingeropolous

nonsense! just put it on a blockchain!
02:36

sarang

Brilliant
02:37

derpy_bridge

<[keybase] unseddd>: no, wouldn't think it is, but at willing to throw some braincells at it
02:37

sarang

Please do
02:37

sarang

Nobody has totally solved it yet
02:38

derpy_bridge

<[keybase] unseddd>: is there a tracking issue for ongoing efforts?
02:39

hyc

the current effort is in CLSAG isn't it?
02:40

sarang

Clsag is more of a stopgap
02:40

derpy_bridge

<[keybase] unseddd>: oh, so CLSAG and it's successors would enable large enough ring sizes for binning?
02:41

sarang

Other efforts include Ommiring, Triptych, Lelantus, RCT 3, Arcturus
02:41

sarang

Clsag would not
02:41

sarang

Some are externally developed, others are in house
02:45

derpy_bridge

<[keybase] unseddd>: sarang: is there a paper(s) or description of the binning technique you mentioned?
02:48

gingeropolous

unsedd, eprint.iacr.org/2019/186.pdf
02:48

gingeropolous

i think thats the reference
02:49

sarang

petsymposium.org/2018/files/papers/issue3/popets-2018-0025.pdf
02:51

gingeropolous

woops
03:00

derpy_bridge

<[keybase] unseddd>: gingeropolous, sarang: many thanks :)
03:29

derpy_bridge

<[keybase] unseddd>: yeah, that paper makes it obvious that uniform distribution is terrible for input selection. sorry for wasting time suggesting it
04:25

derpy_bridge

<[keybase] unseddd>: would be interesting to re-run the POPETS18 analysis using most current chain data, see if their results still hold. still need to read the IACR paper
07:48

derpy_bridge

<[keybase] unseddd>: by re-running the analysis, only mean using the current 11-mixin with Gaussian distribution, and compare with say 11-mixin, (3,4)-bin binning strategy
07:51

derpy_bridge

<[keybase] unseddd>: the 7-mixin, 4-bin gives 4.0 min-untraceability in the POPETS18 paper, so curious to see if the 11-mixin, 4-bin strategy gets close to ideal 1/11 chance of 100% traceability

5 years ago

« a day earlier

a day later »

today »