00:17:54 <knaccc> this is my attempt at a diagram https://imgur.com/a/FX8KN1j cc: sgp_
00:26:30 <knaccc> ;/
00:26:59 <knaccc> (that was accidental typing, not a smilie)
00:52:51 <sgp_> knaccc: how do you know which set of subaddresses to select from? Ideally the exchange would use one subaddress per user
00:54:40 <knaccc> sgp_ the hash is just a simple hash of the subaddress. so alongside every output in every transaction (involving exchanges or otherwise), the hash that appears corresponds to whatever subaddress the recipient asked for funds to be sent to
00:55:12 <knaccc> and all tx senders, including exchanges, must use the correct hash, or wallets will not notice the transaction
00:55:39 <knaccc> because wallets can now directly look for hashes matching wallet subaddresses, instead of doing heavy-ECDH scanning
00:56:20 <knaccc> cc moneromooo - i think what I just typed addresses the question you just posted to github? please let me know if it doesn't answer your question and i've misunderstood
00:59:00 <gingeropolous> <knaccc> (that was accidental typing, not a smilie) <---- its a funny emoticon to throw after posting a link for this :) im getting a good chuckle.
00:59:12 <knaccc> :)
01:01:06 <moneromooo> It does, but it presupposes honest adversaries.
01:01:39 <moneromooo> If an exchange wants to do this, you'll get lots of people asking for patches to recognize those outputs, since they were sent to the wallet.
01:02:01 <moneromooo> Not sure it'd get done since an exhcange would not gain that much from it though.
01:02:36 <moneromooo> The first paragraph (the question) was not addressed though, just the second one.
01:03:43 <gingeropolous> sorry knaccc , that figure didn't do it for me :( but EABE makes my head hurt. i need to go prime myself on it again
01:04:14 <moneromooo> Assuming I understand your technique, I think that it's mostly pointless if the number of exchanges is high enough compared to the ring size.
01:05:10 <moneromooo> If low enough, I see it helping, but whether it overrides the flaws I dunno.
01:05:47 <knaccc> moneromooo ah sorry, i updated the github comment to address the first part
01:06:07 <knaccc> the scheme does not rely on hoping to get outputs from any particular exchange or from any exchange at all
01:06:31 <knaccc> all that matters is that it's not only A's outputs that appear in all 3 branches, but other users' outputs too
01:07:03 <moneromooo> That will only be the case if those come from the exchange, no ?
01:07:59 <knaccc> i don't think i understand your question. i hope its clear that these hashes appear in all txs, not just txs sent by exchanges
01:08:22 <knaccc> and this scheme also solves the equivalent of EABE traceability issues in ABC scenarios etc
01:08:35 <knaccc> so exchanges aren't even necessary to think specially about
01:09:03 <moneromooo> Well, F is another exchange, and Carol and Dave are F users.
01:09:28 <moneromooo> Alice sends to Bob, and looks for grouped outputs. She finds lots of stuff F sent Carol and Dave. She uses those in her ring.
01:10:03 <moneromooo> When E receives Bob's monero, it finds lots of A smelling outputs, but also some of C and D.
01:10:13 <moneromooo> And... I can see why it doesn't matter actually.
01:10:52 <gingeropolous> its like trying to bin based on the emergent characteristics of a particular output type
01:11:09 <gingeropolous> mebbe
01:11:26 <knaccc> yes all that matters is that when A's outputs pop up in all 3 branches, outputs owned by other users pop up per-user in all 3 branches
01:11:28 <moneromooo> So... interestingly, if A withdraws from E to a different subaddress every time, it makes Bob stand out more, does it not.
01:11:47 <knaccc> that's correct
01:11:49 <moneromooo> Since Carol and Dave will not use A's outputs.
01:12:00 <knaccc> yes, you've totally got it
01:12:33 <moneromooo> That's counter intuitive since people will generate addresses to be more private.
01:13:17 <knaccc> it's good that people use a subaddress per-sender
01:13:34 <knaccc> it's a problem if no one receives more than one output to any particular subaddress
01:13:51 <moneromooo> But if they keep the same subaddress for multiple withdraws, then if someone learns about one, they can track all others. That's kinda catch 22.
01:14:16 <sgp_> We are currently pushing for one-time use subaddresses for manual payments though, which means this is a breaking UX recommendation
01:14:16 <knaccc> that's correct, there is no forward secrecy
01:14:37 <sgp_> Also I'm worried about change outputs to the same subaddress being used to clearly identify timing
01:15:32 <knaccc> can you elaborate on the that threat sgp_? what could be inferred
01:15:53 <moneromooo> I really don't like the fact that if a group puts a donation address up, everyone knows whenever they receive somehting.
01:15:54 <sgp_> Someone uses their subaddress to buy goods from a malicious seller, say for something illegal. Then the same person buys coffee from a cooperating store
01:16:08 <knaccc> moneromooo neither do i. that is a problem.
01:16:10 <sgp_> Would those transactions not be linked based on the change output to the same subaddress?
01:17:06 <moneromooo> Oh, that is a good point. Change goes to 0 now.
01:17:33 <sgp_> Afaict, change would connect all transactions to a single subaddress identity
01:17:40 <knaccc> sgp_ you make an interesting point. i think hashes on change outputs will have to be set randomly
01:18:14 <sgp_> At that point the benefit is decreased though from more blocks of subaddresses to choose the decoys from though, no?
01:18:15 <knaccc> yeah that's a very good point
01:18:42 <moneromooo> It makes churn pointless too. Which might be good :D
01:19:04 <moneromooo> Well, unless churning through a new subaddress every time...
01:19:34 <knaccc> well churn has never been popular because of bloat, and i also did work with either sarang or surae, i can't remember whom now, where it was becoming really obvious that multiple churns could not work because they stood out so clearly on the blockchain and were therefore not of any benefit
01:20:10 <sgp_> E send funds to Alice address. Alice sends funds to entity X, n transactions away. Entity X knows all transactions made by Alice. If given to exchange, they tie real identity to all these transactions. X can be any entity on the receiving end of those n transactions
01:20:59 <knaccc> sgp_ is this only if hashes are put on change addresses?
01:21:43 <sgp_> I don't understand fully, but if there is some linked grouping known to a counterparty
01:22:09 <knaccc> when you say "n transactions away", you mean it's EABCX or something like that?
01:22:25 <knaccc> and how does X know about all transactions made by Alice?
01:22:40 <knaccc> Alice is never transacting with X
01:22:41 <sgp_> Here's an example:
01:22:52 <sgp_> E -> A
01:23:09 <sgp_> A -> B,C,D,E,F,...X
01:23:58 <knaccc> so you're not saying A->B->C->D.... etc, you're saying A->B, A->C, etc?
01:23:58 <sgp_> E already knows all transactions sent by A without cooperation, if sent to the same subaddress
01:24:12 <sgp_> Yeah A->B, A->C, ...
01:24:24 <knaccc> ok so A->X happens too?
01:24:27 <knaccc> dircetly
01:24:30 <sgp_> Yes
01:24:34 <knaccc> ok
01:25:17 <knaccc> so yes you are talking about the change problem. this is easily fixed by A never putting that hash on change outputs A sends to herself
01:25:17 <sgp_> Let's say A->X is for a suspicious purchase where X is LE
01:25:36 <sgp_> X asks for the real identity of A from E
01:26:09 <knaccc> right, but how does X know A transacted with E?
01:26:35 <knaccc> and if that is known, how does X know that A transacted with both E and X?
01:26:59 <sgp_> If the change back to A and the transaction from E to A went to the same grouped, identifiable address
01:27:52 <knaccc> right, so the hash on change going back during A->B and A->C and A->X etc cannot be the same hash each time
01:27:59 <knaccc> or those txs will be linked
01:28:20 <sgp_> Yes it must always be different and unlinkable or else you learn the full list of someone's transactions
01:28:47 <knaccc> this is an easily solvable problem, all that needs to be done is to use hash(a || change one-time pubkey) instead for the hash on change
01:29:28 <knaccc> although that does break the bloom filter stuff a little
01:30:05 <sgp_> Suppose I'm B now and want to hide my association with A, who is not an exchange but is a malicious party
01:30:24 <knaccc> so this is a totally different scenario, right?
01:30:32 <sgp_> Yeah
01:30:55 <sgp_> How does B select the decoy outputs if they can't identify which outputs are related to A
01:32:08 <knaccc> so this is with A sending funds to B?
01:32:31 <sgp_> Suppose for simplicity that A sends funds to B 3 times
01:33:14 <knaccc> right, so the first time A->B happens, A chooses 10 decoys, each from a different per-subaddress bucket
01:33:44 <knaccc> and the subsequent times A->B happens, A chooses further decoys from each of those buckets used in the prior transaction
01:34:00 <sgp_> Okay, I think I'm with you so far
01:34:37 <knaccc> there is a problem, which is if the wallet is sending to B using a different subaddress owned by B each time, then the wallet won't realize
01:34:42 <knaccc> and so won't bucket properly
01:34:48 <sgp_> When B wants to send, what bucket(s) do they choose from?
01:35:20 <knaccc> it works exactly  the same way again
01:35:22 <sgp_> Suppose A send 3 times to B, each to the same subaddress
01:35:56 <sgp_> Then what buckets does B select?
01:36:43 <knaccc> as a quick side-note, it doesn't acutally matter if B does anything clever at all or not when sending funds on to cash out at E
01:36:56 <knaccc> but B will just do the same thing A did
01:37:15 <sgp_> How can B identify A's buckets?
01:37:21 <knaccc> which is that on every tx from B to any particular destination, it will choose decoys from buckets
01:37:29 <knaccc> B doesn't care about A's buckets
01:37:34 <knaccc> that's not B's concern
01:37:40 <knaccc> just like A didn't care about E's buckets
01:37:56 <knaccc> assuming A got outputs in the first place by purchasing them from E
01:37:57 <sgp_> Okay, maybe I had the selection backwards in my head then one sec
01:45:56 <sgp_> knaccc I'm still not getting it. Can you try explaining the bucket selection from scratch again? Do I select the groupings arbitrarily or from someone I'm related to?
01:47:55 <knaccc> sure, so here are the steps:
01:48:07 <knaccc> 1. you want to send funds to X
01:48:19 <knaccc> it doesn't matter who sent anything to you in the past. that's not relevant
01:48:44 <knaccc> 2. you search the blockchain for a random bucket that contains at least a few outputs
01:49:05 <knaccc> 3. you pick 10 such buckets at random, and pick an output from each of those buckets as a decoy
01:49:27 <knaccc> 4. the next time you send to X, you check which buckets you had previously randomly picked from
01:49:59 <knaccc> and you pick from each of those buckets a decoy which you have not previously used as a decoy
01:50:01 <knaccc> that's it.
01:50:49 <knaccc> i should clarify:
01:51:07 <knaccc> s/you check which buckets you had previously randomly picked from/you check which buckets you had previously randomly picked from when sending to X
01:51:07 <monerobux> knaccc meant to say: 4. the next time you send to X, you check which buckets you had previously randomly picked from when sending to X
01:54:33 <sgp_> Okay, I think I get it now
01:54:55 <sgp_> You're trying to find another possible identity
01:55:46 <sgp_> Would this setup not encourage users to act selfishly and not make buckets whenever possible?
01:57:16 <sgp_> Buckets would only be created if someone resent to the same subaddress correct?
02:00:03 <sgp_> Also as a separate issue, when sending one transaction each to two addresses, how do you know both addresses are controlled by different people? You wouldn't know to select from buckets
02:00:26 <sgp_> *from the same buckets
02:01:06 <sgp_> And if you are always selecting from the same buckets for all transactions regardless, then all your transactions are trivially grouped together by any outside observers
02:01:32 <knaccc> > act selfishly and not make buckets < it's possible, but recoding wallets isn't easy
02:01:49 <knaccc> > how do you know both addresses are controlled by different people < you don't. this is a problem
02:02:34 <knaccc> and that last point is very good, i'll think about it
02:03:05 <sgp_> Last point is only relevant if you expect to always select from the same buckets for all transactions you send
02:04:05 <sgp_> Multiple transactions to the same subaddress will already be grouped due to the nature of the design
02:04:42 <sgp_> If someone transacts with one entity frequently, the change outputs are quite obvious too then
02:04:48 <knaccc> yes, but that would be what you'd want to do if you wanted to be sure you were never missing situations when the wallet thinks it is sending to two different entities, but they're actually the same entitiy asking for payment via different subaddressses each time
02:05:53 <knaccc> and i'm pondering the implications of always picking from the same buckets over and over
02:06:19 <sgp_> Conditionally dropping linkability will have a bunch of weird complications like these I imagine
02:07:43 <sgp_> Suppose this other extreme example:
02:07:54 <sgp_> E has only one customer, A
02:08:15 <sgp_> E -> A x3
02:09:33 <sgp_> Hmm, I need to flesh out that example a bit more before I write it down actually
02:10:09 <sgp_> But my thought process is: can E reliably eliminate buckets?
02:11:00 <knaccc> i don't understand what you mean by eliminate them
02:11:32 <sgp_> Heuristically eliminate all decoys related to specific buckets
02:11:33 <knaccc> you mean consider them not likely decoys simply beacuse they are not customers of E?
02:11:50 <sgp_> Yeah that was my first line of thinking, but I need to think about it more
02:11:51 <knaccc> i don't think so. none of this requires exchanges to be large or small
02:12:19 <knaccc> and works in non-exchange scenarios to prevent ABC issues if C is hacked (allowing A to see if B is transacting with C)
02:13:48 <sgp_> Different thought: could attackers increase the likelihood of their outputs being selected as decoys by making a ton of buckets, thus increasing their selection chances easier than spamming outputs?
02:14:08 <sgp_> Back to MRL 1 and 4 😎
02:14:26 <knaccc> yes, and that's the same as the problem of choosing decoys in general
02:14:59 <sgp_> True but it's exacerbated no? Since exchange deposits, mining pool payouts, etc would be 1 selection per user, not 1 selection per output
02:16:37 <sgp_> Maybe you could weight the bucket selection chance by the number outputs in it instead of treating all buckets as equal
02:16:49 <knaccc> i think in both cases it's still a case of proportion spammed that determines if they can get you that way, i may be wrong
02:17:03 <knaccc> oh i see, yes perhaps
02:19:11 <sgp_> I think with current ringsizes, there definitely is a way to conditionally sacrifice linkability to improve traceability for a net benefit, but damn it's hard to weigh those things
02:22:06 <sgp_> I dare guess the greatest chance of a net positive sweet spot is by losing a *tiny* bit of linkability
02:22:28 <sgp_> But wow would that increase the scope of Breaking Monero lmao
02:22:49 <sgp_> Thanks for the conversation knaccc!
02:29:28 <knaccc> sgp_ thank you for brainstorming it. my hope is that this could spark thoughts and it helps find the eventual solution
02:32:45 <sgp_> knaccc: I support all crazy ideas :) Did you see this slightly less sweeping idea for public wallets like mining pools? https://github.com/monero-project/monero/issues/5222
02:45:54 <knaccc> oh i missed that, thanks, will read
02:56:02 <gingeropolous> i wonder if fake inputs would fix this
02:59:31 <gingeropolous> probably not. because E would see A making a transaction to B, and so E could see E's output being used as an input, and now there's just another random (false) input. B then uses A's output as an input + some false input and sends back to E.
03:01:20 <gingeropolous> still a loop, and E would know that A is actually spending that output... yeah, not much different than if A combined E's output with another of its inputs to make a transaction
03:02:51 <gingeropolous> ugh this nomenclature is so cumbersome
03:11:13 <moneromooo> Since the scheme is voluntary and people will normally use a subaddres per sender, the wallet could hash the subaddress *and* the sending wallet secret key. This makes the "locate donations to the EFF" impossible, while still allowing matching sends to the same subaddress by a blockchain observer.
03:11:20 * moneromooo goes back to sleep
10:19:40 <moneromooo> Also, if change gets random tags due to the issue sgp mentioned, then if you see grouped outputs, you now know they aren't change. So change would have to *sometimes* reuse a tag.
11:56:10 <sgp_> Change would hide among outputs sent to single-use stealth addresses though, so I wonder how much that matters in practice. It definitely could matter
12:43:04 <knaccc> moneromooo > the wallet could hash the subaddress *and* the sending wallet secret key < whoa, i think that might be a really good idea
12:44:05 <knaccc> because as you point out, it's supposed to be a subaddress-per-sender anyway
12:44:20 <knaccc> so there is no loss when limiting the bucket in that manner
12:44:38 <moneromooo> It definitely breaks fast scan and janus though.
12:44:56 <knaccc> true
12:46:36 <knaccc> i think the biggest showstopper at the moment could be what sgp_ came up with yesterday, which is that the wallet won't know if you're sending to the same merchant via different subaddresses, or to different merchants via different subaddresses
12:47:36 <knaccc> and it pains me to type this, but maybe the answer to that is to recommend subaddresses for normal people, and integrated addressses/integrated subaddresses for merchants
12:48:19 <knaccc> i can't believe i just typed that. this is messy.
12:49:04 <knaccc> and it feels a bit clunky to ask the user if an outgoing payment is to a merchant they've paid before
12:49:31 <knaccc> so the cleanest way to handle this is to consider the implications of having buckets that are used for all outgoing txs
12:49:45 <knaccc> and then have those buckets rotated in and out over time
12:50:08 <knaccc> and now the wallet doesn't need to know whether it's sending more outputs to the same entity or not
12:55:01 <knaccc> i think what i just wrote might work. it was already the case that merchants could have kept a note of all purchaser-change-addresses they observed during incoming transactions to the merchant, and then compared notes with other merchants to see if any of those change outputs were spent with other merchants
12:55:25 <knaccc> s/purchaser-change-addresses/purchaser-change-outputs
12:55:25 <monerobux> knaccc meant to say: i think what i just wrote might work. it was already the case that merchants could have kept a note of all purchaser-change-outputs they observed during incoming transactions to the merchant, and then compared notes with other merchants to see if any of those change outputs were spent with other merchants
13:30:13 <sgp_> just make a consensus rule so that exchanges need to pinky promise to share their data cleanly, but in a way no one else can :p
16:19:16 <sarang> What's the most clear way to share CLSAG timing data?
16:19:22 <sarang> Signatures only? Overall transaction estimates?
16:19:31 <sarang> Range proof batching? No batching?
16:23:26 <moneromooo> All of it :D But there was some verging on dishonest hyping of crazy speed improvements based on batching 64 txes or so recently, so should be careful how people use the data.
16:24:09 <sarang> I've presented batch results before
16:24:37 <moneromooo> Sure. But IIRC someone took the best number possible and threw it out to the masses.
16:24:52 <moneromooo> Having numbers for all interesting cases is good.
16:25:14 <sarang> Oh, was this for the BP improvement?
16:25:15 <moneromooo> I might be wrong. My memory's hazy here.
16:25:21 <moneromooo> Could be.
16:25:27 <sarang> I had put a few batch examples into the PR, but they were clearly marked as such
16:25:28 <moneromooo> Felt like shitcoin time.
16:26:06 <sarang> For the blog post, I can list timing for CLSAG+BP with no batching, and then note that during sync, the improvement is better because of batching
16:26:11 <sarang> That seems fair and accurate
16:26:21 <moneromooo> Sounds good to me.
16:26:26 <sarang> For a single 2-2 transaction, CLSAG+BP verification improvement is 10%
16:26:29 <sarang> just signature is 20%
16:26:42 <sarang> so you asymptotically approach something closer to 20% with higher batching
16:27:06 <moneromooo> Most people will be interested in the whole tx verification really. Which is... annoying to get in monero.
16:27:17 <sarang> Yeah
16:27:24 <sarang> I count signature verification, balance check, and range proof
16:27:27 <sarang> Which are the heavy hitters
16:27:50 <sarang> and I note that YMMV
16:28:32 <sarang> Oh even better, I can note that signature speedup is 20%, with at least 10% speedup for transactions as a whole
16:28:45 <sarang> That doesn't downplay CLSAG's improvements, but also gives some better real-world estimates
16:30:07 <sarang> Blog post draft: https://gist.github.com/SarangNoether/e7bc586d0efd89f74d29b62cc372eeb4
16:34:05 <dEBRUYNE> fwiw, I have typically used the 20% figure if someone asked about the relative improvements
16:34:33 <sarang> 20% is accurate for signatures along
16:34:34 <sarang> *alone
16:34:35 <moneromooo> Looks good. But uses ! for effect, which amused me.
16:34:51 <sarang> I'm pretty sparing with exclamation marks :)
16:34:56 <sarang> Never get to use them in formal papers
16:35:18 <sarang> dEBRUYNE: for signatures alone, space is ~50%, time is ~20%
16:35:28 <sarang> For overall 2-2 transactions, space is ~25%, time is ~10%
16:35:36 <sarang> time improves with range proof batching
16:36:12 <sarang> FWIW, I consider the size improvements to be the bigger deal
16:36:20 <sarang> timing improvements are a nice benefit
16:36:28 <sarang> (and depend on some optimizations that I did)
16:36:34 <dEBRUYNE> Ok, thanks
16:39:05 <sarang> moneromooo: now you have me questioning the excitement of my punctuation...
16:39:08 <sarang> bah
16:39:50 <moneromooo> Sorry. I guess it's fine for blog post.
16:40:03 <sarang> heh
16:41:10 <moneromooo> I've got an overflow of hate for hype and dishonest arguments of late and I may be overmatching, you can ignore me ^_^