PCA does not work?

Previous topic - Next topic

Bitcoinfan


http://www.augur.net/blog/building-a-better-lie-detector

Must say from quick read through it was a entertain analysis.  Easy to read as well.  Nice stuff augur.

psztorc

I just noticed it and responded a few times.

As usual, it is my personal feeling that Augur really doesn't "get" what all of these pieces are or how they are used.
Nullius In Verba

zack

more on hierarchical clustering:

The worry is that an attacker will put tiny votes in just the right places to cascade and give himself control of the outcome even though he has a minority of the reputation.
The reason the attacker can do this is because he knows the deterministic rule for how the votes are clustered.
So I think we should salt the clustering rule. Using a random number that the voters didn't know when they were voting.

It would still possible to take control with a minority, but it is probabilistically improbable.

psztorc

I don't think we should do that, because miners can cycle the blocks (even though they almost certainly won't) until they get a random results that over-tilts the algorithm in their favor. It just swaps one resource for another, and attackers can pick whichever resource is cheaper.
Nullius In Verba

joeykrug

#4
Quote from: zack on June 30, 2015, 02:19:43 PM
more on hierarchical clustering:

The worry is that an attacker will put tiny votes in just the right places to cascade and give himself control of the outcome even though he has a minority of the reputation.
The reason the attacker can do this is because he knows the deterministic rule for how the votes are clustered.
So I think we should salt the clustering rule. Using a random number that the voters didn't know when they were voting.

It would still possible to take control with a minority, but it is probabilistically improbable.

Another easy way to solve this is to just do something like:

``` # detect how far the "truths" are away from actual outcomes
    # then choose closer mode as final truth cluster
    if(L2dist(mode.meanVec, outcomes)<bestDist):
        bestDist = L2dist(mode.meanVec, outcomes)
        best = mode
        bestClusters = clusters
    if(L2dist(mode.meanVec,outcomes)>1.07 and times==1):
        possAltCluster = cluster(features,rep,2,threshold*3)
        return(possAltCluster)```

It's basically a second pass.

This seems to work well in all the test cases I've been able to come up with to throw at it as well as the one Vitalik posted.  If anyone has any ideas for a matrix that could get past this redundancy check, let me know and I'll test it. 

Edit: nvm on last edit, just had a typo in sims!