Pages:
Author

Topic: Sockpuppet-Detection Algorithms - page 2. (Read 2029 times)

legendary
Activity: 1795
Merit: 1208
This is not OK.
October 29, 2012, 07:53:55 PM
#4
Language is pretty easy.

Form a histograms of:
Syllables per word
Words per sentence
Sentences per paragraph.
Vocabulary.
sr. member
Activity: 266
Merit: 250
October 29, 2012, 07:44:54 PM
#3
I guess many accounts will be stockpuppetdetected with total strangers. With more registered and active users there will be more false positives.

Agreed, but if it were scored based, an automated system could at least rank users from 1 (very unlikely to be an alt) to 100 (highly likely to be an alt) and then a human could keep an eye on the higher-scoring accounts.

I'm sure that absolute certainty would be impossible, however I'm still interested in what metrics could be used to come up with such a score. And the pursuit of accuracy through refining the algorithms is rather intriguing to me.
legendary
Activity: 1512
Merit: 1049
Death to enemies!
October 29, 2012, 07:08:16 PM
#2
I guess many accounts will be stockpuppetdetected with total strangers. With more registered and active users there will be more false positives.
sr. member
Activity: 266
Merit: 250
October 29, 2012, 06:49:38 PM
#1
I'm going to be helping someone set up a forum, but we haven't decided on what software yet.

Anyway, it got me curious about how sockpuppets and alt accounts can be detected, with varying degrees of accuracy.

What ingeniously-complicated algorithms already exist to do this? And what could be coded better?

Some obvious metrics to consider:

  • IP address
  • Language
  • Grammar
  • Vocabulary
  • Profile preferences, eg. Timezone
  • Regular login and post days/times
  • Use of smileys or images

What else?

Is software to do this built in to the major forum scripts or is this kind of thing studied separately by mods?
Pages:
Jump to: