In search of the least viewed article on Wikipedia

@[email protected] · 1 year ago

In search of the least viewed article on Wikipedia

@over_clox · 1 year ago

Oddly enough, by posting this data publicly, those least viewed articles will end up getting a lot more views now.

QuinceDaPence · 1 year ago

@[email protected] · 1 year ago

Really enjoyed the read. Thanks for sharing. I’m surprised by the random page implementation.

Usually in a database each record has an integer primary key. The keys would be assigned sequentially as pages are created. Then the “random page” function could select a random integer between zero and the largest page index. If that index isn’t used (because the page was deleted), you could either try again with a new random number or then march up to the next non empty index.

@AbouBenAdhem · 1 year ago

Marching up to the next non-empty key would skew the distribution—pages preceded by more empty keys would show up more often under “random”.

@SheeEttin · edit-2 1 year ago

Fun fact, that concept is used in computer security exploits: https://en.wikipedia.org/wiki/NOP_slide

For choosing an article, it would be better to just pick a new random number.

Although there are probably more efficient ways to pick a random record out of a database. For example, by periodically reindexing, or by sorting extant records by random (if supported by the database).

@[email protected] · 1 year ago

Did you know one of the most translated articles on Wikipedia is none other than American actor Corbin Bleu?

https://www.insider.com/why-corbin-bleu-wikipedia-pages-2019-1

@DirkMcCallahan · 1 year ago

Very cool! I love stuff like this.