Skip to content

Ricardo

My feedback

1 result found

  1. 6 votes
    Vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
    You have left! (?) (thinking…)
    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
    An error occurred while saving the comment
    Ricardo commented  · 

    Laplace was once confronted with the question:
    "How to estimate the probability that the sun will rise tomorrow?"
    Even though the sun has rised for as long as we can remember, we can't infer that the probability of the sun rising tomorrow is 100%.
    This problem is know as "Sunrise Problem", and the solution is exactly the formula: (k+1)/(k+2), which is a specific case of the formula I recommended xhamster starts using.
    The most general formula form is known as "Additive Smoothing", or Laplace Smoothing, and is supposed to remove sampling error when the amount of data is small.
    Estimating video ratings wrongly is very damaging to the xhamster community, since videos with 2-3 likes and no dislikes overshadow videos with a good like/dislike ratio.
    Since xhamster has a huge collection of videos, which implies that a lot of them will have less like/dislike data input, it is important to give the best estimator given the limited data.

    An error occurred while saving the comment
    Ricardo commented  · 

    I did a python code to experiment the formula.
    I selected 1000 random videos (videos from number 7169000 to 7170000)
    The results of the scores are in the pastebin:

    http://pastebin.com/YTj5qs2c

    ps: If the video seems to have an inconsistent rating it is because it has inconsistently high number of likes, the formula just gives the best rating estimate given likes/dislikes data.

    Ricardo shared this idea  · 

Feedback and Knowledge Base