Airbnb NYC the big apple analysis

  • How to create a new metric to correlate the overall score and the number of reviews?
  • This metric will be better to predict the overall score?
  • Is the features of the house enough to determinate the overall score?

The Dataset

Mean and variance of House discrete variables
Mean and variance of reviews

The Review Scores Rating

Review Scores Rating Histogram
Formula to weight the score with number of reviews
Normalized Score Rating
  1. The new metric score with the Number of Reviews on the dataset. As talked before the new score were created with a new formula where used the number of reviews, in this way its normally biased. The result were a r2 equal than 0.806478 and the mean square error were 20.8821823.
  2. The new metric score without the Number of Reviews. Here the result of r2 decreased to 0.312825 and the mean square error were 71.982027.
  3. The metric The “Review Scores Rating” with the Number of Reviews, the result were a r2 equal than 0.145195 and the mean square error were 59.795813.
  4. At last, we used the “Review Scores Rating” without the Number of Reviews, the result were a r2 equal than 0.112494 and the mean square error were 61.477236.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store