Difference between revisions of "Video statistics database"

From Tournesol
Jump to navigation Jump to search
imported>Le science4all
(Created page with "This database is a derived database, which aims to respond efficiently to queries on Tournesol. A video's statistics is tuple '''< video_id, reliability_score, important_scor...")
 
imported>Le science4all
(Added percentiles)
 
Line 1: Line 1:
 
This database is a derived database, which aims to respond efficiently to queries on Tournesol.
 
This database is a derived database, which aims to respond efficiently to queries on Tournesol.
   
A video's statistics is tuple '''< video_id, reliability_score, important_score, engaging_score, habit_score, pedagogical_score, layman_score, diversity_score, backfire_score, entertaining_score, n_contributors, n_ratings, pareto_optimal, public_contributors >'''.
+
A video's statistics is tuple '''< video_id, reliability_score, reliability_percentile, important_score, important_percentile, engaging_score, engaging_percentile, habit_score, habit_percentile, pedagogical_score, pedagogical_percentile, layman_score, layman_percentile, diversity_score, diversity_percentile, backfire_score, backfire_percentile, entertaining_score, entertaining_percentile, n_contributors, n_ratings, pareto_optimal, public_contributors >'''.
   
 
Currently, the list of '''public_contributors''' contains a random set of 20 of the public contributors to the video.
 
Currently, the list of '''public_contributors''' contains a random set of 20 of the public contributors to the video.
Line 7: Line 7:
 
== Daily update ==
 
== Daily update ==
 
Every day, the [[Tournesol learning algorithm]] computes the [[Tournesol scores]] of all videos, and update the database with the results.
 
Every day, the [[Tournesol learning algorithm]] computes the [[Tournesol scores]] of all videos, and update the database with the results.
  +
  +
Once all the scores are registered in the database, for each [[Quality criteria|quality criterion]], we rank the videos according to the criterion, and we convert the rank of a video into its percentile ranking. This percentile is then saved in the video statistics database.
   
 
== Hourly update ==
 
== Hourly update ==

Latest revision as of 17:51, 1 May 2021

This database is a derived database, which aims to respond efficiently to queries on Tournesol.

A video's statistics is tuple < video_id, reliability_score, reliability_percentile, important_score, important_percentile, engaging_score, engaging_percentile, habit_score, habit_percentile, pedagogical_score, pedagogical_percentile, layman_score, layman_percentile, diversity_score, diversity_percentile, backfire_score, backfire_percentile, entertaining_score, entertaining_percentile, n_contributors, n_ratings, pareto_optimal, public_contributors >.

Currently, the list of public_contributors contains a random set of 20 of the public contributors to the video.

Daily update

Every day, the Tournesol learning algorithm computes the Tournesol scores of all videos, and update the database with the results.

Once all the scores are registered in the database, for each quality criterion, we rank the videos according to the criterion, and we convert the rank of a video into its percentile ranking. This percentile is then saved in the video statistics database.

Hourly update

Every hour, the entries n_contributors, n_ratings, pareto_optimal and public_contributors are updated.

To do so, we go through all the video ratings

Hook

Upon new rating.

Upon contributor