Public data dump download numbers seem inconsistent with actual

chester.burbidge's Avatar


07 Jan, 2018 11:38 AM

I'm trying to get the top 1000 downloaded gems for a personal project.

I've downloaded and restored the data from into a postgres database. When I join the versions to get the download counts with the command:

COPY (select, v.full_name, v.authors, d.count from versions v join rubygems r on v.rubygem_id = join gem_downloads d on = TO '/tmp/gem_stats.csv' DELIMITER ',' CSV HEADER;
and analyse and sort the results by most downloaded I get wildly different results to the page

Anyone know why this might be?

  1. Support Staff 1 Posted by sonalkr132 on 22 Apr, 2018 01:13 PM

    sonalkr132's Avatar

    Hi chester,

    Sorry about delay in our response. I am not sure how can we help you if we don't know your complete process. Page you mentioned used this method to get top gems.

Reply to this discussion

Internal reply

Formatting help / Preview (switch to plain text) No formatting (switch to Markdown)

Attaching KB article:


Attached Files

You can attach files up to 10MB

If you don't have an account yet, we need to confirm you're human and not a machine trying to post spam.

Keyboard shortcuts


? Show this help
ESC Blurs the current field

Comment Form

r Focus the comment reply box
^ + ↩ Submit the comment

You can use Command ⌘ instead of Control ^ on Mac

Recent Discussions

20 Oct, 2020 07:13 PM
20 Oct, 2020 11:11 AM
13 Oct, 2020 06:14 PM
13 Oct, 2020 04:27 AM
06 Oct, 2020 08:22 AM