-
-
Save chilts/7229605 to your computer and use it in GitHub Desktop.
var request = require('request'); | |
var unzip = require('unzip'); | |
var csv2 = require('csv2'); | |
request.get('http://s3.amazonaws.com/alexa-static/top-1m.csv.zip') | |
.pipe(unzip.Parse()) | |
.on('entry', function (entry) { | |
entry.pipe(csv2()).on('data', console.log); | |
}) | |
; |
Today is 763k. Last summer it started being short of "one million". I am here again trying to figure out why.
We used Alexa in the past, still can't find anything on why it so short of 1 million. Good paper on T1M rankings pdf
Alexa no longer provides that list for free.
You can download the list using their API.
The price is:
Alexa Top Sites API Requests (1 unit = 10 URLs returned) | $0.025 / unit
So for 1 million domains you'd pay 0.0025 * 1000000 = $2500 😃😃😃
💵 💲 Thanks for that info @vladimarius ... good to know it is still available. I imagine people will find other sources though with that price! Thanks again.
still work alexa at 2021
Just spoke with Amazon about this. There's no guarantee that the free list contains all 1 million, but it is still updated daily.
Hi,
how to get for example 10 top list into a text file not all?
the file now ends at 427k
CSV file is working again! Nice!
The data is not exactly up to date. I would say about 2 months. I have a site in the current the 67,000 positions today, and is in the lists 78,000s
Also how to get for example 10 top list into a text file not all?
425k for 2021.10.11
630779 Dec 8th 2021
Also, https://support.alexa.com/hc/en-us/articles/4410503838999 .
We will be retiring Alexa.com on May 1, 2022
https://support.alexa.com/hc/en-us/articles/4410503838999
Note, this is the last chance you can backup things
With the Alexa top 1 million CSV/ZIP going away shortly, you can use https://statvoo.com/dl/top-1million-sites.csv.zip instead, which is linked to over here: https://statvoo.com/top/ranked and provides a list of the top 1million websites. (Updated daily)
Thanks @ao, that's good to know! :)
great
Can confirm http://s3.amazonaws.com/alexa-static/top-1m.csv.zip still works for me, 1M sites (as of May 11th 2022). I think the actual resources will be gone by December of 2022 though
Hello,
does anyone knows how to get the top-1000 from a specific Country too?
i would search for the Austrian and Germany Top 1000 List. Can anybody help me out with a link to download?
@ciscospirit I don't know any off the top of my head, but perhaps do a search and see what you can find.
Hi everyone, I just noticed this site on a fork of this gist and also seems to be kept up to date:
- Fork : https://gist.github.com/aowongster/a69c84b66c74ca037e7094bed61e48b0
- Majestic Million : https://majestic.com/reports/majestic-million
- Download : https://downloads.majesticseo.com/majestic_million.csv
I don't know if it's useful to anyone, but there we go. :)
Hey everyone, when I download http://s3-us-west-1.amazonaws.com/umbrella-static/top-1m.csv.zip , the csv has ".deprecated" as file extension. This is it ? Its done ?
@kostasmaneadis Yes, it's no more.
-----------------------------------------------------------------
Notice: This file is deprecated and is not being updated anymore.
This file was last updated on February 1, 2023.
This file will not be available from
http://s3.amazonaws.com/alexa-static/top-1m.csv.zip after
July 31, 2023.
-----------------------------------------------------------------
https://radar.cloudflare.com/domains
top 1000000 unordered 🤢
I am in trouble with my new domain securelines.net to install WordPress,
I get access denied when accessing http://s3.amazonaws.com/alexa-static/top-1m.csv.zip
I get access denied when accessing http://s3.amazonaws.com/alexa-static/top-1m.csv.zip
try this one http://s3-us-west-1.amazonaws.com/umbrella-static/top-1m.csv.zip
This is a link to the Cisco Umbrella popularity list. Archive.org has luckily archived the zip: https://web.archive.org/web/20230401000000*/https://s3.amazonaws.com/alexa-static/top-1m.csv.zip
You can check our website for this notebookdepo
You can download from here https://www.kaggle.com/datasets/cheedcheed/top1m
As of today, the Alexa "one million" contains 547855 entries. Very strange.