Indexing new “whois” data

Published on January 9, 2020 by

We are constantly indexing the latest whois data for our whois category. “Whois” data contains information about domain ownership.

We recently indexed the whois data for October, November, and December 2019 which resulted in 21,168,724 selectors. We analyzed a small sample set (1 day, 2019-11-29) and these are the results:

75% of the data is whois protected, which means that the information about the owner (name, email address, postal address) is not available. Often, that data is simply replaced with “Redacted For Privacy” or other dummy data.

However, the other 25% is not whois protected. Here is a breakdown by registrant country:

Image

When looking at TLDs, there were interestingly no domains from some TLDs like DE and AT. This could be an indicator that the actual number of privacy protected domains might be higher than 75%.

Related articles

Newsletter 2021-02-20

Published on February 20, 2021 by

February 2021: Launch of the European Internet Archive The European Internet Archive just launched! 🎉🥳 ➡ https://archive.eu/ 225 TLDs added to the list of web crawling We have added 225 top-level domains (TLDs) to the list of web crawling. Find the full list and how we are categorizing them in this blog post. Our dataset


New Category “Bot Logs”

Published on February 10, 2021 by

We have added the new category “Bot Logs”. It contains data collected by and leaked from viruses such as Azorult. Such data is often sold on marketplaces such as the Genesis Market. We decided to index such data into this new category to help filtering out relevant results. You can find this new category in


Adding 225 TLDs to the list of web crawling

Published on January 28, 2021 by

We are excited to announce that we just added 225 top-level domains (TLDs) to the list of web crawling! Below is the full list. The domain count per TLD represents the domains registered according to DomainTools. We group multiple TLDs into “buckets” to make it manageable – you can select these buckets in the Advanced


Search the blog: