Newsletter 2021-01-23

Published on January 23, 2021 by

January 2021: Capitol Hill Riots Archive, New Features

Our dataset contains now 30 billion records and grows by +3 billion every month. Last year has been incredible for us. We are growing by any metric – all organically. Searches are up 113% Q-on-Q and our user base is growing sustainably at 18% month-on-month.

Capitol Hill Riots

We have sprung into action on January 6, 2021 when Twitter and Facebook started to delete videos of the Capitol Hill riots. It is important evidence for law enforcement & the public. Use the “Tree View” tab to see all pictures and videos! Link: https://intelx.io/?did=814b39fe-ad98-45a1-9f44-0346bc9f9b94

You can find the public FTP details (containing 200 GB of media) in this tweet. Our efforts on this has been mentioned in the news:

Excel like Viewer

We launched an embedded Excel-like viewer! It will automatically appear for all CSV and XLSX files. This new feature was tricky because of different delimiters, formats, and many edge cases to consider. The text-only tab is still available to show the data as plaintext.

A good example is an archive of former President Trump’s tweets: https://intelx.io/?did=5bcdc306-7d01-4b3a-8899-ea09cad8c12f

Native Video and Audio Players

Another new feature we released is native video and audio players. You can try out video streaming by the link on the top regarding the Capitol Hill Riots.

Here is an audio example: North Korea music

📝 Trivia

  • We ran into an exotic Windows issue: one of our key-value databases maxed out the NTFS file fragment count. While the database is stored as 1 file on disk, NTFS stores it internally as millions of little fragments, eventually hitting an internal limit and preventing the file from growing.
  • We are upgrading our peer uplink to 10 Gbit!

📰 Darknet marketplaces and hacks

  • ViceLeaker which allegedly has ties to the Iranian government got internal documents exposed on a Telegram channel. It is indexed here. Use the Tree View to switch to the Telegram messages.
  • Marketplace “DarkMarket” has been seized by Europol and friends. Our first archived copy (link 🔒) dates to July 2019. There is not a lot of information crawled, but at least a few pointers including the onion domains used over time. Read Europol’s press release.
  • There is a follow-up on the WeLeakInfo takedown. Police apparently arrested 21 customers according to this news report.

Kleissner Investments s.r.o., Na Strzi 1702/65, 14000 Prague, Czech Republic

If you don’t wish to receive this newsletter anymore, please click here to unsubscribe.

Related articles

Newsletter 2021-02-20

Published on February 20, 2021 by

February 2021: Launch of the European Internet Archive The European Internet Archive just launched! 🎉🥳 ➡ https://archive.eu/ 225 TLDs added to the list of web crawling We have added 225 top-level domains (TLDs) to the list of web crawling. Find the full list and how we are categorizing them in this blog post. Our dataset


New Category “Bot Logs”

Published on February 10, 2021 by

We have added the new category “Bot Logs”. It contains data collected by and leaked from viruses such as Azorult. Such data is often sold on marketplaces such as the Genesis Market. We decided to index such data into this new category to help filtering out relevant results. You can find this new category in


Adding 225 TLDs to the list of web crawling

Published on January 28, 2021 by

We are excited to announce that we just added 225 top-level domains (TLDs) to the list of web crawling! Below is the full list. The domain count per TLD represents the domains registered according to DomainTools. We group multiple TLDs into “buckets” to make it manageable – you can select these buckets in the Advanced


Search the blog: