We have ordered and deployed 180 TB worth of enterprise storage to be prepared for upcoming price increases and shortages of hard disks. We are reading reports that warn of upcoming delivery delays due to disruptions in the supply chain. We tweeted a picture here of how 100 TB storage looks like in our backend.
This is how 100 TB look like at the backend of Intelligence X.
— Intelligence X (@_IntelligenceX) March 14, 2020
Czech Republic is restricting travel from and to the country, which means that our employees are not allowed to leave the country. Fortunately, our datacenter is in Prague and we continue to operate as usual.
In order to help local hospitals, we are buying locally produced face masks as well as gloves and cleaning equipment in order to donate to local hospitals. There is a great local website which lists suppliers of handmade face masks (local producers) and those who need them (hospitals, doctors offices): https://www.damerousky.cz/.
We have added a Google AdSense ID reverse lookup tab to our free OSINT tools: https://intelx.io/tools?tab=adsense
Google AdSense IDs start with “ca-pub-” and can be found in the HTML code of websites. Since website operators sometimes use the same AdSense code snippet (containing the same ID) across multiple websites, it makes it possible to find those related websites with a reverse lookup. Our newly added tab will redirect to 3rd party sites that perform the reverse lookups.
3 weeks ago, we started to index publicly available documents from Sci-Hub, which hosts 81 million documents equaling 70 TB of size.
Since then, we have indexed 17 million documents equaling 9 TB – or about 20%.
Our search indexer extracted so far 20 million selectors, with most of them being email addresses (48%), followed by URLs (26%) and domains (14%).
The United States Death Master File “contains information about persons who had Social Security numbers and whose deaths were reported to the Social Security Administration from 1962 to the present” (quote Wikipedia). The file itself costs $2,930.00 anually, but was published multiple times on the internet for free. For details read our blog post. We have published open source code that converts the file from its proprietary text format to regular CSV: https://github.com/IntelligenceX/DeathMasterFile2CSV
We are regularly (always unsuccessfully) under attack. We had 3 medium-sized DDoS attacks, many smaller ones, login bruteforce attacks, and regularly observe port scanners, vulnerability scanners and SQL injection attempts.
We took a closer look into one user who thought it was a good idea to first sign up, and then spam us with 31,866 HTTP requests in a short period of time. The full investigation reveals the attackers nickname on hacking forums and is published in this blog post.
Follow-us on Twitter for the latest updates and insight into our operations: https://twitter.com/_IntelligenceX
Kleissner Investments s.r.o., Na Strzi 1702/65, 14000 Prague, Czech Republic
If you don’t wish to receive this newsletter anymore, please click here to unsubscribe.
June 2021: New Usenet data category We added the new data category Usenet. It contains historical and current data from Usenet, which is “a worldwide distributed discussion system”. Today, Usenet is mostly used for piracy. This new category stores currently 209,469,453 selectors and is expected to grow substantially. Improved inline statistics We have improved the
Intelligence X supports Peernet – Founder’s Statement I am excited to announce Peernet, a decentralized network that allows sharing of data freely without censorship and restrictions. Here is the pitch deck: https://peernet.org/dl/Peernet%20Deck.pdf Peernet is making quick progress from its inception as I am finalizing the whitepaper and developing the core library. I would like to
February 2021: Launch of the European Internet Archive The European Internet Archive just launched! 🎉🥳 ➡ https://archive.eu/ 225 TLDs added to the list of web crawling We have added 225 top-level domains (TLDs) to the list of web crawling. Find the full list and how we are categorizing them in this blog post. Our dataset