We added support for US Social Security Numbers (SSNs)! You can immediately search for them here: https://intelx.io/?s=086-38-5955
Searching at Intelligence X works based on selectors (strong search terms). When you search for something, the system automatically detects suitable selectors and performs a search. Below is the list of supported selectors:
Detecting SSNs in texts can be challenging: they are essentially just 9-digit numbers, thus prone to false-positives. The basic format of SSNs is “AAA-GG-SSSS” (sometimes without dashes), comprising of an Area, Group and Serial Number. These parts formerly had meaning, but since 2011, and the dawn of the “randomization act”, they are essentially random.
To prevent a high number of false-positive SSNs, Intelligence X uses various smart algorithms to determine whether a 9-digit number is actually a SSN, or indicates something else.
Additional information on SSNs, including historical and current information, is available on this blog post – “Validating Social Security Numbers through Regular Expressions” – and on the “Social Security Number Randomization” page of the Social Security Administration.
June 2021: New Usenet data category We added the new data category Usenet. It contains historical and current data from Usenet, which is “a worldwide distributed discussion system”. Today, Usenet is mostly used for piracy. This new category stores currently 209,469,453 selectors and is expected to grow substantially. Improved inline statistics We have improved the
Intelligence X supports Peernet – Founder’s Statement I am excited to announce Peernet, a decentralized network that allows sharing of data freely without censorship and restrictions. Here is the pitch deck: https://peernet.org/dl/Peernet%20Deck.pdf Peernet is making quick progress from its inception as I am finalizing the whitepaper and developing the core library. I would like to
February 2021: Launch of the European Internet Archive The European Internet Archive just launched! 🎉🥳 ➡ https://archive.eu/ 225 TLDs added to the list of web crawling We have added 225 top-level domains (TLDs) to the list of web crawling. Find the full list and how we are categorizing them in this blog post. Our dataset