Native Support for Excel Sheets and PowerPoint Presentations 🗃️

Published on May 1, 2019 by

We just added native support to Intelligence X for the following data formats:

  1. Excel Sheets (.XLS, .XLSX, .ODT)
  2. PowerPoint Presentations (.PPTX)

Native support means end-to-end support. This ranges from indexing and crawling files of various data sources, to processing them internally and presenting them to the end-user on the frontend intelx.io. Indexing is the process of taking a file, reading it, and extracting any text thus making it searchable.

intelx.io shows text preview in the results and supports inline view. This means that it immediately shows the text of a document in a detailed view (when you click on a result) without forcing the user to leave the website or download the file locally.

Also a security feature

Inline view of office documents is a convenient feature, but also has an important security aspect: if the end-user downloads and opens unknown office documents (especially from the darknet), there is a risk of malicious embedded VBA macros and other exploits.

Intelligence X now natively supports all major office formats: Word, Excel, PowerPoint, and PDF.

Before vs After

Before, a PowerPoint file was displayed in the detailed view as “data salad” 🥗:

Now, you can see the text of the presentation (both in the preview and detailed view):

Related articles

List of buckets

Published on May 5, 2022 by

At Intelligence X we categorize data sources into buckets. Buckets can be used as filters and to broadly identify the source of individual search results. For example, the bucket “Darknet Tor” indicates the result origins from some a Tor hidden service (.onion domain) and was collected by our Tor crawler. Buckets have human readable names


Adding support for new top-level domains

Published on April 6, 2022 by

We just added support for an additional 152 top-level domains (TLDs), increasing the support to 511 TLDs in total. Support means that you can search for those domains across intelx.io and APIs, and internally that our backend supports processing them. While you can start searching for them immediately, it will take some time until our


A word of #OPSEC @theguardian

Published on March 7, 2022 by

Earlier today at 11:24 The Guardian Journalist Shaun Walker posted the security procedure and the security token used to pass makeshift checkpoints in Ukraine related to the Russian Ukrainian war: This is a reminder to journalists – and the public – to take OPSEC (operations security) seriously and not endanger people on the ground. Posting


Search the blog: