Search More Than 300 Different File Formats Easily and Efficiently

 

Finding information made easy: The company’s internal search engine searchit allows you to search through almost any file format in millions of files on local file servers, mail servers and archives.

Search, find, save with searchit

Efficient search in all Microsoft Office file formats, Outlook archives in PST format, PDF files, TXT files, TIFF/TIF files, PNG files, AutoCAD and DWG formats, ZIP, RAR and 7z archives, XML formats and many more!

How are files searched in searchit ?

What are MIME types?

Supported categories of file formats

Full list of searchable MIME types

Logo searchit

The search of almost any file format is one of the greatest strengths of the enterprise search solution searchit. In contrast to the search in File Explorer, the file content including metadata of all indexed files can be searched, even in formats for scans, images or CAD files. Find out exactly how searchit makes the unsearchable searchable and scroll through the full list of supported file formats.

How are files searched in searchit?

As an enterprise search solution, searchit enables comprehensive file searches through intelligent indexing and categorization. Users can quickly and efficiently search for content in various formats such as documents, emails, presentations, and more to find relevant information and increase productivity.

What are MIME types?

 

MIME types (Multipurpose Internet Mail Extensions) are labels that define the media type of files on the Internet. They enable the correct interpretation and processing of content by telling the servers and browsers the file type.

Supported categories of file formats

Every day, the lawyer rummages through e-mail archives, the forewoman through CAD files – the most frequently used file format depends on both the industry and the job. searchit’sever-growing number of parsers makes it possible to search in almost all file categories.

HTML (Hypertext Markup Language)

The lingua franca of the web – Almost every HTML format found on the web is supported with the searchit search function:

  • Valid XHTML code and XML
  • Microsoft Office document formats
  • OpenDocument
  • iWorks
  • Portable Document Formats
  • EPUB
  • RTF
  • Compression and packaging formats
  • Audio, image and video formats
  • And other scientific, language-processing, object-recognizing, and database-based formats

XML and Derived Formats

The Extensible Markup Language (XML) format is used both for hierarchically structured data and for a platform-independent exchange of data between computer systems. XML languages supported by searchit include:

  • XHTML (Extensible Hypertext Markup Language)
  • OOXML (Office Open XML)
  • ODF (Open Document Format)

Microsoft Office document formats

Text and metadata extraction from Microsoft Office and some related applications can be searched in the following formats:

  • OLE 2 Compound Document Format
  • OOXML (Office Open XML)
  • Temporary Office Lock Files (Owner Files)

OpenDocument Format

searchit searches the OpenDocument format (ODF) for:

  • All files in the OpenOffice.org office suite
  • Older files in OpenOffice 1.0 format, the predecessor of ODF

iWorks Document Formats

Both text and metadata are supported in iWorks, including:

  • Numbers
  • Pages
  • Keynotes

WordPerfect document formats

searchit searches all formats related to:

  • Corel WordPerfect Office Suite
  • WordPerfect WP6+ Files
  • QuattroPro QPW v9+ Files

Portable Document Format

Digitally created and non-searchable scans are made searchable in searchit using the ORC functionality. More about PDF search with searchit.

Electronic Publication Format

searchit searches eBooks, digital books, and papers in the following formats:

  • Electronic Publication Format (EPUB)
  • Fiction Book Publishing Format

Rich Text Format

Full search functionality for documents in Rich Text Format (RTF).

Compression and packaging formats

Enterprise search software searchit enables you to search even in compressed data. Various compression and packaging formats are supported:

  • Tar
  • ARE
  • ARJ
  • CPIO
  • Dump
  • Zip
  • 7Zip
  • Gzip
  • BZip2
  • XZ
  • LZMA
  • Z
  • Pack200
  • RARE
  • AppleSingle and
  • AppleDouble Files

Text Formats

Extracting text content from plain text files seems like an easy task until you start thinking about all the possible character encodings. searchit is able to automatically recognize the character encoding of a text document .

Feed and syndication formats

Updates of websites, podcasts or news articles – searchit supports syndication formats that keep users up to date:

  • RSS Feed
  • Atom Feed
  • IPTC ANPA News Wire Feed Format

Help Formats

searchit searches the Microsoft Help files:

  • CHM Help Format ( called Compiled HTML Help, also Compressed HTML Help or Compiled Help Module(s))

Video Formats

Video recordings in the most common formats are searched with serachit with a focus on metadata:

  • Flash Video Format
  • MP4 family of video formats including MP4, Quicktime, 3GPP and many more
  • Ogg family of video formats

Java Class Files and Archives

Class names and method signatures are searched in searchit in the following formats:

  • Java Class Files
  • jar Archives

Source Code

searchit searches source code for content and metadata itself:

  • Java
  • C
  • C++ Groovy
  • and more!

Email formats

Searchit makes it possible to search e-mails and even e-mail archives in the following formats:

  • PST email format, used in Microsoft Outlook archives
  • MSG e-mail format, used for individually downloaded Outlook e-mails
  • Microsoft TNEF (Transport Neutral Encoding Format, also known as Winmail.dat), used by most Microsoft email clients for email attachments
  • mbox format, widely used in email archives and Unix-like mailboxes
  • RFC 822 format: Used by many email clients in archives and exports

CAD formats

searchit searches data from files in DWG CAD format.

Font Formats

Search for metadata even in font files – searchit supports:

  • TrueType font format
  • Adobe Font Metrics Files

Scientific formats

Many of the programs that are specifically used in science can be searched for metadata and content with searchit :

  • GCMD Directory Interchange Format (DIF)
  • GDAL
  • ISO 19139 file format for geographic information
  • Grib
  • HDF
  • Family of file formats ISA-Tab (ISA Tools)
  • Netcdf
  • Matlab

Executable programs and libraries

Searchit extracts and searches metadata information about platforms, architectures, and types from a range of executable formats and libraries:

  • Windows Executables
  • Linux/BSD programs and libraries
  • and many more!

Crypto Formats

Using secure access controls and special parsers, searchit even searches encrypted messages:

  • PKCS7-signed messages, without information from the outer PKCS7 wrapper
  • Metadata from Time Stamped Data Envelope (TSD) Files
  • Saved Content from the TSD Wrapper

Database formats

Several types of databases can be searched quickly and easily in searchit :

  • SQLite3 files
  • Microsoft Access database files
  • dBase files (dbf) including dBase, FoxBASE, FoxPRO, and shapefile format from ESRI

Natural Language Processing

Artificial intelligence is used in searchit , for example, by means of natural language processing and named entity recognition frameworks. This enables:

  • Classification of the mood and emotional tone of a document
  • Extract metadata from full-text journal publications.

Image and video object recognition

Several object detection frameworks are supported to analyze the content of images and videos. searchit instances are trained with large training datasets for specific areas of application of customers.

Know what's in it - regardless of the file format

Thanks to search<b>it,</b> you can search in hundreds of file formats at the same time on a central platform

Full list of searchable MIME types

Over three hundred formats for text files, images and scans, PDFs and much more are supported in searchit :

AppleSingleFileParse

  • application/applefile

PListParser

  • application/x-plist
  • application/x-bplist-itunes
  • application/x-bplist
  • application/x-bplist-memgraph
  • application/x-bplist-webarchive

ClassParser

  • application/java-vm

AudioParser

  • audio/vnd.wave
  • Audio/X-WAV
  • audio/basic
  • Audio/X-AIFF

MidiParser

  • application/x-midi
  • audio/midi

SourceCodeParser

  • text/x-c++src
  • text/x-groovy
  • text/x-java-source

Pkcs7Parser

  • application/pkcs7-signature
  • application/pkcs7-mime

TSDParser

  • application/timestamped-data

TextAndCSVParser

  • text/csv
  • Text/TSV
  • text/plain

DBFParser

  • application/x-dbf

DGN8Parser

  • image/vnd.dgn; version=8

DIFParser

  • application/dif+xml

DWGParser

EpubParser

  • application/x-ibooks+zip
  • application/epub+zip

ExecutableParser

  • application/x-msdownload
  • application/x-sharedlib
  • application/x-elf
  • application/x-object
  • application/x-executable
  • application/x-coredump

ExternalParser

  • Video/AVI
  • Video/MPEG
  • Video/X-MSvideo
  • Video/MP4

FeedParser

  • application/atom+xml
  • application/rss+xml

AdobeFontMetricParser

  • application/x-font-adobe-metric

TrueTypeParser

  • application/x-font-ttf

HtmlParser

  • text/html
  • application/vnd.wap.xhtml+xml
  • application/x-asp
  • application/xhtml+xml

HttpParser

  • application/x-httpresponse

HwpV5Parser

  • application/x-hwp-v5

BPGParser

  • image/bpg
  • image/x-bpg

HeifParser

  • image/heic-sequence
  • image/heif
  • image/heic
  • image/heif-sequence

ICNSParser

  • image/icns

ImageParser

  • image/png
  • image/vnd.wap.wbmp
  • image/x-jbig2
  • image/bmp
  • image/x-xcf
  • image/gif
  • image/x-icon
  • image/x-ms-bmp

JXLParser

  • image/jxl

JpegParser

  • image/jpeg

PSDParser

  • image/vnd.adobe.photoshop

TiffParser

WebPParser

  • image/webp

IDMLParser

  • application/vnd.adobe.indesign-idml-package

IptcAnpaParser

  • text/vnd.iptc.anpa

IWorkPackageParser

  • application/vnd.apple.keynote
  • application/vnd.apple.iwork
  • application/vnd.apple.numbers
  • application/vnd.apple.pages

IWork13PackageParser

  • application/vnd.apple.numbers.13
  • application/vnd.apple.unknown.13
  • application/vnd.apple.pages.13
  • application/vnd.apple.keynote.13

IWork18PackageParser

  • application/vnd.apple.pages.18
  • application/vnd.apple.keynote.18
  • application/vnd.apple.numbers.18

RFC822Parser

  • message/rfc822

MatParser

  • application/x-matlab-data

MboxParser

  • application/mbox

EMFParser

  • image/emf

JackcessParser

  • application/x-msaccess

MSOwnerFileParser

OfficeParser

OldExcelParser

TNEFParser

  • application/vnd.ms-tnef
  • application/x-tnef
  • application/ms-tnef

WMFParser

  • image/wmf

ActiveMimeParser

  • application/x-activemime

ChmParser

  • application/vnd.ms-htmlhelp
  • application/x-chm
  • application/chm

OneNoteParser

  • application/onenote; format=one

OOXMLParser

  • application/vnd.ms-powerpoint.template.macroenabled.12
  • application/vnd.ms-excel.addin.macroenabled.12
  • application/vnd.openxmlformats-officedocument.wordprocessingml.template
  • application/vnd.ms-excel.sheet.binary.macroenabled.12
  • application/vnd.openxmlformats-officedocument.wordprocessingml.document
  • application/vnd.ms-powerpoint.slide.macroenabled.12
  • application/vnd.ms-visio.drawing
  • application/vnd.ms-powerpoint.slideshow.macroenabled.12
  • application/vnd.ms-powerpoint.presentation.macroenabled.12
  • application/vnd.openxmlformats-officedocument.presentationml.slide
  • application/vnd.ms-excel.sheet.macroenabled.12
  • application/vnd.ms-word.template.macroenabled.12
  • application/vnd.ms-word.document.macroenabled.12
  • application/vnd.ms-powerpoint.addin.macroenabled.12
  • application/vnd.openxmlformats-officedocument.spreadsheetml.template
  • application/vnd.ms-xpsdocument
  • application/vnd.ms-visio.drawing.macroenabled.12
  • application/vnd.ms-visio.template.macroenabled.12
  • model/vnd.dwfx+xps
  • application/vnd.openxmlformats-officedocument.presentationml.template
  • application/vnd.openxmlformats-officedocument.presentationml.presentation
  • application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
  • application/vnd.ms-visio.stencil
  • application/vnd.ms-visio.template
  • application/vnd.openxmlformats-officedocument.presentationml.slideshow
  • application/vnd.ms-visio.stencil.macroenabled.12
  • application/vnd.ms-excel.template.macroenabled.12

Word2006MLParser

shh. OutlookPSTParser

Rtf. RTFParser

  • application/rtf

xml.SpreadsheetMLParser

  • application/vnd.ms-spreadsheetml

xml.WordMLParser

  • application/vnd.ms-wordml

MIFParser

  • application/x-mif
  • application/vnd.mif
  • application/x-maker

Mp3Parser

  • Audio/MPEG

MP4Parser

  • Video/X-M4V
  • application/mp4
  • Video/3GPP
  • Video/3GPP2
  • video/quicktime
  • Audio/MP4
  • Video/MP4

TesseractOCRParser

  • image/ocr-x-portable-pixmap
  • image/ocr-jpx
  • image/x-portable-pixmap
  • image/OCR-JPEG
  • image/OCR-JP2
  • image/jpx
  • image/ocr-png
  • image/OCR-TIFF
  • image/ocr-gif
  • image/ocr-bmp
  • image/jp2

FlatOpenDocumentParser

  • application/vnd.oasis.opendocument.tika.flat.document
  • application/vnd.oasis.opendocument.flat.presentation
  • application/vnd.oasis.opendocument.flat.spreadsheet
  • application/vnd.oasis.opendocument.flat.text

OpenDocumentParser

  • application/x-vnd.oasis.opendocument.presentation
  • application/vnd.oasis.opendocument.chart
  • application/x-vnd.oasis.opendocument.text-web
  • application/x-vnd.oasis.opendocument.image
  • application/vnd.oasis.opendocument.graphics-template
  • application/vnd.oasis.opendocument.text-web
  • application/x-vnd.oasis.opendocument.spreadsheet-template
  • application/vnd.oasis.opendocument.spreadsheet-template
  • application/vnd.sun.xml.writer
  • application/x-vnd.oasis.opendocument.graphics-template
  • application/vnd.oasis.opendocument.graphics
  • application/vnd.oasis.opendocument.spreadsheet
  • application/x-vnd.oasis.opendocument.chart
  • application/x-vnd.oasis.opendocument.spreadsheet
  • application/vnd.oasis.opendocument.image
  • application/x-vnd.oasis.opendocument.text
  • application/x-vnd.oasis.opendocument.text-template
  • application/vnd.oasis.opendocument.formula-template
  • application/x-vnd.oasis.opendocument.formula
  • application/vnd.oasis.opendocument.image-template
  • application/x-vnd.oasis.opendocument.image-template
  • application/x-vnd.oasis.opendocument.presentation-template
  • application/vnd.oasis.opendocument.presentation-template
  • application/vnd.oasis.opendocument.text
  • application/vnd.oasis.opendocument.text-template
  • application/vnd.oasis.opendocument.chart-template
  • application/x-vnd.oasis.opendocument.chart-template
  • application/x-vnd.oasis.opendocument.formula-template
  • application/x-vnd.oasis.opendocument.text-master
  • application/vnd.oasis.opendocument.presentation
  • application/x-vnd.oasis.opendocument.graphics
  • application/vnd.oasis.opendocument.formula
  • application/vnd.oasis.opendocument.text-master

PDFParser

CompressorParser

PackageParser

  • application/x-tar
  • application/java-archive
  • application/x-arj
  • application/x-archive
  • application/zip
  • application/x-cpio
  • application/x-tika-unix-dump
  • application/x-7z-compressed

RarParser

  • application/x-rar-compressed

PRTParser

  • application/x-prt

SAS7BDATParser

  • application/x-sas-data

TMXParser

  • application/x-tmx

FLVParser

  • Video/X-FLV

WACZParser

  • application/x-wacz

WARCParser

  • application/warc
  • application/warc+gz

QuattroProParser

  • application/x-quattro-pro; version=9

WordPerfectParser

  • application/vnd.wordperfect; version=5.1
  • application/vnd.wordperfect; version=5.0
  • application/vnd.wordperfect; version=6.x

XLIFF12Parser

  • application/x-xliff+xml

XLZParser

  • application/x-xliff+zip

DcXMLParser

  • application/xml
  • image/svg+xml

FictionBookParser

  • application/x-fictionbook+xml

FlacParser

  • Audio/X-Oggflac
  • Audio/X-FLAC

OggParser

  • Audio/OGG
  • application/kate
  • application/ogg
  • Video/Daala
  • video/x-ogguvs
  • Video/X-OGM
  • audio/x-oggpcm
  • video/ogg
  • video/x-dirac
  • video/x-oggrgb
  • Video/X-Oggyuv

OpusParser

  • Audio/Opus
  • Audio/OGG; codecs=opus

SpeexParser

  • Audio/OGG; codecs=speex
  • audio/speex

TheoraParser

  • video/theora

VorbisParser

  • Audio/Vorbis

 

Contact us

We focus on holistic service & a high-end enterprise search engine. Please contact us.