Search more than 300 different file formats easily and efficiently

 

Finding information made easy: the company's internal search engineit allows searching almost any file format across millions of files on local file servers, mail servers and archives.

Search, find, save with searchit

Efficient search in all Microsoft Office file formats, Portable Document Format – PDFs –, XML formats and many more!

How do files appear in searchit searched?

What are MIME types?

Supported categories of file formats

Complete list of searchable MIME types

Logo searchit

Searching almost any file format is one of the greatest strengths of the enterprise search solution searchit. In contrast to searching in File Explorer, the file content including metadata of all indexed files can be searched, even in scan, image or CAD file formats. Find out exactly how searchit Makes unsearchable searchable and scroll through the full list of all supported file formats.

How do files get into searchit searched?

As an enterprise search solution, searchit the comprehensive search of files through intelligent indexing and categorization. Users can quickly and efficiently search for content in various formats such as documents, emails, presentations and more to find relevant information and increase productivity.

What are MIME types?

 

MIME types (Multipurpose Internet Mail Extensions) are labels that define the media type of files on the Internet. They enable the correct interpretation and processing of content by telling servers and browsers the file type.

Supported categories of file formats

The lawyer rummages through email archives every day, the foreman rummages through CAD files - the most commonly used file format depends on both the industry and the job. searchits ever-growing number of parsers allows searching in almost all file categories.

HTML (Hypertext Markup Language)

The lingua franca of the web – Almost every HTML format found on the web is searchedit Search function supported:

  • Valid XHTML code and XML
  • Microsoft Office document formats
  • Opendocument
  • iWorks
  • Portable document formats
  • EPUB
  • RTF
  • Compression and packaging formats
  • Audio, image and video formats
  • And other scientific, language processing, object recognition and database-based formats

XML and derived formats

The Extensible Markup Language (XML) format is used both for hierarchically structured data and for platform-independent exchange of data between computer systems. To those from searchit Supported XML languages ​​include:

  • XHTML (Extensible Hypertext Markup Language)
  • OOXML (Office Open XML)
  • ODF (Open Document Format)

Microsoft Office document formats

Text and metadata extraction Microsoft Office and some related applications can be searched in the following formats:

  • OLE 2 Compound Document Format
  • OOXML (Office Open XML)
  • Temporary Office lock files (owner files)

OpenDocument format

searchit searches the OpenDocument Format (ODF) for:

  • All files of the OpenOffice.org office suite
  • Older files in OpenOffice 1.0 format, the predecessor of ODF

iWorks document formats

Both text and metadata are supported in iWorks, including:

  • Numbers
  • Pages
  • Keynotes

WordPerfect document formats

searchit searches all formats associated with:

  • Corel WordPerfect Office Suite
  • WordPerfect WP6+ files
  • QuattroPro QPW v9+ files

Portable Document Format

Digitally created and non-searchable scans are displayed in searchit made searchable using the ORC functionality. More about PDF search with searchit.

Electronic publication format

searchit searches eBooks, digital books and papers in the following formats:

  • Electronic Publication Format (EPUB)
  • Fiction Book Publishing Format

Rich text format

Full search functionality for documents in Rich Text Format (RTF).

Compression and packaging formats

Enterprise Search Software searchit allows you to search yourself compressed data. Various compression and packaging formats are supported:

  • Tar
  • AR
  • ARJ
  • CPIO
  • dump
  • Zip
  • 7Zip
  • Gzip
  • Bzip2
  • XZ
  • Lzma
  • Z
  • pack200
  • RAR
  • AppleSingle and
  • AppleDouble files

Text formats

Extracting text content from simple text files seems like a simple task until you start thinking about all the possible character encodings. searchit is capable of character encoding text document to be recognized automatically.

Feed and syndication formats

Updates from websites, podcasts or news articles – searchit supports syndication formats that allow users to always stay up to date:

  • RSS Feed
  • Atom feed
  • IPTC ANPA News Wire Feed Format

Help formats

searchit searches the Microsoft Help files:

  • CHM help format (called Compiled HTML Help, also Compressed HTML Help or Compiled Help Module(s))

Video formats

Video recordings in the most common formats are recorded with serachit searched with a focus on metadata:

  • Flash video format
  • MP4 family of video formats including MP4, Quicktime, 3GPP and many more.
  • Ogg family of video formats

Java class files and archives

Class names and method signatures are displayed in searchit searched in the following formats:

  • Java class files
  • jar archives

Source Code

searchit Searches source code for content and metadata:

  • Java
  • C
  • C++ Groovy
  • and more!

Email formats

Searching through emails and even email archives is searchit possible in the following formats:

  • PST email format, used with Microsoft Outlook archives
  • MSG email format, used for individually downloaded Outlook emails
  • Microsoft TNEF (Transport Neutral Encoding Format, also known as Winmail.dat), used by most Microsoft email clients for email attachments
  • mbox format, widely used in email archives and Unix-like mailboxes
  • RFC 822 format: Used by many email clients in archives and exports

CAD formats

searchit searches metadata of files in DWG CAD format.

Font formats

Search for metadata even in font files – searchit supports:

  • TrueType font format
  • Adobe Font Metrics files

Scientific formats

Many of the programs that are specifically used in science can be used with searchit Search for metadata and content:

  • GCMD Directory Interchange Format (DIF)
  • GDAL
  • ISO 19139 file format for geographic information
  • Grib
  • HDF
  • Family of file formats ISA-Tab (ISA Tools)
  • NetCDF
  • Matlab

Executable programs and libraries

With searchit Extracts and searches metadata information about platforms, architectures and types from a range of executable formats and libraries:

  • Windows executables
  • Linux/BSD programs and libraries
  • and many more!

Crypto formats

Search is searched through secure access regulations and using special parsersit even encrypted messages:

  • PKCS7 signed messages, without information from the outer PKCS7 wrapper
  • Metadata from Time Stamped Data Envelope (TSD) files
  • Stored content from the TSD wrapper

Database formats

Several types of databases can be included in searchit can be searched quickly and easily:

  • SQLite3 files
  • Microsoft Access database files
  • dBase files (dbf) including dBase, FoxBASE, FoxPRO and shapefile format from ESRI

Natural Language Processing

Artificial intelligence comes to searchit e.g. B. used using Natural Language Processing and Named Entity Recognition frameworks. This makes possible:

  • Classifying the mood and emotional tone of a document
  • Extracting metadata from full text of journal publications.

Image and video object detection

Multiple object detection frameworks are supported to analyze the content of images and videos. searchit Instances are trained using large training data sets for specific customer application areas.

Know what's inside - regardless of the file format

Thanks searchit You can search in hundreds of file formats at the same time on a central platform

Complete list of searchable MIME types

Over three hundred formats for text files, images and scans, PDFs and much more are available in searchit supports:

AppleSingleFileParse

  • application/applefile

PListParser

  • application/x-plist
  • application/x-bplist-itunes
  • application/x-bplist
  • application/x-bplist-memgraph
  • application/x-bplist-webarchive

ClassParser

  • application/java-vm

AudioParser

  • audio/vnd.wave
  • audio/x-wav
  • audio/basic
  • audio/x-aiff

MidiParser

  • application/x-midi
  • audio/midi

SourceCodeParser

  • text/x-c++src
  • text/x-groovy
  • text/x-java-source

Pkcs7Parser

  • application/pkcs7-signature
  • application/pkcs7-mime

TSD Parser

  • application/timestamped-data

TextAndCSVParser

  • text/csv
  • text/tsv
  • text / plain

DBFParser

  • application/x-dbf

DGN8Parser

  • image/vnd.dgn; version=8

DIFParser

  • application/dif+xml

DWG Parser

EpubParser

  • application/x-ibooks+zip
  • application/epub+zip

ExecutableParser

  • application/x-msdownload
  • application/x-sharedlib
  • application/x-eleven
  • application/x-object
  • application / x-executable
  • application/x-coredump

ExternalParser

  • video/avi
  • video/mpeg
  • video/x-msvideo
  • video / mp4

FeedParser

  • application/atom+xml
  • application/rss+xml

AdobeFontMetricParser

  • application/x-font-adobe-metric

TrueTypeParser

  • application/x-font-ttf

HtmlParser

  • text / html
  • application/vnd.wap.xhtml+xml
  • application/x-asp
  • application/xhtml+xml

HttpParser

  • application/x-httpresponse

HwpV5Parser

  • application/x-hwp-v5

BPGParser

  • image/bpg
  • image/x-bpg

HeifParser

  • image/heic-sequence
  • image/hot
  • image/heic
  • image/heif-sequence

ICNSParser

  • image/icns

ImageParser

  • image / png
  • image/vnd.wap.wbmp
  • image/x-jbig2
  • image/bmp
  • image/x-xcf
  • image / gif
  • image/x-icon
  • image/x-ms-bmp

JXLParser

  • image/jxl

JpegParser

  • image / jpeg

PSDParser

  • image/vnd.adobe.photoshop

TiffParser

WebPParser

  • image/webp

IDML Parser

  • application/vnd.adobe.indesign-idml-package

IptcAnpaParser

  • text/vnd.iptc.anpa

IWorkPackageParser

  • application/vnd.apple.keynote
  • application/vnd.apple.iwork
  • application/vnd.apple.numbers
  • application/vnd.apple.pages

IWork13PackageParser

  • application/vnd.apple.numbers.13
  • application/vnd.apple.unknown.13
  • application/vnd.apple.pages.13
  • application/vnd.apple.keynote.13

IWork18PackageParser

  • application/vnd.apple.pages.18
  • application/vnd.apple.keynote.18
  • application/vnd.apple.numbers.18

RFC822Parser

  • message/rfc822

MatParser

  • application/x-matlab-data

MboxParser

  • application/mbox

EMFParser

  • image/emf

JackcessParser

  • application/x-msaccess

MSOwnerFileParser

OfficeParser

OldExcelParser

TNEF Parser

  • application/vnd.ms-tnef
  • application/x-tnef
  • application/ms-tnef

WMFParser

  • image/wmf

ActiveMimeParser

  • application/x-activemime

ChmParser

  • application/vnd.ms-htmlhelp
  • application/x-chm
  • application/chm

OneNoteParser

  • application/onenote; format=one

OOXML Parser

  • application/vnd.ms-powerpoint.template.macroenabled.12
  • application/vnd.ms-excel.addin.macroenabled.12
  • application/vnd.openxmlformats-officedocument.wordprocessingml.template
  • application/vnd.ms-excel.sheet.binary.macroenabled.12
  • application / vnd.openxmlformats-officedocument.wordprocessingml.document
  • application/vnd.ms-powerpoint.slide.macroenabled.12
  • application/vnd.ms-visio.drawing
  • application/vnd.ms-powerpoint.slideshow.macroenabled.12
  • application/vnd.ms-powerpoint.presentation.macroenabled.12
  • application/vnd.openxmlformats-officedocument.presentationml.slide
  • application/vnd.ms-excel.sheet.macroenabled.12
  • application/vnd.ms-word.template.macroenabled.12
  • application/vnd.ms-word.document.macroenabled.12
  • application/vnd.ms-powerpoint.addin.macroenabled.12
  • application/vnd.openxmlformats-officedocument.spreadsheetml.template
  • application/vnd.ms-xpsdocument
  • application/vnd.ms-visio.drawing.macroenabled.12
  • application/vnd.ms-visio.template.macroenabled.12
  • model/vnd.dwfx+xps
  • application/vnd.openxmlformats-officedocument.presentationml.template
  • application / vnd.openxmlformats-officedocument.presentationml.presentation
  • application / vnd.openxmlformats-officedocument.spreadsheetml.sheet
  • application/vnd.ms-visio.stencil
  • application/vnd.ms-visio.template
  • application/vnd.openxmlformats-officedocument.presentationml.slideshow
  • application/vnd.ms-visio.stencil.macroenabled.12
  • application/vnd.ms-excel.template.macroenabled.12

Word2006ML Parser

pst.OutlookPSTParser

rtf.RTFParser

  • application/rtf

xml.SpreadsheetMLParser

  • application/vnd.ms-spreadsheetml

xml.WordMLParser

  • application/vnd.ms-wordml

MIFParser

  • application/x-mif
  • application/vnd.mif
  • application/x-maker

Mp3Parser

  • audio / mpeg

MP4Parser

  • video/x-m4v
  • application/mp4
  • video/3gpp
  • video/3gpp2
  • video/quicktime
  • audio/mp4
  • video / mp4

TesseractOCRParser

  • image/ocr-x-portable-pixmap
  • image/ocr-jpx
  • image/x-portable-pixmap
  • image/ocr-jpeg
  • image/ocr-jp2
  • image/jpx
  • image/ocr-png
  • image/ocr-tiff
  • image/ocr-gif
  • image/ocr-bmp
  • image/jp2

FlatOpenDocumentParser

  • application/vnd.oasis.opendocument.tika.flat.document
  • application/vnd.oasis.opendocument.flat.presentation
  • application/vnd.oasis.opendocument.flat.spreadsheet
  • application/vnd.oasis.opendocument.flat.text

OpenDocumentParser

  • application/x-vnd.oasis.opendocument.presentation
  • application/vnd.oasis.opendocument.chart
  • application/x-vnd.oasis.opendocument.text-web
  • application/x-vnd.oasis.opendocument.image
  • application/vnd.oasis.opendocument.graphics-template
  • application/vnd.oasis.opendocument.text-web
  • application/x-vnd.oasis.opendocument.spreadsheet-template
  • application/vnd.oasis.opendocument.spreadsheet-template
  • application/vnd.sun.xml.writer
  • application/x-vnd.oasis.opendocument.graphics-template
  • application/vnd.oasis.opendocument.graphics
  • application/vnd.oasis.opendocument.spreadsheet
  • application/x-vnd.oasis.opendocument.chart
  • application/x-vnd.oasis.opendocument.spreadsheet
  • application/vnd.oasis.opendocument.image
  • application/x-vnd.oasis.opendocument.text
  • application/x-vnd.oasis.opendocument.text-template
  • application/vnd.oasis.opendocument.formula-template
  • application/x-vnd.oasis.opendocument.formula
  • application/vnd.oasis.opendocument.image-template
  • application/x-vnd.oasis.opendocument.image-template
  • application/x-vnd.oasis.opendocument.presentation-template
  • application/vnd.oasis.opendocument.presentation-template
  • application/vnd.oasis.opendocument.text
  • application/vnd.oasis.opendocument.text-template
  • application/vnd.oasis.opendocument.chart-template
  • application/x-vnd.oasis.opendocument.chart-template
  • application/x-vnd.oasis.opendocument.formula-template
  • application/x-vnd.oasis.opendocument.text-master
  • application/vnd.oasis.opendocument.presentation
  • application/x-vnd.oasis.opendocument.graphics
  • application/vnd.oasis.opendocument.formula
  • application/vnd.oasis.opendocument.text-master

PDFParser

CompressorParser

PackageParser

  • application/x-tar
  • application/java-archive
  • application/x-arj
  • application/x-archive
  • application / zip
  • application/x-cpio
  • application/x-tika-unix-dump
  • application/x-7z-compressed

RarParser

  • application/x-rar-compressed

PRTParser

  • application/x-prt

SAS7BDAT Parser

  • application/x-sas-data

TMXParser

  • application/x-tmx

FLVParser

  • video/x-flv

WACZParser

  • application/x-wacz

WARCParser

  • application/warc
  • application/warc+gz

QuattroProParser

  • application/x-quattro-pro; version=9

WordPerfectParser

  • application/vnd.wordperfect; version=5.1
  • application/vnd.wordperfect; version=5.0
  • application/vnd.wordperfect; version=6.x

XLIFF12Parser

  • application/x-xliff+xml

XLZParser

  • application/x-xliff+zip

DcXMLParser

  • application/xml
  • image/svg+xml

FictionBookParser

  • application/x-fictionbook+xml

FlacParser

  • audio/x-oggflac
  • audio/x-flac

OggParser

  • audio / ogg
  • application/kate
  • application/ogg
  • video/daala
  • video/x-ogguvs
  • video/x-ogm
  • audio/x-oggpcm
  • video/ogg
  • video/x-dirac
  • video/x-oggrgb
  • video/x-oggyuv

OpusParser

  • audio/opus
  • audio/ogg; codecs=opus

SpeexParser

  • audio/ogg; codecs=speex
  • audio/speech

TheoraParser

  • video/theora

VorbisParser

  • audio/vorbis

 

Contact

We rely on holistic service and a high-end enterprise search engine. Contact us.