Search, find, save with searchit
Efficient search in all Microsoft Office file formats, Portable Document Format – PDFs –, XML formats and many more!
Searching almost any file format is one of the greatest strengths of the enterprise search solution searchit. In contrast to searching in File Explorer, the file content including metadata of all indexed files can be searched, even in scan, image or CAD file formats. Find out exactly how searchit Makes unsearchable searchable and scroll through the full list of all supported file formats.
How do files get into searchit searched?
As an enterprise search solution, searchit the comprehensive search of files through intelligent indexing and categorization. Users can quickly and efficiently search for content in various formats such as documents, emails, presentations and more to find relevant information and increase productivity.
What are MIME types?
MIME types (Multipurpose Internet Mail Extensions) are labels that define the media type of files on the Internet. They enable the correct interpretation and processing of content by telling servers and browsers the file type.
Supported categories of file formats
The lawyer rummages through email archives every day, the foreman rummages through CAD files - the most commonly used file format depends on both the industry and the job. searchits ever-growing number of parsers allows searching in almost all file categories.
HTML (Hypertext Markup Language)
The lingua franca of the web – Almost every HTML format found on the web is searchedit Search function supported:
- Valid XHTML code and XML
- Microsoft Office document formats
- Opendocument
- iWorks
- Portable document formats
- EPUB
- RTF
- Compression and packaging formats
- Audio, image and video formats
- And other scientific, language processing, object recognition and database-based formats
XML and derived formats
The Extensible Markup Language (XML) format is used both for hierarchically structured data and for platform-independent exchange of data between computer systems. To those from searchit Supported XML languages include:
- XHTML (Extensible Hypertext Markup Language)
- OOXML (Office Open XML)
- ODF (Open Document Format)
Microsoft Office document formats
Text and metadata extraction Microsoft Office and some related applications can be searched in the following formats:
- OLE 2 Compound Document Format
- OOXML (Office Open XML)
- Temporary Office lock files (owner files)
OpenDocument format
searchit searches the OpenDocument Format (ODF) for:
- All files of the OpenOffice.org office suite
- Older files in OpenOffice 1.0 format, the predecessor of ODF
iWorks document formats
Both text and metadata are supported in iWorks, including:
- Numbers
- Pages
- Keynotes
WordPerfect document formats
searchit searches all formats associated with:
- Corel WordPerfect Office Suite
- WordPerfect WP6+ files
- QuattroPro QPW v9+ files
Portable Document Format
Digitally created and non-searchable scans are displayed in searchit made searchable using the ORC functionality. More about PDF search with searchit.
Electronic publication format
searchit searches eBooks, digital books and papers in the following formats:
- Electronic Publication Format (EPUB)
- Fiction Book Publishing Format
Rich text format
Full search functionality for documents in Rich Text Format (RTF).
Compression and packaging formats
Enterprise Search Software searchit allows you to search yourself compressed data. Various compression and packaging formats are supported:
- Tar
- AR
- ARJ
- CPIO
- dump
- Zip
- 7Zip
- Gzip
- Bzip2
- XZ
- Lzma
- Z
- pack200
- RAR
- AppleSingle and
- AppleDouble files
Text formats
Extracting text content from simple text files seems like a simple task until you start thinking about all the possible character encodings. searchit is capable of character encoding text document to be recognized automatically.
Feed and syndication formats
Updates from websites, podcasts or news articles – searchit supports syndication formats that allow users to always stay up to date:
- RSS Feed
- Atom feed
- IPTC ANPA News Wire Feed Format
Help formats
searchit searches the Microsoft Help files:
- CHM help format (called Compiled HTML Help, also Compressed HTML Help or Compiled Help Module(s))
Video formats
Video recordings in the most common formats are recorded with serachit searched with a focus on metadata:
- Flash video format
- MP4 family of video formats including MP4, Quicktime, 3GPP and many more.
- Ogg family of video formats
Java class files and archives
Class names and method signatures are displayed in searchit searched in the following formats:
- Java class files
- jar archives
Source Code
searchit Searches source code for content and metadata:
- Java
- C
- C++ Groovy
- and more!
Email formats
Searching through emails and even email archives is searchit possible in the following formats:
- PST email format, used with Microsoft Outlook archives
- MSG email format, used for individually downloaded Outlook emails
- Microsoft TNEF (Transport Neutral Encoding Format, also known as Winmail.dat), used by most Microsoft email clients for email attachments
- mbox format, widely used in email archives and Unix-like mailboxes
- RFC 822 format: Used by many email clients in archives and exports
CAD formats
searchit searches metadata of files in DWG CAD format.
Font formats
Search for metadata even in font files – searchit supports:
- TrueType font format
- Adobe Font Metrics files
Scientific formats
Many of the programs that are specifically used in science can be used with searchit Search for metadata and content:
- GCMD Directory Interchange Format (DIF)
- GDAL
- ISO 19139 file format for geographic information
- Grib
- HDF
- Family of file formats ISA-Tab (ISA Tools)
- NetCDF
- Matlab
Executable programs and libraries
With searchit Extracts and searches metadata information about platforms, architectures and types from a range of executable formats and libraries:
- Windows executables
- Linux/BSD programs and libraries
- and many more!
Crypto formats
Search is searched through secure access regulations and using special parsersit even encrypted messages:
- PKCS7 signed messages, without information from the outer PKCS7 wrapper
- Metadata from Time Stamped Data Envelope (TSD) files
- Stored content from the TSD wrapper
Database formats
Several types of databases can be included in searchit can be searched quickly and easily:
- SQLite3 files
- Microsoft Access database files
- dBase files (dbf) including dBase, FoxBASE, FoxPRO and shapefile format from ESRI
Natural Language Processing
Artificial intelligence comes to searchit e.g. B. used using Natural Language Processing and Named Entity Recognition frameworks. This makes possible:
- Classifying the mood and emotional tone of a document
- Extracting metadata from full text of journal publications.
Image and video object detection
Multiple object detection frameworks are supported to analyze the content of images and videos. searchit Instances are trained using large training data sets for specific customer application areas.
Know what's inside - regardless of the file format
Thanks searchit You can search in hundreds of file formats at the same time on a central platformComplete list of searchable MIME types
Over three hundred formats for text files, images and scans, PDFs and much more are available in searchit supports:
AppleSingleFileParse
- application/applefile
PListParser
- application/x-plist
- application/x-bplist-itunes
- application/x-bplist
- application/x-bplist-memgraph
- application/x-bplist-webarchive
ClassParser
- application/java-vm
AudioParser
- audio/vnd.wave
- audio/x-wav
- audio/basic
- audio/x-aiff
MidiParser
- application/x-midi
- audio/midi
SourceCodeParser
- text/x-c++src
- text/x-groovy
- text/x-java-source
Pkcs7Parser
- application/pkcs7-signature
- application/pkcs7-mime
TSD Parser
- application/timestamped-data
TextAndCSVParser
- text/csv
- text/tsv
- text / plain
DBFParser
- application/x-dbf
DGN8Parser
- image/vnd.dgn; version=8
DIFParser
- application/dif+xml
DWG Parser
EpubParser
- application/x-ibooks+zip
- application/epub+zip
ExecutableParser
- application/x-msdownload
- application/x-sharedlib
- application/x-eleven
- application/x-object
- application / x-executable
- application/x-coredump
ExternalParser
- video/avi
- video/mpeg
- video/x-msvideo
- video / mp4
FeedParser
- application/atom+xml
- application/rss+xml
AdobeFontMetricParser
- application/x-font-adobe-metric
TrueTypeParser
- application/x-font-ttf
HtmlParser
- text / html
- application/vnd.wap.xhtml+xml
- application/x-asp
- application/xhtml+xml
HttpParser
- application/x-httpresponse
HwpV5Parser
- application/x-hwp-v5
BPGParser
- image/bpg
- image/x-bpg
HeifParser
- image/heic-sequence
- image/hot
- image/heic
- image/heif-sequence
ICNSParser
- image/icns
ImageParser
- image / png
- image/vnd.wap.wbmp
- image/x-jbig2
- image/bmp
- image/x-xcf
- image / gif
- image/x-icon
- image/x-ms-bmp
JXLParser
- image/jxl
JpegParser
- image / jpeg
PSDParser
- image/vnd.adobe.photoshop
TiffParser
WebPParser
- image/webp
IDML Parser
- application/vnd.adobe.indesign-idml-package
IptcAnpaParser
- text/vnd.iptc.anpa
IWorkPackageParser
- application/vnd.apple.keynote
- application/vnd.apple.iwork
- application/vnd.apple.numbers
- application/vnd.apple.pages
IWork13PackageParser
- application/vnd.apple.numbers.13
- application/vnd.apple.unknown.13
- application/vnd.apple.pages.13
- application/vnd.apple.keynote.13
IWork18PackageParser
- application/vnd.apple.pages.18
- application/vnd.apple.keynote.18
- application/vnd.apple.numbers.18
RFC822Parser
- message/rfc822
MatParser
- application/x-matlab-data
MboxParser
- application/mbox
EMFParser
- image/emf
JackcessParser
- application/x-msaccess
MSOwnerFileParser
OfficeParser
- application/x-tika-msoffice-embedded; format=ole10_native
- application / msword
- application/vnd.visio
- application/x-tika-ole-drm-encrypted
- application/vnd.ms-project
- application/x-tika-msworks-spreadsheet
- application/x-mspublisher
- application / vnd.ms-powerpoint
- application/x-tika-msoffice
- application/sldworks
- application/x-tika-ooxml-protected
- application / vnd.ms-excel
- application/vnd.ms-outlook
OldExcelParser
- application/vnd.ms-excel.workspace.3
- application/vnd.ms-excel.workspace.4
- application/vnd.ms-excel.sheet.2
- application/vnd.ms-excel.sheet.3
- application/vnd.ms-excel.sheet.4
TNEF Parser
- application/vnd.ms-tnef
- application/x-tnef
- application/ms-tnef
WMFParser
- image/wmf
ActiveMimeParser
- application/x-activemime
ChmParser
- application/vnd.ms-htmlhelp
- application/x-chm
- application/chm
OneNoteParser
- application/onenote; format=one
OOXML Parser
- application/vnd.ms-powerpoint.template.macroenabled.12
- application/vnd.ms-excel.addin.macroenabled.12
- application/vnd.openxmlformats-officedocument.wordprocessingml.template
- application/vnd.ms-excel.sheet.binary.macroenabled.12
- application / vnd.openxmlformats-officedocument.wordprocessingml.document
- application/vnd.ms-powerpoint.slide.macroenabled.12
- application/vnd.ms-visio.drawing
- application/vnd.ms-powerpoint.slideshow.macroenabled.12
- application/vnd.ms-powerpoint.presentation.macroenabled.12
- application/vnd.openxmlformats-officedocument.presentationml.slide
- application/vnd.ms-excel.sheet.macroenabled.12
- application/vnd.ms-word.template.macroenabled.12
- application/vnd.ms-word.document.macroenabled.12
- application/vnd.ms-powerpoint.addin.macroenabled.12
- application/vnd.openxmlformats-officedocument.spreadsheetml.template
- application/vnd.ms-xpsdocument
- application/vnd.ms-visio.drawing.macroenabled.12
- application/vnd.ms-visio.template.macroenabled.12
- model/vnd.dwfx+xps
- application/vnd.openxmlformats-officedocument.presentationml.template
- application / vnd.openxmlformats-officedocument.presentationml.presentation
- application / vnd.openxmlformats-officedocument.spreadsheetml.sheet
- application/vnd.ms-visio.stencil
- application/vnd.ms-visio.template
- application/vnd.openxmlformats-officedocument.presentationml.slideshow
- application/vnd.ms-visio.stencil.macroenabled.12
- application/vnd.ms-excel.template.macroenabled.12
Word2006ML Parser
pst.OutlookPSTParser
rtf.RTFParser
- application/rtf
xml.SpreadsheetMLParser
- application/vnd.ms-spreadsheetml
xml.WordMLParser
- application/vnd.ms-wordml
MIFParser
- application/x-mif
- application/vnd.mif
- application/x-maker
Mp3Parser
- audio / mpeg
MP4Parser
- video/x-m4v
- application/mp4
- video/3gpp
- video/3gpp2
- video/quicktime
- audio/mp4
- video / mp4
TesseractOCRParser
- image/ocr-x-portable-pixmap
- image/ocr-jpx
- image/x-portable-pixmap
- image/ocr-jpeg
- image/ocr-jp2
- image/jpx
- image/ocr-png
- image/ocr-tiff
- image/ocr-gif
- image/ocr-bmp
- image/jp2
FlatOpenDocumentParser
- application/vnd.oasis.opendocument.tika.flat.document
- application/vnd.oasis.opendocument.flat.presentation
- application/vnd.oasis.opendocument.flat.spreadsheet
- application/vnd.oasis.opendocument.flat.text
OpenDocumentParser
- application/x-vnd.oasis.opendocument.presentation
- application/vnd.oasis.opendocument.chart
- application/x-vnd.oasis.opendocument.text-web
- application/x-vnd.oasis.opendocument.image
- application/vnd.oasis.opendocument.graphics-template
- application/vnd.oasis.opendocument.text-web
- application/x-vnd.oasis.opendocument.spreadsheet-template
- application/vnd.oasis.opendocument.spreadsheet-template
- application/vnd.sun.xml.writer
- application/x-vnd.oasis.opendocument.graphics-template
- application/vnd.oasis.opendocument.graphics
- application/vnd.oasis.opendocument.spreadsheet
- application/x-vnd.oasis.opendocument.chart
- application/x-vnd.oasis.opendocument.spreadsheet
- application/vnd.oasis.opendocument.image
- application/x-vnd.oasis.opendocument.text
- application/x-vnd.oasis.opendocument.text-template
- application/vnd.oasis.opendocument.formula-template
- application/x-vnd.oasis.opendocument.formula
- application/vnd.oasis.opendocument.image-template
- application/x-vnd.oasis.opendocument.image-template
- application/x-vnd.oasis.opendocument.presentation-template
- application/vnd.oasis.opendocument.presentation-template
- application/vnd.oasis.opendocument.text
- application/vnd.oasis.opendocument.text-template
- application/vnd.oasis.opendocument.chart-template
- application/x-vnd.oasis.opendocument.chart-template
- application/x-vnd.oasis.opendocument.formula-template
- application/x-vnd.oasis.opendocument.text-master
- application/vnd.oasis.opendocument.presentation
- application/x-vnd.oasis.opendocument.graphics
- application/vnd.oasis.opendocument.formula
- application/vnd.oasis.opendocument.text-master
PDFParser
CompressorParser
- application/zlib
- application/x-gzip
- application/x-bzip2
- application/x-compress
- application/x-java-pack200
- application/x-lzma
- application/deflate64
- application/x-lz4
- application/x-snappy
- application/x-brotli
- application/gzip
- application/x-bzip
- application/x-xz
PackageParser
- application/x-tar
- application/java-archive
- application/x-arj
- application/x-archive
- application / zip
- application/x-cpio
- application/x-tika-unix-dump
- application/x-7z-compressed
RarParser
- application/x-rar-compressed
PRTParser
- application/x-prt
SAS7BDAT Parser
- application/x-sas-data
TMXParser
- application/x-tmx
FLVParser
- video/x-flv
WACZParser
- application/x-wacz
WARCParser
- application/warc
- application/warc+gz
QuattroProParser
- application/x-quattro-pro; version=9
WordPerfectParser
- application/vnd.wordperfect; version=5.1
- application/vnd.wordperfect; version=5.0
- application/vnd.wordperfect; version=6.x
XLIFF12Parser
- application/x-xliff+xml
XLZParser
- application/x-xliff+zip
DcXMLParser
- application/xml
- image/svg+xml
FictionBookParser
- application/x-fictionbook+xml
FlacParser
- audio/x-oggflac
- audio/x-flac
OggParser
- audio / ogg
- application/kate
- application/ogg
- video/daala
- video/x-ogguvs
- video/x-ogm
- audio/x-oggpcm
- video/ogg
- video/x-dirac
- video/x-oggrgb
- video/x-oggyuv
OpusParser
- audio/opus
- audio/ogg; codecs=opus
SpeexParser
- audio/ogg; codecs=speex
- audio/speech
TheoraParser
- video/theora
VorbisParser
- audio/vorbis
Contact
We rely on holistic service and a high-end enterprise search engine. Contact us.