RequirementsMicrosoft Windows XP / Vista / Windows 7 / Windows 8 / Windows 10
Date addedAug 01, 2018
DocFetcher is a Open Source desktop search application: It permits you to search the contents of files on your own PC. It is possible to think of it as Google for your local files. The application runs on Windows, Linux and OS X, and can be made available under the Eclipse Public License.
- A portable version: There's a portable variant of DocFetcher that runs on Windows, Linux and OS X. This is helpful is described in more detail further down this page.
- 64-bit service: Both 32-bit and 64-bit operating systems are encouraged.
- Unicode service: DocFetcher includes rock-solid Unicode support for all significant formats, including Microsoft Office, OpenOffice.org, PDF, HTML, RTF and plain text files.
- Archive support: DocFetcher supports the following archive formats: zip, 7z, rar, and the entire tar. * household. The file extensions for zip archives may be customized, allowing you to add more zip-based archive files as needed. Additionally, DocFetcher can handle an unlimited nesting of archives (e.g. a zip archive containing a 7z archive comprising a rar archive... etc ).
- Search in source code files: The file extensions from which DocFetcher recognizes plain text documents may be customized, so you can utilize DocFetcher for hunting in any type of source code along with other text-based file formats. (This works quite well in combination with the customizable zip extensions, e.g. for searching at Java source code within Jar files.)
- Outlook PST files: DocFetcher allows looking for Outlook mails, which Microsoft Outlook typically stores in PST files.
- Detection of HTML pairs: By default, DocFetcher detects pairs of HTML documents (e.g. a document called"foo.html" and a folder named"foo_files"), and treats the pair as one file. This feature might appear rather useless at first, but it turned out that this dramatically increases the quality of the search outcome when you're coping with HTML documents, because all the"clutter" within the HTML folders disappears from the results.
- Regex-based exclusion of files from indexing: You can use normal expressions to exclude certain files . For instance, to exclude Microsoft Excel documents, you can use a regular expression such as this:. *\.xls
- Mime-type detection: It is possible to use normal expressions to flip on"mime-type detection" for specific files, meaning that DocFetcher will attempt to detect their real file types not just by taking a look at the filename, but also by peeking into the file contents. This is useful for documents that have the wrong file extension.
- Powerful query syntax: In addition to fundamental constructs like OR, AND and NOT DocFetcher also supports, among other matters: Wildcards, phrase search, fuzzy search ("find words that are like..."), proximity search ("these two phrases should be at most 10 words from each other"), fostering ("raise the score of documents comprising...")
- DocFetcher could not be started on OS X with Java 9 or newer
- Crash on Windows due to hotkey issues
- Crash on some PDF files
- MS Office files containing very large amounts of text could not be read
- DocFetcher could not read the metadata of certain JPEG files