Speed ​​up file content search

Have a suggestion for "Everything"? Please post it here.
Post Reply
tuska
Posts: 1159
Joined: Thu Jul 13, 2017 9:14 am

Speed ​​up file content search

Post by tuska »

Hi,

Everything Options - 1.5.0.1391a (x64) - Content
- ✅ Index file content
- Include only folders: D:\;C:\Everything\;C:\totalcmd\
- Include only files:   *.csv;*.doc;*.docx;*.eml;*.ini;*.ion;*.md;*.mdb;*.msg;*.odt;*.pdf;*.pps;*.ppt;*.pptx;*.rtf;*.txt;*.xls;*.xlsm;*.xlsx;*.xlt;*.xltm;*.xltx

Search
content:searchword

... will IMMEDIATELY deliver the search result from approx. 600,000 files (in 100,000 folders)!
⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺
 
Bookmark - Search: Custom
content:mytext: | ansicontent:mytext: | ifilter-content:mytext: | plain-text-content:mytext: | unicodecontent:mytext: | utf8content:mytext: | utf16content:mytext: | utf16becontent:mytext:

Macro:
mc<mytext>


Search
mc:searchword

- ... also provides IMMEDIATE search results from approx. 600,000 files (in 100,000 folders) - e.g. 91 files -> Searching content 3% ...
- after about 45 minutes, 155 files were found -> Searching content 14% ...
- for such searches, it would probably also help to disable the energy-saving mode,
  because otherwise it may be that after returning to the PC after 2 hours, Searching content 16%, for example, is displayed.

⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺
Questions:
Is it possible to achieve a speed increase with regard to the different "content: variants" here
by possibly selecting the individual variants separately in the “Content” options? (Without the links!).
Would it use too much RAM?

⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺⸺

Code: Select all

Example:
✅ Index file content 	   (default: content:)
   ✅ ansicontent:	   https://www.voidtools.com/forum/viewtopic.php?t=10176#ansicontent
   ✅ binarycontent:	   https://www.voidtools.com/forum/viewtopic.php?t=10176#binarycontent
   ✅ eml-content:	   https://www.voidtools.com/forum/viewtopic.php?p=37761#eml-content   *)
   			   https://www.voidtools.com/forum/viewtopic.php?p=65318#p65318
   ✅ ifilter-content:	   https://www.voidtools.com/forum/viewtopic.php?t=10176#ifiltercontent,
   			   https://www.voidtools.com/forum/viewtopic.php?p=64043#p64043 (& combine 'content types')
   ✅ plain-text-content:  https://www.voidtools.com/forum/viewtopic.php?t=10176#textplaincontent
   ✅ unicodecontent:	   https://www.voidtools.com/forum/viewtopic.php?t=10176#unicodecontent
   ✅ utf8content:	   https://www.voidtools.com/forum/viewtopic.php?t=10176#utf8content
   ✅ utf16content:	   https://www.voidtools.com/forum/viewtopic.php?t=10176#utf16content
   ✅ utf16becontent:	   https://www.voidtools.com/forum/viewtopic.php?t=10176#utf16becontent
*)
ext:eml mc:"-0500"
... My bookmark can find the text!
  
*.eml eml-content:"-0500"
-OR-
ext:eml eml-content:"-0500"
... does NOT find the text!
    (That is also the reason WHY I used several parameters for the bookmark -> to find "ALL" file contents).

Related topics:
A Question for content searching,... | Q: Creating a shortcut for "content:"?
_______________________________________________________________________________
Windows 11 Pro (x64) Version 24H2 (OS Build 26100.2605) - 2024-12-10,
Processor: Intel(R) Core(TM) i5-12600K 12th Gen, 3.70 GHz, 10 Cores, 16 Logical Processors
Installed RAM: 32.0 GB (19.3 GB usable), Windows Feature Experience Pack 1000.26100.36.0
'Everything' 1.5.0.1391a (x64) - "Portable on system drive C:"
void
Developer
Posts: 17310
Joined: Fri Oct 16, 2009 11:31 pm

Re: Speed ​​up file content search

Post by void »

Is it possible to achieve a speed increase with regard to the different "content: variants" here
No, these search functions will goto disk.

Content indexing will index the content as text.
Not as raw binary data needed for these search functions. (ansicontent:, ifilter-content:, plain-text-content: etc..)


*.eml eml-content:"-0500" -OR- ext:eml eml-content:"-0500" ... does NOT find the text!
Please upload the eml file containing -0500 in a bug report.
ChrisGreaves
Posts: 729
Joined: Wed Jan 05, 2022 9:29 pm

Re: Speed ​​up file content search

Post by ChrisGreaves »

void wrote: Mon Jan 06, 2025 10:57 amContent indexing will index the content as text.
Hi Void; Please and Thank you: Have I understood this?

"Content indexing" will concern itself ONLY with files which Everything identifies as text documents (DOC, DOT, XLS, TXT etc)
"Content searching" (if I can call it that) will search for strings expressed in binary/hex and can be used against all types of files, those types which Everything classes as TEXT and also files such as executable, graphic - in fact every one of the 582,194 files on my computer?

Thanks again, Chris
void
Developer
Posts: 17310
Joined: Fri Oct 16, 2009 11:31 pm

Re: Speed ​​up file content search

Post by void »

Correct.

Content indexing
Converts the content to text at indexing time.
Everything must settle on one code page.
This is done from a BOM or iFilter.

Content searching (not indexed)
Converts the content to text at search time.
Different code pages can be tried on binary files at search time.
tuska
Posts: 1159
Joined: Thu Jul 13, 2017 9:14 am

Re: Speed ​​up file content search

Post by tuska »

void wrote: Mon Jan 06, 2025 10:57 am
*.eml eml-content:"-0500" -OR- ext:eml eml-content:"-0500" ... does NOT find the text!
Please upload the eml file containing -0500 in a bug report.
I sent you an email on 8.1. in a bug report and attachements (zip file).

PS:
Thanks for all the details above!
void
Developer
Posts: 17310
Joined: Fri Oct 16, 2009 11:31 pm

Re: Speed ​​up file content search

Post by void »

Thank you for the eml file.

-0500 is in the eml header. (not the message)

I will consider changing the built-in eml content handler to include the header in the content.
Post Reply