content command can not search row word-break in pd

Discussion related to "Everything" 1.5 Alpha.
Post Reply
odinhuang
Posts: 2
Joined: Sun Dec 12, 2021 5:32 am

content command can not search row word-break in pd

Post by odinhuang »

content command can not search row chinese word-break in pdf

such as:
in "Test.pdf"
其他人也問
了以下問題

When type content:"也問" content:"了以下", found "Test.pdf".
But when type content:"也問了以下", search nothing...
void
Developer
Posts: 16754
Joined: Fri Oct 16, 2009 11:31 pm

Re: content command can not search row word-break in pd

Post by void »

Thank you for your feedback odinhuang,

Normally, Everything doesn't ignore newlines in file content.

Consider enabling Ignore White-space form the Search menu and search for:

content:"也問了以下"



or, search for:

nows:content:"也問了以下"
(nows: = no white-space search modifier)



or, search for:

content:<也問 了以下>
(which expands to content:"也問" content:"了以下" )



or, match the newline (with possible carriage return):

content:"也問&#x0d:&#x0a:了以下"
content:"也問&#x0a:了以下"
content:"也問&newline:了以下"
content:"也問&\r:&\n:了以下"
content:"也問&\n:了以下"


Character entities
odinhuang
Posts: 2
Joined: Sun Dec 12, 2021 5:32 am

Re: content command can not search row word-break in pd

Post by odinhuang »

Thank you very much.
It works!
raccoon
Posts: 1017
Joined: Thu Oct 18, 2018 1:24 am

Re: content command can not search row word-break in pd

Post by raccoon »

void wrote: Sun Dec 12, 2021 5:44 am content:"也問&#x0d:&#x0a:了以下"
content:"也問&#x0a:了以下"
content:"也問&newline:了以下"
Character entities
Is &newline: documented anywhere? I didn't even realize &newline; was an html entity. This would be cool to document in the reference.

Can you share your HTML entity table, or where you sourced it from? The standards seem messy and uncoordinated.
void
Developer
Posts: 16754
Joined: Fri Oct 16, 2009 11:31 pm

Re: content command can not search row word-break in pd

Post by void »

´ &acute:
' &apos:
& &amp:
* &ast:
\ &bsol:
¦ &brvbar:
¸ &cedil:
¸ &cedilla:
¢ &cent:
· &centerdot:
® &circledr:
: &colon:
, &comma:
@ &commat:
© &copy:
¤ &curren:
° &deg:
´ &diacriticalacute:
¨ &die:
$ &dollar:
¨ &dot:
¨ &doubledot:
= &equals:
ʬ &euro:
! &excl:
½ &frac12:
¼ &frac14:
¾ &frac34:
` &grave:
> &gt:
^ &hat:
¡ &iexcl:
¿ &iquest:
« &laquo:
{ &lbrace:
[ &lbrack:
{ &lcub:
_ &lowbar:
( &lpar:
[ &lsqb:
< &lt:
¯ &macr:
µ &micro:
* &midast:
· &middot:
U+000A &newline:
  &nbsp:
¬ &not:
# &num:
ª &ordf:
º &ordm:
¯ &overbar:
¶ &para:
% &percent:
. &period:
+ &plus:
± &plusminus:
± &plusmn:
± &pm:
£ &pound:
? &quest:
" &quot:
» &raquo:
} &rbrace:
] &rbrack:
} &rcub:
® &reg:
) &rpar:
] &rsqb:
§ &sect:
; &semi:
U+00AD &shy:
/ &sol:
¯ &strns:
¹ &sup1:
² &sup2:
³ &sup3:
U+0009 &tab:
¨ &uml:
| &verbar:
| &vert:
| &verticalline:
¥ &yen:

https://dev.w3.org/html5/html-author/charref
https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references
https://www.w3.org/2003/entities/iso8879doc/isolat1.html
https://www.w3.org/TR/MathML2/isonum.html

I will add full support for ISOlat1 and ISOnum



added the following examples:
nows:content:"也問了以下"
(nows: = no white-space search modifier)

content:"也問&\r:&\n:了以下"
(C \ escape sequences are supported inside & and : )
Post Reply