content command can not search row chinese word-break in pdf
such as:
in "Test.pdf"
其他人也問
了以下問題
When type content:"也問" content:"了以下", found "Test.pdf".
But when type content:"也問了以下", search nothing...
content command can not search row word-break in pd
Re: content command can not search row word-break in pd
Thank you for your feedback odinhuang,
Normally, Everything doesn't ignore newlines in file content.
Consider enabling Ignore White-space form the Search menu and search for:
content:"也問了以下"
or, search for:
nows:content:"也問了以下"
(nows: = no white-space search modifier)
or, search for:
content:<也問 了以下>
(which expands to content:"也問" content:"了以下" )
or, match the newline (with possible carriage return):
content:"也問
:
:了以下"
content:"也問
:了以下"
content:"也問&newline:了以下"
content:"也問&\r:&\n:了以下"
content:"也問&\n:了以下"
Character entities
Normally, Everything doesn't ignore newlines in file content.
Consider enabling Ignore White-space form the Search menu and search for:
content:"也問了以下"
or, search for:
nows:content:"也問了以下"
(nows: = no white-space search modifier)
or, search for:
content:<也問 了以下>
(which expands to content:"也問" content:"了以下" )
or, match the newline (with possible carriage return):
content:"也問
:
:了以下"
content:"也問
:了以下"
content:"也問&newline:了以下"
content:"也問&\r:&\n:了以下"
content:"也問&\n:了以下"
Character entities
Re: content command can not search row word-break in pd
Thank you very much.
It works!
It works!
Re: content command can not search row word-break in pd
Is &newline: documented anywhere? I didn't even realize &newline; was an html entity. This would be cool to document in the reference.void wrote: ↑Sun Dec 12, 2021 5:44 am content:"也問
:
:了以下"
content:"也問
:了以下"
content:"也問&newline:了以下"
Character entities
Can you share your HTML entity table, or where you sourced it from? The standards seem messy and uncoordinated.
Re: content command can not search row word-break in pd
´ ´:
' &apos:
& &:
* &ast:
\ &bsol:
¦ ¦:
¸ ¸:
¸ ¸la:
¢ ¢:
· ¢erdot:
® &circledr:
: &colon:
, &comma:
@ &commat:
© ©:
¤ ¤:
° °:
´ &diacriticalacute:
¨ &die:
$ &dollar:
¨ &dot:
¨ &doubledot:
= &equals:
ʬ &euro:
! &excl:
½ ½:
¼ ¼:
¾ ¾:
` &grave:
> >:
^ &hat:
¡ ¡:
¿ ¿:
« «:
{ &lbrace:
[ &lbrack:
{ &lcub:
_ &lowbar:
( &lpar:
[ &lsqb:
< <:
¯ ¯:
µ µ:
* &midast:
· ·:
U+000A &newline:
 :
¬ ¬:
# &num:
ª ª:
º º:
¯ &overbar:
¶ ¶:
% &percent:
. &period:
+ &plus:
± &plusminus:
± ±:
± &pm:
£ £:
? &quest:
" ":
» »:
} &rbrace:
] &rbrack:
} &rcub:
® ®:
) &rpar:
] &rsqb:
§ §:
; &semi:
U+00AD ­:
/ &sol:
¯ &strns:
¹ ¹:
² ²:
³ ³:
U+0009 &tab:
¨ ¨:
| &verbar:
| &vert:
| &verticalline:
¥ ¥:
https://dev.w3.org/html5/html-author/charref
https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references
https://www.w3.org/2003/entities/iso8879doc/isolat1.html
https://www.w3.org/TR/MathML2/isonum.html
I will add full support for ISOlat1 and ISOnum
added the following examples:
nows:content:"也問了以下"
(nows: = no white-space search modifier)
content:"也問&\r:&\n:了以下"
(C \ escape sequences are supported inside & and : )
' &apos:
& &:
* &ast:
\ &bsol:
¦ ¦:
¸ ¸:
¸ ¸la:
¢ ¢:
· ¢erdot:
® &circledr:
: &colon:
, &comma:
@ &commat:
© ©:
¤ ¤:
° °:
´ &diacriticalacute:
¨ &die:
$ &dollar:
¨ &dot:
¨ &doubledot:
= &equals:
ʬ &euro:
! &excl:
½ ½:
¼ ¼:
¾ ¾:
` &grave:
> >:
^ &hat:
¡ ¡:
¿ ¿:
« «:
{ &lbrace:
[ &lbrack:
{ &lcub:
_ &lowbar:
( &lpar:
[ &lsqb:
< <:
¯ ¯:
µ µ:
* &midast:
· ·:
U+000A &newline:
 :
¬ ¬:
# &num:
ª ª:
º º:
¯ &overbar:
¶ ¶:
% &percent:
. &period:
+ &plus:
± &plusminus:
± ±:
± &pm:
£ £:
? &quest:
" ":
» »:
} &rbrace:
] &rbrack:
} &rcub:
® ®:
) &rpar:
] &rsqb:
§ §:
; &semi:
U+00AD ­:
/ &sol:
¯ &strns:
¹ ¹:
² ²:
³ ³:
U+0009 &tab:
¨ ¨:
| &verbar:
| &vert:
| &verticalline:
¥ ¥:
https://dev.w3.org/html5/html-author/charref
https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references
https://www.w3.org/2003/entities/iso8879doc/isolat1.html
https://www.w3.org/TR/MathML2/isonum.html
I will add full support for ISOlat1 and ISOnum
added the following examples:
nows:content:"也問了以下"
(nows: = no white-space search modifier)
content:"也問&\r:&\n:了以下"
(C \ escape sequences are supported inside & and : )