this is xhtml
? expect ?
�x3f; should fail
&#}; should fail
? expect ?
? should fail
&#-10; should fail
� should fail
 should fail
 should fail
” error but workaround to "
𿿾 should fail
􃀀 should fail
� should fail
� should fail
Ж cyrillic Zh

& ampersand
link
≫⃒ from html5. unrecognized.
⟨ ⟩ are different in html4 and html5
& should fail
&---; unrecognized
& & & complain about bare ampersands
' expect '
&asdfghjkl; unrecognized