Hash symbol in numeric character references

My question is very similar to this one , but didn't find my answer there.

From the link , I could gather that HTML supports displaying ISO 8859/1 8-bit single-byte graphic characters, through numeric representations such as:

&32;

for Space.

&33;

for an exclamation mark.

The above will not be resolved unless the object names are prefixed with #

:

 

for Space will be resolved.

!

for exclamation mark will be allowed.

What is the reason for prefixing entity names with a character #

for these characters when ISO Latin 1 character entities do not conform to the same standards. It can be inferred that the HTML parser will be written to handle this, but it would be great to know why this standard was introduced in the first place.

+3


source to share


1 answer


The details of how they are processed are detailed in the parsing section of the HTML 5 specification . You will especially want to read the links for " use symbol link ".

HTML 3 was never up to date, and even HTML 3.2 was replaced long ago. ISO documents are also irrelevant in this context.



Posting a parsing algorithm can be painful (at least requires some use), but it's guaranteed to be correct.

+1


source







All Articles