Question 1

What is the difference between named and numeric HTML entities?

Accepted Answer

Named entities use a human-readable alias like & for an ampersand or < for a less-than sign, making source code easier to read and maintain. Numeric entities use the Unicode code point in either decimal (&#38;) or hexadecimal (&#x26;) form. Every named entity has an equivalent numeric form, but not every Unicode character has a named entity. Numeric entities are the universal fallback and work in any HTML or XML context, while named entities require the browser or parser to recognize the specific alias.

Question 2

Which characters must be encoded in HTML content?

Accepted Answer

At minimum, you must encode the five characters that have special meaning in HTML: the ampersand (&), less-than sign (<), greater-than sign (>), double quote ("), and single quote/apostrophe ('). The ampersand starts an entity reference, angle brackets delimit tags, and quotes delimit attribute values. Failing to encode these characters can break your markup or create security vulnerabilities. Other characters like non-breaking spaces, em dashes, and emoji are optional to encode but can improve cross-platform compatibility.

Question 3

Does HTML entity encoding prevent all XSS attacks?

Accepted Answer

HTML entity encoding is highly effective at preventing XSS in HTML body content by neutralizing angle brackets and script injection. However, it is not sufficient in all contexts. Encoding requirements differ depending on where user data appears: within HTML attributes you also need to encode quotes, inside JavaScript blocks you need JavaScript-specific escaping, within URLs you need URL encoding, and inside CSS you need CSS escaping. The OWASP recommendation is to apply context-specific output encoding for each insertion point rather than relying on a single encoding strategy.

Question 4

Are HTML entities the same as URL encoding or Base64?

Accepted Answer

No, these are three distinct encoding schemes for different purposes. HTML entities encode characters for safe display within HTML documents. URL encoding (percent-encoding) encodes characters for safe transmission in URLs and query strings, using the %XX format. Base64 encoding converts binary data into an ASCII string representation for transport in text-based protocols. Each encoding scheme has its own reserved characters, syntax, and use cases, and applying the wrong encoding to a given context will produce incorrect results.

Question 5

Why do some web pages show raw entities like &amp; instead of the intended character?

Accepted Answer

This usually indicates double encoding, where content that was already entity-encoded gets encoded a second time. The ampersand in & itself gets encoded to &amp;, so the browser displays the literal text & instead of the intended character. Double encoding commonly occurs when data passes through multiple processing layers that each apply encoding independently. The fix is to ensure encoding happens exactly once, typically at the final output stage, and that earlier stages pass raw text rather than pre-encoded strings.

Question 6

Should I encode all characters or just the special HTML characters?

Accepted Answer

For most use cases, encoding only the five special HTML characters (<, >, &, ", ') is sufficient and produces the most readable output. Encoding all characters converts every letter, digit, and symbol into its numeric entity form, which dramatically increases the size of the output and makes it nearly impossible to read in source view. Full encoding can be useful in niche scenarios such as obfuscating email addresses from basic scrapers or ensuring absolute compatibility with systems that have unreliable character set handling, but it is not recommended for general use.

HTML Entity Encoder / Decoder

About HTML Entity Encoder / Decoder

How to Use the HTML Entity Encoder / Decoder

Common Use Cases

Preventing XSS in User-Generated Content

Embedding Code Snippets in HTML Documents

Composing HTML Emails with Special Characters

Sanitizing Data for Template Engines

Frequently Asked Questions

Related Tools