PHP get_html_translation_table() Function

❮ PHP String Reference

Example

Print the translation table used by the htmlspecialchars function:

<?php
print_r (get_html_translation_table()); // HTML_SPECIALCHARS is default.
?>
Try it Yourself »

Definition and Usage

The get_html_translation_table() function returns the translation table used by the htmlentities() and htmlspecialchars() functions.

Tip: Some characters can be encoded several ways. The get_html_translation_table() function returns the most common encoding.


Syntax

get_html_translation_table(function,flags,character-set)

Parameter Values

Parameter Description
function Optional. Specifies which translation table to return.

Possible values:

  • HTML_SPECIALCHARS - Default. Translates some characters that need URL-encoding to be shown properly on a HTML page
  • HTML_ENTITIES - Translates all characters that need URL-encoding to be shown properly on a HTML page
flags Optional. Specifies which quotes the table will contain and which document type the table is for.

The available quote styles are:

  • ENT_COMPAT - Default. Table contains entities for double quotes, not single quotes
  • ENT_QUOTES - Table contains entities for double and single quotes
  • ENT_NOQUOTES - Table will not contain entities for double and single quotes

Additional flags for specifying which doctype the table is for:

  • ENT_HTML401 - Default. Table for HTML 4.01
  • ENT_HTML5 - Table for HTML 5
  • ENT_XML1 - Table for XML 1
  • ENT_XHTML - Table for XHTML
character-set Optional. A string that specifies which character-set to use.

Allowed values are:

  • UTF-8 - Default. ASCII compatible multi-byte 8-bit Unicode
  • ISO-8859-1 - Western European
  • ISO-8859-15 - Western European (adds the Euro sign + French and Finnish letters missing in ISO-8859-1)
  • cp866 - DOS-specific Cyrillic charset
  • cp1251 - Windows-specific Cyrillic charset
  • cp1252 - Windows specific charset for Western European
  • KOI8-R - Russian
  • BIG5 - Traditional Chinese, mainly used in Taiwan
  • GB2312 - Simplified Chinese, national standard character set
  • BIG5-HKSCS - Big5 with Hong Kong extensions
  • Shift_JIS - Japanese
  • EUC-JP - Japanese
  • MacRoman - Character-set that was used by Mac OS

Note: Unrecognized character-sets will be ignored and replaced by ISO-8859-1 in versions prior to PHP 5.4. As of PHP 5.4, it will be ignored an replaced by UTF-8.



Technical Details

Return Value: Returns the translation table as an array, with the original characters as keys and entities as values
PHP Version: 4+
Changelog: The default value for the character-set parameter was changed to UTF-8 in PHP 5

The additional flags for specifying which doctype the table is for; ENT_HTML401, ENT_HTML5, ENT_XML1 and ENT_XHTML were added in PHP 5.4

The character-set parameter was added in PHP 5.3.4

More Examples

Example

Table for HTML_SPECIALCHARS:

<?php
print_r (get_html_translation_table(HTML_SPECIALCHARS));
?>

Displaying character and entity name:

Array
(
  ["] => &quot;
  [&] => &amp;
  [<] => &lt;
  [>] => &gt;
)

Example

Table for HTML_ENTITIES:

<?php
print_r (get_html_translation_table(HTML_ENTITIES));
?>

Displaying character and entity name:

Array
(
  ["] => &quot;
  [&] => &amp;
  [<] => &lt;
  [>] => &gt;
  [ ] => &nbsp;
  [¡] => &iexcl;
  [¢] => &cent;
  [£] => &pound;
  [¤] => &curren;
  [¥] => &yen;
  [¦] => &brvbar;
  [§] => &sect;
  [¨] => &uml;
  [©] => &copy;
  [ª] => &ordf;
  [«] => &laquo;
  [¬] => &not;
  [�­] => &shy;
  [®] => &reg;
  [¯] => &macr;
  [°] => &deg;
  [±] => &plusmn;
  [²] => &sup2;
  [³] => &sup3;
  [´] => &acute;
  [µ] => &micro;
  [¶] => &para;
  [·] => &middot;
  [¸] => &cedil;
  [¹] => &sup1;
  [º] => &ordm;
  [»] => &raquo;
  [¼] => &frac14;
  [½] => &frac12;
  [¾] => &frac34;
  [¿] => &iquest;
  [À] => &Agrave;
  [Á] => &Aacute;
  [Â] => &Acirc;
  [Ã] => &Atilde;
  [Ä] => &Auml;
  [Å] => &Aring;
  [Æ] => &AElig;
  [Ç] => &Ccedil;
  [È] => &Egrave;
  [É] => &Eacute;
  [Ê] => &Ecirc;
  [Ë] => &Euml;
  [Ì] => &Igrave;
  [Í] => &Iacute;
  [Î] => &Icirc;
  [Ï] => &Iuml;
  [Ð] => &ETH;
  [Ñ] => &Ntilde;
  [Ò] => &Ograve;
  [Ó] => &Oacute;
  [Ô] => &Ocirc;
  [Õ] => &Otilde;
  [Ö] => &Ouml;
  [×] => &times;
  [Ø] => &Oslash;
  [Ù] => &Ugrave;
  [Ú] => &Uacute;
  [Û] => &Ucirc;
  [Ü] => &Uuml;
  [Ý] => &Yacute;
  [Þ] => &THORN;
  [ß] => &szlig;
  [à] => &agrave;
  [á] => &aacute;
  [â] => &acirc;
  [ã] => &atilde;
  [ä] => &auml;
  [å] => &aring;
  [æ] => &aelig;
  [ç] => &ccedil;
  [è] => &egrave;
  [é] => &eacute;
  [ê] => &ecirc;
  [ë] => &euml;
  [ì] => &igrave;
  [í] => &iacute;
  [î] => &icirc;
  [ï] => &iuml;
  [ð] => &eth;
  [ñ] => &ntilde;
  [ò] => &ograve;
  [ó] => &oacute;
  [ô] => &ocirc;
  [õ] => &otilde;
  [ö] => &ouml;
  [÷] => &divide;
  [ø] => &oslash;
  [ù] => &ugrave;
  [ú] => &uacute;
  [û] => &ucirc;
  [ü] => &uuml;
  [ý] => &yacute;
  [þ] => &thorn;
  [ÿ] => &yuml;
  [Œ] => &OElig;
  [œ] => &oelig;
  [Š] => &Scaron;
  [š] => &scaron;
  [Ÿ] => &Yuml;
  [ƒ] => &fnof;
  [ˆ] => &circ;
  [˜] => &tilde;
  [Α] => &Alpha;
  [Β] => &Beta;
  [Γ] => &Gamma;
  [Δ] => &Delta;
  [Ε] => &Epsilon;
  [Ζ] => &Zeta;
  [Η] => &Eta;
  [Θ] => &Theta;
  [Ι] => &Iota;
  [Κ] => &Kappa;
  [Λ] => &Lambda;
  [Μ] => &Mu;
  [Ν] => &Nu;
  [Ξ] => &Xi;
  [Ο] => &Omicron;
  [Π] => &Pi;
  [Ρ] => &Rho;
  [Σ] => &Sigma;
  [Τ] => &Tau;
  [Υ] => &Upsilon;
  [Φ] => &Phi;
  [Χ] => &Chi;
  [Ψ] => &Psi;
  [Ω] => &Omega;
  [α] => &alpha;
  [β] => &beta;
  [γ] => &gamma;
  [δ] => &delta;
  [ε] => &epsilon;
  [ζ] => &zeta;
  [η] => &eta;
  [θ] => &theta;
  [ι] => &iota;
  [κ] => &kappa;
  [λ] => &lambda;
  [μ] => &mu;
  [ν] => &nu;
  [ξ] => &xi;
  [ο] => &omicron;
  [π] => &pi;
  [ρ] => &rho;
  [ς] => &sigmaf;
  [σ] => &sigma;
  [τ] => &tau;
  [υ] => &upsilon;
  [φ] => &phi;
  [χ] => &chi;
  [ψ] => &psi;
  [ω] => &omega;
  [ϑ] => &thetasym;
  [ϒ] => &upsih;
  [ϖ] => &piv;
  [ ] => &ensp;
  [ ] => &emsp;
  [ ] => &thinsp;
  [‌] => &zwnj;
  [‍] => &zwj;
  [‎] => &lrm;
  [‏] => &rlm;
  [–] => &ndash;
  [—] => &mdash;
  [‘] => &lsquo;
  [’] => &rsquo;
  [‚] => &sbquo;
  [“] => &ldquo;
  [”] => &rdquo;
  [„] => &bdquo;
  [†] => &dagger;
  [‡] => &Dagger;
  [•] => &bull;
  […] => &hellip;
  [‰] => &permil;
  [′] => &prime;
  [″] => &Prime;
  [‹] => &lsaquo;
  [›] => &rsaquo;
  [‾] => &oline;
  [⁄] => &frasl;
  [€] => &euro;
  [ℑ] => &image;
  [℘] => &weierp;
  [ℜ] => &real;
  [™] => &trade;
  [ℵ] => &alefsym;
  [←] => &larr;
  [↑] => &uarr;
  [→] => &rarr;
  [↓] => &darr;
  [↔] => &harr;
  [↵] => &crarr;
  [⇐] => &lArr;
  [⇑] => &uArr;
  [⇒] => &rArr;
  [⇓] => &dArr;
  [⇔] => &hArr;
  [∀] => &forall;
  [∂] => &part;
  [∃] => &exist;
  [∅] => &empty;
  [∇] => &nabla;
  [∈] => &isin;
  [∉] => &notin;
  [∋] => &ni;
  [∏] => &prod;
  [∑] => &sum;
  [−] => &minus;
  [∗] => &lowast;
  [√] => &radic;
  [∝] => &prop;
  [∞] => &infin;
  [∠] => &ang;
  [∧] => &and;
  [∨] => &or;
  [∩] => &cap;
  [∪] => &cup;
  [∫] => &int;
  [∴] => &there4;
  [∼] => &sim;
  [≅] => &cong;
  [≈] => &asymp;
  [≠] => &ne;
  [≡] => &equiv;
  [≤] => &le;
  [≥] => &ge;
  [⊂] => &sub;
  [⊃] => &sup;
  [⊄] => &nsub;
  [⊆] => &sube;
  [⊇] => &supe;
  [⊕] => &oplus;
  [⊗] => &otimes;
  [⊥] => &perp;
  [⋅] => &sdot;
  [⌈] => &lceil;
  [⌉] => &rceil;
  [⌊] => &lfloor;
  [⌋] => &rfloor;
  [〈] => &lang;
  [〉] => &rang;
  [◊] => &loz;
  [♠] => &spades;
  [♣] => &clubs;
  [♥] => &hearts;
  [♦] => &diams;
)


❮ PHP String Reference
Copyright 1999-2023 by Refsnes Data. All Rights Reserved.