PHP's htmlspecialchars_decode in JavaScript

Here’s what our current JavaScript equivalent to PHP's htmlspecialchars_decode looks like.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
module.exports = function htmlspecialchars_decode (string, quoteStyle) { // eslint-disable-line camelcase
// discuss at: http://locutus.io/php/htmlspecialchars_decode/
// original by: Mirek Slugen
// improved by: Kevin van Zonneveld (http://kvz.io)
// bugfixed by: Mateusz "loonquawl" Zalega
// bugfixed by: Onno Marsman (https://twitter.com/onnomarsman)
// bugfixed by: Brett Zamir (http://brett-zamir.me)
// bugfixed by: Brett Zamir (http://brett-zamir.me)
// input by: ReverseSyntax
// input by: Slawomir Kaniecki
// input by: Scott Cariss
// input by: Francois
// input by: Ratheous
// input by: Mailfaker (http://www.weedem.fr/)
// revised by: Kevin van Zonneveld (http://kvz.io)
// reimplemented by: Brett Zamir (http://brett-zamir.me)
// example 1: htmlspecialchars_decode("<p>this -&gt; &quot;</p>", 'ENT_NOQUOTES')
// returns 1: '<p>this -> &quot;</p>'
// example 2: htmlspecialchars_decode("&amp;quot;")
// returns 2: '&quot;'
var optTemp = 0
var i = 0
var noquotes = false
if (typeof quoteStyle === 'undefined') {
quoteStyle = 2
}
string = string.toString()
.replace(/&lt;/g, '<')
.replace(/&gt;/g, '>')
var OPTS = {
'ENT_NOQUOTES': 0,
'ENT_HTML_QUOTE_SINGLE': 1,
'ENT_HTML_QUOTE_DOUBLE': 2,
'ENT_COMPAT': 2,
'ENT_QUOTES': 3,
'ENT_IGNORE': 4
}
if (quoteStyle === 0) {
noquotes = true
}
if (typeof quoteStyle !== 'number') {
// Allow for a single string or an array of string flags
quoteStyle = [].concat(quoteStyle)
for (i = 0; i < quoteStyle.length; i++) {
// Resolve string input to bitwise e.g. 'PATHINFO_EXTENSION' becomes 4
if (OPTS[quoteStyle[i]] === 0) {
noquotes = true
} else if (OPTS[quoteStyle[i]]) {
optTemp = optTemp | OPTS[quoteStyle[i]]
}
}
quoteStyle = optTemp
}
if (quoteStyle & OPTS.ENT_HTML_QUOTE_SINGLE) {
// PHP doesn't currently escape if more than one 0, but it should:
string = string.replace(/&#0*39;/g, "'")
// This would also be useful here, but not a part of PHP:
// string = string.replace(/&apos;|&#x0*27;/g, "'");
}
if (!noquotes) {
string = string.replace(/&quot;/g, '"')
}
// Put this in last place to avoid escape being double-decoded
string = string.replace(/&amp;/g, '&')
return string
}
[ View on GitHub | Edit on GitHub | Source on GitHub ]

How to use

You you can install via npm install locutus and require it via require('locutus/php/strings/htmlspecialchars_decode'). You could also require the strings module in full so that you could access strings.htmlspecialchars_decode instead.

If you intend to target the browser, you can then use a module bundler such as Browserify, webpack or rollup.js.

ES5/ES6

This function targets ES5, but as of Locutus 2.0.2 we also support ES6 functions. Locutus transpiles to ES5 before publishing to npm.

A community effort

Not unlike Wikipedia, Locutus is an ongoing community effort. Our philosophy follows The McDonald’s Theory. This means that we don't consider it to be a bad thing that many of our functions are first iterations, which may still have their fair share of issues. We hope that these flaws will inspire others to come up with better ideas.

This way of working also means that we don't offer any production guarantees, and recommend to use Locutus inspiration and learning purposes only.

Examples

Please note that these examples are distilled from test cases that automatically verify our functions still work correctly. This could explain some quirky ones.

#codeexpected result
1htmlspecialchars_decode("<p>this -&gt; &quot;</p>", 'ENT_NOQUOTES')'<p>this -> &quot;</p>'
2htmlspecialchars_decode("&amp;quot;")'&quot;'

« More PHP strings functions