Some notes on JavaScript Web App Reverse Engineering, specifically focussing on module / dependency identification.
For the more general case of 'module detection' (and related concepts), see these issues:
- pionxzh/wakaru#41
-
Module detection
-
- pionxzh/wakaru#73
-
add a 'module graph'
-
- pionxzh/wakaru#74
-
explore 'AST fingerprinting' for module/function identification (eg. to assist smart / stable renames, etc)
-
And other similar deep dives / reverse engineering / module identification attempts:
- https://github.com/pionxzh/wakaru/issues?q=%22%5Bmodule-detection%5D%22
- pionxzh/wakaru#40
-
[module-detection / smart-rename]
styled-components
/Tailwind-Styled-Component
libs - Reverse Engineered Webpack Tailwind-Styled-Component (0xdevalias' gist)
-
- pionxzh/wakaru#50
-
Support detect and replace
swc
'sruntime-helpers
-
- pionxzh/wakaru#55
-
Support detect and replace
microsoft/tslib
'sruntime-helpers
-
- pionxzh/wakaru#79
-
[module-detection]
zustand
- React state management
-
- pionxzh/wakaru#80
-
[module-detection]
js-xss
-
- pionxzh/wakaru#86
-
[module-detection]
statsig-io/js-client
-
- pionxzh/wakaru#87
-
[module-detection]
DataDog/browser-sdk
-
- pionxzh/wakaru#88
-
[module-detection]
TanStack/query
-
- pionxzh/wakaru#89
-
[module-detection]
radix-ui/primitives
-
- ?etc?
- pionxzh/wakaru#40
- https://github.com/j4k0xb/webcrack/issues?q=%22%5Bplugin%5D%22
- j4k0xb/webcrack#143
-
[plugin]
Add support fordata-sentry-component
/data-sentry-element
/data-sentry-source-file
(from@sentry/babel-plugin-component-annotate
)
-
- j4k0xb/webcrack#151
-
[plugin]
plugin to support WordPress Gutenberg specific blocks features (including how it injectswindow.React
,window.wp.element
, etc) within JSX decompilation
-
- j4k0xb/webcrack#152
-
[plugin]
plugin to support unminifyinggoober
CSS-in-JS library patterns + related JSX decompilation
-
- ?etc?
- j4k0xb/webcrack#143
- https://github.com/postcss/autoprefixer
-
Autoprefixer PostCSS plugin to parse CSS and add vendor prefixes to CSS rules using values from Can I Use.
-
- https://github.com/lumeland/autoprefixer-deno
-
Autoprefixer for Deno Scripts to transform the source code of autoprefixer for Deno compatibility.
-
I had a file that contained content like this:
// ..snip..
class r extends o {
check(e) {
return !e.value.split(/\s+/).some(e => {
let t = e.toLowerCase();
return t === "reverse" || t === "alternate-reverse";
});
}
}
r.names = ["animation", "animation-direction"];
module.exports = r;
Which I searched for on GitHub Code Search:
Which seemed to point me towards autoprefixer
/ autoprefixer-deno
's lib/hacks/animation.js
:
- https://github.com/postcss/autoprefixer/blob/541295c0e6dd348db2d3f52772b59cd403c59d29/lib/hacks/animation.js#L15
- https://github.com/lumeland/autoprefixer-deno/blob/f44e39599235d48715e1608aa689107f46cb2525/deno/lib/hacks/animation.js#L15
- https://github.com/browserslist/caniuse-lite
-
A smaller version of
caniuse-db
, with only the essentials!
-
- https://github.com/Fyrd/caniuse
-
Raw browser/feature support data from caniuse.com
-
I found the following confusing looking code in a bundle I was exploring (prettified here):
module.exports = {
A: {
A: {
1: "A B",
2: "J E F G PC"
},
B: {
1: "0 1 2 3 4 C K L H M N O P Q R S T U V W X Y Z a b c d e f g h i j k l m r s t u v w x y z D"
},
C: {
1: "6 7 8 9 H M N O n o p AB BB CB DB EB FB GB HB IB JB KB LB MB NB OB PB QB RB SB TB UB VB WB XB YB ZB aB bB cB",
2: "5 QC 3B I J E F G A B C K L RC SC",
129: "dB eB fB",
769: "gB 4B",
1281: "0 1 2 3 4 hB 5B iB jB kB lB mB nB oB pB qB rB sB q tB uB vB wB xB P Q R 6B S T U V W X Y Z a b c d e f g h i j k l m r s t u v w x y z D 7B 8B 9B"
},
D: {
1: "0 1 2 3 4 8 9 AB BB CB DB EB FB GB HB IB JB KB LB MB NB OB PB QB RB SB TB UB VB WB XB YB ZB aB bB cB dB eB fB gB 4B hB 5B iB jB kB lB mB nB oB pB qB rB sB q tB uB vB wB xB P Q R S T U V W X Y Z a b c d e f g h i j k l m r s t u v w x y z D 7B 8B 9B",
2: "5 6 I J E F G A B C K L H M N O",
33: "7 n o p"
},
E: {
1: "F G A B C K L H XC BC yB zB CC YC ZC DC EC 0B aC 1B FC GC HC IC JC KC 2B LC MC bC",
2: "5 I J E TC AC UC VC WC"
},
F: {
1: "6 7 8 9 H M N O n o p AB BB CB DB EB FB GB HB IB JB KB LB MB NB OB PB QB RB SB TB UB VB WB XB YB ZB aB bB cB dB eB fB gB hB iB jB kB lB mB nB oB pB qB rB sB q tB uB vB wB xB P Q R 6B S T U V W X Y Z a b c d e f g h i j k l m",
2: "G B C cC dC eC fC yB NC gC zB"
},
G: {
1: "F mC nC oC pC qC rC sC tC uC vC wC xC yC zC 0C DC EC 0B 1C 1B FC GC HC IC JC KC 2B LC MC",
2: "AC hC OC iC jC kC lC"
},
H: {
2: "2C"
},
I: {
1: "D 7C 8C",
2: "3B I 3C 4C 5C 6C OC"
},
J: {
1: "A",
2: "E"
},
K: {
1: "q",
2: "A B C yB NC zB"
},
L: {
1: "D"
},
M: {
1: "D"
},
N: {
1: "A B"
},
O: {
1: "0B"
},
P: {
1: "I n o p 9C AD BD CD DD BC ED FD GD HD ID 1B 2B JD KD"
},
Q: {
1: "CC"
},
R: {
1: "LD"
},
S: {
1: "MD ND"
}
},
B: 2,
C: "High Resolution Time API",
D: true
};
Searching GitHub for the only identifiable string didn't turn up much, except that it seemed like it might relate to a web feature, and some references linked to caniuse.com:
- https://github.com/search?type=code&q=%22High+Resolution+Time+API%22
- https://caniuse.com/high-resolution-time
Trying to search for parts of the other 'minified' strings on GitHub mostly seemed to turn up pnpm cache entries and similar, but didn't help to identify what the original source was.
Looking at the repo containing the data for caniuse.com, and searching for the identifiable string seemed to confirm that it looks related in data shape, but wasn't quite the right match, as it still didn't explain the weird 'minified' looking strings:
- https://github.com/search?q=repo%3AFyrd%2Fcaniuse+%22High+Resolution+Time+API%22&type=code
- https://github.com/Fyrd/caniuse/blob/4ef5449979e70a3188a721c299247d436aaab71f/features-json/high-resolution-time.json#L2
Exploring a bit further with ChatGPT pointed me towards browserslist/caniuse-lite
, and while searching for the main identifiable string didn't turn up anything directly, manually looking at the data did find what seemed to be a close match for the code we found in the bundle:
- https://github.com/search?q=repo%3Abrowserslist%2Fcaniuse-lite%20%22High%20Resolution%20Time%20API%22&type=code
- https://github.com/browserslist/caniuse-lite/blob/main/data/features/high-resolution-time.js
While this is one example, there are many other examples based on all of the other feature data:
- https://github.com/brix/crypto-js
-
JavaScript library of crypto standards.
-
I had a file that contained content like this:
// ..snip..
i = (r = (o = d).lib).Base;
a = r.WordArray;
s = (l = o.algo).SHA256;
c = l.HMAC;
u = l.PBKDF2 = i.extend({
cfg: i.extend({
keySize: 4,
hasher: s,
iterations: 250000
}),
// ..snip..
o.PBKDF2 = function (e, t, n) {
return u.create(n).compute(e, t);
};
module.exports = d.PBKDF2;
Which I searched for on GitHub Code Search:
Which seemed to point me towards crypto-js
's src/pbkdf2.js
:
- https://github.com/cssnano/cssnano
-
A modular minifier, built on top of the PostCSS ecosystem.
-
- https://github.com/cssnano/cssnano/tree/master/packages/stylehacks
-
stylehacks Detect/remove browser hacks from CSS files.
-
I had a file that just contained this content:
module.exports = {
MEDIA_QUERY: "media query",
PROPERTY: "property",
SELECTOR: "selector",
VALUE: "value"
};
Which I searched for on GitHub Code Search:
Which seemed to point me towards stylehacks
, which seems to be a part of cssnano
- https://github.com/remarkablemark/html-dom-parser
-
HTML to DOM parser that works on both the server (Node.js) and the client (browser)
-
The parser converts an HTML string to a JavaScript object that describes the DOM tree.
-
I had a file that contained content like this:
// ..snip..
function a(e) {
return function (e) {
return r.CASE_SENSITIVE_TAG_NAMES_MAP[e];
}(e = e.toLowerCase()) || e;
}
exports.formatAttributes = i;
exports.formatDOM = function e(t, n = null, r) {
// ..snip..
(l = new o.ProcessingInstruction(r.substring(0, r.indexOf(" ")).toLowerCase(), r)).next = s
// ..snip..
Which I searched for on GitHub Code Search:
Which seemed to point me towards html-dom-parser
's utilities.ts
file:
- https://github.com/postcss/postcss
-
PostCSS is a tool for transforming styles with JS plugins. These plugins can lint your CSS, support variables and mixins, transpile future CSS syntax, inline images, and more.
-
- https://github.com/postcss/postcss-deno
-
PostCSS for Deno Scripts to transform the source code of PostCSS for Deno compatibility.
-
I had a file that started with this content:
const t = {
after: "\n",
beforeClose: "\n",
beforeComment: "\n",
beforeDecl: "\n",
beforeOpen: " ",
beforeRule: "\n",
colon: ": ",
commentLeft: " ",
commentRight: " ",
emptyBody: "",
indent: " ",
semicolon: false
};
// ..snip..
Which I searched for on GitHub Code Search:
Which after a little looking through things, lead me to the postcss
/ postcss-deno
stringifier.js
file:
- https://github.com/postcss/postcss/blob/7ac43db0c30c08dc799a55e5f2f1f269a61b6856/lib/stringifier.js#L3-L16
- https://github.com/postcss/postcss-deno/blob/01dfbb279fa8811ee649ecfdfa210112aaab8d7a/deno/lib/stringifier.js#L3-L16
If I kept looking through the minified code I also would have seen this part later on, which sort of helped confirm that it relates to PostCSS:
// ..snip..
stringify(e, t) {
if (!this[e.type]) {
throw new Error("Unknown AST node type " + e.type + ". Maybe you need to change PostCSS stringifier.");
}
this[e.type](e, t);
}
// ..snip..
- https://github.com/ProseMirror/prosemirror-state
-
ProseMirror editor state
-
This is a core module of ProseMirror. ProseMirror is a well-behaved rich semantic content editor based on
contentEditable
, with support for collaborative editing and custom document schemas. -
This module implements the editor state, which tracks the current document and selection, and managed plugins.
-
I had a file that contained content like this:
// ..snip..
static near(e, t = 1) {
return this.findFrom(e, t) || this.findFrom(e, -t) || new m(e.node(0));
}
static atStart(e) {
return f(e, e, 0, 0, 1) || new m(e);
}
static atEnd(e) {
return f(e, e, e.content.size, e.childCount, -1) || new m(e);
}
static fromJSON(e, t) {
if (!t || !t.type) {
throw new RangeError("Invalid input for Selection.fromJSON");
}
let n = i[t.type];
if (!n) {
throw new RangeError(`No selection type ${t.type} defined`);
}
return n.fromJSON(e, t);
}
// ..snip..
Which I searched for on GitHub Code Search:
Which seemed to point me towards prosemirror-state
's src/selection.ts
:
- https://github.com/react-grid-layout/react-resizable
-
A simple React component that is resizable with a handle.
-
I had a file that contained content like this:
module.exports = function () {
throw new Error("Don't instantiate Resizable directly! Use require('react-resizable').Resizable");
};
// ..snip..
Which I searched for on GitHub Code Search:
Which seemed to point me towards react-resizable
's index.js
:
- https://github.com/tailwindlabs/tailwindcss-typography
-
tailwindcss-typography Beautiful typographic defaults for HTML you don't control.
-
The official Tailwind CSS Typography plugin provides a set of
prose
classes you can use to add beautiful typographic defaults to any vanilla HTML you don’t control, like HTML rendered from Markdown, or pulled from a CMS.
-
I had a file that contained content like this:
// ..snip..
const r = e => e.toFixed(7).replace(/(\.[0-9]+?)0+$/, "$1").replace(/\.0$/, "");
// ..snip..
Which I searched for on GitHub Code Search:
Which seemed to point me towards tailwindcss-typography
's src/styles.js
:
- https://github.com/0xdevalias
- https://gist.github.com/0xdevalias
- https://github.com/0xdevalias/chatgpt-source-watch : Analyzing the evolution of ChatGPT's codebase through time with curated archives and scripts.
- Deobfuscating / Unminifying Obfuscated Web App Code (0xdevalias' gist)
- Reverse Engineering Webpack Apps (0xdevalias' gist)
- Fingerprinting Minified JavaScript Libraries / AST Fingerprinting / Source Code Similarity / Etc (0xdevalias' gist)
- React Server Components, Next.js v13+, and Webpack: Notes on Streaming Wire Format (
__next_f
, etc) (0xdevalias' gist)) - Bypassing Cloudflare, Akamai, etc (0xdevalias' gist)
- Debugging Electron Apps (and related memory issues) (0xdevalias' gist)
- devalias' Beeper CSS Hacks (0xdevalias' gist)
- Reverse Engineering Golang (0xdevalias' gist)
- Reverse Engineering on macOS (0xdevalias' gist)