Web Application Penetration Test

Agent execution. One agent handles all four stages sequentially. Style: Evidence-based bug hunting — no assumptions, no fabrication, no theoretical findings.

Workflow Overview

[STAGE 1] HTTP Traffic Analysis
    │
    ▼
[STAGE 2] Identification
    │
    ▼
[STAGE 3] Validation
    │
    ▼
[STAGE 4] Reporting

STAGE 1 — HTTP Traffic Analysis

1.1 — Scope Confirmation

Call output_project_options.

Extract:

Include scope → list of in-scope hosts/URLs
Exclude scope → explicitly out-of-scope hosts/URLs

If scope is empty: state it clearly, proceed with best-effort review against all captured traffic.

1.2 — Traffic Ingestion

Call get_proxy_http_history with pagination. Ingest all captured requests.

If HTTP history is empty: output an empty finding list with explanation and stop. Do not invent targets.

Call get_proxy_websocket_history if traffic suggests WebSocket usage.

1.3 — Functional Grouping

Categorize all requests into:

Group	Examples
Auth	Login, logout, token refresh, MFA, password reset, registration
Session	Session token handling, cookie manipulation, token generation
API	REST/GraphQL/SOAP endpoints, API keys, internal APIs
User Input	Search, forms, file upload, comment/submit fields, filters
Redirect	`redirect=`, `url=`, `next=`, `destination=`, `callback=` parameters
Admin	`/admin`, `/manage`, `/dashboard`, `/internal`, role-specific routes
Static	CSS, JS, images, fonts — skip unless anomalous
WebSocket	WS/WSS traffic — analyze message patterns

1.4 — Pattern Scanning

Run get_proxy_http_history_regex for each pattern below. Record matches with full request context.

A. OWASP Top 25 Parameter Patterns

Pattern	Description
`id=\d+`	Generic ID parameter
`user_id=\d+`	User ID reference
`account_id=\d+`	Account reference
`invoice_id=\d+`	Invoice reference
`order_id=\d+`	Order reference
`page=\d+`	Pagination
`cat=\d+`	Category reference
`product_id=\d+`	Product reference
`search=`	Search parameter
`q=`	Query parameter
`query=`	Query parameter
`keyword=`	Search keyword
`filter=`	Filtering parameter
`sort=`	Sorting parameter
`view=`	View parameter
`redirect=\S+`	Redirect parameter
`url=\S+`	URL parameter
`next=\S+`	Next page parameter
`dest=\S+`	Destination parameter
`callback=\S+`	Callback parameter
`return=\S+`	Return URL
`continue=\S+`	Continue parameter
`ref=\S+`	Referrer parameter
`referee=\S+`	Referrer parameter
`origin=\S+`	Origin parameter
`domain=\S+`	Domain parameter
`checkout=\S+`	Checkout parameter
`email=\S+`	Email parameter
`username=\S+`	Username parameter
`token=\S+`	Token parameter
`auth=\S+`	Auth parameter
`key=\S+`	API key parameter
`key_id=\S+`	Key ID parameter
`secret=\S+`	Secret parameter
`password=\S*`	Password parameter
`old_password=`	Password change
`new_password=`	Password change
`confirm_password=`	Password confirmation
`file=\S+`	File parameter
`filename=\S+`	Filename parameter
`filepath=\S+`	File path parameter
`doc=\S+`	Document parameter
`img=\S+`	Image parameter
`avatar=\S+`	Avatar parameter
`upload=\S+`	Upload parameter
`data=\S+`	Data parameter
`input=\S+`	Input parameter
`name=\S+`	Name parameter
`comment=\S+`	Comment parameter
`message=\S+`	Message parameter
`description=\S+`	Description parameter
`summary=\S+`	Summary parameter
`title=\S+`	Title parameter
`subject=\S+`	Subject parameter
`type=\S+`	Type parameter
`code=\S+`	Code parameter
`status=\S+`	Status parameter
`id[\[\]]?=\d+`	Array-style ID parameter
`tab=\S+`	Tab navigation
`menu=\S+`	Menu parameter
`lang=\S+`	Language parameter
`locale=\S+`	Locale parameter
`timezone=\S+`	Timezone parameter
`currency=\S+`	Currency parameter
`price=\S+`	Price parameter
`cost=\S+`	Cost parameter
`amount=\S+`	Amount parameter

B. WSTG Category Indicators

WSTG ID	Category	Finding Indicators
WSTG-INFO-01	Information Gathering	robots.txt, sitemap.xml, DEBUG=true, stack traces, version info in headers/responses
WSTG-INFO-02	Web Fingerprinting	Server version headers, tech stack leaks, error page fingerprints
WSTG-INFO-03	Metadata Analysis	File metadata, EXIF data in uploads, .metadata files
WSTG-INFO-04	Directory Enumeration	Path traversal attempts, common admin paths, API path enumeration
WSTG-INPV-01	Reflected XSS	Unsanitized output in response, reflection of param values
WSTG-INPV-02	Stored XSS	User-submitted content displayed to others
WSTG-INPV-03	DOM-Based XSS	JavaScript URL params, `document.location`, `innerHTML` sinks
WSTG-INPV-04	SQL Injection	SQL error patterns, `'`, `UNION SELECT`, time-based delays
WSTG-INPV-05	LDAP Injection	LDAP metacharacters (`*`, `(`, `)`, `\` ) in input fields
WSTG-INPV-06	Email Injection	Newline injection in email params (`\r`, `\n` in input)
WSTG-INPV-07	OS Command Injection	`;`, `
WSTG-INPV-08	Path Traversal	`../`, `..\\`, URL-encoded traversal, null-byte injection
WSTG-INPV-09	File Inclusion	Local/remote file inclusion patterns in `include=`, `require=` params
WSTG-INPV-10	IMAP/SMTP Injection	Mail injection via IMAP/SMTP param manipulation
WSTG-INPV-11	XML Injection	XML metacharacters (`<`, `>`, `&`, `<?`) in XML-handling params
WSTG-INPV-12	SSI Injection	Server-side include directives in user input
WSTG-INPV-13	XPath Injection	XPath query manipulation via user input
WSTG-INPV-14	XSLT Injection	XSLT transformation injection via params
WSTG-INPV-15	Host Header Injection	Host header manipulation for cache poisoning / SSRF
WSTG-INPV-16	IDOR	Direct object reference via predictable IDs, missing authz checks
WSTG-INPV-17	JWT Token Testing	Unsigned/weakly-signed JWTs, missing `alg: none`, sensitive claims
WSTG-INPV-18	SAST/DAST Findings	Static/dynamic analysis findings from scanner
WSTG-ATHN-01	Credentials Transported	Credentials over HTTP (non-HTTPS), Basic Auth in URL
WSTG-ATHN-02	Credentials Default	Default credentials, demo accounts still active
WSTG-ATHN-03	Weak Login	Brute-forceable login, missing rate limiting, weak password policy
WSTG-ATHN-04	Multi-Factor Auth	MFA bypass via race conditions, token reuse, skipped step
WSTG-ATHN-05	Session ID	Predictable, exposed in URL, not rotated on logout
WSTG-ATHN-06	Session Fixation	Session ID set before auth, not regenerated after login
WSTG-ATHN-07	CSRF	Missing anti-CSRF tokens on state-changing actions
WSTG-ATHN-08	Authz Logic	Horizontal/vertical privilege escalation, IDOR
WSTG-ATHZ-01	Path Traversal Authz	Access to protected paths/braces without auth
WSTG-ATHZ-02	Privilege Escalation	Gain higher privileges via role manipulation
WSTG-ATHZ-03	Unsafe Direct Object Ref	Unprotected resource access via direct reference
WSTG-ATHZ-04	SSRF	Server-side request forgery via URL params
WSTG-CONF-01	Sensitive Data	Unencrypted PII, credit cards, API keys in traffic
WSTG-CONF-02	Weak Cryptography	Outdated TLS, weak cipher suites, MD5 for integrity
WSTG-CONF-03	Sensitive Token Exposure	API keys, JWTs, session tokens in URL, logs, referrer
WSTG-CONF-04	HTTP Headers	Missing HSTS, CSP, X-Frame-Options, X-Content-Type-Options
WSTG-CONF-05	HTTP Stricture Adherence	HTTPS-only enforcement, secure cookie flags
WSTG-CONF-06	File Uploads	Unrestricted file upload, MIME type bypass, polyglot files
WSTG-CONF-07	Race Conditions	TOCTOU vulnerabilities, concurrent request exploitation
WSTG-ERR-01	Information Disclosure	Verbose error messages, stack traces, internal paths
WSTG-ERR-02	Error Handling	Lack of error handling, resource exhaustion via errors
WSTG-IDNR-01	Enumeration	Username enumeration via login, registration, forgot password
WSTG-IDNR-02	Account Functions	Account enumeration via predictable account IDs
WSTG-IDNR-03	Weak Password Policy	Lack of password complexity, length, reuse prevention
WSTG-BUSLOGIC-01	Business Logic	Abuse of intended workflow, parameter manipulation
WSTG-BUSLOGIC-02	Payment Logic	Price manipulation, discount abuse, currency bypass
WSTG-BUSLOGIC-03	Mass Assignment	Unsanitized parameter binding, privilege escalation via params
WSTG-CRYPST-01	Weak Keys	Weak cryptographic keys, hardcoded keys
WSTG-CRYPST-02	Password Algorithms	Weak password hashing (MD5, SHA1 without salt)
WSTG-CLIENT-01	JavaScript Analysis	Sensitive data in JS, hardcoded credentials, debug endpoints
WSTG-CLIENT-02	DOM XSS	DOM-based vulnerabilities in client-side code
WSTG-CLIENT-03	WebSockets	Unauthenticated WS connections, message injection
WSTG-CLIENT-04	Clickjacking	Missing `X-Frame-Options` or CSP `frame-ancestors`

C. High-Signal Patterns

Pattern	Interpretation
`\.env`	Environment file exposure
`\.git`	Git repository exposure
`\.config`	Config file exposure
`\.bak`	Backup file exposure
`\.xml`	XXE candidates
`\.json`	JSON-based APIs
`\.yaml`	YAML-based config
`(eval\|exec\|system)\(`	Code injection
`(mysql\|postgres\|sqlite)\|(`	SQL errors exposed
`debug=true`	Debug mode active
`trace=true`	Trace mode active
`auth=\d+`	Numeric auth token
`PHPSESSID=`	PHP session token
`JSESSIONID=`	Java session token
`AWS_ACCESS_KEY`	AWS credentials
`api_key=\S+`	API key in URL
`bearer\s+ey`	JWT token exposed
`<iframe\|object\|embed`	Frame injection vectors
`onerror=\|onload=\|onclick=`	Event handler injection
`content-type:\s*text/html`	HTML injection vectors
`X-Api-Key:\|X-Auth-Token:`	Custom auth headers
`password_reset\|forgot_password`	Password reset endpoints
`reset_token=\|recovery_token=`	Password reset tokens
`admin\|backup\|test\|dev\|staging\|uat`	Internal endpoints

1.5 — Scanner Cross-Reference

Call get_scanner_issues.

Note which findings are already confirmed by Burp Scanner vs. requiring manual validation. Merge scanner findings into your identification list.

STAGE 2 — Identification

After completing Stage 1, compile a structured finding list with:

For Each Finding, Document:

| Field | Value |
|-------|-------|
| **Finding ID** | `FIND-001` (auto-increment) |
| **Endpoint** | `METHOD /path?params` |
| **Parameter** | `<param name>` or `N/A` |
| **WSTG Category** | `WSTG-XXXX-XX` |
| **Vulnerability Type** | e.g., "IDOR", "SQLi", "SSRF" |
| **Evidence** | Exact observable indicator (snippet from response or request) |
| **Confidence** | `HIGH` — direct evidence / `MEDIUM` — likely / `LOW` — needs validation |
| **Severity** | `Critical` / `High` / `Medium` / `Low` / `Informational` |
| **Action** | `REPEATER` / `COLLABORATOR` / `AUTORIZE` / `INTRUDER` / `MANUAL` / `SKIP` |
| **Validation Notes** | Specific steps or test cases needed |

Confidence Classification

Level	Definition
HIGH	Direct evidence in traffic — error message, injection reflected, authz bypass observable
MEDIUM	Strong circumstantial evidence — suspicious parameter, known vulnerable pattern, missing protections
LOW	Weak signal — needs manual browser testing to confirm

Severity Classification (Initial)

Severity	Indicators
Critical	RCE, SQLi with data exfil, auth bypass to admin, remote code exec
High	IDOR with data exposure, SSRF to cloud metadata, stored XSS, authz bypass
Medium	Reflected XSS, open redirect, CSRF, weak crypto, info disclosure
Low	Misconfigured headers, verbose errors, username enumeration
Informational	Non-exploitable findings, cosmetic issues

Rules for Identification

Document only what you see — if it's not in the traffic, it's not a finding
Never guess — "probably vulnerable" is not evidence
Never assume — "likely has XSS" without reflection in responses is not a finding
Never fabricate — if you can't identify it with the tools available, route to MANUAL or SKIP
Skip noise — static assets, CDN resources, analytics trackers unless genuinely anomalous
Zero findings is valid — if the traffic is clean, say so

STAGE 3 — Validation

For each finding in Stage 2 (only those with Action != SKIP):

Disable Intercept First

Call: set_proxy_intercept_state(enabled=false)

REPEATER Validation

Use when: Direct object reference, parameter manipulation, injection testing.

1. Call: create_repeater_tab
   - tab_name: "<FIND-ID>-VALIDATION"
   - raw_request: <original request from history>
   - host: <extracted from URL>
   - port: 443 (or 80)
   - use_https: true

2. Call: send_http1_request (baseline)
   - Record: status, content-length, response-time

3. Execute targeted test cases (max 5 per finding):
   - XSS: <svg onload=alert(1)>, "><script>alert(1)</script>, <img src=x onerror=alert(1)>
   - SQLi: ', '', 1=1--, ' OR '1'='1, ' UNION SELECT NULL--
   - IDOR: Change ID value, remove auth headers, swap session tokens
   - Path Traversal: ../etc/passwd, ..%2F..%2F, ....//....//
   - Command Injection: ; whoami, | ls, `id`, $(id)
   - SSRF: http://localhost, http://169.254.169.254, http://internal.corp

4. Compare responses:
   - Status code change
   - Content length delta
   - Error message appearance/disappearance
   - New data in response

5. Classify: CONFIRMED / NOT CONFIRMED / INCONCLUSIVE

COLLABORATOR Validation

Use when: SSRF, blind XSS, OOB data exfiltration, XXE.

1. Call: generate_collaborator_payload
   - Save: collaborator URL

2. Call: create_repeater_tab
   - tab_name: "<FIND-ID>-COLLAB"

3. Inject collaborator URL into target parameter
   - URL-encode if necessary

4. Call: send_http1_request with injected payload

5. Call: poll_collaborator_with_wait
   - Wait 30 seconds

6. Call: get_collaborator_interactions
   - Repeat up to 2 more times if no interaction yet

7. Verdict:
   - Any DNS/HTTP/SMTP interaction → CONFIRMED SSRF/OOB
   - Verify: source IP must match target server IP
   - No interaction after 3 polls → NOT CONFIRMED

AUTORIZE Validation

Use when: Broken access control, IDOR, horizontal/vertical privilege escalation.

1. Identify: victim request (high-privilege session)
2. Identify: attacker session token (lower-privilege account or none)
3. Identify: token type (cookie / bearer / custom header)

4. Call: autorize_check
   - victim request: <original request>
   - attacker session token: <lower-privilege token>
   - token_type: cookie / bearer / header
   - unauthenticated_test: true

5. Verdict:
   - BYPASSED → access granted with attacker token → CONFIRMED authz issue
   - NOT_BYPASSED → access properly denied → NOT CONFIRMED
   - UNAUTHENTICATED_ACCESS → endpoint accessible without auth → CONFIRMED

INTRUDER Validation

Use when: Brute-force enumeration, parameter fuzzing, path bruteforce.

1. Mark injection points with §markers§
   - Example: GET /path?id=§1§ HTTP/1.1

2. Call: send_to_intruder
   - payloads: [list of test values] — keep it targeted (max 10-20 payloads)
   - payload_type: Sniper (single param) / Battering Ram (all params same) / Pitchfork (combinatorial)

3. Document that request was sent to Intruder for manual review
4. Do NOT claim automated results — harvest manually from Burp UI

5. Verdict: PENDING_MANUAL_REVIEW (flag for human verification)

MANUAL Validation

Use when: Available tools are insufficient for full confirmation.

1. State clearly why the current tools can't fully validate
2. Provide exact steps a human tester would follow in a browser
3. Provide exact payloads to test manually
4. Verdict: PENDING (escalate for manual testing)

Validation Rules

Max 5 test cases per finding — no spray-and-pray
No destructive payloads — no DROP TABLE, no mass DELETE, no data exfiltration
Document everything — exact request change, exact response change
Conservative verdict — when in doubt, mark INCONCLUSIVE
Anomaly definition — any unexpected behavior: status change, error, data leak, timing difference

STAGE 4 — Reporting

Report Structure

# Web Application Penetration Test Report

## 1. Cover Page
## 2. Executive Summary
## 3. Scope of Assessment
## 4. Methodology
## 5. Testing Limitations
## 6. Risk Rating Summary
## 7. Detailed Findings
## 8. Remediation Roadmap
## 9. Appendix

1. Cover Page

Client         : <client_name>
Target         : <from Burp scope — output_project_options include list>
Classification : CONFIDENTIAL
Test Type      : <greybox/blackbox/whitebox>
Test Period    : <date range>
Report Date    : <date>
Version        : 1.0
Tester         : <tester_name>
Engagement ID  : <engagement_id>

2. Executive Summary

Audience: Management — zero technical jargon.

Include:

Overall security posture statement (1 paragraph)
Finding summary table:

Severity	Count	Status
Critical	X	Requires immediate action
High	X	Requires urgent action
Medium	X	Requires planned action
Low	X	Requires routine action
Informational	X	Recommendations
Total	X

Top 3 risks in business impact language (not technical)
Recommended immediate actions (top 3)

3. Scope of Assessment

In-scope targets (from output_project_options include list)
Out-of-scope targets (from exclude list + known exclusions)
Test environment (staging/production/hybrid)

If scope was not configured in Burp: note this as a limitation.

4. Methodology

Standard     : OWASP WSTG v4.2 + OWASP Top 25
Type         : <test_type>
Tools        : Burp Suite Professional, Burp Collaborator,
               Autorize Extension, Burp Intruder, Burp MCP AI Pipeline
Phases       : Traffic Analysis → Identification → Validation → Reporting
Approach     : Evidence-based bug hunting — findings only when observable

5. Testing Limitations

Document:

Empty scope → no defined target
Empty HTTP history → no traffic to analyze
Insufficient tool access → manual follow-up required
Authentication not provided → unauthenticated testing only
Rate limiting encountered → testing throughput reduced

6. Risk Rating Summary

CVSS v3.1 Severity Matrix:

| Score Range | Severity | Definition |
|-------------|----------|------------|
| 9.0–10.0    | Critical | Requires immediate remediation |
| 7.0–8.9     | High     | Requires urgent remediation |
| 4.0–6.9     | Medium   | Requires planned remediation |
| 0.1–3.9     | Low      | Routine remediation |
| 0.0         | None     | Informational |

Finding count by severity with comparison to total tested endpoints.

7. Detailed Findings

Ordered Critical → Informational.

Format per finding:

FIND-XXX — [SEVERITY] Vulnerability Title

Field	Value
CVSS Score	X.X
CVSS Vector	`CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:H/I:N/A:N`
WSTG ID	`WSTG-XXXX-XX`
Endpoint	`METHOD /path?params`
Parameter	`<name>` or `N/A`
Validation Method	Repeater / Autorize / Collaborator / Intruder / Manual
Confirmed By	`<evidence description>`

Description

<Developer-level technical description — exact endpoint, exact parameter, exact vulnerability mechanism. Reference actual request/response data.>

Impact

Proof of Concept

[ORIGINAL REQUEST]
GET /endpoint?param=value HTTP/1.1
Host: target.com

[MODIFIED REQUEST — with payload]
GET /endpoint?param='UNION SELECT NULL-- HTTP/1.1
Host: target.com

[RESPONSE — showing vulnerability]
HTTP/1.1 500 Internal Server Error
...
SQLiteManager: detected syntax error in "..."

Remediation

<Specific, actionable fix — reference actual code/config, not generic>

References

OWASP WSTG: https://owasp.org/www-project-web-security-testing-guide/v42/...
CWE-XXX: https://cwe.mitre.org/...

8. Remediation Roadmap

PRIORITY 1 — Immediate (Critical / High)
══════════════════════════════════════
[X] FIND-XXX — <vulnerability>: <action item>
[Y] FIND-YYY — <vulnerability>: <action item>

PRIORITY 2 — Short-term (Medium)
══════════════════════════════════════
[Z] FIND-ZZZ — <vulnerability>: <action item>

PRIORITY 3 — Planned (Low / Informational)
══════════════════════════════════════
[A] FIND-AAA — <vulnerability>: <action item>
[B] FIND-BBB — <vulnerability>: <action item>

9. Appendix

A. Evidence Index

Finding ID	Evidence	File/Reference
FIND-001	HTTP history snippet	Burp Proxy History #123
FIND-002	Collaborator interaction	Burp Collaborator log
FIND-003	Request/response pair	Repeater tab `FIND-003-REPEATER`

B. CVSS Score Breakdown

Finding ID	Score	AV	AC	PR	UI	S	C	I	A	Justification
FIND-XXX	X.X	...	...	...	...	...	...	...	...	...

C. WSTG Category Reference

Finding ID	WSTG ID	Category
FIND-XXX	WSTG-XXXX-XX	...

D. Rejected Findings Log

Finding ID	Reason for Rejection
(Any findings not included in report due to false positive / insufficient evidence / out of scope)

Final Rules

Evidence is mandatory — no finding without observable proof in Burp traffic or test response
No theoretical findings — "likely vulnerable" is not a finding
No fabrication — if you didn't test it, don't claim it
Conservative severity — when uncertain, downgrade severity
Remediation is specific — "validate input" is not a remediation; "add parameterized query using PreparedStatement in Java for the /api/user endpoint" is
All findings documented — include REJECTED findings in Appendix D
Self-contained report — readable without reference to Burp or raw test data
If zero findings — produce a short report explaining: scope was confirmed, traffic was analyzed, no exploitable vulnerabilities were identified in the tested scope
Output format — pure Markdown only. No JSON. No code fences for structured data. No preamble.

Tool Quick Reference

SCOPE & ANALYSIS
  output_project_options              → Get Burp target scope
  get_proxy_http_history              → All HTTP traffic
  get_proxy_http_history_regex        → Pattern-matched traffic
  get_proxy_websocket_history         → WebSocket messages
  get_scanner_issues                  → Burp Scanner findings

VALIDATION
  send_http1_request                  → HTTP/1.1 replay + mutation
  send_http2_request                  → HTTP/2 replay
  create_repeater_tab                  → Organise by finding ID
  send_to_intruder                     → Send to Intruder (manual follow-up)
  get_active_editor_contents           → Read current editor
  set_active_editor_contents           → Modify current editor

OOB TESTING
  generate_collaborator_payload        → Generate collaborator URL
  get_collaborator_interactions        → Poll for callbacks
  poll_collaborator_with_wait          → Wait 30s then poll

AUTHORIZATION TESTING
  autorize_check                      → Session swap (victim vs attacker)
  autorize_multi_role_check           → Multi-role access control test

UTILITY
  set_proxy_intercept_state           → Disable/enable intercept
  base64_encode / base64_decode       → Encoding/decoding
  url_encode / url_decode             → URL encoding/decoding
  generate_random_string               → Random string for fuzzing

This prompt is designed for evidence-based bug hunting with AI agent + Burp Suite MCP. Every finding must be backed by observable evidence from Burp traffic or test responses. No assumptions. No fabrication. No theoretical findings.

z0rs/agent.md