denniswon · February 25, 2026 07:19 · denniswon · Feb 25, 2026 · denniswon · Feb 25, 2026
diff --git a/gistfile1.txt b/gistfile1.txt
  ---
  TLSNotary Integration — Complete Technical State Analysis

  1. Architecture Overview

  The TLSNotary integration adds zkTLS (MPC-TLS based proof of web data) to Newton Protocol. It enables operators to
  cryptographically verify that data was fetched from a specific HTTPS API without ever seeing the user's credentials.

  ┌─────────────┐     ┌──────────────┐     ┌──────────┐     ┌──────────────┐
  │  Browser     │ WS  │  Newton      │ TCP │  Target  │     │   IPFS       │
  │  Extension   │────▶│  Sidecar     │────▶│  API     │     │   Gateway    │
  │  (Prover)    │     │  (Verifier)  │     │  Server  │     │              │
  └──────┬───────┘     └──────────────┘     └──────────┘     └──────┬───────┘
         │                                                          │
         │  1. MPC-TLS + Attestation                                │
         │  2. Create Presentation (selective disclosure)           │
         │  3. Upload to IPFS ─────────────────────────────────────▶│
         │                                                          │
         │  4. Submit task with proof_cid                           │
         ▼                                                          │
  ┌──────────────┐     ┌──────────────┐     ┌──────────────┐       │
  │   Gateway    │────▶│  Operators   │────▶│  Aggregator  │       │
  │  (proof_cid  │     │  (fetch CID, │     │  (BLS agg)   │       │
  │   passthru)  │     │   verify,    │◀────┘              │       │
  └──────────────┘     │   Rego eval) │                     │       │
                       │              │◀────────────────────────────┘
                       └──────────────┘  5. Fetch presentation by CID

  ★ Insight ─────────────────────────────────────
  Two verification paths exist — and this is important for your review:

  1. Operator-level (crates/operator/src/tls_proof.rs): Strict trusted-key verification. Reads TLSN_TRUSTED_NOTARY_KEY from env.
  Rejects presentations signed by unknown notaries.
  2. WASM data-provider level (crates/data-provider/src/wasm/tlsn.rs): Trusts the presentation's embedded key. The security
  boundary here is the WASM sandbox itself, not the notary key. This means a malicious WASM plugin could accept any notary.

  These two paths serve different use cases, but the inconsistency is worth flagging.
  ─────────────────────────────────────────────────

  2. Component-by-Component Breakdown

  2.1 Core Library: crates/tls-notary/ (Phase 1 — MERGED to main)

  Files: lib.rs, prove.rs, present.rs, verify.rs

  Current State: On main, working.

  ┌───────────────────────┬───────────────────────────────────┬────────────────────────────────────────────────────────────────┐
  │       Function        │              Purpose              │                          Key Details                           │
  ├───────────────────────┼───────────────────────────────────┼────────────────────────────────────────────────────────────────┤
  │ notarize()            │ MPC-TLS prover + verifier over    │ Production flow requires WebSocket (sidecar), not duplex       │
  │                       │ in-memory duplex                  │                                                                │
  ├───────────────────────┼───────────────────────────────────┼────────────────────────────────────────────────────────────────┤
  │ create_presentation() │ Selective disclosure from         │ Auto-redacts Authorization, Cookie, User-Agent, X-API-Key,     │
  │                       │ attestation                       │ Proxy-Authorization, Set-Cookie                                │
  ├───────────────────────┼───────────────────────────────────┼────────────────────────────────────────────────────────────────┤
  │ verify_presentation() │ Verify against trusted notary key │ Requires explicit trusted_key param (good security)            │
  ├───────────────────────┼───────────────────────────────────┼────────────────────────────────────────────────────────────────┤
  │ get_crypto_provider() │ Root CA selection                 │ Compile-time gated via #[cfg(feature = "fixtures")]            │
  └───────────────────────┴───────────────────────────────────┴────────────────────────────────────────────────────────────────┘

  Technical Details I Found:

  - Serialization: Uses bincode 2.x serde API (bincode::serde::encode_to_vec / decode_from_slice) — but on the phase3 branch, the
  README shows bincode::serialize() (bincode 1.x API). This is a version inconsistency between what's on main (bincode 2.x) and
  what's documented on later branches.
  - OOM Protection: MAX_ATTESTATION_SIZE = 1 MiB, uses futures::AsyncReadExt::take() on all unbounded reads. This was a review
  feedback item you requested — implemented correctly.
  - Signing Key Safety: run_verifier() uses [1u8; 32] test key, gated behind #[cfg(any(test, feature = "fixtures"))]. Production
  builds error without a key. Good.
  - In-memory duplex: prove.rs uses tokio::io::duplex(MAX_RECV_BUFFER) for prover ↔ verifier. This only works for testing. The
  sidecar uses WebSocket for the real flow. The notarize() function is therefore test-only in its current form — it cannot be used
  in production as-is because the verifier must be a remote sidecar.

  2.2 Sidecar Binary: bin/tlsn-sidecar/ (Phase 2 — PR #379, NOT merged)

  Files: main.rs, session.rs, verifier_core.rs, config.rs, egress.rs, auth.rs, proof.rs, rate_limit.rs, ws_adapter.rs

  This is the production notary/verifier server. 9 source files, ~1800 lines.

  Endpoints:

  ┌─────────────────┬────────┬──────┬─────────────────────────────────────────────────────┐
  │      Route      │ Method │ Auth │                       Purpose                       │
  ├─────────────────┼────────┼──────┼─────────────────────────────────────────────────────┤
  │ /health         │ GET    │ No   │ Health + Prometheus metrics                         │
  ├─────────────────┼────────┼──────┼─────────────────────────────────────────────────────┤
  │ /session        │ WS     │ No   │ Create MPC-TLS session, returns sessionId           │
  ├─────────────────┼────────┼──────┼─────────────────────────────────────────────────────┤
  │ /verifier       │ WS     │ No   │ Prover connects here for MPC-TLS verification       │
  ├─────────────────┼────────┼──────┼─────────────────────────────────────────────────────┤
  │ /proxy          │ WS     │ No   │ WebSocket→TCP bridge (session-bound, SSRF-filtered) │
  ├─────────────────┼────────┼──────┼─────────────────────────────────────────────────────┤
  │ /v1/proof/store │ POST   │ Yes  │ Upload presentation to IPFS                         │
  ├─────────────────┼────────┼──────┼─────────────────────────────────────────────────────┤
  │ /v1/proof/:cid  │ GET    │ No   │ Retrieve presentation from IPFS                     │
  └─────────────────┴────────┴──────┴─────────────────────────────────────────────────────┘

  Issues and Concerns I Found:

  Security

  1. /session and /verifier have no authentication. Anyone can create MPC-TLS sessions. The session endpoints are unauthenticated —
   only /v1/proof/store requires API key auth. This means an attacker can exhaust the session pool (bounded at max_sessions=256)
  without any credentials.
  2. Auth middleware is not applied as a layer. The auth.rs defines extract_bearer_token() and AuthProvider but they're only called
   manually in proof.rs::store_proof_handler. There's no Tower middleware extracting/validating auth. The auth.rs file has unused
  middleware imports and a manual check pattern.
  3. Proxy endpoint session binding is weak. The /proxy endpoint requires a session parameter that must exist in the session map,
  but there's no crypto binding between the session creator and the proxy user. Anyone who knows (or guesses) a session ID can use
  the proxy.
  4. CORS is fully open: CorsLayer::new().allow_origin(Any).allow_methods(Any).allow_headers(Any). For a browser extension talking
  to localhost this makes sense, but if deployed as a shared service, this is too permissive.
  5. verify_reveal_config uses sent_unsafe()/received_unsafe() after validating ranges. The comment explains why, but this is
  fragile — the validation must always run before extraction.

  Architecture

  6. ws_adapter.rs is a full reimplementation of axum's WebSocket. This is ~300 lines of custom WebSocket upgrade, frame handling,
  and Sink/Stream impls. This exists because TLSNotary needs futures::AsyncRead/AsyncWrite (which async-tungstenite provides) while
   axum uses tokio-tungstenite. This is a significant maintenance burden.
  7. No signing key management for the sidecar verifier. The sidecar's verifier_core.rs uses RootCertStore::mozilla() for TLS
  verification but doesn't configure a signing key. The attestation signing happens in prove.rs::run_verifier() which isn't used by
   the sidecar — the sidecar delegates to Session::new() + handle.new_verifier(). The sidecar doesn't produce attestations — it
  runs the MPC-TLS protocol and produces VerifierOutput with transcript data. The extension/prover then builds the attestation
  using its own key? This flow needs clarification.
  8. Session state is in-memory HashMap behind a Mutex. No persistence, no clustering. Sidecar must be single-instance per client.
  Session reaper runs every 60 seconds.
  9. IPFS integration has no pinning. upload_to_ipfs calls /api/v0/add but doesn't pin. Garbage collection could remove proofs
  before operators verify them.

  Error Handling

  10. eyre used throughout sidecar instead of thiserror. Only verifier_core.rs and egress.rs use typed thiserror errors. main.rs,
  proof.rs, config.rs all use eyre::Result. Per project conventions (rust.md), binaries can use anyhow/eyre at entry points, but
  library-like modules should use typed errors.
  11. data_provider/src/wasm/tlsn.rs uses bcs for deserialization while the rest of the codebase uses bincode.
  verify_presentation_bytes calls bcs::from_bytes() — this is a serialization format mismatch. If the extension serializes with
  bincode and the data provider deserializes with BCS, it will fail.

  Performance

  12. reqwest::Client::new() in AppState. Good — shared client with connection pooling.
  13. Rate limiter uses governor with DashMap. Per-IP limiting is implemented. Global limiting is marked "reserved, not yet
  enforced" in the config comments.
  14. No connection timeout on IPFS requests. upload_to_ipfs and get_proof_handler use the shared reqwest::Client without explicit
  timeouts on individual requests.

  2.3 Operator Integration (Phase 2-3 — NOT merged)

  Files: crates/operator/src/tls_proof.rs, changes to core.rs and builder.rs

  Flow:
  1. Operator receives task with proof_cid
  2. Fetches presentation from IPFS via fetch_and_verify_tls_proof()
  3. Deserializes with bincode
  4. Verifies against TLSN_TRUSTED_NOTARY_KEY
  5. Extracts VerifiedTlsProof { server_name, verified, response_body, request_target }
  6. Injects into Rego as data.tls_proof namespace
  7. Policy evaluation proceeds with TLS data available

  Issues:

  15. Feature-gated behind tls-notary. All TLSN code is #[cfg(feature = "tls-notary")] with fallback to None. This is clean.
  16. Trusted key from env var: TLSN_TRUSTED_NOTARY_KEY. No rotation, no multi-key support, no on-chain registry. If the notary key
   changes, all operators must update their env vars.
  17. Soft failure on verification: When proof verification fails, operator logs a warning and proceeds without the TLS proof data.
   The policy may still evaluate to true if it doesn't require TLS data. This could be a security concern — a task with proof_cid
  that fails verification should arguably be rejected, not evaluated without the proof.
  18. HTTP response body parsing is naive: tls_proof.rs:131-137 parses the HTTP body by finding \r\n\r\n. This doesn't handle
  chunked transfer encoding, Content-Encoding (gzip), or multi-part responses.
  19. get_trusted_notary_key() tests use env::set_var/env::remove_var. Per testing.md, this is a test anti-pattern — env vars are
  process-wide and tests run in parallel. These tests are racy.

  2.4 Data Provider WASM Integration (Phase 3 — NOT merged)

  Files: crates/data-provider/src/wasm/tlsn.rs, crates/data-provider/wit-v2/newton-provider.wit

  WIT Interface (@0.2.0):
  interface tlsn {
      verify-from-cid: func(proof-cid: string) -> result<verified-data, string>;
      verify: func(presentation-bytes: list<u8>) -> result<verified-data, string>;
  }

  Issues:

  20. CID multihash verification only supports SHA2-256 (code 0x12). CIDv1 with Blake2b or other hash functions will be rejected.
  This is acceptable for now but limits future IPFS compatibility.
  21. verify_presentation_bytes trusts the embedded notary key. This is documented but weaker than the operator-level check. A
  malicious WASM plugin could construct a self-signed presentation and it would pass verification here.
  22. BCS vs bincode inconsistency (critical). tlsn.rs:119 uses bcs::from_bytes() but the core library serializes with
  bincode::serde::encode_to_vec(). These are incompatible formats. This will cause deserialization failures at runtime.

  2.5 Gateway Integration (Phase 3-5 — NOT merged)

  Changes: proof_cid field added to CreateTaskRequest and SendTaskRequest, threaded through ConsensusCommitRequest, passed to
  operators.

  Issues:

  23. proof_cid on SendTaskRequest says "Not supported in async mode; returns an error if provided" — but I don't see the actual
  rejection logic. Just the doc comment.
  24. No validation of proof_cid format. The gateway passes through whatever string the client sends. No CID format validation, no
  length check.
  25. Gateway doesn't verify the proof itself. It's a pure passthrough. This means the gateway can't reject obviously invalid CIDs
  before broadcasting to operators, wasting operator compute.

  2.6 Rego Policy Integration (Phase 5 — NOT merged)

  Three policy templates:
  - tlsn_twitter_followers.rego — Twitter/X follower count check
  - tlsn_bank_balance.rego — Bank balance + currency verification
  - tlsn_generic.rego — Configurable field checks with comparison operators (gt, gte, lt, lte, eq)

  Data namespace: data.tlsn_verified, data.tlsn_server, data.tlsn_connection_time, data.tlsn_data

  Issues:

  26. Proof freshness checked in Rego via max_proof_age_secs. This is good but relies on the connection timestamp from the
  presentation, which is set by the notary. A malicious notary could backdate or future-date this.
  27. E2E tests are all mock-based. No real MPC-TLS, no real IPFS. The tests verify Rego policy logic against hardcoded JSON, not
  the actual verification pipeline.

  3. Cross-Cutting Concerns

  3.1 Serialization Format Mismatch (Critical)

  ┌─────────────────────────┬────────────────────┬────────────────────┐
  │        Component        │     Serialize      │    Deserialize     │
  ├─────────────────────────┼────────────────────┼────────────────────┤
  │ prove.rs (core)         │ bincode 2.x serde  │ bincode 2.x serde  │
  ├─────────────────────────┼────────────────────┼────────────────────┤
  │ proof.rs (sidecar)      │ base64 passthrough │ base64 passthrough │
  ├─────────────────────────┼────────────────────┼────────────────────┤
  │ tls_proof.rs (operator) │ —                  │ bincode 2.x serde  │
  ├─────────────────────────┼────────────────────┼────────────────────┤
  │ tlsn.rs (data-provider) │ —                  │ bcs                │
  └─────────────────────────┴────────────────────┴────────────────────┘

  The data-provider path uses BCS while everything else uses bincode. This is a runtime failure waiting to happen.

  3.2 Trust Model Inconsistencies

  ┌───────────────────────┬──────────────────────────────┬─────────────────────────────────┐
  │         Path          │         Trust Level          │           Key Source            │
  ├───────────────────────┼──────────────────────────────┼─────────────────────────────────┤
  │ Operator tls_proof.rs │ Strict: explicit trusted key │ TLSN_TRUSTED_NOTARY_KEY env var │
  ├───────────────────────┼──────────────────────────────┼─────────────────────────────────┤
  │ WASM tlsn.rs          │ Permissive: embedded key     │ Presentation itself             │
  ├───────────────────────┼──────────────────────────────┼─────────────────────────────────┤
  │ verify.rs (core)      │ Strict: caller provides key  │ Function parameter              │
  └───────────────────────┴──────────────────────────────┴─────────────────────────────────┘

  3.3 Feature Gate Matrix

  ┌────────────────────────┬────────────┬──────────────────────────────────────┐
  │         Crate          │  Feature   │            What it gates             │
  ├────────────────────────┼────────────┼──────────────────────────────────────┤
  │ newton-tls-notary      │ fixtures   │ Test CA certificates                 │
  ├────────────────────────┼────────────┼──────────────────────────────────────┤
  │ newton-prover-operator │ tls-notary │ All TLSN verification code           │
  ├────────────────────────┼────────────┼──────────────────────────────────────┤
  │ newton-prover-core     │ (none)     │ Rego integration is always available │
  └────────────────────────┴────────────┴──────────────────────────────────────┘

  3.4 Dependency Chain

  bin/tlsn-sidecar
    └── tlsn (via libs/tlsn submodule, alpha.14)
    └── newton-prover-core (for hex! macro)
    └── newton-prover-metrics

  crates/tls-notary
    └── tlsn (prove, verify, present)
    └── tlsn-formats (HTTP transcript parsing)

  crates/operator (feature = "tls-notary")
    └── crates/tls-notary
    └── newton-prover-core (fetch_from_ipfs)

  crates/data-provider
    └── crates/tls-notary (verify only)
    └── bcs (!!!)

  4. Summary of Issues for PR #379 Review

  Critical (Must Fix)

  ┌─────┬──────────────────────────────────────────────────────────────────────────────────┬────────────────────────────────────┐
  │  #  │                                      Issue                                       │              Location              │
  ├─────┼──────────────────────────────────────────────────────────────────────────────────┼────────────────────────────────────┤
  │ 22  │ BCS vs bincode serialization mismatch in data-provider                           │ data-provider/src/wasm/tlsn.rs:119 │
  ├─────┼──────────────────────────────────────────────────────────────────────────────────┼────────────────────────────────────┤
  │ 17  │ Soft failure on proof verification — operator proceeds without TLS data when     │ operator/src/core.rs               │
  │     │ verification fails, potentially allowing bypass                                  │                                    │
  └─────┴──────────────────────────────────────────────────────────────────────────────────┴────────────────────────────────────┘

  High (Should Fix)

  ┌─────┬────────────────────────────────────────────────────────────────────────────────────────┬──────────────────────────────┐
  │  #  │                                         Issue                                          │           Location           │
  ├─────┼────────────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────┤
  │ 1   │ /session and /verifier endpoints unauthenticated — session pool exhaustion DoS         │ sidecar/src/main.rs          │
  ├─────┼────────────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────┤
  │ 3   │ Proxy session binding has no crypto proof — session ID guessable (UUID v4, but no MAC) │ sidecar/src/session.rs       │
  ├─────┼────────────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────┤
  │ 7   │ Sidecar verifier doesn't produce attestations — flow unclear for production use        │ sidecar/src/verifier_core.rs │
  ├─────┼────────────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────┤
  │ 14  │ No request-level timeouts on IPFS fetches                                              │ sidecar/src/proof.rs         │
  ├─────┼────────────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────┤
  │ 19  │ env::set_var in tests (racy)                                                           │ operator/src/tls_proof.rs    │
  ├─────┼────────────────────────────────────────────────────────────────────────────────────────┼──────────────────────────────┤
  │ 24  │ No CID format validation in gateway                                                    │ gateway/src/rpc/types/mod.rs │
  └─────┴────────────────────────────────────────────────────────────────────────────────────────┴──────────────────────────────┘

  Medium (Consider)

  ┌─────┬────────────────────────────────────────────────────────────┬────────────────────────────────┐
  │  #  │                           Issue                            │            Location            │
  ├─────┼────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 6   │ 300-line custom WebSocket adapter — maintenance burden     │ sidecar/src/ws_adapter.rs      │
  ├─────┼────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 9   │ IPFS upload without pinning — GC could remove proofs       │ sidecar/src/proof.rs           │
  ├─────┼────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 16  │ Single trusted key via env var — no rotation/multi-key     │ operator/src/tls_proof.rs      │
  ├─────┼────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 18  │ Naive HTTP body parsing (no chunked/gzip support)          │ operator/src/tls_proof.rs:131  │
  ├─────┼────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 21  │ WASM path trusts embedded notary key — weaker security     │ data-provider/src/wasm/tlsn.rs │
  ├─────┼────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 23  │ proof_cid "not supported in async mode" but no enforcement │ gateway/src/rpc/types/mod.rs   │
  ├─────┼────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 26  │ Proof freshness relies on notary-set timestamp             │ Rego policies                  │
  └─────┴────────────────────────────────────────────────────────────┴────────────────────────────────┘

  Low / Observations

  ┌─────┬─────────────────────────────────────────────────────────────────┬────────────────────────────────┐
  │  #  │                              Issue                              │            Location            │
  ├─────┼─────────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 4   │ CORS fully open (fine for localhost, not for shared deployment) │ sidecar/src/main.rs            │
  ├─────┼─────────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 8   │ Session state in-memory only — no HA/clustering                 │ sidecar/src/session.rs         │
  ├─────┼─────────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 10  │ eyre in non-entry-point modules                                 │ Multiple sidecar files         │
  ├─────┼─────────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 13  │ Global rate limiting not yet enforced                           │ sidecar/src/config.rs          │
  ├─────┼─────────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 20  │ CID multihash only supports SHA2-256                            │ data-provider/src/wasm/tlsn.rs │
  ├─────┼─────────────────────────────────────────────────────────────────┼────────────────────────────────┤
  │ 27  │ E2E tests are all mock-based — no real MPC-TLS coverage         │ integration-tests/             │
  └─────┴─────────────────────────────────────────────────────────────────┴────────────────────────────────┘

  5. What's Working Well

  - Compile-time trust boundary for test vs production crypto providers (#[cfg(feature = "fixtures")])
  - OOM protection everywhere — take() on unbounded reads, MAX_PROOF_SIZE constants
  - SSRF defense in the sidecar — DNS rebinding guard, blocked CIDRs, session-bound proxy, byte quotas, concurrent proxy limits
  with RAII guards
  - Selective disclosure in present.rs — auto-redacts sensitive headers, supports JSON path-based field selection
  - Feature gating — all TLSN code is behind tls-notary feature flag, clean #[cfg] fallbacks
  - Typed errors in verifier_core.rs and egress.rs — proper thiserror enums
  - CID multihash re-verification in data-provider — defends against malicious IPFS gateways
  - Rate limiting with governor crate — per-IP DashMap-backed limiter with configurable burst
  - Session capacity limits — bounded at max_sessions with rejection when full
No results found