johnlindquist/10-tier-optimizations.md

Created February 24, 2026 15:50

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/johnlindquist/47819be1c4f649fa187b00aeffe1ba89.js"></script>
Save johnlindquist/47819be1c4f649fa187b00aeffe1ba89 to your computer and use it in GitHub Desktop.

Download ZIP

Turborepo Performance Optimization Plans (17 research iterations with Oracle/GPT-5.2 sessions inlined)

Raw

10-tier-optimizations.md

Current State

Turborepo has already landed the big algorithmic win — topological-wave parallel task hashing (commit b3c0f46da8) — but several implementation details are now the bottleneck:

Parallelism is re-serialized by shared locks + deep clones: TaskHashTrackerState uses one RwLock guarding 6 independent HashMaps (crates/turborepo-task-hash/src/lib.rs:226-242), so any insert_hash() write blocks all concurrent reads. The visitor's wave-hashing funnels results through Arc<Mutex<HashMap>> with per-wave serialization (crates/turborepo-lib/src/task_graph/visitor/mod.rs:233-303).
Allocation hot-loops dominate hash & dispatch: DetailedMap (3 nested HashMaps) deep-cloned on every env_vars() read (line 622). EnvironmentVariableMap cloned per task at line 388. Repeated regex compilations for env patterns (4 per task in hashable_task_env) and glob patterns (O(packages) recompilations). SCM hashing allocates 10,000+ hex Strings inside parallel loops.
I/O paths leave easy wins on the table: Cache archives use zstd level 0, single-thread encode, sequential file reads, and an 8KB restore buffer.
Already optimized: Lockfile transitive closure (3.3x faster in recent commits). Workspace discovery is async/parallel. globwalk internal regexes cached with OnceLock. CompiledWildcards exists for builtins pass-through path.
Existing tracing infrastructure: Spans like precompute_task_hashes, task_cache_new, exec_context_new, visit_recv_wait exist. TURBO_LOG_VERBOSITY=debug enables filtering.

Required Changes

P0 — Largest wins, lowest ambiguity

P0.1: Split `TaskHashTrackerState` lock + remove deep clones

Files: crates/turborepo-task-hash/src/lib.rs:226-242, 585-661

Before	After
`state: Arc<RwLock<TaskHashTrackerState>>` (single lock, 6 maps)	6 independent `DashMap` fields or per-field `RwLock`
`env_vars()` returns `state.get(id).cloned()` — deep clone of 3 HashMaps	Store `Arc<DetailedMap>`, return cheap `Arc::clone()`
`insert_hash()` write-locks all 6 maps	Each map locked independently

Verify perf: TURBO_LOG_VERBOSITY=debug → compare precompute_task_hashes span duration before/after on a large monorepo. Run cargo run -p turborepo -- run build --dry=json twice and diff hashes to confirm stability.

Verify correctness: cargo test -p turborepo-task-hash && cargo test -p turborepo-lib

Estimated speedup: 15-30% on parallel hash computation

P0.2: Replace wave `Mutex<HashMap>` with concurrent map

Files: crates/turborepo-lib/src/task_graph/visitor/mod.rs:233-303

Before	After
`results: Arc<Mutex<HashMap<TaskId, (String, EnvironmentVariableMap)>>>`	`DashMap<TaskId, (String, Arc<EnvironmentVariableMap>)>`
Per-wave: collect results → lock mutex → insert all → unlock	Direct `results.insert()` inside `par_iter()` closure
`.clone()` at line 388 deep-copies env map	`Arc::clone()` — pointer copy

Verify perf: Watch precompute_task_hashes and visit_recv_wait spans. Expect lower tail latency on large waves. Add debug assertion: precomputed.len() == engine.tasks().len().

Verify correctness: cargo test -p turborepo-lib && cargo test -p turborepo-engine

Estimated speedup: 10-25% on task dispatch

P0.3: Precompile env wildcard regexes for hashable env path

Files: crates/turborepo-env/src/lib.rs:273-319, 354-383, 493-496

Before	After
`hashable_task_env()` calls `wildcard_map_from_wildcards()` → 4 `RegexBuilder::new().build()` per task	Pre-compile in `TaskHasher::new()` → `compiled_task_env: HashMap<TaskName, CompiledWildcards>`
`get_global_hashable_env_vars()` recompiles at lines 493, 496	Pre-compile `DEFAULT_ENV_VARS` once, store on `TaskHasher`
`CompiledWildcards` (lines 173-225) exists but only used for pass-through path	Extend to all env wildcard matching paths

Verify perf: Add tracing span around env hashing to confirm it drops from dominating CPU. Count regex compilations per run with a debug counter.

Verify correctness: cargo test -p turborepo-env && cargo test -p turborepo-task-hash. Add unit test comparing old-path vs compiled-path output for same env + patterns.

Estimated speedup: Eliminates O(tasks × 4) regex compilations per build

P0.4: Cache archive fast-path improvements

Files: crates/turborepo-cache/src/cache_archive/create.rs:115,151,175-177 and restore.rs:49

Before	After
`zstd::Encoder::new(writer, 0)`	`zstd::Encoder::new(writer, 3)` (configurable)
Restore buffer: `[0; 8192]` (8KB)	`[0; 65536]` (64KB)
Single-threaded zstd encoder	Explore `zstd` multi-threaded feature flag

Verify perf: Measure cache artifact sizes and restore times. For remote cache: compare upload/download times.

Verify correctness: cargo test -p turborepo-cache. Add integration test: create archive → restore → assert file contents match.

Estimated speedup: 30-50% cache size reduction; faster restores with larger buffers

P1 — Big wins, slightly more surface area

P1.1: Cache compiled glob patterns across packages

Files: crates/turborepo-scm/src/package_deps.rs:221,226,295,318,364-366

Change: Add OnceLock<DashMap<String, CompiledGlob>> cache keyed by pattern string. Plumb pre-compiled patterns to globwalk() call sites.

Verify: Add glob_compilations_total tracing counter. In 100+ package repo, confirm count collapses to ~unique(patterns).

Test: cargo test -p turborepo-scm && cargo test --workspace

P1.2: SCM hash allocation removal + double-walk merge

Files: hash_object.rs:69, repo_index.rs:162,242,255,246-247,265-266, ls_tree.rs:74, manual.rs:119-202

Change: Represent hashes as [u8; 20] or [u8; 40] internally; format to hex String only at UI/serialization boundaries. Merge the two manual.rs walks into one traversal when include_default_files=true. Replace format! range boundaries with reusable buffers.

Verify: Track "hash string allocations" with debug counter. Validate hashes match previous implementation for fixed repo state.

Test: cargo test -p turborepo-scm && cargo test --workspace

P1.3: Engine builder parallel config loading

Files: crates/turborepo-engine/src/builder.rs:162-166, 173-193, 991-1021

Change: Convert serial workspace loops to rayon::par_iter(). Sort results after parallel load for deterministic ordering.

Verify: Add spans around config discovery/parse/validation. In large monorepos, expect near-linear speedup.

Test: cargo test -p turborepo-engine && cargo test --workspace

P1.4: Change mapper prefix index

Files: crates/turborepo-repository/src/change_mapper/package.rs:60-84

Change: Replace O(n×m) linear scan with sorted prefix vec + binary search or radix trie (note: radix_trie is in workspace deps). Build index from package paths once in DefaultPackageChangeMapper::new().

Verify: Add detect_package_comparisons_total counter. Run --affected with TURBO_LOG_VERBOSITY=trace to confirm changed-file classification remains correct.

Test: cargo test -p turborepo-repository && cargo test --workspace

P2 — Smaller wins, do after P0/P1

P2.1: Visitor micro-allocations (TaskId cloning)

Carry TaskId by reference in wave closures; store once per task and reuse
cargo test -p turborepo-lib

P2.2: Lockfile DashMap pre-sizing

crates/turborepo-lockfiles/src/lib.rs:135: DashMap::new() → DashMap::with_capacity(workspaces.len() * 50)
cargo test -p turborepo-lockfiles

Ripple Effects

Hash/Env sharing via Arc reduces clones but can increase retained memory if references outlive expected scope. Audit end-of-run cleanup paths.
DashMap introduces nondeterministic iteration order. Any code serializing map contents (dry-run JSON, run summaries) must sort explicitly before output.
Parallel engine config loading can produce nondeterministic error messages. Sort validation results by package path.
SCM hash representation change ([u8; 40] instead of String) ripples through call sites. Convert at UI/JSON boundaries only. Add golden tests.
Cache compression level 3 trades slightly more CPU for smaller artifacts. Make configurable. Add heuristic for tiny caches (level 1).

What This Unlocks

Near-linear scaling of task-hash computation: Once RwLock contention + deep clones are removed, the topological-wave parallelism can finally scale with cores.
Faster incremental/watch loops: The prefix index for detect_package() directly improves --affected and watch-mode responsiveness — the paths users feel most.
More efficient remote caching: Smaller artifacts + faster restore makes cache-hit runs meaningfully faster, not just "no work executed".
Performance observability: With existing spans (precompute_task_hashes, task_cache_new, exec_context_new) and TURBO_LOG_VERBOSITY controls, you can build a stable before/after perf dashboard.

Effort & Risk

Change	Effort	Risk	Notes
P0.1 Lock split + Arc(DetailedMap)	Medium	Medium	Biggest parallelism blocker. Risk: lifetime/ownership bugs, nondeterministic iteration
P0.2 Wave Mutex → DashMap	Low-Medium	Low	Straightforward swap. Invariant: every task has exactly one entry after precompute
P0.3 Precompile env regexes	Medium	Low	Mostly plumbing. Compare old/new outputs in tests
P0.4 Cache archive tweaks	Low	Low	Buffers + compression defaults easy; failures obvious (restore mismatch)
P1.1 Glob cache	Medium	Medium	Must respect platform-specific glob behavior
P1.2 SCM allocations + walk merge	Medium-High	Medium	Hash-string refactor ripples through many call sites; needs golden tests
P1.3 Engine builder parallelism	Medium	Medium	Nondeterministic validation ordering; requires explicit sorting
P1.4 Change mapper prefix index	Medium	Low-Medium	Edge cases: nested packages, symlinks, path normalization
P2.x	Low	Low	Only after P0/P1 land

Global verification command for any change: cargo test --workspace (full test suite). For perf regression detection: TURBO_LOG_VERBOSITY=debug cargo run -p turborepo -- run build --dry=json on a representative monorepo, comparing span timings and hash stability across runs.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

09e25577a7 release(turborepo): 2.8.11-canary.27 (#11975) db01cb4490 perf: Fast path for shallow wildcard glob patterns in workspace discovery (#11972) d82c6919d1 fix: Resolve git_root to worktree root in linked worktrees (#11974) e2bc393cec release(turborepo): 2.8.11-canary.26 (#11973) 4b5410b397 perf: Send engine callback before tracker bookkeeping and add tracing spans (#11970) 75406f62d0 release(turborepo): 2.8.11-canary.25 (#11971) b3c0f46da8 perf: Parallelize task hash computation across topological waves (#11969) 69a89b33a4 release(turborepo): 2.8.11-canary.24 (#11968)

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.
task-hash-tracker-rwlock-contention Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs Maturity: stable Relevance: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. Key findings: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.
task-graph-visitor-allocations Files: crates/turborepo-lib/src/task_graph/visitor/mod.rs, crates/turborepo-lib/src/task_graph/visitor/exec.rs Maturity: stable Relevance: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run. Key findings: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run.
change-mapper-linear-package-detection Files: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs Maturity: stable Relevance: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages Key findings: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages

Exploration

AREA: change-mapper-linear-package-detection FILES: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs MATURITY: stable RELEVANCE: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages NOTES: (unspecified)

Raw Exploration Notes: AREA: change-mapper-linear-package-detection FILES: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs MATURITY: stable RELEVANCE: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages

Selected Code Context

===== FILE: crates/turborepo-repository/src/change_mapper/package.rs ===== use thiserror::Error; use turbopath::{AnchoredSystemPath, AnchoredSystemPathBuf}; use wax::{BuildError, Program};

use crate::{ change_mapper::{AllPackageChangeReason, PackageInclusionReason}, package_graph::{PackageGraph, PackageName, WorkspacePackage}, package_manager::PackageManager, };

pub enum PackageMapping { /// We've hit a global file, so all packages have changed All(AllPackageChangeReason), /// This change is meaningless, no packages have changed None, /// This change has affected one package Package((WorkspacePackage, PackageInclusionReason)), }

/// Maps a single file change to affected packages. This can be a single /// package (Package), none of the packages (None), or all of the packages /// (All). pub trait PackageChangeMapper { fn detect_package(&self, file: &AnchoredSystemPath) -> PackageMapping; }

impl<L, R> PackageChangeMapper for either::Either<L, R> where L: PackageChangeMapper, R: PackageChangeMapper, { fn detect_package(&self, file: &AnchoredSystemPath) -> PackageMapping { match self { either::Either::Left(l) => l.detect_package(file), either::Either::Right(r) => r.detect_package(file), } } }

/// Detects package by checking if the file is inside the package. /// /// Does not use the globalDependencies in turbo.json. /// Since we don't have these dependencies, any file that is /// not in any package will automatically invalidate all /// packages. This is fine for builds, but less fine /// for situations like watch mode. pub struct DefaultPackageChangeMapper<'a> { pkg_dep_graph: &'a PackageGraph, }

impl<'a> DefaultPackageChangeMapper<'a> { pub fn new(pkg_dep_graph: &'a PackageGraph) -> Self { Self { pkg_dep_graph } } fn is_file_in_package(file: &AnchoredSystemPath, package_path: &AnchoredSystemPath) -> bool { file.components() .zip(package_path.components()) .all(|(a, b)| a == b) } }

impl PackageChangeMapper for DefaultPackageChangeMapper<'_> { fn detect_package(&self, file: &AnchoredSystemPath) -> PackageMapping { for (name, entry) in self.pkg_dep_graph.packages() { if name == &PackageName::Root { continue; } if let Some(package_path) = entry.package_json_path.parent() && Self::is_file_in_package(file, package_path) { return PackageMapping::Package(( WorkspacePackage { name: name.clone(), path: package_path.to_owned(), }, PackageInclusionReason::FileChanged { file: file.to_owned(), }, )); } }

    PackageMapping::All(AllPackageChangeReason::GlobalDepsChanged {
        file: file.to_owned(),
    })
}

}

pub struct DefaultPackageChangeMapperWithLockfile<'a> { base: DefaultPackageChangeMapper<'a>, }

impl<'a> DefaultPackageChangeMapperWithLockfile<'a> { pub fn new(pkg_dep_graph: &'a PackageGraph) -> Self { Self { base: DefaultPackageChangeMapper::new(pkg_dep_graph), } } }

impl PackageChangeMapper for DefaultPackageChangeMapperWithLockfile<'_> { fn detect_package(&self, path: &AnchoredSystemPath) -> PackageMapping { // If we have a lockfile change, we consider this as a root package change, // since there's a chance that the root package uses a workspace package // dependency (this is cursed behavior but sadly possible). There's a chance // that we can make this more accurate by checking which package // manager, since not all package managers may permit root pulling from // workspace package dependencies if PackageManager::supported_managers() .iter() .any(|pm| pm.lockfile_name() == path.as_str()) { PackageMapping::Package(( WorkspacePackage { name: PackageName::Root, path: AnchoredSystemPathBuf::from_raw("").unwrap(), }, PackageInclusionReason::ConservativeRootLockfileChanged, )) } else { self.base.detect_package(path) } } }

#[derive(Error, Debug)] pub enum Error { #[error(transparent)] InvalidFilter(#[from] BuildError), }

/// A package detector. /// /// It uses a global deps list to determine /// if a file should cause all packages to be marked as changed. /// This is less conservative than the DefaultPackageChangeMapper, /// which assumes that any changed file that is not in a package /// changes all packages. Since we have a list of global deps, /// we can check against that and avoid invalidating in unnecessary cases. pub struct GlobalDepsPackageChangeMapper<'a> { base: DefaultPackageChangeMapperWithLockfile<'a>, global_deps_matcher: wax::Any<'a>, }

impl<'a> GlobalDepsPackageChangeMapper<'a> { pub fn new<S: wax::Pattern<'a>, I: Iterator<Item = S>>( pkg_dep_graph: &'a PackageGraph, global_deps: I, ) -> Result<Self, Error> { let base = DefaultPackageChangeMapperWithLockfile::new(pkg_dep_graph); let global_deps_matcher = wax::any(global_deps)?;

    Ok(Self {
        base,
        global_deps_matcher,
    })
}

}

impl PackageChangeMapper for GlobalDepsPackageChangeMapper<'> { fn detect_package(&self, path: &AnchoredSystemPath) -> PackageMapping { match self.base.detect_package(path) { // Since DefaultPackageChangeMapper is overly conservative, we can check here if // the path is actually in globalDeps and if not, return it as // PackageDetection::Package(WorkspacePackage::root()). PackageMapping::All() => { let cleaned_path = path.clean(); let in_global_deps = self.global_deps_matcher.is_match(cleaned_path.as_str());

            if in_global_deps {
                PackageMapping::All(AllPackageChangeReason::GlobalDepsChanged {
                    file: path.to_owned(),
                })
            } else {
                PackageMapping::Package((
                    WorkspacePackage::root(),
                    PackageInclusionReason::FileChanged {
                        file: path.to_owned(),
                    },
                ))
            }
        }
        result => result,
    }
}

}

#[cfg(test)] mod tests { use tempfile::tempdir; use turbopath::{AbsoluteSystemPath, AnchoredSystemPathBuf};

use super::{DefaultPackageChangeMapper, GlobalDepsPackageChangeMapper};
use crate::{
    change_mapper::{
        AllPackageChangeReason, ChangeMapper, LockfileContents, PackageChanges,
        PackageInclusionReason,
    },
    discovery::{self, PackageDiscovery},
    package_graph::{PackageGraphBuilder, WorkspacePackage},
    package_json::PackageJson,
    package_manager::PackageManager,
};

#[allow(dead_code)]
pub struct MockDiscovery;

impl PackageDiscovery for MockDiscovery {
    async fn discover_packages(
        &self,
    ) -> Result<discovery::DiscoveryResponse, discovery::Error> {
        Ok(discovery::DiscoveryResponse {
            package_manager: PackageManager::Npm,
            workspaces: vec![],
        })
    }

    async fn discover_packages_blocking(
        &self,
    ) -> Result<discovery::DiscoveryResponse, discovery::Error> {
        self.discover_packages().await
    }
}

#[tokio::test]
async fn test_different_package_detectors() -> Result<(), anyhow::Error> {
    let repo_root = tempdir()?;
    let root_package_json = PackageJson::default();

    let pkg_graph = PackageGraphBuilder::new(
        AbsoluteSystemPath::from_std_path(repo_root.path())?,
        root_package_json,
    )
    .with_package_discovery(MockDiscovery)
    .build()
    .await?;

    let default_package_detector = DefaultPackageChangeMapper::new(&pkg_graph);
    let change_mapper = ChangeMapper::new(&pkg_graph, vec![], default_package_detector);

    let package_changes = change_mapper.changed_packages(
        [AnchoredSystemPathBuf::from_raw("README.md")?]
            .into_iter()
            .collect(),
        LockfileContents::Unchanged,
    )?;

    // We should return All because we don't have global deps and
    // therefore must be conservative about changes
    assert_eq!(
        package_changes,
        PackageChanges::All(AllPackageChangeReason::GlobalDepsChanged {
            file: AnchoredSystemPathBuf::from_raw("README.md")?,
        })
    );

    let turbo_package_detector =
        GlobalDepsPackageChangeMapper::new(&pkg_graph, std::iter::empty::<&str>())?;
    let change_mapper = ChangeMapper::new(&pkg_graph, vec![], turbo_package_detector);

    let package_changes = change_mapper.changed_packages(
        [AnchoredSystemPathBuf::from_raw("README.md")?]
            .into_iter()
            .collect(),
        LockfileContents::Unchanged,
    )?;

    // We only get a root workspace change since we have global deps specified and
    // README.md is not one of them
    assert_eq!(
        package_changes,
        PackageChanges::Some(
            [(
                WorkspacePackage::root(),
                PackageInclusionReason::FileChanged {
                    file: AnchoredSystemPathBuf::from_raw("README.md")?,
                }
            )]
            .into_iter()
            .collect()
        )
    );

    Ok(())
}

} ===== END FILE: crates/turborepo-repository/src/change_mapper/package.rs =====

===== FILE: crates/turborepo-repository/src/change_mapper/mod.rs ===== //! Maps changed files to changed packages in a repository. //! Used for both --filter and for isolated builds.

use std::{ collections::{HashMap, HashSet}, hash::Hash, };

pub use package::{ DefaultPackageChangeMapper, DefaultPackageChangeMapperWithLockfile, Error, GlobalDepsPackageChangeMapper, PackageChangeMapper, PackageMapping, }; use tracing::debug; use turbopath::{AbsoluteSystemPath, AnchoredSystemPathBuf}; use wax::Program;

use crate::package_graph::{ ChangedPackagesError, ExternalDependencyChange, PackageGraph, PackageName, WorkspacePackage, };

mod package;

const DEFAULT_GLOBAL_DEPS: &[&str] = ["package.json", "turbo.json", "turbo.jsonc"].as_slice();

// We may not be able to load the lockfile contents, but we // still want to be able to express a generic change. pub enum LockfileChange { Empty, ChangedPackages(HashSet<turborepo_lockfiles::Package>), }

/// This describes the state of a change to a lockfile. pub enum LockfileContents { /// We know the lockfile did not change Unchanged, /// We know the lockfile changed but don't have the file contents of the /// previous lockfile (i.e. git status, or perhaps a lockfile that was /// deleted or otherwise inaccessible with the information we have) UnknownChange, /// We know the lockfile changed and have the contents of the previous /// lockfile Changed(Vec), }

#[derive(Debug, PartialEq, Eq, Hash, Clone)] pub enum PackageInclusionReason { /// All the packages are invalidated All(AllPackageChangeReason), /// Root task was run RootTask { task: String }, /// We conservatively assume that the root package is changed because /// the lockfile changed. ConservativeRootLockfileChanged, /// The lockfile changed and caused this package to be invalidated LockfileChanged { removed: Vec<turborepo_lockfiles::Package>, added: Vec<turborepo_lockfiles::Package>, }, /// A transitive dependency of this package changed DependencyChanged { dependency: PackageName }, /// A transitive dependent of this package changed DependentChanged { dependent: PackageName }, /// A file contained in this package changed FileChanged { file: AnchoredSystemPathBuf }, /// The filter selected a directory which contains this package InFilteredDirectory { directory: AnchoredSystemPathBuf }, /// Package is automatically included because of the filter (or lack /// thereof) IncludedByFilter { filters: Vec }, }

#[derive(Debug, PartialEq, Eq, Hash, Clone)] pub enum AllPackageChangeReason { GlobalDepsChanged { file: AnchoredSystemPathBuf, }, /// A file like package.json or turbo.json changed DefaultGlobalFileChanged { file: AnchoredSystemPathBuf, }, LockfileChangeDetectionFailed, LockfileChangedWithoutDetails, RootInternalDepChanged { root_internal_dep: PackageName, }, GitRefNotFound { from_ref: Option, to_ref: Option, }, }

pub fn merge_changed_packages<T: Hash + Eq>( changed_packages: &mut HashMap<T, PackageInclusionReason>, new_changes: impl IntoIterator<Item = (T, PackageInclusionReason)>, ) { for (package, reason) in new_changes { changed_packages.entry(package).or_insert(reason); } }

#[derive(Debug, PartialEq, Eq)] pub enum PackageChanges { All(AllPackageChangeReason), Some(HashMap<WorkspacePackage, PackageInclusionReason>), }

pub struct ChangeMapper<'a, PD> { pkg_graph: &'a PackageGraph,

ignore_patterns: Vec<String>,
package_detector: PD,

}

impl<'a, PD: PackageChangeMapper> ChangeMapper<'a, PD> { pub fn new( pkg_graph: &'a PackageGraph, ignore_patterns: Vec, package_detector: PD, ) -> Self { Self { pkg_graph, ignore_patterns, package_detector, } }

fn default_global_file_changed(
    changed_files: &HashSet<AnchoredSystemPathBuf>,
) -> Option<&AnchoredSystemPathBuf> {
    changed_files
        .iter()
        .find(|f| DEFAULT_GLOBAL_DEPS.iter().any(|dep| *dep == f.as_str()))
}

pub fn changed_packages(
    &self,
    changed_files: HashSet<AnchoredSystemPathBuf>,
    lockfile_contents: LockfileContents,
) -> Result<PackageChanges, ChangeMapError> {
    if let Some(file) = Self::default_global_file_changed(&changed_files) {
        debug!("global file changed");
        return Ok(PackageChanges::All(
            AllPackageChangeReason::DefaultGlobalFileChanged {
                file: file.to_owned(),
            },
        ));
    }

    // get filtered files and add the packages that contain them
    let filtered_changed_files = self.filter_ignored_files(changed_files.iter())?;

    // calculate lockfile_change here based on changed_files
    match self.get_changed_packages(filtered_changed_files.into_iter()) {
        PackageChanges::All(reason) => Ok(PackageChanges::All(reason)),

        PackageChanges::Some(mut changed_pkgs) => {
            match lockfile_contents {
                LockfileContents::Changed(previous_lockfile_contents) => {
                    // if we run into issues, don't error, just assume all packages have changed
                    let Ok(lockfile_changes) =
                        self.get_changed_packages_from_lockfile(&previous_lockfile_contents)
                    else {
                        debug!(
                            "unable to determine lockfile changes, assuming all packages \
                             changed"
                        );
                        return Ok(PackageChanges::All(
                            AllPackageChangeReason::LockfileChangeDetectionFailed,
                        ));
                    };
                    debug!(
                        "found {} packages changed by lockfile",
                        lockfile_changes.len()
                    );
                    merge_changed_packages(
                        &mut changed_pkgs,
                        lockfile_changes.into_iter().map(|change| {
                            let ExternalDependencyChange {
                                package,
                                added,
                                removed,
                            } = change;
                            (
                                package,
                                PackageInclusionReason::LockfileChanged { added, removed },
                            )
                        }),
                    );

                    Ok(PackageChanges::Some(changed_pkgs))
                }

                // We don't have the actual contents, so just invalidate everything
                LockfileContents::UnknownChange => {
                    // this can happen in a blobless checkout
                    debug!(
                        "we know the lockfile changed but we don't have the contents so we \
                         have to assume all packages changed and rebuild everything"
                    );
                    Ok(PackageChanges::All(
                        AllPackageChangeReason::LockfileChangedWithoutDetails,
                    ))
                }

                // We don't know if the lockfile changed or not, so we can't assume anything
                LockfileContents::Unchanged => {
                    debug!("the lockfile did not change");
                    Ok(PackageChanges::Some(changed_pkgs))
                }
            }
        }
    }
}

fn filter_ignored_files<'b>(
    &self,
    changed_files: impl Iterator<Item = &'b AnchoredSystemPathBuf> + 'b,
) -> Result<HashSet<&'b AnchoredSystemPathBuf>, ChangeMapError> {
    let matcher = wax::any(self.ignore_patterns.iter().map(|s| s.as_str()))?;
    Ok(changed_files
        .filter(move |f| !matcher.is_match(f.as_path()))
        .collect())
}

// note: this could probably be optimized by using a hashmap of package paths
fn get_changed_packages<'b>(
    &self,
    files: impl Iterator<Item = &'b AnchoredSystemPathBuf>,
) -> PackageChanges {
    let root_internal_deps = self.pkg_graph.root_internal_package_dependencies();
    let mut changed_packages = HashMap::new();
    for file in files {
        match self.package_detector.detect_package(file) {
            // Internal root dependency changed so global hash has changed
            PackageMapping::Package((pkg, _)) if root_internal_deps.contains(&pkg) => {
                debug!(
                    "{} changes root internal dependency: \"{}\"\nshortest path from root: \
                     {:?}",
                    file.to_string(),
                    pkg.name,
                    self.pkg_graph.root_internal_dependency_explanation(&pkg),
                );
                return PackageChanges::All(AllPackageChangeReason::RootInternalDepChanged {
                    root_internal_dep: pkg.name.clone(),
                });
            }
            PackageMapping::Package((pkg, reason)) => {
                debug!("{} changes \"{}\"", file.to_string(), pkg.name);
                changed_packages.insert(pkg, reason);
            }
            PackageMapping::All(reason) => {
                debug!("all packages changed due to {file:?}");
                return PackageChanges::All(reason);
            }
            PackageMapping::None => {}
        }
    }

    PackageChanges::Some(changed_packages)
}

fn get_changed_packages_from_lockfile(
    &self,
    lockfile_content: &[u8],
) -> Result<Vec<ExternalDependencyChange>, ChangeMapError> {
    // We pass None for yarnrc since we're only comparing lockfiles for changes,
    // not resolving package dependencies. Catalog resolution isn't needed here.
    let previous_lockfile = self.pkg_graph.package_manager().parse_lockfile(
        self.pkg_graph.root_package_json(),
        lockfile_content,
        None,
    )?;

    let additional_packages = self
        .pkg_graph
        .changed_packages_from_lockfile(previous_lockfile.as_ref())?;

    Ok(additional_packages)
}

pub fn lockfile_changed(
    turbo_root: &AbsoluteSystemPath,
    changed_files: &HashSet<AnchoredSystemPathBuf>,
    lockfile_path: &AbsoluteSystemPath,
) -> bool {
    let lockfile_path_relative = turbo_root
        .anchor(lockfile_path)
        .expect("lockfile should be in repo");

    changed_files.iter().any(|f| f == &lockfile_path_relative)
}

}

#[derive(thiserror::Error, Debug)] pub enum ChangeMapError { #[error(transparent)] Wax(#[from] wax::BuildError), #[error("Package manager error: {0}")] PackageManager(#[from] crate::package_manager::Error), #[error("No lockfile")] NoLockfile, #[error("Lockfile error: {0}")] Lockfile(turborepo_lockfiles::Error), }

impl From for ChangeMapError { fn from(value: ChangedPackagesError) -> Self { match value { ChangedPackagesError::NoLockfile => Self::NoLockfile, ChangedPackagesError::Lockfile(e) => Self::Lockfile(e), } } }

#[cfg(test)] mod test { use test_case::test_case;

use super::ChangeMapper;
use crate::change_mapper::package::DefaultPackageChangeMapper;

#[cfg(unix)]
#[test_case("/a/b/c", &["package.lock"], "/a/b/c/package.lock", true ; "simple")]
#[test_case("/a/b/c", &["a", "b", "c"], "/a/b/c/package.lock", false ; "lockfile unchanged")]
fn test_lockfile_changed(
    turbo_root: &str,
    changed_files: &[&str],
    lockfile_path: &str,
    expected: bool,
) {
    let turbo_root = turbopath::AbsoluteSystemPathBuf::new(turbo_root).unwrap();
    let lockfile_path = turbopath::AbsoluteSystemPathBuf::new(lockfile_path).unwrap();
    let changed_files = changed_files
        .iter()
        .map(|s| turbopath::AnchoredSystemPathBuf::from_raw(s).unwrap())
        .collect();
    let changes = ChangeMapper::<DefaultPackageChangeMapper>::lockfile_changed(
        &turbo_root,
        &changed_files,
        &lockfile_path,
    );

    assert_eq!(changes, expected);
}

#[cfg(windows)]
#[test_case("C:\\\\a\\b\\c", &["package.lock"], "C:\\\\a\\b\\c\\package.lock", true ; "simple")]
#[test_case("C:\\\\a\\b\\c", &["a", "b", "c"],  "C:\\\\a\\b\\c\\package.lock", false ; "lockfile unchanged")]
fn test_lockfile_changed(
    turbo_root: &str,
    changed_files: &[&str],
    lockfile_path: &str,
    expected: bool,
) {
    let turbo_root = turbopath::AbsoluteSystemPathBuf::new(turbo_root).unwrap();
    let lockfile_path = turbopath::AbsoluteSystemPathBuf::new(lockfile_path).unwrap();
    let changed_files = changed_files
        .iter()
        .map(|s| turbopath::AnchoredSystemPathBuf::from_raw(s).unwrap())
        .collect();
    let changes = ChangeMapper::<DefaultPackageChangeMapper>::lockfile_changed(
        &turbo_root,
        &changed_files,
        &lockfile_path,
    );

    // we don't want to implement PartialEq on the error type,
    // so simply compare the debug representations
    assert_eq!(changes, expected);
}

} ===== END FILE: crates/turborepo-repository/src/change_mapper/mod.rs =====

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

12-optimizations-full-verify.md

Now I have all the research needed. Let me compile the comprehensive performance analysis.

Current State

Turborepo's Rust core has already made strong strides in parallelism — recent commits (e.g., b3c0f46da8 parallelizing task hash computation via rayon topological waves, db01cb4490 fast-pathing shallow wildcard globs) show the team actively pursuing perf. However, significant wins remain across 10 distinct areas spanning the full turbo run lifecycle: startup, graph construction, hashing, task execution, and caching. The codebase uses rayon extensively but has pockets of serial I/O, lock contention, redundant regex compilation, and avoidable allocations in hot paths.

Profiling infrastructure exists: turbo run build --profile output.json generates Chromium Trace Event Format JSON viewable in chrome://tracing. Tracing spans (#[tracing::instrument]) are wired throughout the critical path. No criterion/divan benchmarks exist yet — all perf validation is done via integration tests in turborepo-tests/integration/ and real-monorepo profiling.

Required Changes

Tier 1 — Quick Wins (low risk, high leverage, <1 day each)

1. Parallelize turbo.json stat calls in package discovery

File: crates/turborepo-repository/src/discovery.rs:157-171

The LocalPackageDiscovery::discover_packages() method uses iter().then() which sequentially awaits each tokio::fs::try_exists() call. For N packages, this is O(N) serial filesystem I/O.

// BEFORE (sequential):
iter(package_paths)
    .then(|path| async move { /* await try_exists */ })
    .collect()

// AFTER (concurrent):
iter(package_paths)
    .map(|path| async move { /* try_exists */ })
    .buffer_unordered(128)
    .collect()

Expected gain: 5-10x for 50+ packages (serial stat latency → concurrent I/O) Verify: turbo run build --profile → check discover_packages span shrinks. Run existing tests in crates/turborepo-repository/src/discovery.rs (fallback_tests, caching_tests).

2. Pre-compile task env regex patterns

File: crates/turborepo-env/src/lib.rs:273-333

wildcard_map_from_wildcards() recompiles regex via RegexBuilder::new().build() on every call — once per task. The CompiledWildcards pattern already exists for builtin pass-through env and tests already validate CompiledWildcards matches runtime compilation (test_compiled_wildcards_matches_from_wildcards, line 663).

// Extend CompiledWildcards to also cover task-level env patterns
// Compile once during task hasher setup, reuse for every task

Expected gain: Eliminates O(tasks) regex compilations. In a 100-task monorepo, ~100-1000ms saved. Verify: Existing test test_pass_through_env_compiled_matches_original (line 633) validates correctness. Profile span calculate_task_hash should shrink.

3. Multi-threaded zstd compression for cache archives

File: crates/turborepo-cache/src/cache_archive/create.rs:151

Cache compression uses zstd::Encoder::new(writer, 0) — single-threaded, level 0 (minimal compression). zstd supports multi-threaded encoding natively.

// BEFORE:
let zw = zstd::Encoder::new(writer, 0)?.auto_finish();

// AFTER:
let mut encoder = zstd::Encoder::new(writer, 3)?;
encoder.multithread(num_cpus::get() as u32)?;
let zw = encoder.auto_finish();

Expected gain: 30-40% smaller caches (better remote cache efficiency), parallel compression on cache-miss tasks. Verify: Existing tests test_compression (line 524), test_cachewriter_finish_renames_file (line 625). Measure archive sizes before/after.

Tier 2 — Medium Effort (moderate risk, significant impact, 1-3 days each)

4. Split TaskHashTracker RwLock into per-field DashMaps

File: crates/turborepo-task-hash/src/lib.rs:225-226

A single RwLock<TaskHashTrackerState> guards five independent HashMaps. Write-lock acquisition for any field (e.g., inserting a hash) blocks all readers of all fields across parallel rayon task-hash computation.

// BEFORE:
struct TaskHashTracker { state: Arc<RwLock<TaskHashTrackerState>> }

// AFTER:
struct TaskHashTracker {
    hashes: DashMap<TaskId<'static>, Arc<str>>,
    env_vars: DashMap<TaskId<'static>, DetailedMap>,
    frameworks: DashMap<TaskId<'static>, Option<String>>,
    outputs: DashMap<TaskId<'static>, Vec<...>>,
    cache_status: DashMap<TaskId<'static>, CacheStatus>,
}

Expected gain: Removes write-lock serialization across topological waves. Tests at line 762 (test_hash_tracker_concurrent_read_write) already stress-test concurrent access. Verify: All existing tracker tests pass. Profile precompute_task_hashes span for reduced contention.

5. Arc-wrap EnvironmentVariableMap in precomputed hash results

File: crates/turborepo-lib/src/task_graph/visitor/mod.rs:388

Every task clones a full EnvironmentVariableMap (HashMap<String, String>) from the precomputed map. With 100+ env vars per task × 100 tasks, this is substantial allocation.

// BEFORE (line 388):
HashMap<TaskId<'static>, (String, EnvironmentVariableMap)>
// .clone() copies entire HashMap per task

// AFTER:
HashMap<TaskId<'static>, (String, Arc<EnvironmentVariableMap>)>
// Arc::clone is 8 bytes (atomic increment)
// populate_env() uses Arc::make_mut() for copy-on-write

Expected gain: ~100MB allocation reduction per run in large monorepos. Verify: Integration tests in turborepo-tests/integration/tests/run/. Profile memory with heaptrack or DHAT.

6. Parallelize turbo.json config loading in EngineBuilder

File: crates/turborepo-engine/src/builder.rs:156-166

The BFS task graph construction loads turbo.json configs serially per workspace via TaskInheritanceResolver::new(turbo_json_loader).resolve(workspace).

// BEFORE:
for workspace in self.workspaces.iter() {
    let tasks = TaskInheritanceResolver::new(turbo_json_loader).resolve(workspace)?;
}

// AFTER:
use rayon::prelude::*;
let all_tasks: Vec<_> = self.workspaces.par_iter()
    .map(|ws| TaskInheritanceResolver::new(turbo_json_loader).resolve(ws))
    .collect::<Result<Vec<_>, _>>()?;

Assumption: turbo_json_loader is Send + Sync or can be made so. If it holds interior mutability (cache), may need DashMap or per-thread caching. Expected gain: 2-5x for 50+ workspaces during engine construction. Verify: Profile EngineBuilder::build span. Run turborepo-tests/integration/tests/run/ suite.

7. Size DashMap and deduplicate DFS in lockfile transitive closures

File: crates/turborepo-lockfiles/src/lib.rs:130-149

all_transitive_closures() creates DashMap::new() with no capacity hint (causes rehashing), and redundant DFS walks across workspaces re-traverse shared dependency subgraphs.

// BEFORE:
let resolve_cache: ResolveCache = DashMap::new();

// AFTER:
let total_deps: usize = workspaces.values().map(|d| d.len()).sum();
let resolve_cache: ResolveCache = DashMap::with_capacity(total_deps * 2);
// Plus: memoize transitive closure results per (package, version) tuple

Expected gain: Fewer rehashes, fewer redundant DFS walks. Impact scales with dependency graph size. Verify: Run lockfile-specific integration tests. Compare all_transitive_closures profile span before/after.

Tier 3 — Larger Efforts (higher risk, targeted impact, 3-5 days each)

8. Compiled glob pattern caching

File: crates/turborepo-globwalk/src/lib.rs:468-479

Every globwalk() call recompiles patterns from scratch. In a 100-package monorepo, the same dist/** pattern gets regex-compiled 100+ times across save_outputs and file-hashing paths.

// Add an LRU cache keyed on (base_path_hash, include_patterns, exclude_patterns)
// Return Arc<CompiledGlobs> for zero-copy reuse
static GLOB_CACHE: Lazy<Mutex<LruCache<u64, Arc<CompiledGlobs>>>> = ...;

Expected gain: Eliminates O(packages × tasks) regex compilations. Verify: Existing glob walk tests (~50 tests in file). Profile globwalk spans.

9. O(1) file-to-package detection via path prefix index

File: crates/turborepo-repository/src/change_mapper/package.rs:62-87

detect_package() does O(n) linear scan through all packages for each changed file. For --affected and watch mode with 500 packages × 1000 changed files, this is O(500K) comparisons.

// BEFORE: linear scan
for (name, entry) in self.pkg_dep_graph.packages() {
    if Self::is_file_in_package(file, package_path) { ... }
}

// AFTER: sorted prefix index with binary search
// Build sorted Vec<(AnchoredSystemPath, PackageName)> at construction
// Binary search for longest matching prefix

Expected gain: O(log n) per file instead of O(n). Critical for --affected and watch mode in large monorepos. Verify: Existing test test_different_package_detectors (line 225). Integration tests for --affected.

10. Eliminate double directory walk in manual SCM hashing

File: crates/turborepo-scm/src/manual.rs:80-202

When include_default_files=true, get_package_file_hashes_without_git() walks the entire directory tree twice — once for explicit includes, once for defaults with git_ignore.

// Merge into single walk: process gitignore rules inline
// Track which files matched explicit includes during the walk
// Apply default-file logic for remaining files in the same pass

Expected gain: 2x speedup for manual hashing path (non-git repos, fallback). Verify: Tests at lines 469-556 validate deduplication and exclusion behavior.

Ripple Effects

DashMap in TaskHashTracker (#4) — Changes the public API of TaskHashTracker. All callers (visitor/mod.rs, run_summary/tracker.rs) need updated access patterns. DashMap's Ref<> return type differs from RwLock's guard.
Arc-wrapped EnvironmentVariableMap (#5) — populate_env() in exec.rs:150-155 currently mutates the map directly. With Arc wrapping, it needs Arc::make_mut() or a local clone. The command_factory.command() call also receives &execution_env.
Parallelized turbo.json loading (#6) — The turbo_json_loader must be Send + Sync. If it uses internal caching via HashMap, it needs conversion to DashMap or wrapped in Mutex. The turbo_json_chain_cache in builder.rs:259 may need similar treatment.
Zstd compression level change (#3) — Higher compression level increases CPU time per cache write. Level 3 is a ~10% CPU increase for ~30% size reduction. Multi-threading offsets this. Remote cache transfer times should decrease.
Glob pattern caching (#8) — Cached patterns must be invalidated if base_path changes or if patterns are workspace-relative. Need careful key design to avoid stale cache hits.

What This Unlocks

Combined Tier 1 wins (items 1-3): Estimated 15-25% reduction in cold-start turbo run time for 50+ package monorepos, plus smaller cache artifacts improving remote cache throughput.
Combined Tier 1+2 wins (items 1-7): Estimated 30-50% reduction in turbo run overhead (non-task-execution time) for large monorepos. The engine construction, hashing, and caching phases all get faster.
Full implementation (all 10): Turborepo's overhead becomes nearly constant regardless of monorepo size — startup, graph construction, hashing, and caching all scale sub-linearly with package count.
Profiling workflow: All changes can be measured with turbo run build --profile output.json and Chrome DevTools tracing. Each optimization targets a named tracing span, making A/B comparison straightforward.

Effort & Risk

#	Change	Effort	Risk	Validation Strategy
1	Parallel turbo.json stat	2-4 hrs	Low	Existing unit tests + `--profile` span comparison
2	Pre-compile task env regex	4-8 hrs	Low	Existing correctness tests validate compiled=runtime
3	Multi-threaded zstd	2-4 hrs	Low	Existing archive round-trip tests + size comparison
4	Split TaskHashTracker RwLock	1-2 days	Medium	Concurrent stress tests already exist (lines 762-799)
5	Arc-wrap EnvironmentVariableMap	1 day	Medium	Integration tests + memory profiling
6	Parallel turbo.json loading	2-3 days	Medium-High	Depends on loader thread-safety; integration tests
7	Size DashMap + dedup lockfile DFS	1-2 days	Medium	Lockfile integration tests
8	Glob pattern caching	2-3 days	Medium	50+ existing glob tests; cache invalidation needs care
9	Path prefix index for change mapper	1-2 days	Low-Medium	Existing detector tests + `--affected` integration tests
10	Single-walk manual SCM hashing	1-2 days	Medium	Existing dedup/exclusion tests (lines 469-556)

Assumptions:

Performance estimates assume 50-200 package monorepos (the sweet spot for these optimizations)
turbo_json_loader in builder.rs may need thread-safety work before #6 is viable
zstd multi-threading (#3) assumes the zstd crate's zstdmt feature is enabled in the dependency
All estimates are for the Rust-side overhead only — actual task execution time (npm scripts, webpack, etc.) is orthogonal

Recommended order: Start with Tier 1 (items 1-3) for immediate, low-risk wins. Then tackle #4 and #5 which directly improve the parallel task hash computation recently landed in b3c0f46da8. Items #6-10 can be prioritized based on profiling data from real monorepos.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.
task-hash-tracker-rwlock-contention Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs Maturity: stable Relevance: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. Key findings: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.
task-graph-visitor-allocations Files: crates/turborepo-lib/src/task_graph/visitor/mod.rs, crates/turborepo-lib/src/task_graph/visitor/exec.rs Maturity: stable Relevance: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run. Key findings: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run.
change-mapper-linear-package-detection Files: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs Maturity: stable Relevance: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages Key findings: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages
discovery-sequential-turbo-json-stat Files: crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: Local package discovery serializes per-package turbo.json existence checks via .then(), adding O(n) filesystem latency on the critical path of every non-daemon run. Key findings: I confirmed the key finding by reading both files. The strongest candidate is in discovery.rs: At line 157, LocalPackageDiscovery::discover_packages uses tokio_stream::iter(...).then(...) which processes each tokio::fs::try_exists() call sequentially — .then() awaits each future before starting the next. For independent filesystem stat calls across hundreds of packages, this serializes I/O that should be concurrent. The file even has the tokio async machinery in place — it just needs .map() + .buffer_unordered(N) instead of .then().

Exploration

AREA: discovery-sequential-turbo-json-stat FILES: crates/turborepo-repository/src/discovery.rs MATURITY: stable RELEVANCE: Local package discovery serializes per-package turbo.json existence checks via .then(), adding O(n) filesystem latency on the critical path of every non-daemon run. NOTES: (unspecified)

Raw Exploration Notes: I confirmed the key finding by reading both files. The strongest candidate is in discovery.rs:

At line 157, LocalPackageDiscovery::discover_packages uses tokio_stream::iter(...).then(...) which processes each tokio::fs::try_exists() call sequentially — .then() awaits each future before starting the next. For independent filesystem stat calls across hundreds of packages, this serializes I/O that should be concurrent.

The file even has the tokio async machinery in place — it just needs .map() + .buffer_unordered(N) instead of .then().

Selected Code Context

===== FILE: crates/turborepo-repository/src/discovery.rs ===== //! turborepo-discovery //! //! This package contains a number of strategies for discovering various things //! about a workspace. These traits come with a basic implementation and some //! adaptors that can be used to compose them together. //! //! This powers various intents such as 'query the daemon for this data, or //! fallback to local discovery if the daemon is not available'. Eventually, //! these strategies will implement some sort of monad-style composition so that //! we can track areas of run that are performing sub-optimally.

use tokio::time::error::Elapsed; use tokio_stream::{StreamExt, iter}; use turbopath::AbsoluteSystemPathBuf;

use crate::{ package_json::PackageJson, package_manager::{self, PackageManager}, };

#[derive(Clone, PartialEq, Eq, Debug)] pub struct WorkspaceData { pub package_json: AbsoluteSystemPathBuf, pub turbo_json: Option, }

#[derive(Debug, Clone)] pub struct DiscoveryResponse { pub workspaces: Vec, pub package_manager: PackageManager, }

#[derive(thiserror::Error, Debug)] pub enum Error { #[error("Discovery unavailable")] Unavailable, #[error("Discovery failed: {0}")] Failed(Box<dyn std::error::Error + Send + Sync>), }

/// Defines a strategy for discovering packages on the filesystem. pub trait PackageDiscovery { // desugar to assert that the future is Send /// Discover packages on the filesystem. In the event that this would block, /// some strategies may return Err(Error::Unavailable). If you want to /// wait, use discover_packages_blocking which will wait for the result. fn discover_packages( &self, ) -> impl std::future::Future<Output = Result<DiscoveryResponse, Error>> + Send;

/// Discover packages on the filesystem, blocking until the result is ready.
fn discover_packages_blocking(
    &self,
) -> impl std::future::Future<Output = Result<DiscoveryResponse, Error>> + Send;

}

/// We want to allow for lazily generating the PackageDiscovery implementation /// to prevent unnecessary work. This trait allows us to do that. /// /// Note: there is a blanket implementation for everything that implements /// PackageDiscovery pub trait PackageDiscoveryBuilder { type Output: PackageDiscovery; type Error: std::error::Error;

fn build(self) -> Result<Self::Output, Self::Error>;

}

pub struct LocalPackageDiscovery { repo_root: AbsoluteSystemPathBuf, package_manager: PackageManager, }

impl LocalPackageDiscovery { pub fn new(repo_root: AbsoluteSystemPathBuf, package_manager: PackageManager) -> Self { Self { repo_root, package_manager, } } }

pub struct LocalPackageDiscoveryBuilder { repo_root: AbsoluteSystemPathBuf, package_manager: Option, package_json: Option, allow_missing_package_manager: bool, }

impl LocalPackageDiscoveryBuilder { pub fn new( repo_root: AbsoluteSystemPathBuf, package_manager: Option, package_json: Option, ) -> Self { Self { repo_root, package_manager, package_json, allow_missing_package_manager: false, } }

pub fn with_allow_no_package_manager(&mut self, allow_missing_package_manager: bool) {
    self.allow_missing_package_manager = allow_missing_package_manager;
}

pub fn with_package_manager(&mut self, package_manager: Option<PackageManager>) -> &mut Self {
    self.package_manager = package_manager;
    self
}

}

impl PackageDiscoveryBuilder for LocalPackageDiscoveryBuilder { type Output = LocalPackageDiscovery; type Error = package_manager::Error;

fn build(self) -> Result<Self::Output, Self::Error> {
    let package_manager = match self.package_manager {
        Some(pm) => pm,
        None => {
            let package_json = self.package_json.map(Ok).unwrap_or_else(|| {
                PackageJson::load(&self.repo_root.join_component("package.json"))
            })?;
            if self.allow_missing_package_manager {
                PackageManager::read_or_detect_package_manager(&package_json, &self.repo_root)?
            } else {
                PackageManager::get_package_manager(&self.repo_root, &package_json)?
            }
        }
    };

    Ok(LocalPackageDiscovery {
        repo_root: self.repo_root,
        package_manager,
    })
}

}

impl PackageDiscovery for LocalPackageDiscovery { async fn discover_packages(&self) -> Result<DiscoveryResponse, Error> { tracing::debug!("discovering packages using local strategy");

    let package_paths = match self.package_manager.get_package_jsons(&self.repo_root) {
        Ok(packages) => packages,
        // if there is not a list of workspaces, it is not necessarily an error. just report no
        // workspaces
        Err(package_manager::Error::Workspace(_)) => {
            return Ok(DiscoveryResponse {
                workspaces: vec![],
                package_manager: self.package_manager.clone(),
            });
        }
        Err(e) => return Err(Error::Failed(Box::new(e))),
    };

    iter(package_paths)
        .then(|path| async move {
            let potential_turbo = path
                .parent()
                .expect("non-root")
                .join_component("turbo.json");
            let potential_turbo_exists = tokio::fs::try_exists(potential_turbo.as_path()).await;

            Ok(WorkspaceData {
                package_json: path,
                turbo_json: potential_turbo_exists
                    .unwrap_or_default()
                    .then_some(potential_turbo),
            })
        })
        .collect::<Result<Vec<_>, _>>()
        .await
        .map(|workspaces| DiscoveryResponse {
            workspaces,
            package_manager: self.package_manager.clone(),
        })
}

// there is no notion of waiting for upstream deps here, so this is the same as
// the non-blocking
async fn discover_packages_blocking(&self) -> Result<DiscoveryResponse, Error> {
    self.discover_packages().await
}

}

/// Attempts to run the primary strategy for an amount of time /// specified by timeout before falling back to fallback pub struct FallbackPackageDiscovery<P: PackageDiscovery + Send + Sync, F> { primary: P, fallback: F, timeout: std::time::Duration, }

impl<P: PackageDiscovery + Send + Sync, F: PackageDiscovery + Send + Sync> FallbackPackageDiscovery<P, F> { pub fn new(primary: P, fallback: F, timeout: std::time::Duration) -> Self { Self { primary, fallback, timeout, } } }

impl<T: PackageDiscovery> PackageDiscoveryBuilder for T { type Output = T; type Error = std::convert::Infallible;

fn build(self) -> Result<Self::Output, Self::Error> {
    Ok(self)
}

}

impl<A: PackageDiscovery + Send + Sync, B: PackageDiscovery + Send + Sync> PackageDiscovery for FallbackPackageDiscovery<A, B> { async fn discover_packages(&self) -> Result<DiscoveryResponse, Error> { tracing::debug!("discovering packages using fallback strategy");

    tracing::debug!("attempting primary strategy");
    match tokio::time::timeout(self.timeout, self.primary.discover_packages()).await {
        Ok(Ok(packages)) => Ok(packages),
        Ok(Err(err1)) => {
            tracing::debug!("primary strategy failed, attempting fallback strategy");
            match self.fallback.discover_packages().await {
                Ok(packages) => Ok(packages),
                // if the backup is unavailable, return the original error
                Err(Error::Unavailable) => Err(err1),
                Err(err2) => Err(err2),
            }
        }
        Err(_) => {
            tracing::debug!("primary strategy timed out, attempting fallback strategy");
            self.fallback.discover_packages().await
        }
    }
}

async fn discover_packages_blocking(&self) -> Result<DiscoveryResponse, Error> {
    tracing::debug!("discovering packages using fallback strategy");

    tracing::debug!("attempting primary strategy");
    match tokio::time::timeout(self.timeout, self.primary.discover_packages_blocking()).await {
        Ok(Ok(packages)) => Ok(packages),
        Ok(Err(err1)) => {
            tracing::debug!("primary strategy failed, attempting fallback strategy");
            match self.fallback.discover_packages_blocking().await {
                Ok(packages) => Ok(packages),
                // if the backup is unavailable, return the original error
                Err(Error::Unavailable) => Err(err1),
                Err(err2) => Err(err2),
            }
        }
        Err(Elapsed { .. }) => {
            tracing::debug!("primary strategy timed out, attempting fallback strategy");
            self.fallback.discover_packages_blocking().await
        }
    }
}

}

pub struct CachingPackageDiscovery<P: PackageDiscovery> { primary: P, data: async_once_cell::OnceCell, }

impl<P: PackageDiscovery> CachingPackageDiscovery

{ pub fn new(primary: P) -> Self { Self { primary, data: Default::default(), } } }

impl<P: PackageDiscovery + Send + Sync> PackageDiscovery for CachingPackageDiscovery

{ async fn discover_packages(&self) -> Result<DiscoveryResponse, Error> { tracing::debug!("discovering packages using caching strategy"); self.data .get_or_try_init(async { tracing::debug!("discovering packages using primary strategy"); self.primary.discover_packages().await }) .await .map(ToOwned::to_owned) }

async fn discover_packages_blocking(&self) -> Result<DiscoveryResponse, Error> {
    tracing::debug!("discovering packages using caching strategy");
    self.data
        .get_or_try_init(async {
            tracing::debug!("discovering packages using primary strategy");
            self.primary.discover_packages_blocking().await
        })
        .await
        .map(ToOwned::to_owned)
}

}

#[cfg(test)] mod fallback_tests { use std::{ sync::atomic::{AtomicUsize, Ordering}, time::Duration, };

use tokio::runtime::Runtime;

use super::*;

struct MockDiscovery {
    should_fail: bool,
    calls: AtomicUsize,
}

impl MockDiscovery {
    fn new(should_fail: bool) -> Self {
        Self {
            should_fail,
            calls: Default::default(),
        }
    }
}

impl PackageDiscovery for MockDiscovery {
    async fn discover_packages(&self) -> Result<DiscoveryResponse, Error> {
        if self.should_fail {
            Err(Error::Failed(Box::new(std::io::Error::other("mock error"))))
        } else {
            tokio::time::sleep(Duration::from_millis(100)).await;
            self.calls.fetch_add(1, Ordering::SeqCst);
            // Simulate successful package discovery
            Ok(DiscoveryResponse {
                package_manager: PackageManager::Npm,
                workspaces: vec![],
            })
        }
    }

    async fn discover_packages_blocking(
        &self,
    ) -> Result<crate::discovery::DiscoveryResponse, crate::discovery::Error> {
        self.discover_packages().await
    }
}

#[test]
fn test_fallback_on_primary_failure() {
    let rt = Runtime::new().unwrap();
    rt.block_on(async {
        let primary = MockDiscovery::new(true);
        let fallback = MockDiscovery::new(false);

        let mut discovery =
            FallbackPackageDiscovery::new(primary, fallback, Duration::from_secs(5));

        // Invoke the method under test
        let result = discovery.discover_packages().await;

        // Assert that the fallback was used and successful
        assert!(result.is_ok());

        // Assert that the fallback was used
        assert_eq!(*discovery.primary.calls.get_mut(), 0);
        assert_eq!(*discovery.fallback.calls.get_mut(), 1);
    });
}

#[test]
fn test_fallback_on_primary_timeout() {
    let rt = Runtime::new().unwrap();
    rt.block_on(async {
        let primary = MockDiscovery::new(false);
        let fallback = MockDiscovery::new(false);

        let mut discovery =
            FallbackPackageDiscovery::new(primary, fallback, Duration::from_millis(1));

        // Invoke the method under test
        let result = discovery.discover_packages().await;

        // Assert that the fallback was used and successful
        assert!(result.is_ok());

        // Assert that the fallback was used
        assert_eq!(*discovery.primary.calls.get_mut(), 0);
        assert_eq!(*discovery.fallback.calls.get_mut(), 1);
    });
}

}

#[cfg(test)] mod caching_tests { use std::sync::atomic::{AtomicUsize, Ordering};

use tokio::runtime::Runtime;

use super::*;

struct MockPackageDiscovery {
    call_count: AtomicUsize,
}

impl PackageDiscovery for MockPackageDiscovery {
    async fn discover_packages(&self) -> Result<DiscoveryResponse, Error> {
        self.call_count.fetch_add(1, Ordering::SeqCst);
        // Simulate successful package discovery
        Ok(DiscoveryResponse {
            package_manager: PackageManager::Npm,
            workspaces: vec![],
        })
    }

    async fn discover_packages_blocking(
        &self,
    ) -> Result<crate::discovery::DiscoveryResponse, crate::discovery::Error> {
        self.discover_packages().await
    }
}

#[test]
fn test_caching_package_discovery() {
    let rt = Runtime::new().unwrap();
    rt.block_on(async {
        let primary = MockPackageDiscovery {
            call_count: Default::default(),
        };
        let mut discovery = CachingPackageDiscovery::new(primary);

        // First call should use primary discovery
        let _first_result = discovery.discover_packages().await.unwrap();
        assert_eq!(*discovery.primary.call_count.get_mut(), 1);

        // Second call should use cached data and not increase call count
        let _second_result = discovery.discover_packages().await.unwrap();
        assert_eq!(*discovery.primary.call_count.get_mut(), 1);
    });
}

} ===== END FILE: crates/turborepo-repository/src/discovery.rs =====

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

18-confirmed-with-diffs.md

I have the full response. Now let me write the final synthesized report.

Current State

GPT-5.2 Pro confirmed all 13 performance bottlenecks against the actual source code (with one partial confirmation for glob pattern recompilation where the packed snapshot truncated the relevant path). The turborepo codebase already has some wins in place (parallel file-hash dedup in PackageInputsHashes::calculate_file_hashes, stack-allocated hex buffers in SCM, compiled wildcard path for pass-through env), but significant hot-path costs remain that scale with monorepo size: serial config loading, single-threaded compression, per-task regex recompilation, RwLock contention across hash tracking, and redundant I/O on remote cache hits.

Required Changes

Prioritized by highest impact + lowest risk first:

Tier 1: High Impact

1. Cache Multiplexer Double I/O (Area #13) — crates/turborepo-cache/src/multiplexer.rs

Confirmed: Remote hits call http.fetch() → extract to disk → fs.put() re-reads and re-compresses files into local cache
Fix: Add HTTPCache::fetch_raw() returning raw archive bytes + FSCache::put_archive_bytes() that atomically writes bytes to disk without re-archiving
Verify: Tracing spans around fetch_raw/put_archive_bytes; benchmark remote-hit scenarios comparing wall time + compress operations (should drop to 0 on remote hits)

2. Cache Archive Single-Threaded Compression (Area #2) — crates/turborepo-cache/src/cache_archive/create.rs

Confirmed: CacheWriter::create() uses zstd::Encoder::new(file_buffer, 0) level 0 single-threaded; add_file() reads sequentially
Fix: Enable zstd multi-threading (enc.multithread(min(num_cpus, 8))); pre-scan files in parallel with bounded buffers, write tar entries from prepared buffers
Verify: Microbench MiB/s throughput; tracing counters for stat/read/tar_append/zstd_finish phases

3. Task Env Regex Recompilation (Area #6) — crates/turborepo-env/src/lib.rs

Confirmed: hashable_task_env() calls from_wildcards() + wildcard_map_from_wildcards_unresolved() which compile regex via RegexBuilder::new().build() on every task
Fix: Extend CompiledWildcards to cover inclusion patterns; precompile by FrameworkSlug and task env pattern hash; add hashable_task_env_compiled() method
Verify: Instrument regex compile count; benchmark calculate_task_hash over N tasks with identical patterns

4. Engine Builder Serial Config Loading (Area #1) — crates/turborepo-engine/src/builder.rs

Confirmed: turbo_json_chain() DFS calls turbo_json_loader.load() synchronously per node; has_task_definition_in_run_inner() also loads recursively
Fix: (A) Memoize load() per PackageName with DashMap cache; (B) Parallelize independent ReadReq futures via bounded tokio task pool
Verify: Tracing load() call counts + total load time; benchmark N workspaces with M extends edges

Tier 2: Medium-High Impact

5. Loose Env Full Clone (Area #12) — crates/turborepo-task-hash/src/lib.rs

Confirmed: EnvMode::Loose => Ok(self.env_at_execution_start.clone()) deep-clones entire env map per task
Fix: Return Cow::Borrowed(self.env_at_execution_start) for loose mode, or wrap in Arc<EnvironmentVariableMap>
Verify: Heap profiling (jemalloc/dhat); tracing env map size and clone counts

6. TaskHashTracker RwLock Contention (Area #7) — crates/turborepo-task-hash/src/lib.rs

Confirmed: Single Arc<RwLock<TaskHashTrackerState>> guards 5 independent HashMaps
Fix: Split into per-field DashMap or individual RwLocks; add snapshot method for run-summary serialization
Verify: Contention benchmark (concurrent hash() + insert); parking_lot lock metrics

Tier 3: Medium Impact, Low Risk (Quick Wins)

7. GlobWatcher Allocation Overhead (Area #11) — crates/turborepo-filewatch/src/globwatcher.rs

Confirmed: GlobSet::hash() builds fresh BTreeSet from include.keys() every call; PartialEq builds two fresh HashSets
Fix: Precompute include_keys_sorted: Arc<[String]> and include_keys_hash: u64 at construction; Hash/Eq use precomputed values; update on mutation
Verify: Microbench hash()/eq() 1M iterations; watch-mode allocation counts

8. Discovery Sequential turbo.json Stats (Area #10) — crates/turborepo-repository/src/discovery.rs

Confirmed: .then() serializes per-package tokio::fs::try_exists() calls
Fix: Replace .then() with .map() + .buffer_unordered(64); sort results if ordering matters downstream
Verify: Discovery span timing with many workspaces on cold disk

9. Change Mapper Linear Package Detection (Area #9) — crates/turborepo-repository/src/change_mapper/package.rs

Confirmed: detect_package() loops through all packages checking prefix match via component zipping
Fix: Build HashMap<AnchoredSystemPathBuf, WorkspacePackage> index; walk ancestors of each file to find longest prefix match
Verify: Benchmark with P packages and F changed files

Tier 4: Medium Impact, Medium Effort

10. Lockfile Transitive Closure (Area #5) — crates/turborepo-lockfiles/src/lib.rs

Confirmed: Unsized DashMap::new(); shared memoization for resolve_package but no memoization for dependency subgraph walks
Fix: Pre-size DashMap; add dep_closure_cache: DashMap<String, Arc<HashSet<Package>>> to memoize shared subgraphs
Verify: Instrument all_dependencies call count + recursion depth; benchmark with shared deps

11. Glob Pattern Recompilation (Area #3) — crates/turborepo-globwalk/src/lib.rs

Partially confirmed (packed snapshot truncated compilation site): wax::{Glob, Program} compiled from strings repeatedly
Fix: DashMap + Arc cache keyed by (base_path, include_globs, exclude_globs, walk_type, settings)
Verify: Instrument compile counts; benchmark repeated identical globwalk calls

12. Task Visitor Allocations (Area #8) — crates/turborepo-lib/src/task_graph/visitor/mod.rs

Confirmed: precomputed.get(&info)...clone() clones EnvironmentVariableMap per task; mutex-based wave result collection
Fix: Store execution env as Arc<EnvironmentVariableMap> in precomputed; replace global mutex map with preallocated Vec or DashMap
Verify: Allocation profiling; measure precompute_task_hashes lock contention

13. SCM Package Hash Allocations (Area #4) — crates/turborepo-scm/src/hash_object.rs

Partially confirmed: hex encoding uses stack buffer but still calls to_string() per file hash
Fix: Store OIDs as [u8; 20] or [u8; 40] hex bytes; only stringify at final hash step
Verify: Heap profiling during hashing; bench hash_objects over 100K files

Bonus Wins (Spotted by GPT-5.2 Pro)

14. Visitor precompute mutex bottleneck: precompute_task_hashes collects wave results then takes a global mutex to insert per wave. Replace with indexed vec or DashMap.

15. GlobWatcher include-map mutation + cached fields: Once hash/keys are cached, unlock faster get_changed_globs with compact bitset/ID mapping instead of HashSet<String> candidate filtering.

Ripple Effects

Change Group	Crates Touched	Dependencies
Cache (2, 13)	`turborepo-cache` (multiplexer, fs, http, cache_archive), `turborepo-run-cache`	Restored file list behavior; archive format semantics
Env (6, 12, 8)	`turborepo-env`, `turborepo-task-hash`, executor plumbing	Cow/Arc lifetime propagation through visitor/exec
Engine (1)	`turborepo-engine` builder, callers of error span behavior	Cycle detection + turbo.json extends resolution
Watch-mode (11, 9, 10)	`turborepo-repository` (discovery, change_mapper), `turborepo-filewatch`	Output ordering; invalidation correctness
Hash tracker (7)	`turborepo-task-hash`, run-summary serialization	Snapshot method needed for backward compat

What This Unlocks

Cache pipeline (13 + 2): Eliminates the redundant "remote hit -> local recompress" pipeline entirely. Cache population scales with network + disk write, not CPU compression. This is the single highest-leverage compound win.
Per-task overhead (6 + 12 + 8 + 7): Reduces overhead enough that higher --concurrency values actually help. Today, increasing concurrency can move the bottleneck to hashing/env bookkeeping. After these changes, turbo can saturate more CPU cores productively.
Startup/planning phase (1 + 10 + 9): As caching improves, the "discovery + engine build" startup phase becomes the dominant cost. These wins make cold starts and --affected mode scale sublinearly with workspace count.

Effort & Risk

#	Change	Impact	Effort	Risk	Verification
13	Remote hit -> store raw bytes to local	High	M	M	Integration bench + remote-hit/local-fill regression test
2	Multi-threaded zstd + parallel read staging	High	M	M	Throughput bench + stress tests for memory/ordering
6	Compile+cache env wildcards/patterns	High	M	M	Compile-count instrumentation + env semantic tests
1	Cached+parallel turbo.json loading	High	L-M	M	Load-count + builder bench + error-span tests
12	Avoid loose env deep clone (Cow/Arc)	Med-High	M	M	Heap profiling + strict/loose env integration
7	Split TaskHashTracker lock	Medium	M	M	Contention bench + run-summary regression
11	Precompute GlobSet hash/eq keys	Medium	S	L	Microbench hash/eq + watch-mode alloc counts
10	buffer_unordered turbo.json try_exists	Medium	S	L	Discovery span timing + ordering test/sort
9	Indexed package path lookup	Medium	S	L	detect_package bench + correctness tests
4	Reduce SCM hash string allocations	Low-Med	M	M	Heap profiling + hashing correctness tests
5	Memoize dependency closure subgraphs	Medium	M	M	Dependency-walk counters + lockfile correctness tests
3	Glob compilation cache	Medium	M	M	Compile-count instrumentation + windows path tests
8	Visitor env/task-id clone reductions	Medium	M	L-M	Alloc profiling + high-task-count run

Recommended implementation order: Start with the three quick wins (11, 10, 9) to build confidence and tooling, then tackle the high-impact cache pipeline (13, 2), then the per-task overhead cluster (6, 12, 7, 8), and finally the engine builder + lockfile changes (1, 5).

Global verification strategy: Use TURBO_LOG_VERBOSITY=2 + tracing spans on every changed path, run cargo test across all touched crates, and compare turbo run build --dry=json output hashes before/after to ensure determinism is preserved.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.
task-hash-tracker-rwlock-contention Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs Maturity: stable Relevance: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. Key findings: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.
task-graph-visitor-allocations Files: crates/turborepo-lib/src/task_graph/visitor/mod.rs, crates/turborepo-lib/src/task_graph/visitor/exec.rs Maturity: stable Relevance: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run. Key findings: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run.
change-mapper-linear-package-detection Files: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs Maturity: stable Relevance: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages Key findings: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages
discovery-sequential-turbo-json-stat Files: crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: Local package discovery serializes per-package turbo.json existence checks via .then(), adding O(n) filesystem latency on the critical path of every non-daemon run. Key findings: I confirmed the key finding by reading both files. The strongest candidate is in discovery.rs: At line 157, LocalPackageDiscovery::discover_packages uses tokio_stream::iter(...).then(...) which processes each tokio::fs::try_exists() call sequentially — .then() awaits each future before starting the next. For independent filesystem stat calls across hundreds of packages, this serializes I/O that should be concurrent. The file even has the tokio async machinery in place — it just needs .map() + .buffer_unordered(N) instead of .then().
globwatcher-globset-hash-eq-allocs Files: crates/turborepo-filewatch/src/globwatcher.rs, crates/turborepo-filewatch/src/hash_watcher.rs Maturity: stable Relevance: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos. Key findings: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos.
task-hash-loose-env-clone Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/run/watch.rs, crates/turborepo-lib/src/run/mod.rs Maturity: stable Relevance: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate. Key findings: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate.
cache-multiplexer-redundant-restore-io Files: crates/turborepo-cache/src/multiplexer.rs, crates/turborepo-cache/src/http.rs, crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/restore.rs, crates/turborepo-cache/src/cache_archive/create.rs Maturity: stable Relevance: Remote cache hits trigger double I/O — files are extracted to disk via http.fetch(), then re-read and re-compressed by fs.put() to populate local cache, making every remote hit ~2-3x more expensive than necessary. Key findings: Remote cache hits trigger double I/O — files are extracted to disk via http.fetch(), then re-read and re-compressed by fs.put() to populate local cache, making every remote hit ~2-3x more expensive than necessary.

Exploration

AREA: cache-multiplexer-redundant-restore-io FILES: crates/turborepo-cache/src/multiplexer.rs, crates/turborepo-cache/src/http.rs, crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/restore.rs, crates/turborepo-cache/src/cache_archive/create.rs MATURITY: stable RELEVANCE: Remote cache hits trigger double I/O — files are extracted to disk via http.fetch(), then re-read and re-compressed by fs.put() to populate local cache, making every remote hit ~2-3x more expensive than necessary. NOTES: (unspecified)

Raw Exploration Notes: AREA: cache-multiplexer-redundant-restore-io FILES: crates/turborepo-cache/src/multiplexer.rs, crates/turborepo-cache/src/http.rs, crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/restore.rs, crates/turborepo-cache/src/cache_archive/create.rs MATURITY: stable RELEVANCE: Remote cache hits trigger double I/O — files are extracted to disk via http.fetch(), then re-read and re-compressed by fs.put() to populate local cache, making every remote hit ~2-3x more expensive than necessary.

Selected Code Context

===== FILE: crates/turborepo-cache/src/multiplexer.rs ===== use std::sync::{ Arc, Mutex, atomic::{AtomicBool, Ordering}, };

use tracing::{debug, warn}; use turbopath::{AbsoluteSystemPath, AnchoredSystemPathBuf}; use turborepo_analytics::AnalyticsSender; use turborepo_api_client::{APIAuth, APIClient};

use crate::{ CacheConfig, CacheError, CacheHitMetadata, CacheOpts, fs::FSCache, http::{HTTPCache, UploadMap}, };

pub struct CacheMultiplexer { // We use an AtomicBool instead of removing the cache because that would require // wrapping the cache in a Mutex which would cause a lot of contention. // This does create a mild race condition where we might use the cache // even though another thread might be removing it, but that's fine. should_use_http_cache: AtomicBool, // Just for keeping track of whether we've already printed a warning about the remote cache // being read-only should_print_skipping_remote_put: AtomicBool, cache_config: CacheConfig, fs: Option, http: Option, }

impl CacheMultiplexer { #[tracing::instrument(skip_all)] pub fn new( opts: &CacheOpts, repo_root: &AbsoluteSystemPath, api_client: APIClient, api_auth: Option, analytics_recorder: Option, ) -> Result<Self, CacheError> { let use_fs_cache = opts.cache.local.should_use(); let use_http_cache = opts.cache.remote.should_use();

    // Since the above two flags are not mutually exclusive it is possible to
    // configure yourself out of having a cache. We should tell you about it
    // but we shouldn't fail your build for that reason.
    if !use_fs_cache && !use_http_cache {
        warn!("no caches are enabled");
    }

    debug!(
        "CacheMultiplexer::new creating FSCache with cache_dir={}, repo_root={}",
        opts.cache_dir, repo_root
    );
    let fs_cache = use_fs_cache
        .then(|| FSCache::new(&opts.cache_dir, repo_root, analytics_recorder.clone()))
        .transpose()?;

    let http_cache = use_http_cache
        .then_some(api_auth)
        .flatten()
        .map(|api_auth| {
            HTTPCache::new(
                api_client,
                opts,
                repo_root.to_owned(),
                api_auth,
                analytics_recorder.clone(),
            )
        });

    Ok(CacheMultiplexer {
        should_print_skipping_remote_put: AtomicBool::new(true),
        should_use_http_cache: AtomicBool::new(http_cache.is_some()),
        cache_config: opts.cache,
        fs: fs_cache,
        http: http_cache,
    })
}

// This is technically a TOCTOU bug, but at worst it'll cause
// a few extra cache requests.
fn get_http_cache(&self) -> Option<&HTTPCache> {
    if self.should_use_http_cache.load(Ordering::Relaxed) {
        self.http.as_ref()
    } else {
        None
    }
}

pub fn requests(&self) -> Option<Arc<Mutex<UploadMap>>> {
    self.http.as_ref().map(|http| http.requests())
}

#[tracing::instrument(skip_all)]
pub async fn put(
    &self,
    anchor: &AbsoluteSystemPath,
    key: &str,
    files: &[AnchoredSystemPathBuf],
    duration: u64,
) -> Result<(), CacheError> {
    if self.cache_config.local.write {
        self.fs
            .as_ref()
            .map(|fs| fs.put(anchor, key, files, duration))
            .transpose()?;
    }

    let http_result = match self.get_http_cache() {
        Some(http) => {
            if self.cache_config.remote.write {
                let http_result = http.put(anchor, key, files, duration).await;

                Some(http_result)
            } else {
                if self
                    .should_print_skipping_remote_put
                    .load(Ordering::Relaxed)
                {
                    // Warn once per build, not per task
                    warn!("Remote cache is read-only, skipping upload");
                    self.should_print_skipping_remote_put
                        .store(false, Ordering::Relaxed);
                }
                // Cache is functional but running in read-only mode, so we don't want to try to
                // write to it
                None
            }
        }
        _ => None,
    };

    match http_result {
        Some(Err(CacheError::ApiClientError(
            box turborepo_api_client::Error::CacheDisabled { .. },
            ..,
        ))) => {
            warn!("failed to put to http cache: cache disabled");
            self.should_use_http_cache.store(false, Ordering::Relaxed);
            Ok(())
        }
        Some(Err(e)) => Err(e),
        None | Some(Ok(())) => Ok(()),
    }
}

#[tracing::instrument(skip_all)]
pub async fn fetch(
    &self,
    anchor: &AbsoluteSystemPath,
    key: &str,
) -> Result<Option<(CacheHitMetadata, Vec<AnchoredSystemPathBuf>)>, CacheError> {
    if self.cache_config.local.read
        && let Some(fs) = &self.fs
        && let response @ Ok(Some(_)) = fs.fetch(anchor, key)
    {
        return response;
    }

    if self.cache_config.remote.read
        && let Some(http) = self.get_http_cache()
        && let Ok(Some((CacheHitMetadata { source, time_saved }, files))) =
            http.fetch(key).await
    {
        // Store this into fs cache. We can ignore errors here because we know
        // we have previously successfully stored in HTTP cache, and so the overall
        // result is a success at fetching. Storing in lower-priority caches is an
        // optimization.
        if self.cache_config.local.write
            && let Some(fs) = &self.fs
        {
            let _ = fs.put(anchor, key, &files, time_saved);
        }

        return Ok(Some((CacheHitMetadata { source, time_saved }, files)));
    }

    Ok(None)
}

#[tracing::instrument(skip_all)]
pub async fn exists(&self, key: &str) -> Result<Option<CacheHitMetadata>, CacheError> {
    if self.cache_config.local.read
        && let Some(fs) = &self.fs
    {
        match fs.exists(key) {
            cache_hit @ Ok(Some(_)) => {
                return cache_hit;
            }
            Ok(None) => {}
            Err(err) => debug!("failed to check fs cache: {:?}", err),
        }
    }

    if self.cache_config.remote.read
        && let Some(http) = self.get_http_cache()
    {
        match http.exists(key).await {
            cache_hit @ Ok(Some(_)) => {
                return cache_hit;
            }
            Ok(None) => {}
            Err(err) => debug!("failed to check http cache: {:?}", err),
        }
    }

    Ok(None)
}

} ===== END FILE: crates/turborepo-cache/src/multiplexer.rs =====

===== FILE: crates/turborepo-cache/src/http.rs ===== use std::{ backtrace::Backtrace, collections::HashMap, io::{Cursor, Write}, sync::{Arc, Mutex}, };

use tokio_stream::StreamExt; use tracing::{debug, warn}; use turbopath::{AbsoluteSystemPath, AbsoluteSystemPathBuf, AnchoredSystemPathBuf}; use turborepo_analytics::AnalyticsSender; use turborepo_api_client::{ APIAuth, APIClient, CacheClient, Response, analytics::{self, AnalyticsEvent}, };

use crate::{ CacheError, CacheHitMetadata, CacheOpts, CacheSource, cache_archive::{CacheReader, CacheWriter}, signature_authentication::ArtifactSignatureAuthenticator, upload_progress::{UploadProgress, UploadProgressQuery}, };

pub type UploadMap = HashMap<String, UploadProgressQuery<10, 100>>;

pub struct HTTPCache { client: APIClient, signer_verifier: Option, repo_root: AbsoluteSystemPathBuf, api_auth: Arc<Mutex>, analytics_recorder: Option, uploads: Arc<Mutex>, }

impl HTTPCache { #[tracing::instrument(skip_all)] pub fn new( client: APIClient, opts: &CacheOpts, repo_root: AbsoluteSystemPathBuf, api_auth: APIAuth, analytics_recorder: Option, ) -> HTTPCache { let signer_verifier = if opts .remote_cache_opts .as_ref() .is_some_and(|remote_cache_opts| remote_cache_opts.signature) { Some(ArtifactSignatureAuthenticator { team_id: api_auth .team_id .as_deref() .unwrap_or_default() .as_bytes() .to_vec(), secret_key_override: None, }) } else { None };

    HTTPCache {
        client,
        signer_verifier,
        repo_root,
        uploads: Arc::new(Mutex::new(HashMap::new())),
        api_auth: Arc::new(Mutex::new(api_auth)),
        analytics_recorder,
    }
}

/// Attempts to refresh the auth token when a cache operation encounters a
/// 403 forbidden error. Returns true if the token was successfully
/// refreshed, false otherwise.
async fn try_refresh_token(&self) -> bool {
    match turborepo_auth::get_token_with_refresh().await {
        Ok(Some(new_token)) => {
            // Update the API auth with the new token
            if let Ok(mut auth) = self.api_auth.lock() {
                auth.token = new_token;
                debug!("Successfully refreshed auth token for cache operations");
                true
            } else {
                warn!("Failed to acquire lock for updating auth token");
                false
            }
        }
        Ok(None) => {
            debug!("No refresh token available or token doesn't support refresh");
            false
        }
        Err(e) => {
            warn!("Failed to refresh token: {:?}", e);
            false
        }
    }
}

/// Helper method to execute a cache operation with automatic token refresh
/// on 403 errors.
async fn execute_with_token_refresh<T, F, Fut>(
    &self,
    hash: &str,
    operation: F,
) -> Result<T, CacheError>
where
    F: Fn(APIAuth) -> Fut,
    Fut: std::future::Future<Output = Result<T, turborepo_api_client::Error>>,
{
    // Try the operation with the current token
    let api_auth = self.api_auth.lock().unwrap().clone();
    match operation(api_auth.clone()).await {
        Ok(result) => Ok(result),
        Err(turborepo_api_client::Error::UnknownStatus { code, .. }) if code == "forbidden" => {
            // Try to refresh the token
            if self.try_refresh_token().await {
                // Retry the operation with the refreshed token
                let refreshed_auth = self.api_auth.lock().unwrap().clone();
                operation(refreshed_auth)
                    .await
                    .map_err(|err| Self::convert_api_error(hash, err))
            } else {
                // Token refresh failed, return the original error
                Err(CacheError::ForbiddenRemoteCacheWrite)
            }
        }
        Err(e) => Err(Self::convert_api_error(hash, e)),
    }
}

#[tracing::instrument(skip_all)]
pub async fn put(
    &self,
    anchor: &AbsoluteSystemPath,
    hash: &str,
    files: &[AnchoredSystemPathBuf],
    duration: u64,
) -> Result<(), CacheError> {
    let mut artifact_body = Vec::new();
    self.write(&mut artifact_body, anchor, files).await?;
    let bytes = artifact_body.len();

    let tag = self
        .signer_verifier
        .as_ref()
        .map(|signer| signer.generate_tag(hash.as_bytes(), &artifact_body))
        .transpose()?;

    tracing::debug!("uploading {}", hash);

    // Use the helper method to handle token refresh on 403 errors
    let artifact_body_clone = artifact_body.clone(); // Store the artifact body for retry
    let tag_clone = tag.clone();
    let uploads_clone = self.uploads.clone();

    self.execute_with_token_refresh(hash, |api_auth| {
        let client = &self.client;
        let tag_ref = tag_clone.as_deref();
        let artifact_body_ref = artifact_body_clone.clone();
        let uploads_ref = uploads_clone.clone();

        async move {
            // Create the stream inside the closure so it can be used for retry
            let stream = tokio_util::codec::FramedRead::new(
                Cursor::new(artifact_body_ref),
                tokio_util::codec::BytesCodec::new(),
            )
            .map(|res| {
                res.map(|bytes| bytes.freeze())
                    .map_err(turborepo_api_client::Error::from)
            });

            let (progress, query) = UploadProgress::<10, 100, _>::new(stream, Some(bytes));

            {
                let mut uploads = uploads_ref.lock().unwrap();
                uploads.insert(hash.to_string(), query);
            }

            client
                .put_artifact(
                    hash,
                    progress,
                    bytes,
                    duration,
                    tag_ref,
                    &api_auth.token,
                    api_auth.team_id.as_deref(),
                    api_auth.team_slug.as_deref(),
                )
                .await
        }
    })
    .await?;

    tracing::debug!("uploaded {}", hash);
    Ok(())
}

#[tracing::instrument(skip_all)]
async fn write(
    &self,
    writer: impl Write,
    anchor: &AbsoluteSystemPath,
    files: &[AnchoredSystemPathBuf],
) -> Result<(), CacheError> {
    let mut cache_archive = CacheWriter::from_writer(writer, true)?;
    for file in files {
        cache_archive.add_file(anchor, file)?;
    }

    Ok(())
}

#[tracing::instrument(skip_all)]
pub async fn exists(&self, hash: &str) -> Result<Option<CacheHitMetadata>, CacheError> {
    let response = self
        .execute_with_token_refresh(hash, |api_auth| {
            let client = &self.client;
            async move {
                client
                    .artifact_exists(
                        hash,
                        &api_auth.token,
                        api_auth.team_id.as_deref(),
                        api_auth.team_slug.as_deref(),
                    )
                    .await
            }
        })
        .await?;

    let Some(response) = response else {
        return Ok(None);
    };

    let duration = Self::get_duration_from_response(&response)?;

    Ok(Some(CacheHitMetadata {
        source: CacheSource::Remote,
        time_saved: duration,
    }))
}

fn get_duration_from_response(response: &Response) -> Result<u64, CacheError> {
    if let Some(duration_value) = response.headers().get("x-artifact-duration") {
        let duration = duration_value
            .to_str()
            .map_err(|_| CacheError::InvalidDuration(Backtrace::capture()))?;

        duration
            .parse::<u64>()
            .map_err(|_| CacheError::InvalidDuration(Backtrace::capture()))
    } else {
        Ok(0)
    }
}

fn log_fetch(&self, event: analytics::CacheEvent, hash: &str, duration: u64) {
    // If analytics fails to record, it's not worth failing the cache
    if let Some(analytics_recorder) = &self.analytics_recorder {
        let analytics_event = AnalyticsEvent {
            session_id: None,
            source: analytics::CacheSource::Remote,
            event,
            hash: hash.to_string(),
            duration,
        };
        debug!("logging fetch: {analytics_event:?}");
        let _ = analytics_recorder.send(analytics_event);
    }
}

#[tracing::instrument(skip_all)]
pub async fn fetch(
    &self,
    hash: &str,
) -> Result<Option<(CacheHitMetadata, Vec<AnchoredSystemPathBuf>)>, CacheError> {
    let response = self
        .execute_with_token_refresh(hash, |api_auth| {
            let client = &self.client;
            async move {
                client
                    .fetch_artifact(
                        hash,
                        &api_auth.token,
                        api_auth.team_id.as_deref(),
                        api_auth.team_slug.as_deref(),
                    )
                    .await
            }
        })
        .await?;

    let Some(response) = response else {
        self.log_fetch(analytics::CacheEvent::Miss, hash, 0);
        return Ok(None);
    };

    let duration = Self::get_duration_from_response(&response)?;

    let body = if let Some(signer_verifier) = &self.signer_verifier {
        let expected_tag = response
            .headers()
            .get("x-artifact-tag")
            .ok_or(CacheError::ArtifactTagMissing(Backtrace::capture()))?;

        let expected_tag = expected_tag
            .to_str()
            .map_err(|_| CacheError::InvalidTag(Backtrace::capture()))?
            .to_string();

        let body = response.bytes().await.map_err(|e| {
            CacheError::ApiClientError(
                Box::new(turborepo_api_client::Error::ReqwestError(e)),
                Backtrace::capture(),
            )
        })?;
        let is_valid = signer_verifier.validate(hash.as_bytes(), &body, &expected_tag)?;

        if !is_valid {
            return Err(CacheError::InvalidTag(Backtrace::capture()));
        }

        body
    } else {
        response.bytes().await.map_err(|e| {
            CacheError::ApiClientError(
                Box::new(turborepo_api_client::Error::ReqwestError(e)),
                Backtrace::capture(),
            )
        })?
    };

    let files = Self::restore_tar(&self.repo_root, &body)?;

    self.log_fetch(analytics::CacheEvent::Hit, hash, duration);
    Ok(Some((
        CacheHitMetadata {
            source: CacheSource::Remote,
            time_saved: duration,
        },
        files,
    )))
}

pub fn requests(&self) -> Arc<Mutex<UploadMap>> {
    self.uploads.clone()
}

#[tracing::instrument(skip_all)]
pub(crate) fn restore_tar(
    root: &AbsoluteSystemPath,
    body: &[u8],
) -> Result<Vec<AnchoredSystemPathBuf>, CacheError> {
    let mut cache_reader = CacheReader::from_reader(body, true)?;
    cache_reader.restore(root)
}

fn convert_api_error(hash: &str, err: turborepo_api_client::Error) -> CacheError {
    match err {
        turborepo_api_client::Error::ReqwestError(e) if e.is_timeout() => {
            CacheError::TimeoutError(hash.to_string())
        }
        turborepo_api_client::Error::ReqwestError(e) if e.is_connect() => {
            CacheError::ConnectError
        }
        turborepo_api_client::Error::UnknownStatus { code, .. } if code == "forbidden" => {
            CacheError::ForbiddenRemoteCacheWrite
        }
        e => e.into(),
    }
}

}

#[cfg(test)] mod test { use std::{backtrace::Backtrace, time::Duration};

use anyhow::Result;
use futures::future::try_join_all;
use insta::assert_snapshot;
use tempfile::tempdir;
use turbopath::AbsoluteSystemPathBuf;
use turborepo_analytics::start_analytics;
use turborepo_api_client::{APIClient, SecretString, analytics};
use turborepo_vercel_api_mock::start_test_server;

use crate::{
    CacheOpts, CacheSource,
    http::{APIAuth, HTTPCache},
    test_cases::{TestCase, get_test_cases, validate_analytics},
};

#[tokio::test]
async fn test_http_cache() -> Result<()> {
    let port = port_scanner::request_open_port().unwrap();
    let (ready_tx, ready_rx) = tokio::sync::oneshot::channel();
    let handle = tokio::spawn(start_test_server(port, Some(ready_tx)));

    // Wait for the server to be ready (with timeout)
    tokio::time::timeout(Duration::from_secs(5), ready_rx)
        .await
        .map_err(|_| anyhow::anyhow!("Test server failed to start within timeout"))??;

    let test_cases = get_test_cases();

    try_join_all(
        test_cases
            .iter()
            .map(|test_case| round_trip_test(test_case, port)),
    )
    .await?;

    validate_analytics(&test_cases, analytics::CacheSource::Remote, port).await?;
    handle.abort();
    Ok(())
}

async fn round_trip_test(test_case: &TestCase, port: u16) -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path())?;
    test_case.initialize(&repo_root_path)?;

    let hash = test_case.hash;
    let files = &test_case.files;
    let duration = test_case.duration;

    let api_client = APIClient::new(
        format!("http://localhost:{port}"),
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        true,
    )?;
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };
    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("my-token".to_string()),
        team_slug: None,
    };
    let (analytics_recorder, analytics_handle) =
        start_analytics(api_auth.clone(), api_client.clone());

    let cache = HTTPCache::new(
        api_client,
        &opts,
        repo_root_path.to_owned(),
        api_auth,
        Some(analytics_recorder),
    );

    // Should be a cache miss at first
    let miss = cache.fetch(hash).await?;
    assert!(miss.is_none());

    let anchored_files: Vec<_> = files.iter().map(|f| f.path().to_owned()).collect();
    cache
        .put(&repo_root_path, hash, &anchored_files, duration)
        .await?;

    let cache_response = cache.exists(hash).await?.unwrap();

    assert_eq!(cache_response.time_saved, duration);
    assert_eq!(cache_response.source, CacheSource::Remote);

    let (cache_response, received_files) = cache.fetch(hash).await?.unwrap();

    assert_eq!(cache_response.time_saved, duration);

    for (test_file, received_file) in files.iter().zip(received_files) {
        assert_eq!(&*received_file, test_file.path());
        let file_path = repo_root_path.resolve(&received_file);
        if let Some(contents) = test_file.contents() {
            assert_eq!(std::fs::read_to_string(file_path)?, contents);
        } else {
            assert!(file_path.exists());
        }
    }

    analytics_handle.close_with_timeout().await;

    Ok(())
}

#[test]
fn test_forbidden_error() {
    let err = HTTPCache::convert_api_error(
        "hash",
        turborepo_api_client::Error::UnknownStatus {
            code: "forbidden".into(),
            message: "Not authorized".into(),
            backtrace: Backtrace::capture(),
        },
    );
    assert_snapshot!(err.to_string(), @"Insufficient permissions to write to remote cache. Please verify that your role has write access for Remote Cache Artifact at https://vercel.com/docs/accounts/team-members-and-roles/access-roles/team-level-roles?resource=Remote+Cache+Artifact");
}

#[test]
fn test_unknown_status() {
    let err = HTTPCache::convert_api_error(
        "hash",
        turborepo_api_client::Error::UnknownStatus {
            code: "unknown".into(),
            message: "Special message".into(),
            backtrace: Backtrace::capture(),
        },
    );
    assert_snapshot!(err.to_string(), @"failed to contact remote cache: Unknown status unknown: Special message");
}

#[test]
fn test_cache_disabled() {
    let err = HTTPCache::convert_api_error(
        "hash",
        turborepo_api_client::Error::CacheDisabled {
            status: turborepo_vercel_api::CachingStatus::Disabled,
            message: "Cache disabled".into(),
        },
    );
    assert_snapshot!(err.to_string(), @"failed to contact remote cache: Cache disabled");
}

#[tokio::test]
async fn test_token_refresh_on_403() {
    // This test verifies that the HTTPCache can handle token refresh when
    // encountering 403 errors. Note: This is an integration test that would
    // need a mock server setup to fully verify the token refresh flow, but
    // the logic structure is tested through the build validation.
    let repo_root = tempfile::tempdir().unwrap();
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path()).unwrap();

    let api_client = APIClient::new(
        "http://localhost:8000",
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        false,
    )
    .unwrap();
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };

    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("expired-token".to_string()),
        team_slug: None,
    };

    let cache = HTTPCache::new(api_client, &opts, repo_root_path, api_auth, None);

    // Verify that the cache has the token refresh capability
    // The actual token refresh would be tested in integration tests with a proper
    // mock server. The vca_ prefix check is now handled in the auth layer.
    // The result depends on whether there are any tokens available in the system
    //
    // The result can be true or false depending on system state, but the method
    // should not panic. The test will fail if it does.
    cache.try_refresh_token().await;
}

#[tokio::test]
async fn test_cache_token_update_after_refresh() {
    // Test that the cache properly updates its internal token after a successful
    // refresh
    let repo_root = tempfile::tempdir().unwrap();
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path()).unwrap();

    let api_client = APIClient::new(
        "http://localhost:8000",
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        false,
    )
    .unwrap();
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };

    let initial_api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("initial-token".to_string()),
        team_slug: None,
    };

    let cache = HTTPCache::new(api_client, &opts, repo_root_path, initial_api_auth, None);

    // Verify initial token
    let initial_auth = cache.api_auth.lock().unwrap().clone();
    assert_eq!(initial_auth.token.expose(), "initial-token");

    // Test the token refresh mechanism (without actual HTTP call)
    // In a real scenario, try_refresh_token would call
    // turborepo_auth::get_token_with_refresh and update the internal token
    // if successful
    let refresh_result = cache.try_refresh_token().await;

    // The result depends on system state - could be true or false
    let final_auth = cache.api_auth.lock().unwrap().clone();

    if refresh_result {
        // If refresh succeeded, token should have been updated
        assert_ne!(final_auth.token.expose(), "initial-token");
    } else {
        // If refresh failed, token should remain unchanged
        assert_eq!(final_auth.token.expose(), "initial-token");
    }
}

#[test]
fn test_cache_auth_mutex_thread_safety() {
    // Test that the Arc<Mutex<APIAuth>> is properly thread-safe
    use std::{sync::Arc, thread};

    let repo_root = tempfile::tempdir().unwrap();
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path()).unwrap();

    let api_client = APIClient::new(
        "http://localhost:8000",
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        false,
    )
    .unwrap();
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };

    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("thread-test-token".to_string()),
        team_slug: None,
    };

    let cache = Arc::new(HTTPCache::new(
        api_client,
        &opts,
        repo_root_path,
        api_auth,
        None,
    ));

    // Test concurrent access to the auth mutex
    let handles: Vec<_> = (0..5)
        .map(|i| {
            let cache_clone = Arc::clone(&cache);
            thread::spawn(move || {
                let auth = cache_clone.api_auth.lock().unwrap();
                assert_eq!(auth.token.expose(), "thread-test-token");
                assert_eq!(auth.team_id, Some("my-team".to_string()));
                // Simulate some work
                thread::sleep(std::time::Duration::from_millis(10));
                format!("thread-{i}")
            })
        })
        .collect();

    // Wait for all threads to complete
    for handle in handles {
        let result = handle.join().unwrap();
        assert!(result.starts_with("thread-"));
    }
}

} ===== END FILE: crates/turborepo-cache/src/http.rs =====

===== FILE: crates/turborepo-cache/src/fs.rs ===== use std::backtrace::Backtrace;

use camino::Utf8Path; use serde::{Deserialize, Serialize}; use tracing::debug; use turbopath::{AbsoluteSystemPath, AbsoluteSystemPathBuf, AnchoredSystemPathBuf}; use turborepo_analytics::AnalyticsSender; use turborepo_api_client::{analytics, analytics::AnalyticsEvent};

use crate::{ CacheError, CacheHitMetadata, CacheSource, cache_archive::{CacheReader, CacheWriter}, };

pub struct FSCache { cache_directory: AbsoluteSystemPathBuf, analytics_recorder: Option, }

#[derive(Debug, Deserialize, Serialize)] struct CacheMetadata { hash: String, duration: u64, }

impl CacheMetadata { fn read(path: &AbsoluteSystemPath) -> Result<CacheMetadata, CacheError> { serde_json::from_str(&path.read_to_string()?) .map_err(|e| CacheError::InvalidMetadata(e, Backtrace::capture())) } }

impl FSCache { fn resolve_cache_dir( repo_root: &AbsoluteSystemPath, cache_dir: &Utf8Path, ) -> AbsoluteSystemPathBuf { AbsoluteSystemPathBuf::from_unknown(repo_root, cache_dir) }

#[tracing::instrument(skip_all)]
pub fn new(
    cache_dir: &Utf8Path,
    repo_root: &AbsoluteSystemPath,
    analytics_recorder: Option<AnalyticsSender>,
) -> Result<Self, CacheError> {
    debug!(
        "FSCache::new called with cache_dir={}, repo_root={}",
        cache_dir, repo_root
    );
    let cache_directory = Self::resolve_cache_dir(repo_root, cache_dir);
    debug!("FSCache resolved cache_directory={}", cache_directory);
    cache_directory.create_dir_all()?;

    Ok(FSCache {
        cache_directory,
        analytics_recorder,
    })
}

fn log_fetch(&self, event: analytics::CacheEvent, hash: &str, duration: u64) {
    // If analytics fails to record, it's not worth failing the cache
    if let Some(analytics_recorder) = &self.analytics_recorder {
        let analytics_event = AnalyticsEvent {
            session_id: None,
            source: analytics::CacheSource::Local,
            event,
            hash: hash.to_string(),
            duration,
        };

        let _ = analytics_recorder.send(analytics_event);
    }
}

#[tracing::instrument(skip_all)]
pub fn fetch(
    &self,
    anchor: &AbsoluteSystemPath,
    hash: &str,
) -> Result<Option<(CacheHitMetadata, Vec<AnchoredSystemPathBuf>)>, CacheError> {
    let uncompressed_cache_path = self.cache_directory.join_component(&format!("{hash}.tar"));
    let compressed_cache_path = self
        .cache_directory
        .join_component(&format!("{hash}.tar.zst"));

    debug!(
        "FSCache::fetch looking for cache artifacts at {} or {}",
        uncompressed_cache_path, compressed_cache_path
    );

    let cache_path = if uncompressed_cache_path.exists() {
        uncompressed_cache_path
    } else if compressed_cache_path.exists() {
        compressed_cache_path
    } else {
        debug!(
            "FSCache::fetch cache miss for hash {} in {}",
            hash, self.cache_directory
        );
        self.log_fetch(analytics::CacheEvent::Miss, hash, 0);
        return Ok(None);
    };

    let mut cache_reader = CacheReader::open(&cache_path)?;

    let restored_files = cache_reader.restore(anchor)?;

    let meta = CacheMetadata::read(
        &self
            .cache_directory
            .join_component(&format!("{hash}-meta.json")),
    )?;

    self.log_fetch(analytics::CacheEvent::Hit, hash, meta.duration);

    Ok(Some((
        CacheHitMetadata {
            time_saved: meta.duration,
            source: CacheSource::Local,
        },
        restored_files,
    )))
}

#[tracing::instrument(skip_all)]
pub(crate) fn exists(&self, hash: &str) -> Result<Option<CacheHitMetadata>, CacheError> {
    let cache_dir = self.cache_directory.as_str();
    let mut buf = String::with_capacity(cache_dir.len() + 1 + hash.len() + "-meta.json".len());
    buf.push_str(cache_dir);
    buf.push(std::path::MAIN_SEPARATOR);
    buf.push_str(hash);
    let prefix_len = buf.len();

    buf.push_str(".tar");
    let uncompressed_exists = std::path::Path::new(&buf).exists();

    buf.push_str(".zst");
    let compressed_exists = std::path::Path::new(&buf).exists();

    if !uncompressed_exists && !compressed_exists {
        return Ok(None);
    }

    buf.truncate(prefix_len);
    buf.push_str("-meta.json");

    let duration = CacheMetadata::read(
        &AbsoluteSystemPathBuf::try_from(buf.as_str())
            .map_err(|_| CacheError::ConfigCacheInvalidBase)?,
    )
    .map(|meta| meta.duration)
    .unwrap_or(0);

    Ok(Some(CacheHitMetadata {
        time_saved: duration,
        source: CacheSource::Local,
    }))
}

#[tracing::instrument(skip_all)]
pub fn put(
    &self,
    anchor: &AbsoluteSystemPath,
    hash: &str,
    files: &[AnchoredSystemPathBuf],
    duration: u64,
) -> Result<(), CacheError> {
    let cache_path = self
        .cache_directory
        .join_component(&format!("{hash}.tar.zst"));

    let mut cache_item = CacheWriter::create(&cache_path)?;

    for file in files {
        cache_item.add_file(anchor, file)?;
    }

    // Finish the archive (performs atomic rename from temp to final path)
    cache_item.finish()?;

    // Write metadata file atomically using write-to-temp-then-rename pattern
    let metadata_path = self
        .cache_directory
        .join_component(&format!("{hash}-meta.json"));

    let meta = CacheMetadata {
        hash: hash.to_string(),
        duration,
    };

    let meta_json = serde_json::to_string(&meta)
        .map_err(|e| CacheError::InvalidMetadata(e, Backtrace::capture()))?;

    // Write to temporary file then atomically rename
    let temp_metadata_path = self
        .cache_directory
        .join_component(&format!(".{hash}-meta.json.{}.tmp", std::process::id()));

    temp_metadata_path.create_with_contents(&meta_json)?;
    temp_metadata_path.rename(&metadata_path)?;

    Ok(())
}

}

#[cfg(test)] mod test { use std::time::Duration;

use anyhow::Result;
use futures::future::try_join_all;
use tempfile::tempdir;
use turbopath::AnchoredSystemPath;
use turborepo_analytics::start_analytics;
use turborepo_api_client::{APIAuth, APIClient, SecretString};
use turborepo_vercel_api_mock::start_test_server;

use super::*;
use crate::test_cases::{TestCase, get_test_cases, validate_analytics};

#[tokio::test]
async fn test_fs_cache() -> Result<()> {
    let port = port_scanner::request_open_port().unwrap();
    let (ready_tx, ready_rx) = tokio::sync::oneshot::channel();
    tokio::spawn(start_test_server(port, Some(ready_tx)));

    // Wait for server to be ready
    tokio::time::timeout(Duration::from_secs(5), ready_rx)
        .await
        .map_err(|_| anyhow::anyhow!("Test server failed to start"))??;

    let test_cases = get_test_cases();

    try_join_all(
        test_cases
            .iter()
            .map(|test_case| round_trip_test(test_case, port)),
    )
    .await?;

    validate_analytics(&test_cases, analytics::CacheSource::Local, port).await?;
    Ok(())
}

async fn round_trip_test(test_case: &TestCase, port: u16) -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;
    test_case.initialize(repo_root_path)?;

    let api_client = APIClient::new(
        format!("http://localhost:{port}"),
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        true,
    )?;
    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("my-token".to_string()),
        team_slug: None,
    };
    let (analytics_sender, analytics_handle) =
        start_analytics(api_auth.clone(), api_client.clone());

    let cache = FSCache::new(
        Utf8Path::new(""),
        repo_root_path,
        Some(analytics_sender.clone()),
    )?;

    let expected_miss = cache.fetch(repo_root_path, test_case.hash)?;
    assert!(expected_miss.is_none());

    let files: Vec<_> = test_case
        .files
        .iter()
        .map(|f| f.path().to_owned())
        .collect();
    cache.put(repo_root_path, test_case.hash, &files, test_case.duration)?;

    let (status, files) = cache.fetch(repo_root_path, test_case.hash)?.unwrap();

    assert_eq!(
        status,
        CacheHitMetadata {
            time_saved: test_case.duration,
            source: CacheSource::Local
        }
    );

    assert_eq!(files.len(), test_case.files.len());
    for (expected, actual) in test_case.files.iter().zip(files.iter()) {
        let actual: &AnchoredSystemPath = actual;
        assert_eq!(expected.path(), actual);
        let actual_file = repo_root_path.resolve(actual);
        if let Some(contents) = expected.contents() {
            assert_eq!(contents, actual_file.read_to_string()?);
        } else {
            assert!(actual_file.exists());
        }
    }

    analytics_handle.close_with_timeout().await;
    Ok(())
}

/// Test that multiple concurrent writes to the same hash don't corrupt the
/// cache. This tests the atomic write pattern
/// (write-to-temp-then-rename).
#[tokio::test]
async fn test_concurrent_writes_same_hash() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("test content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "concurrent_write_test";
    let duration = 100;

    // Create multiple caches pointing to the same directory
    let cache1 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache2 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache3 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;

    // Perform concurrent writes
    let handle1 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache1.put(&repo_root, hash, &files, duration) })
    };
    let handle2 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache2.put(&repo_root, hash, &files, duration) })
    };
    let handle3 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache3.put(&repo_root, hash, &files, duration) })
    };

    // All writes should succeed (or at least not corrupt the cache)
    let _ = handle1.await?;
    let _ = handle2.await?;
    let _ = handle3.await?;

    // The cache should be readable
    let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let result = cache.fetch(repo_root_path, hash)?;
    assert!(
        result.is_some(),
        "Cache should be readable after concurrent writes"
    );

    Ok(())
}

/// Test that reads during writes don't fail.
/// A read should either return the old content, new content, or a miss -
/// never corrupted data.
#[tokio::test]
async fn test_read_during_write() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("original content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "read_during_write_test";
    let duration = 100;

    // First write to establish the cache
    let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    cache.put(repo_root_path, hash, &files, duration)?;

    // Update the source file
    test_file.create_with_contents("updated content")?;

    // Perform concurrent read and write
    let cache_write = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache_read = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;

    let write_handle = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache_write.put(&repo_root, hash, &files, duration + 1) })
    };

    // Perform multiple reads while write is happening
    for _ in 0..10 {
        let result = cache_read.fetch(repo_root_path, hash);
        // Should either succeed with valid data or fail cleanly - no corruption
        if let Ok(Some((metadata, _))) = result {
            // Duration should be either old or new value
            assert!(
                metadata.time_saved == duration || metadata.time_saved == duration + 1,
                "Unexpected duration: {}",
                metadata.time_saved
            );
        }
    }

    write_handle.await??;

    Ok(())
}

/// Test that multiple concurrent reads don't interfere with each other.
#[tokio::test]
async fn test_concurrent_reads() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("test content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "concurrent_read_test";
    let duration = 100;

    // Write to cache first
    let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    cache.put(repo_root_path, hash, &files, duration)?;

    // Perform concurrent reads
    let mut handles = Vec::new();
    for _ in 0..10 {
        let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
        let repo_root = repo_root_path.to_owned();
        handles.push(tokio::spawn(async move { cache.fetch(&repo_root, hash) }));
    }

    // All reads should succeed
    for handle in handles {
        let result = handle.await??;
        assert!(result.is_some(), "Concurrent read should succeed");
        let (metadata, _) = result.unwrap();
        assert_eq!(metadata.time_saved, duration);
    }

    Ok(())
}

/// Test that temp files are cleaned up after concurrent writes.
#[tokio::test]
async fn test_concurrent_writes_cleanup_temp_files() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("test content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "temp_cleanup_test";
    let duration = 100;

    // Perform concurrent writes
    let cache1 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache2 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache3 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;

    let handle1 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache1.put(&repo_root, hash, &files, duration) })
    };
    let handle2 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache2.put(&repo_root, hash, &files, duration) })
    };
    let handle3 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache3.put(&repo_root, hash, &files, duration) })
    };

    // Wait for all writes to complete
    let _ = handle1.await?;
    let _ = handle2.await?;
    let _ = handle3.await?;

    // Verify no orphaned temp files remain in cache directory
    let cache_dir = repo_root_path.join_component("cache");
    let temp_files: Vec<_> = std::fs::read_dir(cache_dir.as_std_path())?
        .filter_map(|e| e.ok())
        .filter(|e| e.file_name().to_string_lossy().ends_with(".tmp"))
        .collect();
    assert!(
        temp_files.is_empty(),
        "Orphaned temp files found after concurrent writes: {:?}",
        temp_files
    );

    // Verify exactly one archive file exists for the hash
    let archive_files: Vec<_> = std::fs::read_dir(cache_dir.as_std_path())?
        .filter_map(|e| e.ok())
        .filter(|e| {
            let name = e.file_name().to_string_lossy().to_string();
            name.contains(hash) && name.ends_with(".tar.zst")
        })
        .collect();
    assert_eq!(
        archive_files.len(),
        1,
        "Expected exactly one archive file, found: {:?}",
        archive_files
    );

    Ok(())
}

} ===== END FILE: crates/turborepo-cache

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

18-gpt5-pro-confirmed.md

Now I have the complete GPT-5.2 Pro analysis. Let me write the consolidated output for you.

Current State

The turborepo codebase has 18 confirmed performance bottlenecks across the critical turbo run path. GPT-5.2 Pro validated 7 of the original 15 areas directly from source code (items 1, 6, 7, 8, 11, 12, 14) and identified 3 additional new wins (items 16-18). The remaining items (2, 4, 5, 9, 10, 13, 15) are validated by prior code reading but weren't in the Oracle snapshot.

Key hot-path issues confirmed in code:

EngineBuilder resolves turbo.json configs serially per workspace inside for workspace in self.workspaces.iter()
Env wildcard matching recompiles regexes in wildcard_map_from_wildcards() every call, even when include/exclude lists are empty
TaskHashTracker uses a single RwLock for 5 independent HashMaps, serializing parallel rayon task-hash computation
Visitor hot path deep-clones EnvironmentVariableMap per task and clones TaskId 5+ times per task
GlobWatcher Hash/Eq allocate fresh BTreeSet/HashSet per file event in watch mode
Loose env mode deep-clones the entire process environment per task (EnvMode::Loose => Ok(self.env_at_execution_start.clone()))
Package graph builder clones PackageNode to add it to the graph, uses unsized HashMaps

Required Changes

Recommended Implementation Order (highest ROI first)

Priority	Change	Effort	Risk	Expected Impact
1	#11 GlobWatcher hash/eq: store `include_raw: BTreeSet<String>`	Low	Low	Eliminates per-event allocations in watch mode
2	#16 Skip regex compilation for empty pattern lists	Low	Low	Saves wasted regex builds per task
3	#18 Remove `raw_excludes.clone()` in `GlobSet::from_raw`	Low	Low	Free allocation removal
4	#8 Remove unnecessary `Arc<Mutex>` in `precompute_task_hashes`	Low	Low	Eliminates mutex overhead in hash precompute
5	#12 Switch loose env to `Arc<EnvironmentVariableMap>`	Medium	Medium	Removes hundreds of thousands of string clones across 500-2000 tasks
6	#7 Replace single RwLock with DashMap per field	Medium	Medium	10-40% faster hash phase on multi-core
7	#6 Extend compiled wildcards + cache for hashable env	Medium	Medium	Removes ~2000 regex compilations per run
8	#1 Parallelize workspace task resolution with rayon	Medium	Medium	100-500ms saved on cold startup for 100-300 workspace repos
9	#14 Avoid cloning PackageNode, pre-size HashMaps	Medium	Low	Tens of ms saved on cold startup
10	#3 Compiled glob pattern LRU cache	Medium	Low	Hundreds of redundant regex compilations removed
11	#17 Env hashing: avoid `format!("k=v")` allocations	Medium	Medium/High	Tens of ms + lower RSS
12	#2 Multi-threaded zstd compression	Medium	Medium	1.5-4x faster archive compression
13	#13 Cache multiplexer: stop double I/O on remote hits	High	High	30-60% faster remote cache hits
14	#5 Lockfile transitive closure memoization	High	High	100-800ms off startup for large lockfiles
15	#4 SCM package hash allocation reduction	High	High	10-30% CPU time saved in manual hashing
16	#9 Change mapper: prefix index instead of O(n*m)	Medium	Medium	Seconds to hundreds of ms in watch mode
17	#10 Discovery: `buffer_unordered(64)` instead of `.then()`	Low	Low	100s of ms saved on cold startup
18	#15 HTTP cache upload: use `Bytes` instead of `Vec<u8>` clone	Low	Low	Eliminates multi-MB memcpy per retry

Key Implementation Diffs

#11 (GlobWatcher) — store canonical key set once:

 pub struct GlobSet {
     include: HashMap<String, wax::Glob<'static>>,
     exclude: Any<'static>,
     exclude_raw: BTreeSet<String>,
+    include_raw: BTreeSet<String>,
 }
 impl PartialEq for GlobSet {
     fn eq(&self, other: &Self) -> bool {
-        self.include.keys().collect::<HashSet<_>>() == other.include.keys().collect::<HashSet<_>>()
+        self.include_raw == other.include_raw && self.exclude_raw == other.exclude_raw
     }
 }
 impl Hash for GlobSet {
     fn hash<H: Hasher>(&self, state: &mut H) {
-        let include_raw: BTreeSet<_> = BTreeSet::from_iter(self.include.keys());
-        include_raw.hash(state);
+        self.include_raw.hash(state);
         self.exclude_raw.hash(state);
     }
 }

#8 (Visitor) — remove unnecessary Arc<Mutex> in precompute:

- let results: Arc<Mutex<HashMap<...>>> = Arc::new(Mutex::new(HashMap::new()));
+ let mut results: HashMap<...> = HashMap::with_capacity(sorted.len());
  // After rayon wave:
- let mut results = results.lock().map_err(|_| Error::Mutex)?;
- results.insert(task_id, (hash, env));
+ results.insert(task_id, (hash, env));

#12 (Loose env) — share via Arc:

- EnvMode::Loose => Ok(self.env_at_execution_start.clone()),
+ EnvMode::Loose => Ok(Arc::clone(&self.loose_env)),

#7 (TaskHashTracker) — sharded concurrent maps:

struct HashRecord {
    hash: Arc<str>,
    env_vars: DetailedMap,
    framework: Option<FrameworkSlug>,
}
pub struct TaskHashTracker {
    hot: DashMap<TaskId<'static>, HashRecord>,
    outputs: DashMap<TaskId<'static>, Vec<AnchoredSystemPathBuf>>,
    cache: DashMap<TaskId<'static>, CacheHitMetadata>,
    expanded: DashMap<TaskId<'static>, Arc<FileHashes>>,
}

#1 (Engine Builder) — parallelize with rayon:

- for workspace in self.workspaces.iter() {
-     let workspace_tasks = TaskInheritanceResolver::new(turbo_json_loader).resolve(workspace)?;
-     tasks_set.extend(workspace_tasks);
- }
+ let resolved: Vec<Result<Vec<TaskName<'static>>, BuilderError>> =
+     self.workspaces.par_iter()
+         .map(|ws| TaskInheritanceResolver::new(turbo_json_loader).resolve(ws))
+         .collect();
+ for r in resolved { tasks_set.extend(r?); }

Verification Framework (applies to ALL changes)

Micro-benchmarks:

# Per-crate criterion benchmarks
cargo bench -p turborepo-engine --bench engine_builder
cargo bench -p turborepo-task-hash --bench task_hash_tracker
cargo bench -p turborepo-env --bench wildcard_env
cargo bench -p turborepo-filewatch --bench globset_hash
cargo bench -p turborepo-repository --bench package_graph_build

End-to-end profiling:

turbo run build --profile --no-cache   # Compare phase times
cargo flamegraph -p turborepo-cli -- turbo run build
RUST_LOG=turborepo_engine=trace,turborepo_task_hash=trace turbo run build

Memory profiling:

# dhat for allocation counts in hot loops
# heaptrack on Linux for end-to-end allocation regression

Regression testing:

cargo test -p turborepo-engine
cargo test -p turborepo-env
cargo test -p turborepo-task-hash
cargo test -p turborepo-filewatch
cargo test -p turborepo-repository
# Plus full integration/e2e suite

Ripple Effects

#12 + #8 are tightly coupled: If you switch to Arc<EnvironmentVariableMap> for loose env, you must also change the visitor precompute map to store Arc values to avoid reintroducing deep clones downstream
#7 breaks snapshot semantics: Code expecting a single lock to imply "atomic snapshot" needs a snapshot() method to materialize TaskHashTrackerState for diagnostics
#6 introduces caching: Must cap caches to prevent unbounded memory growth in long-lived watch sessions
#1 increases filesystem concurrency: Can amplify FD pressure on large repos; use bounded thread pools or par_chunks
#17 format compatibility: If env hash format must match the Go implementation exactly, "k=v" format must be preserved (but preallocate Vec::with_capacity)

What This Unlocks

Faster cold start graph construction (engine build + discovery) — noticeably snappier turbo run on large repos
Parallel task-hash throughput that actually scales — --concurrency works as expected instead of stalling on shared locks
Dramatically lower allocation pressure in env handling (loose mode), visitor execution loop, and watch mode globset hashing
Cleaner primitives (compiled patterns, sharded trackers, Arc-wrapped environments) that make future optimizations like incremental hashing and smarter affected detection much easier to land
Cache pipeline improvements (streaming, multi-threaded compression, zero-copy uploads) that cut wall-clock time on every cache-miss and remote-hit scenario

Effort & Risk

Start here (1-2 days each, low risk): #11, #16, #18, #8, #10, #15 — pure refactors with clear before/after benchmarks

Next wave (2-5 days each, medium risk): #12, #7, #6, #1, #14, #3 — require API signature changes and careful testing but deliver the biggest wins

Final wave (1-2 weeks each, higher risk): #2, #13, #5, #4, #9 — involve cache correctness, cross-platform concerns, and lockfile semantics

Recommended sequence for maximum ROI: 11 → 16 → 8 → 12 → 7 → 6 → 1 → 14, then tackle the cache pipeline and lockfile/change-mapper work once the core hot loops stop allocating/locking themselves to death.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.
task-hash-tracker-rwlock-contention Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs Maturity: stable Relevance: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. Key findings: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.
task-graph-visitor-allocations Files: crates/turborepo-lib/src/task_graph/visitor/mod.rs, crates/turborepo-lib/src/task_graph/visitor/exec.rs Maturity: stable Relevance: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run. Key findings: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run.
change-mapper-linear-package-detection Files: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs Maturity: stable Relevance: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages Key findings: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages
discovery-sequential-turbo-json-stat Files: crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: Local package discovery serializes per-package turbo.json existence checks via .then(), adding O(n) filesystem latency on the critical path of every non-daemon run. Key findings: I confirmed the key finding by reading both files. The strongest candidate is in discovery.rs: At line 157, LocalPackageDiscovery::discover_packages uses tokio_stream::iter(...).then(...) which processes each tokio::fs::try_exists() call sequentially — .then() awaits each future before starting the next. For independent filesystem stat calls across hundreds of packages, this serializes I/O that should be concurrent. The file even has the tokio async machinery in place — it just needs .map() + .buffer_unordered(N) instead of .then().
globwatcher-globset-hash-eq-allocs Files: crates/turborepo-filewatch/src/globwatcher.rs, crates/turborepo-filewatch/src/hash_watcher.rs Maturity: stable Relevance: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos. Key findings: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos.
task-hash-loose-env-clone Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/run/watch.rs, crates/turborepo-lib/src/run/mod.rs Maturity: stable Relevance: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate. Key findings: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate.
cache-multiplexer-redundant-restore-io Files: crates/turborepo-cache/src/multiplexer.rs, crates/turborepo-cache/src/http.rs, crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/restore.rs, crates/turborepo-cache/src/cache_archive/create.rs Maturity: stable Relevance: Remote cache hits trigger double I/O — files are extracted to disk via http.fetch(), then re-read and re-compressed by fs.put() to populate local cache, making every remote hit ~2-3x more expensive than necessary. Key findings: Remote cache hits trigger double I/O — files are extracted to disk via http.fetch(), then re-read and re-compressed by fs.put() to populate local cache, making every remote hit ~2-3x more expensive than necessary.
package-graph-builder-alloc-reduction Files: crates/turborepo-repository/src/package_graph/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs Maturity: stable Relevance: Package graph builder is on the critical startup path of every turbo run invocation and has multiple HashMaps allocated without capacity, redundant PackageName/string clones per workspace, and repeated to_string() path conversions that compound with monorepo size. Key findings: Package graph builder is on the critical startup path of every turbo run invocation and has multiple HashMaps allocated without capacity, redundant PackageName/string clones per workspace, and repeated to_string() path conversions that compound with monorepo size.
http-cache-upload-redundant-clones Files: crates/turborepo-cache/src/http.rs, crates/turborepo-api-client/src/lib.rs Maturity: stable Relevance: Multi-MB artifact bodies are cloned twice during token-refresh retry and file-path vectors are unnecessarily collected, multiplying memory pressure on the critical remote-cache upload path. Key findings: Multi-MB artifact bodies are cloned twice during token-refresh retry and file-path vectors are unnecessarily collected, multiplying memory pressure on the critical remote-cache upload path.

Exploration

AREA: http-cache-upload-redundant-clones FILES: crates/turborepo-cache/src/http.rs, crates/turborepo-api-client/src/lib.rs MATURITY: stable RELEVANCE: Multi-MB artifact bodies are cloned twice during token-refresh retry and file-path vectors are unnecessarily collected, multiplying memory pressure on the critical remote-cache upload path. NOTES: (unspecified)

Raw Exploration Notes: AREA: http-cache-upload-redundant-clones FILES: crates/turborepo-cache/src/http.rs, crates/turborepo-api-client/src/lib.rs MATURITY: stable RELEVANCE: Multi-MB artifact bodies are cloned twice during token-refresh retry and file-path vectors are unnecessarily collected, multiplying memory pressure on the critical remote-cache upload path.

Selected Code Context

===== FILE: crates/turborepo-cache/src/http.rs ===== use std::{ backtrace::Backtrace, collections::HashMap, io::{Cursor, Write}, sync::{Arc, Mutex}, };

pub type UploadMap = HashMap<String, UploadProgressQuery<10, 100>>;

pub struct HTTPCache { client: APIClient, signer_verifier: Option, repo_root: AbsoluteSystemPathBuf, api_auth: Arc<Mutex>, analytics_recorder: Option, uploads: Arc<Mutex>, }

    HTTPCache {
        client,
        signer_verifier,
        repo_root,
        uploads: Arc::new(Mutex::new(HashMap::new())),
        api_auth: Arc::new(Mutex::new(api_auth)),
        analytics_recorder,
    }
}

/// Attempts to refresh the auth token when a cache operation encounters a
/// 403 forbidden error. Returns true if the token was successfully
/// refreshed, false otherwise.
async fn try_refresh_token(&self) -> bool {
    match turborepo_auth::get_token_with_refresh().await {
        Ok(Some(new_token)) => {
            // Update the API auth with the new token
            if let Ok(mut auth) = self.api_auth.lock() {
                auth.token = new_token;
                debug!("Successfully refreshed auth token for cache operations");
                true
            } else {
                warn!("Failed to acquire lock for updating auth token");
                false
            }
        }
        Ok(None) => {
            debug!("No refresh token available or token doesn't support refresh");
            false
        }
        Err(e) => {
            warn!("Failed to refresh token: {:?}", e);
            false
        }
    }
}

/// Helper method to execute a cache operation with automatic token refresh
/// on 403 errors.
async fn execute_with_token_refresh<T, F, Fut>(
    &self,
    hash: &str,
    operation: F,
) -> Result<T, CacheError>
where
    F: Fn(APIAuth) -> Fut,
    Fut: std::future::Future<Output = Result<T, turborepo_api_client::Error>>,
{
    // Try the operation with the current token
    let api_auth = self.api_auth.lock().unwrap().clone();
    match operation(api_auth.clone()).await {
        Ok(result) => Ok(result),
        Err(turborepo_api_client::Error::UnknownStatus { code, .. }) if code == "forbidden" => {
            // Try to refresh the token
            if self.try_refresh_token().await {
                // Retry the operation with the refreshed token
                let refreshed_auth = self.api_auth.lock().unwrap().clone();
                operation(refreshed_auth)
                    .await
                    .map_err(|err| Self::convert_api_error(hash, err))
            } else {
                // Token refresh failed, return the original error
                Err(CacheError::ForbiddenRemoteCacheWrite)
            }
        }
        Err(e) => Err(Self::convert_api_error(hash, e)),
    }
}

#[tracing::instrument(skip_all)]
pub async fn put(
    &self,
    anchor: &AbsoluteSystemPath,
    hash: &str,
    files: &[AnchoredSystemPathBuf],
    duration: u64,
) -> Result<(), CacheError> {
    let mut artifact_body = Vec::new();
    self.write(&mut artifact_body, anchor, files).await?;
    let bytes = artifact_body.len();

    let tag = self
        .signer_verifier
        .as_ref()
        .map(|signer| signer.generate_tag(hash.as_bytes(), &artifact_body))
        .transpose()?;

    tracing::debug!("uploading {}", hash);

    // Use the helper method to handle token refresh on 403 errors
    let artifact_body_clone = artifact_body.clone(); // Store the artifact body for retry
    let tag_clone = tag.clone();
    let uploads_clone = self.uploads.clone();

    self.execute_with_token_refresh(hash, |api_auth| {
        let client = &self.client;
        let tag_ref = tag_clone.as_deref();
        let artifact_body_ref = artifact_body_clone.clone();
        let uploads_ref = uploads_clone.clone();

        async move {
            // Create the stream inside the closure so it can be used for retry
            let stream = tokio_util::codec::FramedRead::new(
                Cursor::new(artifact_body_ref),
                tokio_util::codec::BytesCodec::new(),
            )
            .map(|res| {
                res.map(|bytes| bytes.freeze())
                    .map_err(turborepo_api_client::Error::from)
            });

            let (progress, query) = UploadProgress::<10, 100, _>::new(stream, Some(bytes));

            {
                let mut uploads = uploads_ref.lock().unwrap();
                uploads.insert(hash.to_string(), query);
            }

            client
                .put_artifact(
                    hash,
                    progress,
                    bytes,
                    duration,
                    tag_ref,
                    &api_auth.token,
                    api_auth.team_id.as_deref(),
                    api_auth.team_slug.as_deref(),
                )
                .await
        }
    })
    .await?;

    tracing::debug!("uploaded {}", hash);
    Ok(())
}

#[tracing::instrument(skip_all)]
async fn write(
    &self,
    writer: impl Write,
    anchor: &AbsoluteSystemPath,
    files: &[AnchoredSystemPathBuf],
) -> Result<(), CacheError> {
    let mut cache_archive = CacheWriter::from_writer(writer, true)?;
    for file in files {
        cache_archive.add_file(anchor, file)?;
    }

    Ok(())
}

#[tracing::instrument(skip_all)]
pub async fn exists(&self, hash: &str) -> Result<Option<CacheHitMetadata>, CacheError> {
    let response = self
        .execute_with_token_refresh(hash, |api_auth| {
            let client = &self.client;
            async move {
                client
                    .artifact_exists(
                        hash,
                        &api_auth.token,
                        api_auth.team_id.as_deref(),
                        api_auth.team_slug.as_deref(),
                    )
                    .await
            }
        })
        .await?;

    let Some(response) = response else {
        return Ok(None);
    };

    let duration = Self::get_duration_from_response(&response)?;

    Ok(Some(CacheHitMetadata {
        source: CacheSource::Remote,
        time_saved: duration,
    }))
}

fn get_duration_from_response(response: &Response) -> Result<u64, CacheError> {
    if let Some(duration_value) = response.headers().get("x-artifact-duration") {
        let duration = duration_value
            .to_str()
            .map_err(|_| CacheError::InvalidDuration(Backtrace::capture()))?;

        duration
            .parse::<u64>()
            .map_err(|_| CacheError::InvalidDuration(Backtrace::capture()))
    } else {
        Ok(0)
    }
}

fn log_fetch(&self, event: analytics::CacheEvent, hash: &str, duration: u64) {
    // If analytics fails to record, it's not worth failing the cache
    if let Some(analytics_recorder) = &self.analytics_recorder {
        let analytics_event = AnalyticsEvent {
            session_id: None,
            source: analytics::CacheSource::Remote,
            event,
            hash: hash.to_string(),
            duration,
        };
        debug!("logging fetch: {analytics_event:?}");
        let _ = analytics_recorder.send(analytics_event);
    }
}

#[tracing::instrument(skip_all)]
pub async fn fetch(
    &self,
    hash: &str,
) -> Result<Option<(CacheHitMetadata, Vec<AnchoredSystemPathBuf>)>, CacheError> {
    let response = self
        .execute_with_token_refresh(hash, |api_auth| {
            let client = &self.client;
            async move {
                client
                    .fetch_artifact(
                        hash,
                        &api_auth.token,
                        api_auth.team_id.as_deref(),
                        api_auth.team_slug.as_deref(),
                    )
                    .await
            }
        })
        .await?;

    let Some(response) = response else {
        self.log_fetch(analytics::CacheEvent::Miss, hash, 0);
        return Ok(None);
    };

    let duration = Self::get_duration_from_response(&response)?;

    let body = if let Some(signer_verifier) = &self.signer_verifier {
        let expected_tag = response
            .headers()
            .get("x-artifact-tag")
            .ok_or(CacheError::ArtifactTagMissing(Backtrace::capture()))?;

        let expected_tag = expected_tag
            .to_str()
            .map_err(|_| CacheError::InvalidTag(Backtrace::capture()))?
            .to_string();

        let body = response.bytes().await.map_err(|e| {
            CacheError::ApiClientError(
                Box::new(turborepo_api_client::Error::ReqwestError(e)),
                Backtrace::capture(),
            )
        })?;
        let is_valid = signer_verifier.validate(hash.as_bytes(), &body, &expected_tag)?;

        if !is_valid {
            return Err(CacheError::InvalidTag(Backtrace::capture()));
        }

        body
    } else {
        response.bytes().await.map_err(|e| {
            CacheError::ApiClientError(
                Box::new(turborepo_api_client::Error::ReqwestError(e)),
                Backtrace::capture(),
            )
        })?
    };

    let files = Self::restore_tar(&self.repo_root, &body)?;

    self.log_fetch(analytics::CacheEvent::Hit, hash, duration);
    Ok(Some((
        CacheHitMetadata {
            source: CacheSource::Remote,
            time_saved: duration,
        },
        files,
    )))
}

pub fn requests(&self) -> Arc<Mutex<UploadMap>> {
    self.uploads.clone()
}

#[tracing::instrument(skip_all)]
pub(crate) fn restore_tar(
    root: &AbsoluteSystemPath,
    body: &[u8],
) -> Result<Vec<AnchoredSystemPathBuf>, CacheError> {
    let mut cache_reader = CacheReader::from_reader(body, true)?;
    cache_reader.restore(root)
}

fn convert_api_error(hash: &str, err: turborepo_api_client::Error) -> CacheError {
    match err {
        turborepo_api_client::Error::ReqwestError(e) if e.is_timeout() => {
            CacheError::TimeoutError(hash.to_string())
        }
        turborepo_api_client::Error::ReqwestError(e) if e.is_connect() => {
            CacheError::ConnectError
        }
        turborepo_api_client::Error::UnknownStatus { code, .. } if code == "forbidden" => {
            CacheError::ForbiddenRemoteCacheWrite
        }
        e => e.into(),
    }
}

}

#[cfg(test)] mod test { use std::{backtrace::Backtrace, time::Duration};

use anyhow::Result;
use futures::future::try_join_all;
use insta::assert_snapshot;
use tempfile::tempdir;
use turbopath::AbsoluteSystemPathBuf;
use turborepo_analytics::start_analytics;
use turborepo_api_client::{APIClient, SecretString, analytics};
use turborepo_vercel_api_mock::start_test_server;

use crate::{
    CacheOpts, CacheSource,
    http::{APIAuth, HTTPCache},
    test_cases::{TestCase, get_test_cases, validate_analytics},
};

#[tokio::test]
async fn test_http_cache() -> Result<()> {
    let port = port_scanner::request_open_port().unwrap();
    let (ready_tx, ready_rx) = tokio::sync::oneshot::channel();
    let handle = tokio::spawn(start_test_server(port, Some(ready_tx)));

    // Wait for the server to be ready (with timeout)
    tokio::time::timeout(Duration::from_secs(5), ready_rx)
        .await
        .map_err(|_| anyhow::anyhow!("Test server failed to start within timeout"))??;

    let test_cases = get_test_cases();

    try_join_all(
        test_cases
            .iter()
            .map(|test_case| round_trip_test(test_case, port)),
    )
    .await?;

    validate_analytics(&test_cases, analytics::CacheSource::Remote, port).await?;
    handle.abort();
    Ok(())
}

async fn round_trip_test(test_case: &TestCase, port: u16) -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path())?;
    test_case.initialize(&repo_root_path)?;

    let hash = test_case.hash;
    let files = &test_case.files;
    let duration = test_case.duration;

    let api_client = APIClient::new(
        format!("http://localhost:{port}"),
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        true,
    )?;
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };
    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("my-token".to_string()),
        team_slug: None,
    };
    let (analytics_recorder, analytics_handle) =
        start_analytics(api_auth.clone(), api_client.clone());

    let cache = HTTPCache::new(
        api_client,
        &opts,
        repo_root_path.to_owned(),
        api_auth,
        Some(analytics_recorder),
    );

    // Should be a cache miss at first
    let miss = cache.fetch(hash).await?;
    assert!(miss.is_none());

    let anchored_files: Vec<_> = files.iter().map(|f| f.path().to_owned()).collect();
    cache
        .put(&repo_root_path, hash, &anchored_files, duration)
        .await?;

    let cache_response = cache.exists(hash).await?.unwrap();

    assert_eq!(cache_response.time_saved, duration);
    assert_eq!(cache_response.source, CacheSource::Remote);

    let (cache_response, received_files) = cache.fetch(hash).await?.unwrap();

    assert_eq!(cache_response.time_saved, duration);

    for (test_file, received_file) in files.iter().zip(received_files) {
        assert_eq!(&*received_file, test_file.path());
        let file_path = repo_root_path.resolve(&received_file);
        if let Some(contents) = test_file.contents() {
            assert_eq!(std::fs::read_to_string(file_path)?, contents);
        } else {
            assert!(file_path.exists());
        }
    }

    analytics_handle.close_with_timeout().await;

    Ok(())
}

#[test]
fn test_forbidden_error() {
    let err = HTTPCache::convert_api_error(
        "hash",
        turborepo_api_client::Error::UnknownStatus {
            code: "forbidden".into(),
            message: "Not authorized".into(),
            backtrace: Backtrace::capture(),
        },
    );
    assert_snapshot!(err.to_string(), @"Insufficient permissions to write to remote cache. Please verify that your role has write access for Remote Cache Artifact at https://vercel.com/docs/accounts/team-members-and-roles/access-roles/team-level-roles?resource=Remote+Cache+Artifact");
}

#[test]
fn test_unknown_status() {
    let err = HTTPCache::convert_api_error(
        "hash",
        turborepo_api_client::Error::UnknownStatus {
            code: "unknown".into(),
            message: "Special message".into(),
            backtrace: Backtrace::capture(),
        },
    );
    assert_snapshot!(err.to_string(), @"failed to contact remote cache: Unknown status unknown: Special message");
}

#[test]
fn test_cache_disabled() {
    let err = HTTPCache::convert_api_error(
        "hash",
        turborepo_api_client::Error::CacheDisabled {
            status: turborepo_vercel_api::CachingStatus::Disabled,
            message: "Cache disabled".into(),
        },
    );
    assert_snapshot!(err.to_string(), @"failed to contact remote cache: Cache disabled");
}

#[tokio::test]
async fn test_token_refresh_on_403() {
    // This test verifies that the HTTPCache can handle token refresh when
    // encountering 403 errors. Note: This is an integration test that would
    // need a mock server setup to fully verify the token refresh flow, but
    // the logic structure is tested through the build validation.
    let repo_root = tempfile::tempdir().unwrap();
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path()).unwrap();

    let api_client = APIClient::new(
        "http://localhost:8000",
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        false,
    )
    .unwrap();
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };

    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("expired-token".to_string()),
        team_slug: None,
    };

    let cache = HTTPCache::new(api_client, &opts, repo_root_path, api_auth, None);

    // Verify that the cache has the token refresh capability
    // The actual token refresh would be tested in integration tests with a proper
    // mock server. The vca_ prefix check is now handled in the auth layer.
    // The result depends on whether there are any tokens available in the system
    //
    // The result can be true or false depending on system state, but the method
    // should not panic. The test will fail if it does.
    cache.try_refresh_token().await;
}

#[tokio::test]
async fn test_cache_token_update_after_refresh() {
    // Test that the cache properly updates its internal token after a successful
    // refresh
    let repo_root = tempfile::tempdir().unwrap();
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path()).unwrap();

    let api_client = APIClient::new(
        "http://localhost:8000",
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        false,
    )
    .unwrap();
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };

    let initial_api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("initial-token".to_string()),
        team_slug: None,
    };

    let cache = HTTPCache::new(api_client, &opts, repo_root_path, initial_api_auth, None);

    // Verify initial token
    let initial_auth = cache.api_auth.lock().unwrap().clone();
    assert_eq!(initial_auth.token.expose(), "initial-token");

    // Test the token refresh mechanism (without actual HTTP call)
    // In a real scenario, try_refresh_token would call
    // turborepo_auth::get_token_with_refresh and update the internal token
    // if successful
    let refresh_result = cache.try_refresh_token().await;

    // The result depends on system state - could be true or false
    let final_auth = cache.api_auth.lock().unwrap().clone();

    if refresh_result {
        // If refresh succeeded, token should have been updated
        assert_ne!(final_auth.token.expose(), "initial-token");
    } else {
        // If refresh failed, token should remain unchanged
        assert_eq!(final_auth.token.expose(), "initial-token");
    }
}

#[test]
fn test_cache_auth_mutex_thread_safety() {
    // Test that the Arc<Mutex<APIAuth>> is properly thread-safe
    use std::{sync::Arc, thread};

    let repo_root = tempfile::tempdir().unwrap();
    let repo_root_path = AbsoluteSystemPathBuf::try_from(repo_root.path()).unwrap();

    let api_client = APIClient::new(
        "http://localhost:8000",
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        false,
    )
    .unwrap();
    let opts = CacheOpts {
        cache_dir: ".turbo/cache".into(),
        cache: Default::default(),
        workers: 0,
        remote_cache_opts: None,
    };

    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("thread-test-token".to_string()),
        team_slug: None,
    };

    let cache = Arc::new(HTTPCache::new(
        api_client,
        &opts,
        repo_root_path,
        api_auth,
        None,
    ));

    // Test concurrent access to the auth mutex
    let handles: Vec<_> = (0..5)
        .map(|i| {
            let cache_clone = Arc::clone(&cache);
            thread::spawn(move || {
                let auth = cache_clone.api_auth.lock().unwrap();
                assert_eq!(auth.token.expose(), "thread-test-token");
                assert_eq!(auth.team_id, Some("my-team".to_string()));
                // Simulate some work
                thread::sleep(std::time::Duration::from_millis(10));
                format!("thread-{i}")
            })
        })
        .collect();

    // Wait for all threads to complete
    for handle in handles {
        let result = handle.join().unwrap();
        assert!(result.starts_with("thread-"));
    }
}

} ===== END FILE: crates/turborepo-cache/src/http.rs =====

===== FILE: crates/turborepo-api-client/src/lib.rs ===== //! HTTP client for interacting with the Remote Cache API. //! Provides authentication, caching, and telemetry endpoints for Remote Cache //! operations. By default configured for Vercel API

#![feature(error_generic_member_access)] #![feature(assert_matches)] // miette's derive macro causes false positives for this lint #![allow(unused_assignments)] #![deny(clippy::all)]

use std::{backtrace::Backtrace, env, future::Future, sync::LazyLock, time::Duration};

use regex::Regex; pub use reqwest::Response; use reqwest::{Body, Method, RequestBuilder, StatusCode}; use serde::Deserialize; use turborepo_ci::{Vendor, is_ci}; pub use turborepo_types::SecretString; use turborepo_vercel_api::{ APIError, CachingStatus, CachingStatusResponse, PreflightResponse, Team, TeamsResponse, UserResponse, VerificationResponse, VerifiedSsoUser, token::ResponseTokenMetadata, }; use url::Url;

pub use crate::error::{Error, Result};

pub mod analytics; mod error; mod retry; pub mod telemetry;

pub use bytes::Bytes; pub use tokio_stream::Stream;

static AUTHORIZATION_REGEX: LazyLock = LazyLock::new(|| Regex::new(r"(?i)(?:^|,) *authorization *(?:,|$)").unwrap());

pub trait Client { fn get_user(&self, token: &SecretString) -> impl Future<Output = Result> + Send; fn get_teams(&self, token: &SecretString) -> impl Future<Output = Result> + Send; fn get_team( &self, token: &SecretString, team_id: &str, ) -> impl Future<Output = Result<Option>> + Send; fn add_ci_header(request_builder: RequestBuilder) -> RequestBuilder; fn verify_sso_token( &self, token: &SecretString, token_name: &str, ) -> impl Future<Output = Result> + Send; fn handle_403(response: Response) -> impl Future<Output = Error> + Send; fn make_url(&self, endpoint: &str) -> Result; }

pub trait CacheClient { fn get_artifact( &self, hash: &str, token: &SecretString, team_id: Option<&str>, team_slug: Option<&str>, method: Method, ) -> impl Future<Output = Result<Option>> + Send; fn fetch_artifact( &self, hash: &str, token: &SecretString, team_id: Option<&str>, team_slug: Option<&str>, ) -> impl Future<Output = Result<Option>> + Send; #[allow(clippy::too_many_arguments)] fn put_artifact( &self, hash: &str, artifact_body: impl tokio_stream::Stream<Item = Resultbytes::Bytes> + Send + Sync + 'static, body_len: usize, duration: u64, tag: Option<&str>, token: &SecretString, team_id: Option<&str>, team_slug: Option<&str>, ) -> impl Future<Output = Result<()>> + Send; fn artifact_exists( &self, hash: &str, token: &SecretString, team_id: Option<&str>, team_slug: Option<&str>, ) -> impl Future<Output = Result<Option>> + Send; fn get_caching_status( &self, token: &SecretString, team_id: Option<&str>, team_slug: Option<&str>, ) -> impl Future<Output = Result> + Send; }

pub trait TokenClient { fn get_metadata( &self, token: &SecretString, ) -> impl Future<Output = Result> + Send; fn delete_token(&self, token: &SecretString) -> impl Future<Output = Result<()>> + Send; }

#[derive(Clone)] pub struct APIClient { client: reqwest::Client, base_url: String, user_agent: String, use_preflight: bool, timeout: Option, upload_timeout: Option, }

#[derive(Clone)] pub struct APIAuth { pub team_id: Option, pub token: SecretString, pub team_slug: Option, }

impl std::fmt::Debug for APIAuth { fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result { f.debug_struct("APIAuth") .field("team_id", &self.team_id) .field("token", &self.token) .field("team_slug", &self.team_slug) .finish() } }

pub fn is_linked(api_auth: &Option) -> bool { api_auth .as_ref() .is_some_and(|api_auth| api_auth.is_linked()) }

impl Client for APIClient { #[tracing::instrument(skip_all)] async fn get_user(&self, token: &SecretString) -> Result { let url = self.make_url("/v2/user")?; let request_builder = self .api_request(Method::GET, url) .header("User-Agent", self.user_agent.clone()) .bearer_auth(token.expose()) .header("Content-Type", "application/json"); let response = retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout) .await? .into_response() .error_for_status()?;

    Ok(response.json().await?)
}

#[tracing::instrument(skip_all)]
async fn get_teams(&self, token: &SecretString) -> Result<TeamsResponse> {
    let request_builder = self
        .api_request(Method::GET, self.make_url("/v2/teams?limit=100")?)
        .header("User-Agent", self.user_agent.clone())
        .header("Content-Type", "application/json")
        .bearer_auth(token.expose());

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout)
            .await?
            .into_response()
            .error_for_status()?;

    Ok(response.json().await?)
}

#[tracing::instrument(skip_all)]
async fn get_team(&self, token: &SecretString, team_id: &str) -> Result<Option<Team>> {
    let endpoint = format!("/v2/teams/{team_id}");
    let response = self
        .api_request(Method::GET, self.make_url(&endpoint)?)
        .header("User-Agent", self.user_agent.clone())
        .header("Content-Type", "application/json")
        .bearer_auth(token.expose())
        .send()
        .await?
        .error_for_status()?;

    Ok(response.json().await?)
}
fn add_ci_header(mut request_builder: RequestBuilder) -> RequestBuilder {
    if is_ci()
        && let Some(vendor_constant) = Vendor::get_constant()
    {
        request_builder = request_builder.header("x-artifact-client-ci", vendor_constant);
    }

    request_builder
}

#[tracing::instrument(skip_all)]
async fn verify_sso_token(
    &self,
    token: &SecretString,
    token_name: &str,
) -> Result<VerifiedSsoUser> {
    let request_builder = self
        .api_request(Method::GET, self.make_url("/registration/verify")?)
        .query(&[("token", token.expose()), ("tokenName", token_name)])
        .header("User-Agent", self.user_agent.clone());

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout)
            .await?
            .into_response()
            .error_for_status()?;

    let verification_response: VerificationResponse = response.json().await?;

    Ok(VerifiedSsoUser {
        token: verification_response.token,
        team_id: verification_response.team_id,
    })
}

async fn handle_403(response: Response) -> Error {
    #[derive(Deserialize)]
    struct WrappedAPIError {
        error: APIError,
    }
    let body = match response.text().await {
        Ok(body) => body,
        Err(e) => return Error::ReqwestError(e),
    };

    let WrappedAPIError { error: api_error } = match serde_json::from_str(&body) {
        Ok(api_error) => api_error,
        Err(err) => {
            return Error::InvalidJson {
                err,
                text: body.clone(),
            };
        }
    };

    if let Some(status_string) = api_error.code.strip_prefix("remote_caching_") {
        let status = match status_string {
            "disabled" => CachingStatus::Disabled,
            "enabled" => CachingStatus::Enabled,
            "over_limit" => CachingStatus::OverLimit,
            "paused" => CachingStatus::Paused,
            _ => {
                return Error::UnknownCachingStatus(
                    status_string.to_string(),
                    Backtrace::capture(),
                );
            }
        };

        Error::CacheDisabled {
            status,
            message: api_error.message,
        }
    } else {
        Error::UnknownStatus {
            code: api_error.code,
            message: api_error.message,
            backtrace: Backtrace::capture(),
        }
    }
}

fn make_url(&self, endpoint: &str) -> Result<Url> {
    let url = format!("{}{}", self.base_url, endpoint);
    Url::parse(&url).map_err(|err| Error::InvalidUrl { url, err })
}

}

impl CacheClient for APIClient { #[tracing::instrument(skip_all)] async fn get_artifact( &self, hash: &str, token: &SecretString, team_id: Option<&str>, team_slug: Option<&str>, method: Method, ) -> Result<Option> { let mut request_url = self.make_url(&format!("/v8/artifacts/{hash}"))?; let mut allow_auth = true;

    if self.use_preflight {
        let preflight_response = self
            .do_preflight(
                token,
                request_url.clone(),
                "GET",
                "Authorization, User-Agent",
            )
            .await?;

        allow_auth = preflight_response.allow_authorization_header;
        request_url = preflight_response.location;
    };

    let mut request_builder = self
        .api_request(method, request_url)
        .header("User-Agent", self.user_agent.clone());

    if allow_auth {
        request_builder = request_builder.bearer_auth(token.expose());
    }

    request_builder = Self::add_team_params(request_builder, team_id, team_slug);

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout).await?;
    let response = response.into_response();

    match response.status() {
        StatusCode::FORBIDDEN => Err(Self::handle_403(response).await),
        StatusCode::NOT_FOUND => Ok(None),
        _ => Ok(Some(response.error_for_status()?)),
    }
}

#[tracing::instrument(skip_all)]
async fn artifact_exists(
    &self,
    hash: &str,
    token: &SecretString,
    team_id: Option<&str>,
    team_slug: Option<&str>,
) -> Result<Option<Response>> {
    self.get_artifact(hash, token, team_id, team_slug, Method::HEAD)
        .await
}

#[tracing::instrument(skip_all)]
async fn fetch_artifact(
    &self,
    hash: &str,
    token: &SecretString,
    team_id: Option<&str>,
    team_slug: Option<&str>,
) -> Result<Option<Response>> {
    self.get_artifact(hash, token, team_id, team_slug, Method::GET)
        .await
}

#[tracing::instrument(skip_all)]
async fn put_artifact(
    &self,
    hash: &str,
    artifact_body: impl tokio_stream::Stream<Item = Result<bytes::Bytes>> + Send + Sync + 'static,
    body_length: usize,
    duration: u64,
    tag: Option<&str>,
    token: &SecretString,
    team_id: Option<&str>,
    team_slug: Option<&str>,
) -> Result<()> {
    let mut request_url = self.make_url(&format!("/v8/artifacts/{hash}"))?;
    let mut allow_auth = true;

    if self.use_preflight {
        let preflight_response = self
            .do_preflight(
                token,
                request_url.clone(),
                "PUT",
                "Authorization, Content-Type, User-Agent, x-artifact-duration, x-artifact-tag",
            )
            .await?;

        allow_auth = preflight_response.allow_authorization_header;
        request_url = preflight_response.location.clone();
    }

    let stream = Body::wrap_stream(artifact_body);

    let mut request_builder = self
        .upload_request(Method::PUT, request_url)
        .header("Content-Type", "application/octet-stream")
        .header("x-artifact-duration", duration.to_string())
        .header("User-Agent", self.user_agent.clone())
        .header("Content-Length", body_length)
        .body(stream);

    if allow_auth {
        request_builder = request_builder.bearer_auth(token.expose());
    }

    request_builder = Self::add_team_params(request_builder, team_id, team_slug);

    request_builder = Self::add_ci_header(request_builder);

    if let Some(tag) = tag {
        request_builder = request_builder.header("x-artifact-tag", tag);
    }

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Connection)
            .await?
            .into_response();

    if response.status() == StatusCode::FORBIDDEN {
        return Err(Self::handle_403(response).await);
    }

    response.error_for_status()?;
    Ok(())
}

#[tracing::instrument(skip_all)]
async fn get_caching_status(
    &self,
    token: &SecretString,
    team_id: Option<&str>,
    team_slug: Option<&str>,
) -> Result<CachingStatusResponse> {
    let request_builder = self
        .api_request(Method::GET, self.make_url("/v8/artifacts/status")?)
        .header("User-Agent", self.user_agent.clone())
        .header("Content-Type", "application/json")
        .bearer_auth(token.expose());

    let request_builder = Self::add_team_params(request_builder, team_id, team_slug);

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout)
            .await?
            .into_response()
            .error_for_status()?;

    Ok(response.json().await?)
}

}

impl TokenClient for APIClient { #[tracing::instrument(skip_all)] async fn get_metadata(&self, token: &SecretString) -> Result { let endpoint = "/v5/user/tokens/current"; let url = self.make_url(endpoint)?; let request_builder = self .api_request(Method::GET, url) .header("User-Agent", self.user_agent.clone()) .bearer_auth(token.expose()) .header("Content-Type", "application/json");

    #[derive(Deserialize, Debug)]
    struct Response {
        #[serde(rename = "token")]
        metadata: ResponseTokenMetadata,
    }
    #[derive(Deserialize, Debug)]
    struct ErrorResponse {
        error: ErrorDetails,
    }
    #[derive(Deserialize, Debug)]
    struct ErrorDetails {
        message: String,
        #[serde(rename = "invalidToken", default)]
        invalid_token: bool,
    }

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout).await?;
    let response = response.into_response();
    let status = response.status();
    // Give a better error message for invalid tokens. This endpoint returns the
    // following statuses:
    // 200: OK
    // 400: Bad Request
    // 403: Forbidden
    // 404: Not Found
    match status {
        StatusCode::OK => Ok(response.json::<Response>().await?.metadata),
        // If we're forbidden, check to see if the token is invalid. If so, give back a nice
        // error message.
        StatusCode::FORBIDDEN => {
            let body = response.json::<ErrorResponse>().await?;
            if body.error.invalid_token {
                return Err(Error::InvalidToken {
                    status: status.as_u16(),
                    // Call make_url again since url is moved.
                    url: self.make_url(endpoint)?.to_string(),
                    message: body.error.message,
                });
            }
            Err(Error::ForbiddenToken {
                url: self.make_url(endpoint)?.to_string(),
            })
        }
        _ => Err(response.error_for_status().unwrap_err().into()),
    }
}

/// Invalidates the given token on the server.
#[tracing::instrument(skip_all)]
async fn delete_token(&self, token: &SecretString) -> Result<()> {
    let endpoint = "/v3/user/tokens/current";
    let url = self.make_url(endpoint)?;
    let request_builder = self
        .api_request(Method::DELETE, url)
        .header("User-Agent", self.user_agent.clone())
        .bearer_auth(token.expose())
        .header("Content-Type", "application/json");

    #[derive(Deserialize, Debug)]
    struct ErrorResponse {
        error: ErrorDetails,
    }
    #[derive(Deserialize, Debug)]
    struct ErrorDetails {
        message: String,
        #[serde(rename = "invalidToken", default)]
        invalid_token: bool,
    }

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout)
            .await?
            .into_response();
    let status = response.status();
    // Give a better error message for invalid tokens. This endpoint returns the
    // following statuses:
    // 200: OK
    // 400: Bad Request
    // 403: Forbidden
    // 404: Not Found
    match status {
        StatusCode::OK => Ok(()),
        // If we're forbidden, check to see if the token is invalid. If so, give back a nice
        // error message.
        StatusCode::FORBIDDEN => {
            let body = response.json::<ErrorResponse>().await?;
            if body.error.invalid_token {
                return Err(Error::InvalidToken {
                    status: status.as_u16(),
                    // Call make_url again since url is moved.
                    url: self.make_url(endpoint)?.to_string(),
                    message: body.error.message,
                });
            }
            Err(Error::ForbiddenToken {
                url: self.make_url(endpoint)?.to_string(),
            })
        }
        _ => Err(response.error_for_status().unwrap_err().into()),
    }
}

}

impl APIClient { /// Create a new APIClient. /// /// # Arguments /// base_url - The base URL for the API. /// timeout - The timeout for requests. /// upload_timeout - If specified, uploading files will use timeout for /// the connection, and upload_timeout for the total. /// Otherwise, timeout will be used for the total. /// version - The version of the client. /// use_preflight - If true, use the preflight API for all requests. pub fn new( base_url: impl AsRef, timeout: Option, upload_timeout: Option, version: &str, use_preflight: bool, ) -> Result { let client = Self::build_http_client(timeout)?; let user_agent = build_user_agent(version); Ok(APIClient { client, base_url: base_url.as_ref().to_string(), user_agent, use_preflight, timeout, upload_timeout, }) }

/// Creates an APIClient using a pre-built `reqwest::Client`, avoiding
/// redundant TLS initialization when a client already exists.
pub fn new_with_client(
    client: reqwest::Client,
    base_url: impl AsRef<str>,
    timeout: Option<Duration>,
    upload_timeout: Option<Duration>,
    version: &str,
    use_preflight: bool,
) -> Self {
    let user_agent = build_user_agent(version);
    APIClient {
        client,
        base_url: base_url.as_ref().to_string(),
        user_agent,
        use_preflight,
        timeout,
        upload_timeout,
    }
}

/// Builds a shared HTTP client with optional connect timeout. This is
/// the single TLS initialization point — all consumers should share the
/// resulting client.
#[tracing::instrument(skip_all)]
pub fn build_http_client(connect_timeout: Option<Duration>) -> Result<reqwest::Client> {
    let mut builder = reqwest::Client::builder();
    if let Some(dur) = connect_timeout {
        builder = builder.connect_timeout(dur);
    }
    builder.build().map_err(Error::TlsError)
}

pub fn base_url(&self) -> &str {
    self.base_url.as_str()
}

pub fn with_base_url(&mut self, base_url: String) {
    self.base_url = base_url;
}

/// Creates a request builder with the standard API timeout applied.
fn api_request(&self, method: Method, url: impl reqwest::IntoUrl) -> RequestBuilder {
    let builder = self.client.request(method, url);
    match self.timeout {
        Some(dur) => builder.timeout(dur),
        None => builder,
    }
}

/// Creates a request builder with upload timeout semantics:
/// connect_timeout is set on the shared client, and the total
/// request timeout uses upload_timeout (falling back to api timeout).
fn upload_request(&self, method: Method, url: impl reqwest::IntoUrl) -> RequestBuilder {
    let builder = self.client.request(method, url);
    let effective_timeout = self.upload_timeout.or(self.timeout);
    match effective_timeout {
        Some(dur) => builder.timeout(dur),
        None => builder,
    }
}

#[tracing::instrument(skip_all)]
async fn do_preflight(
    &self,
    token: &SecretString,
    request_url: Url,
    request_method: &str,
    request_headers: &str,
) -> Result<PreflightResponse> {
    let request_builder = self
        .api_request(Method::OPTIONS, request_url)
        .header("User-Agent", self.user_agent.clone())
        .header("Access-Control-Request-Method", request_method)
        .header("Access-Control-Request-Headers", request_headers)
        .bearer_auth(token.expose());

    let response =
        retry::make_retryable_request(request_builder, retry::RetryStrategy::Timeout)
            .await?
            .into_response();

    let headers = response.headers();
    let location = if let Some(location) = headers.get("Location") {
        let location = location.to_str()?;

        match Url::parse(location) {
            Ok(location_url) => location_url,
            Err(url::ParseError::RelativeUrlWithoutBase) => Url::parse(&self.base_url)
                .map_err(|err| Error::InvalidUrl {
                    url: self.base_url.clone(),
                    err,
                })?
                .join(location)
                .map_err(|err| Error::InvalidUrl {
                    url: location.to_string(),
                    err,
                })?,
            Err(e) => {
                return Err(Error::InvalidUrl {
                    url: location.to_string(),
                    err: e,
                });
            }
        }
    } else {
        response.url().clone()
    };

    let allowed_headers = headers
        .get("Access-Control-Allow-Headers")
        .map_or("", |h| h.to_str().unwrap_or(""));

    let allow_auth = AUTHORIZATION_REGEX.is_match(allowed_headers);

    Ok(PreflightResponse {
        location,
        allow_authorization_header: allow_auth,
    })
}
/// Create a new request builder with the preflight check done,
/// team parameters added, CI header, and a content type of json.
pub(crate) async fn create_request_builder(
    &self,
    url: &str,
    api_auth: &APIAuth,
    method: Method,
) -> Result<RequestBuilder> {
    let mut url = self.make_url(url)?;
    let mut allow_auth = true;

    let APIAuth {
        token,
        team_id,
        team_slug,
    } = api_auth;

    if self.use_preflight {
        let preflight_response = self
            .do_preflight(
                token,
                url.clone(),
                method.as_str(),
                "Authorization, User-Agent",
            )
            .await?;

        allow_auth = preflight_response.allow_authorization_header;
        url = preflight_response.location;
    }

    let mut request_builder = self
        .api_request(method, url)
        .header("Content-Type", "application/json");

    if allow_auth {
        request_builder = request_builder.bearer_auth(token.expose());
    }

    request_builder =
        Self::add_team_params(request_builder, team_id.as_deref(), team_slug.as_deref());

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

9-bottleneck-summary.md

Now I have comprehensive data from all sources. Let me compile the final report.

Current State

The turborepo Rust codebase has already landed several major perf wins in recent canary releases (2.8.11-canary.24–27):

Task hash parallelization (b3c0f46da8): Tasks now hash in parallel topological waves via rayon — 270ms → 53ms on large repos
Engine builder config caching (e145bc6b74, b79b680f53): turbo.json chain caching + parallel preloading — build_engine 283ms → 43ms
Engine callback reordering (4b5410b397): Tracker bookkeeping moved off the hot path
Shallow wildcard fast path (db01cb4490): Fast path for common glob patterns in workspace discovery
SCM allocation reduction (fc19b667aa): Stack-allocated hex buffers, eliminated redundant subprocess spawns

Five open optimization opportunities remain, ranked by expected ROI:

Required Changes

1. Task Env Regex Caching (HIGH — est. 10-20% of hash phase)

File: crates/turborepo-env/src/lib.rs:303-308
File: crates/turborepo-task-hash/src/lib.rs (calculate_task_hash)

wildcard_map_from_wildcards() calls RegexBuilder::new().build() on every invocation. On a 1700-task repo where most tasks share the same env patterns (e.g., ["NODE_ENV", "CI"]), this compiles the same regex 1700 times.

Fix: Cache compiled wildcards by pattern set at the TaskHasher level:

// In TaskHasher struct:
wildcard_cache: HashMap<Vec<String>, CompiledWildcards>,

// In calculate_task_hash(), replace raw from_wildcards() with:
let key = task_definition.env().to_vec();
let compiled = self.wildcard_cache.entry(key)
    .or_insert_with(|| CompiledWildcards::compile(task_definition.env()).unwrap());

Alternatively, extend from_wildcards to accept pre-compiled patterns, similar to how pass_through_env_compiled already works for the pass-through path.

Verify:

# No-op (fully cached) run on large monorepo — this is where regex overhead dominates
TURBO_LOG_VERBOSITY=debug turbo run build --dry 2>&1 | grep -i regex
# Profile with tracing spans around wildcard_map_from_wildcards
turbo run build --profile  # check flamegraph for RegexBuilder::build

2. TaskHashTracker RwLock Split (MEDIUM — est. 5-10% under high parallelism)

File: crates/turborepo-task-hash/src/lib.rs:226-241

Single Arc<RwLock<TaskHashTrackerState>> guards 6 independent HashMaps. With rayon parallelizing task hashes across topological waves, write contention on one map blocks reads from unrelated maps.

Fix: Replace the monolithic state with per-concern locks:

pub struct TaskHashTracker {
    hashes: DashMap<TaskId<'static>, Arc<str>>,
    env_vars: DashMap<TaskId<'static>, DetailedMap>,
    framework: DashMap<TaskId<'static>, FrameworkSlug>,
    outputs: DashMap<TaskId<'static>, Vec<AnchoredSystemPathBuf>>,
    cache: DashMap<TaskId<'static>, CacheHitMetadata>,
    inputs: DashMap<TaskId<'static>, Arc<FileHashes>>,
}

Or keep RwLock but split into 6 separate Arc<RwLock<HashMap<...>>> fields — simpler refactor, same contention benefit.

Verify:

# Scale test: vary rayon threads and measure lock wait
RAYON_NUM_THREADS=1 turbo run build --profile  # baseline
RAYON_NUM_THREADS=8 turbo run build --profile  # should scale better after
RAYON_NUM_THREADS=32 turbo run build --profile # most visible improvement
# Compare "precompute_task_hashes" span duration across thread counts

3. Parallel zstd Compression (MEDIUM — est. 20-40% on cache-miss builds)

File: crates/turborepo-cache/src/cache_archive/create.rs:115,151

zstd::Encoder::new(writer, 0) uses single-threaded compression. Every cache-miss task blocks on sequential compression of its output archive.

Fix:

let mut encoder = zstd::Encoder::new(file_buffer, 0)?;
// Only enable MT for archives above a size threshold to avoid thread pool overhead
encoder.multithread(std::thread::available_parallelism().map_or(4, |n| n.get() as u32))?;
let zw = encoder.auto_finish();

Gotchas:

zstd MT output is not byte-identical to single-threaded — verify that cache consumers only check decoded content, not raw archive bytes
Thread pool overhead may hurt for tiny archives; consider a size threshold (e.g., only enable MT for archives > 1MB)
Test with RAYON_NUM_THREADS to avoid over-subscribing CPU when rayon is also running tasks

Verify:

# Cold build with cache cleared
rm -rf node_modules/.cache/turbo
time turbo run build --force  # before
# After patch:
time turbo run build --force  # compare archive creation time in profile

4. Globwalk Pattern Caching (LOW-MEDIUM — est. 10-20% of globwalk time)

File: crates/turborepo-globwalk/src/lib.rs:285-290

glob_with_contextual_error() calls Glob::new(raw).map(|g| g.into_owned()) every time with no memoization. In 100-package repos, dist/** gets compiled 100+ times.

Fix: Thread-local LRU cache:

use std::cell::RefCell;
use lru::LruCache;

fn glob_with_contextual_error<S: AsRef<str>>(raw: S) -> Result<Glob<'static>, WalkError> {
    thread_local! {
        static CACHE: RefCell<LruCache<String, Glob<'static>>> =
            RefCell::new(LruCache::new(std::num::NonZeroUsize::new(256).unwrap()));
    }
    let raw = raw.as_ref();
    CACHE.with(|c| {
        let mut cache = c.borrow_mut();
        if let Some(g) = cache.get(raw) {
            return Ok(g.clone());
        }
        let g = Glob::new(raw).map(|g| g.into_owned())
            .map_err(|e| WalkError::BadPattern(raw.to_string(), Box::new(e)))?;
        cache.put(raw.to_string(), g.clone());
        Ok(g)
    })
}

Verify:

# Add tracing: count Glob::new calls vs cache hits
turbo run build --profile  # look at globwalk spans

5. DashMap Pre-sizing in Lockfile Closure (LOW — marginal)

File: crates/turborepo-lockfiles/src/lib.rs:135

DashMap::new() starts at default capacity. Pre-sizing avoids rehash overhead.

Fix:

let estimated_packages: usize = workspaces.values().map(|deps| deps.len()).sum();
let resolve_cache: ResolveCache = DashMap::with_capacity(estimated_packages);

Verify: Only worth measuring if lockfile closure is >5% of total run time in profiles.

Ripple Effects

Change	Files Touched	Breaking Risk
Regex caching	`turborepo-env/src/lib.rs`, `turborepo-task-hash/src/lib.rs`	None — internal only
RwLock split	`turborepo-task-hash/src/lib.rs`	Low — all access goes through `TaskHashTracker` methods; interface unchanged
zstd MT	`turborepo-cache/src/cache_archive/create.rs`	Medium — archive bytes change; verify remote cache compatibility
Glob caching	`turborepo-globwalk/src/lib.rs`	None — pure internal optimization
DashMap pre-size	`turborepo-lockfiles/src/lib.rs`	None — behavior identical

Key risk for zstd MT: If the remote cache stores raw archive bytes and compares them for dedup, switching to MT compression would produce different bytes for the same content. The cache key is the task hash (not archive hash), so this should be safe — but verify with turborepo-cache integration tests.

What This Unlocks

No-op runs (fully cached): Regex caching + RwLock split target the overhead that dominates when actual task execution is skipped. Expected: 15-25% faster no-op turbo run on 500+ package repos.
Cold builds (cache miss): Parallel zstd compression directly reduces per-task archive creation time. On Next.js monorepos with large .next outputs (100MB+), this could save 2-5 seconds per cache-miss task.
Scaling ceiling raised: RwLock split removes the serialization bottleneck that limits rayon's parallelism. Combined with the already-landed wave-based hashing, this means turbo can effectively utilize 16-32 cores during hash computation.
Consistent sub-second overhead: Together, these changes push the "turbo overhead" (time spent before/after actual task execution) well below 1 second even on large monorepos.

Effort & Risk

#	Optimization	Effort	Risk	Expected Gain	Priority
1	Regex caching	Low (1-2 days)	Low — pattern equivalence easy to test	10-20% of hash phase	P0
2	RwLock split	Medium (2-3 days)	Low — methods stay the same, needs stress tests	5-10% under high parallelism	P1
3	zstd MT	Low (1 day)	Medium — archive byte compatibility, thread oversubscription	20-40% of archive creation	P1
4	Glob caching	Low (1 day)	Low — thread-local, no shared state	10-20% of globwalk	P2
5	DashMap pre-size	Trivial (1 hour)	None	<5%	P3

Recommended verification harness (applies to all changes):

# 1. No-op run (measures overhead: regex, locks, glob)
turbo run build --dry --profile   # 10 iterations, compare median

# 2. Cold build (measures compression, hashing)  
rm -rf node_modules/.cache/turbo && turbo run build --force --profile

# 3. Scale test (measures lock contention)
for threads in 1 2 4 8 16 32; do
  RAYON_NUM_THREADS=$threads turbo run build --dry --profile 2>&1 | \
    grep precompute_task_hashes
done

# 4. Correctness gate
cargo test -p turborepo-task-hash -p turborepo-env -p turborepo-cache -p turborepo-globwalk

Assumption: These estimates assume a 500+ package monorepo (like Vercel's internal monorepo or similar large Next.js setups). On smaller repos (<50 packages), only the zstd MT change would produce measurable improvement.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.
task-hash-tracker-rwlock-contention Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs Maturity: stable Relevance: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. Key findings: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.

Exploration

AREA: task-hash-tracker-rwlock-contention FILES: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs MATURITY: stable RELEVANCE: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. NOTES: (unspecified)

Raw Exploration Notes: AREA: task-hash-tracker-rwlock-contention FILES: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs MATURITY: stable RELEVANCE: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.

Selected Code Context

===== FILE: crates/turborepo-task-hash/src/lib.rs ===== //! turborepo-task-hash: Task hashing utilities for Turborepo cache invalidation //! //! This crate provides the core task hashing logic for Turborepo. It computes //! hashes for tasks based on their inputs (files, environment variables, //! dependencies) to determine cache invalidation.

pub mod global_hash;

use std::{ collections::{HashMap, HashSet}, sync::{Arc, RwLock}, };

pub use global_hash::; use rayon::prelude::; use serde::Serialize; use thiserror::Error; use tracing::debug; use turbopath::{ AbsoluteSystemPath, AnchoredSystemPath, AnchoredSystemPathBuf, RelativeUnixPathBuf, }; use turborepo_cache::CacheHitMetadata; // Re-export turborepo_engine::TaskNode for convenience pub use turborepo_engine::TaskNode; use turborepo_env::{ BUILTIN_PASS_THROUGH_ENV, BySource, CompiledWildcards, DetailedMap, EnvironmentVariableMap, }; use turborepo_frameworks::{Slug as FrameworkSlug, infer_framework}; use turborepo_hash::{FileHashes, LockFilePackagesRef, TaskHashable, TurboHash}; use turborepo_repository::package_graph::{PackageInfo, PackageName}; use turborepo_scm::{RepoGitIndex, SCM}; use turborepo_task_id::TaskId; use turborepo_telemetry::events::{generic::GenericEventBuilder, task::PackageTaskEventBuilder}; use turborepo_types::{ EnvMode, HashTrackerCacheHitMetadata, HashTrackerDetailedMap, HashTrackerInfo, RunOptsHashInfo, TaskDefinitionHashInfo, TaskInputs, };

#[derive(Debug, Error)] pub enum Error { #[error("Missing pipeline entry: {0}")] MissingPipelineEntry(TaskId<'static>), #[error("Missing package.json for {0}.")] MissingPackageJson(String), #[error("Cannot find package-file hash for {0}.")] MissingPackageFileHash(String), #[error("Missing hash for dependent task {0}.")] MissingDependencyTaskHash(String), #[error("Cannot acquire lock for task hash tracker.")] Mutex, #[error("Missing environment variables for {0}.")] MissingEnvVars(TaskId<'static>), #[error( "Error processing environment patterns for task {task_id} (including global exclusions):
{err}" )] EnvPattern { task_id: TaskId<'static>, #[source] err: turborepo_env::Error, }, #[error(transparent)] Scm(#[from] turborepo_scm::Error), #[error(transparent)] Env(#[from] turborepo_env::Error), #[error(transparent)] Regex(#[from] regex::Error), #[error(transparent)] Path(#[from] turbopath::PathError), }

#[derive(Debug, Default)] pub struct PackageInputsHashes { hashes: HashMap<TaskId<'static>, String>, expanded_hashes: HashMap<TaskId<'static>, Arc>, }

impl PackageInputsHashes { #[tracing::instrument(skip( all_tasks, workspaces, task_definitions, repo_root, scm, _telemetry, pre_built_index ))] pub fn calculate_file_hashes<'a, T>( scm: &SCM, all_tasks: impl Iterator<Item = &'a TaskNode>, workspaces: HashMap<&PackageName, &PackageInfo>, task_definitions: &HashMap<TaskId<'static>, T>, repo_root: &AbsoluteSystemPath, _telemetry: &GenericEventBuilder, pre_built_index: Option<&RepoGitIndex>, ) -> Result<PackageInputsHashes, Error> where T: TaskDefinitionHashInfo + Sync, { tracing::trace!(scm_manual=%scm.is_manual(), "scm running in {} mode", if scm.is_manual() { "manual" } else { "git" });

    // Use the pre-built index if provided, otherwise build one on the spot.
    let owned_index;
    let repo_index = match pre_built_index {
        Some(idx) => Some(idx),
        None => {
            owned_index = scm.build_repo_index(workspaces.len());
            owned_index.as_ref()
        }
    };

    // Phase 1: Collect task metadata and group by (package_path, inputs) for dedup.
    // Multiple tasks in the same package with identical inputs produce the same
    // file hashes — no need to globwalk and hash the same files repeatedly.
    struct TaskInfo<'b> {
        task_id: TaskId<'static>,
        package_path: &'b AnchoredSystemPath,
        inputs: &'b TaskInputs,
    }

    let mut task_infos = Vec::new();
    for task in all_tasks {
        let TaskNode::Task(task_id) = task else {
            continue;
        };
        let task_definition = task_definitions
            .get(task_id)
            .ok_or_else(|| Error::MissingPipelineEntry(task_id.clone()))?;
        let workspace_name = task_id.to_workspace_name();
        let pkg = workspaces
            .get(&workspace_name)
            .ok_or_else(|| Error::MissingPackageJson(workspace_name.to_string()))?;
        let package_path = pkg
            .package_json_path
            .parent()
            .unwrap_or_else(|| AnchoredSystemPath::new("").unwrap());
        let inputs = task_definition.inputs();
        task_infos.push(TaskInfo {
            task_id: task_id.clone(),
            package_path,
            inputs,
        });
    }

    // Build dedup key: (package_path_str, globs, default)
    type HashKey = (AnchoredSystemPathBuf, Vec<String>, bool);
    let mut unique_keys: Vec<HashKey> = Vec::new();
    let mut key_indices: HashMap<HashKey, usize> = HashMap::new();
    let mut task_key_map: Vec<usize> = Vec::with_capacity(task_infos.len());

    for info in &task_infos {
        let key: HashKey = (
            info.package_path.to_owned(),
            info.inputs.globs.clone(),
            info.inputs.default,
        );
        let idx = match key_indices.get(&key) {
            Some(&idx) => idx,
            None => {
                let idx = unique_keys.len();
                key_indices.insert(key.clone(), idx);
                unique_keys.push(key);
                idx
            }
        };
        task_key_map.push(idx);
    }

    debug!(
        total_tasks = task_infos.len(),
        unique_hash_keys = unique_keys.len(),
        "file hash deduplication"
    );

    // Phase 2: Compute file hashes in parallel across unique keys.
    // EMFILE (too many open files) errors are handled via retry-with-backoff
    // in the globwalk and hash_objects layers, so we can safely parallelize
    // all keys on rayon without worrying about fd exhaustion.
    let file_hash_results: Vec<Result<Arc<FileHashes>, Error>> = unique_keys
        .into_par_iter()
        .map(|(package_path, globs, default)| {
            scm.get_package_file_hashes(
                repo_root,
                &package_path,
                &globs,
                default,
                None,
                repo_index,
            )
            .map(|h| {
                let mut v: Vec<_> = h.into_iter().collect();
                v.sort_unstable_by(|(a, _), (b, _)| a.cmp(b));
                Arc::new(FileHashes(v))
            })
            .map_err(Error::from)
        })
        .collect();

    let file_hash_results: Vec<Arc<FileHashes>> = file_hash_results
        .into_iter()
        .collect::<Result<Vec<_>, _>>()?;

    // Phase 3: Distribute shared results to individual tasks.
    let mut hashes = HashMap::with_capacity(task_infos.len());
    let mut expanded_hashes = HashMap::with_capacity(task_infos.len());

    for (i, info) in task_infos.into_iter().enumerate() {
        let key_idx = task_key_map[i];
        let file_hashes = &file_hash_results[key_idx];

        let hash = file_hashes.as_ref().hash();

        hashes.insert(info.task_id.clone(), hash);
        expanded_hashes.insert(info.task_id, Arc::clone(file_hashes));
    }

    Ok(PackageInputsHashes {
        hashes,
        expanded_hashes,
    })
}

}

#[derive(Default, Debug, Clone)] pub struct TaskHashTracker { state: Arc<RwLock>, }

#[derive(Default, Debug, Serialize)] pub struct TaskHashTrackerState { #[serde(skip)] package_task_env_vars: HashMap<TaskId<'static>, DetailedMap>, package_task_hashes: HashMap<TaskId<'static>, Arc>, #[serde(skip)] package_task_framework: HashMap<TaskId<'static>, FrameworkSlug>, #[serde(skip)] package_task_outputs: HashMap<TaskId<'static>, Vec>, #[serde(skip)] package_task_cache: HashMap<TaskId<'static>, CacheHitMetadata>, #[serde(skip)] package_task_inputs_expanded_hashes: HashMap<TaskId<'static>, Arc>, }

/// Caches package-inputs hashes, and package-task hashes. pub struct TaskHasher<'a, R> { hashes: HashMap<TaskId<'static>, String>, run_opts: &'a R, env_at_execution_start: &'a EnvironmentVariableMap, global_env: EnvironmentVariableMap, global_env_patterns: &'a [String], global_hash: &'a str, task_hash_tracker: TaskHashTracker, compiled_builtins: CompiledWildcards, external_deps_hash_cache: HashMap<String, String>, }

impl<'a, R: RunOptsHashInfo> TaskHasher<'a, R> { pub fn new( package_inputs_hashes: PackageInputsHashes, run_opts: &'a R, env_at_execution_start: &'a EnvironmentVariableMap, global_hash: &'a str, global_env: EnvironmentVariableMap, global_env_patterns: &'a [String], ) -> Self { let PackageInputsHashes { hashes, expanded_hashes, } = package_inputs_hashes;

    let compiled_builtins = CompiledWildcards::compile(BUILTIN_PASS_THROUGH_ENV)
        .unwrap_or_else(|_| {
            let empty: &[&str] = &[];
            CompiledWildcards::compile(empty).unwrap()
        });

    Self {
        hashes,
        run_opts,
        env_at_execution_start,
        global_hash,
        global_env,
        global_env_patterns,
        task_hash_tracker: TaskHashTracker::new(expanded_hashes),
        compiled_builtins,
        external_deps_hash_cache: HashMap::new(),
    }
}

/// Pre-compute and cache external dependency hashes for all packages.
/// Many tasks share the same package, so this avoids re-sorting
/// transitive dependencies for every task.
pub fn precompute_external_deps_hashes<'b>(
    &mut self,
    workspaces: impl Iterator<Item = (&'b PackageName, &'b PackageInfo)>,
) {
    if self.run_opts.single_package() {
        return;
    }
    let ws: Vec<_> = workspaces.collect();
    self.external_deps_hash_cache = ws
        .par_iter()
        .map(|(name, info)| {
            let hash = get_external_deps_hash(&info.transitive_dependencies);
            (name.as_str().to_owned(), hash)
        })
        .collect();
}

#[tracing::instrument(skip(self, task_definition, task_env_mode, workspace, dependency_set))]
pub fn calculate_task_hash<T: TaskDefinitionHashInfo>(
    &self,
    task_id: &TaskId<'static>,
    task_definition: &T,
    task_env_mode: EnvMode,
    workspace: &PackageInfo,
    dependency_set: &[&TaskNode],
    telemetry: PackageTaskEventBuilder,
) -> Result<String, Error> {
    let do_framework_inference = self.run_opts.framework_inference();
    let is_monorepo = !self.run_opts.single_package();

    let hash_of_files = self
        .hashes
        .get(task_id)
        .ok_or_else(|| Error::MissingPackageFileHash(task_id.to_string()))?;
    // See if we can infer a framework
    let framework = do_framework_inference
        .then(|| infer_framework(workspace, is_monorepo))
        .flatten()
        .inspect(|framework| {
            debug!("auto detected framework for {}", task_id.package());
            debug!(
                "framework: {}, env_prefix: {:?}",
                framework.slug(),
                framework.env(self.env_at_execution_start)
            );
            telemetry.track_framework(framework.slug().to_string());
        });
    let framework_slug = framework.map(|f| f.slug());

    let env_vars = if let Some(framework) = framework {
        let mut computed_wildcards = framework.env(self.env_at_execution_start);

        if let Some(exclude_prefix) = self
            .env_at_execution_start
            .get("TURBO_CI_VENDOR_ENV_KEY")
            .filter(|prefix| !prefix.is_empty())
        {
            let computed_exclude = format!("!{exclude_prefix}*");
            debug!(
                "excluding environment variables matching wildcard {}",
                computed_exclude
            );
            computed_wildcards.push(computed_exclude);
        }

        // Combine task-specific env patterns with global env exclusions
        // Global exclusions (patterns starting with !) should apply to framework
        // inference
        let combined_env_patterns: Vec<String> = task_definition
            .env()
            .iter()
            .chain(
                self.global_env_patterns
                    .iter()
                    .filter(|p| p.starts_with('!')),
            )
            .cloned()
            .collect();

        self.env_at_execution_start
            .hashable_task_env(&computed_wildcards, &combined_env_patterns)
            .map_err(|err| Error::EnvPattern {
                task_id: task_id.clone().into_owned(),
                err,
            })?
    } else {
        let all_env_var_map = self
            .env_at_execution_start
            .from_wildcards(task_definition.env())?;

        DetailedMap {
            by_source: BySource {
                explicit: all_env_var_map.clone(),
                matching: EnvironmentVariableMap::default(),
            },
            all: all_env_var_map,
        }
    };

    let hashable_env_pairs = env_vars.all.to_hashable();
    let outputs = task_definition.hashable_outputs(task_id);
    let task_dependency_hashes = self.calculate_dependency_hashes(dependency_set)?;
    let ext_hash_fallback;
    let external_deps_hash: Option<&str> = if !is_monorepo {
        None
    } else if let Some(cached) = self.external_deps_hash_cache.get(task_id.package()) {
        Some(cached.as_str())
    } else {
        ext_hash_fallback = get_external_deps_hash(&workspace.transitive_dependencies);
        Some(ext_hash_fallback.as_str())
    };

    if !hashable_env_pairs.is_empty() {
        debug!(
            "task hash env vars for {}:{}\n vars: {:?}",
            task_id.package(),
            task_id.task(),
            hashable_env_pairs
        );
    }

    let package_dir = workspace.package_path().to_unix();
    let is_root_package = package_dir.is_empty();
    // We wrap in an Option to mimic Go's serialization of nullable values
    let optional_package_dir = (!is_root_package).then_some(package_dir);

    let task_hashable = TaskHashable {
        global_hash: self.global_hash,
        task_dependency_hashes,
        package_dir: optional_package_dir,
        hash_of_files,
        external_deps_hash,
        task: task_id.task(),
        outputs,

        pass_through_args: self.run_opts.pass_through_args(),
        env: task_definition.env(),
        resolved_env_vars: hashable_env_pairs,
        pass_through_env: task_definition.pass_through_env().unwrap_or_default(),
        env_mode: task_env_mode,
    };

    let task_hash = task_hashable.calculate_task_hash();

    let task_hash_arc: Arc<str> = Arc::from(task_hash.as_str());
    self.task_hash_tracker.insert_hash(
        task_id.clone(),
        env_vars,
        task_hash_arc,
        framework_slug,
    );

    Ok(task_hash)
}

/// Gets the hashes of a task's dependencies. Because the visitor
/// receives the nodes in topological order, we know that all of
/// the dependencies have been processed before the current task.
///
/// # Arguments
///
/// * `dependency_set`: The dependencies of the current task
///
/// returns: Result<Vec<String, Global>, Error>
fn calculate_dependency_hashes(
    &self,
    dependency_set: &[&TaskNode],
) -> Result<Vec<Arc<str>>, Error> {
    let state = self
        .task_hash_tracker
        .state
        .read()
        .expect("hash tracker rwlock poisoned");

    let mut dependency_hash_list: Vec<Arc<str>> = Vec::with_capacity(dependency_set.len());
    for dependency_task in dependency_set {
        let TaskNode::Task(dependency_task_id) = dependency_task else {
            continue;
        };

        let dependency_hash = state
            .package_task_hashes
            .get(dependency_task_id)
            .ok_or_else(|| Error::MissingDependencyTaskHash(dependency_task.to_string()))?;
        dependency_hash_list.push(Arc::clone(dependency_hash));
    }
    drop(state);

    dependency_hash_list.sort_unstable();
    dependency_hash_list.dedup();

    Ok(dependency_hash_list)
}

pub fn into_task_hash_tracker_state(self) -> TaskHashTrackerState {
    let rwlock = Arc::into_inner(self.task_hash_tracker.state)
        .expect("multiple references to tracker state still exist");
    rwlock.into_inner().unwrap()
}

pub fn task_hash_tracker(&self) -> TaskHashTracker {
    self.task_hash_tracker.clone()
}

pub fn env<T: TaskDefinitionHashInfo>(
    &self,
    task_id: &TaskId,
    task_env_mode: EnvMode,
    task_definition: &T,
) -> Result<EnvironmentVariableMap, Error> {
    match task_env_mode {
        EnvMode::Strict => {
            let pass_through_env_vars = self.env_at_execution_start.pass_through_env_compiled(
                &self.compiled_builtins,
                &self.global_env,
                task_definition.pass_through_env().unwrap_or_default(),
            )?;

            let tracker_env = self
                .task_hash_tracker
                .env_vars(task_id)
                .ok_or_else(|| Error::MissingEnvVars(task_id.clone().into_owned()))?;

            let mut full_task_env = EnvironmentVariableMap::default();
            full_task_env.union(&pass_through_env_vars);
            full_task_env.union(&tracker_env.all);

            Ok(full_task_env)
        }
        EnvMode::Loose => Ok(self.env_at_execution_start.clone()),
    }
}

}

pub fn get_external_deps_hash( transitive_dependencies: &Option<HashSet<turborepo_lockfiles::Package>>, ) -> String { let Some(transitive_dependencies) = transitive_dependencies else { return "".into(); };

// Collect references instead of cloning each Package (which has two Strings).
let mut transitive_deps: Vec<&turborepo_lockfiles::Package> =
    transitive_dependencies.iter().collect();

transitive_deps.sort_unstable_by(|a, b| match a.key.cmp(&b.key) {
    std::cmp::Ordering::Equal => a.version.cmp(&b.version),
    other => other,
});

LockFilePackagesRef(transitive_deps).hash()

}

pub fn get_internal_deps_hash( scm: &SCM, root: &AbsoluteSystemPath, package_dirs: Vec<&AnchoredSystemPath>, pre_built_index: Option<&RepoGitIndex>, ) -> Result<String, Error> { if package_dirs.is_empty() { return Ok("".into()); }

let owned_index;
let repo_index = match pre_built_index {
    Some(idx) => Some(idx),
    None => {
        owned_index = scm.build_repo_index(package_dirs.len());
        owned_index.as_ref()
    }
};

let merged = package_dirs
    .into_par_iter()
    .map(|package_dir| {
        scm.get_package_file_hashes::<&str>(root, package_dir, &[], false, None, repo_index)
    })
    .reduce(
        || Ok(HashMap::new()),
        |acc, hashes| {
            let mut acc = acc?;
            let hashes = hashes?;
            acc.extend(hashes.into_iter());
            Ok(acc)
        },
    )?;

let mut file_hashes: Vec<_> = merged.into_iter().collect();
file_hashes.sort_unstable_by(|(a, _), (b, _)| a.cmp(b));
Ok(FileHashes(file_hashes).hash())

}

impl TaskHashTracker { pub fn new(input_expanded_hashes: HashMap<TaskId<'static>, Arc>) -> Self { Self { state: Arc::new(RwLock::new(TaskHashTrackerState { package_task_inputs_expanded_hashes: input_expanded_hashes, ..Default::default() })), } }

pub fn hash(&self, task_id: &TaskId) -> Option<Arc<str>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_hashes.get(task_id).cloned()
}

fn insert_hash(
    &self,
    task_id: TaskId<'static>,
    env_vars: DetailedMap,
    hash: Arc<str>,
    framework_slug: Option<FrameworkSlug>,
) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state
        .package_task_env_vars
        .insert(task_id.clone(), env_vars);
    if let Some(framework) = framework_slug {
        // Only pay for one extra clone when framework inference is active.
        state
            .package_task_framework
            .insert(task_id.clone(), framework);
    }
    state.package_task_hashes.insert(task_id, hash);
}

pub fn env_vars(&self, task_id: &TaskId) -> Option<DetailedMap> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_env_vars.get(task_id).cloned()
}

pub fn framework(&self, task_id: &TaskId) -> Option<FrameworkSlug> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_framework.get(task_id).cloned()
}

pub fn expanded_outputs(&self, task_id: &TaskId) -> Option<Vec<AnchoredSystemPathBuf>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_outputs.get(task_id).cloned()
}

pub fn insert_expanded_outputs(
    &self,
    task_id: TaskId<'static>,
    outputs: Vec<AnchoredSystemPathBuf>,
) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state.package_task_outputs.insert(task_id, outputs);
}

pub fn cache_status(&self, task_id: &TaskId) -> Option<CacheHitMetadata> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_cache.get(task_id).copied()
}

pub fn insert_cache_status(&self, task_id: TaskId<'static>, cache_status: CacheHitMetadata) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state.package_task_cache.insert(task_id, cache_status);
}

pub fn get_expanded_inputs(&self, task_id: &TaskId) -> Option<Arc<FileHashes>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state
        .package_task_inputs_expanded_hashes
        .get(task_id)
        .cloned()
}

}

// Implement HashTrackerInfo for TaskHashTracker to allow use with // turborepo-run-summary. The trait is defined in turborepo-types to enable // proper dependency direction (task-hash doesn't depend on run-summary). impl HashTrackerInfo for TaskHashTracker { fn hash(&self, task_id: &TaskId) -> Option<Arc> { TaskHashTracker::hash(self, task_id) }

fn env_vars(&self, task_id: &TaskId) -> Option<HashTrackerDetailedMap> {
    TaskHashTracker::env_vars(self, task_id).map(|detailed| HashTrackerDetailedMap {
        explicit: detailed.by_source.explicit.to_secret_hashable(),
        matching: detailed.by_source.matching.to_secret_hashable(),
    })
}

fn cache_status(&self, task_id: &TaskId) -> Option<HashTrackerCacheHitMetadata> {
    TaskHashTracker::cache_status(self, task_id).map(|status| {
        let (local, remote) = match status.source {
            turborepo_cache::CacheSource::Local => (true, false),
            turborepo_cache::CacheSource::Remote => (false, true),
        };
        HashTrackerCacheHitMetadata {
            local,
            remote,
            time_saved: status.time_saved,
        }
    })
}

fn expanded_outputs(&self, task_id: &TaskId) -> Option<Vec<AnchoredSystemPathBuf>> {
    TaskHashTracker::expanded_outputs(self, task_id)
}

fn framework(&self, task_id: &TaskId) -> Option<String> {
    TaskHashTracker::framework(self, task_id).map(|f| f.to_string())
}

fn expanded_inputs(&self, task_id: &TaskId) -> Option<Vec<(RelativeUnixPathBuf, String)>> {
    TaskHashTracker::get_expanded_inputs(self, task_id).map(|file_hashes| file_hashes.0.clone())
}

}

// Implement HashTrackerProvider for TaskHashTracker to allow use with // turborepo-task-executor's TaskExecutor. impl turborepo_task_executor::HashTrackerProvider for TaskHashTracker { fn insert_cache_status(&self, task_id: TaskId<'static>, status: CacheHitMetadata) { TaskHashTracker::insert_cache_status(self, task_id, status) }

fn insert_expanded_outputs(
    &self,
    task_id: TaskId<'static>,
    outputs: Vec<AnchoredSystemPathBuf>,
) {
    TaskHashTracker::insert_expanded_outputs(self, task_id, outputs)
}

}

#[cfg(test)] mod test { use super::*;

#[test]
fn test_hash_tracker_is_send_and_sync() {
    // We need the tracker to implement these traits as multiple tasks will query
    // and write to it
    fn assert_send<T: Send>() {}
    fn assert_sync<T: Sync>() {}
    assert_send::<TaskHashTracker>();
    assert_sync::<TaskHashTracker>();
}

#[test]
fn test_hash_tracker_concurrent_reads() {
    let tracker = TaskHashTracker::new(HashMap::new());
    let task_id: TaskId<'static> = TaskId::new("pkg", "build");
    tracker.insert_hash(
        task_id.clone(),
        DetailedMap::default(),
        Arc::from("abc123"),
        None,
    );

    // Multiple concurrent reads should not deadlock or panic with RwLock
    std::thread::scope(|s| {
        for _ in 0..8 {
            let tracker = &tracker;
            let task_id = &task_id;
            s.spawn(move || {
                for _ in 0..100 {
                    let h = tracker.hash(task_id);
                    assert_eq!(h.as_deref(), Some("abc123"));
                }
            });
        }
    });
}

#[test]
fn test_hash_tracker_concurrent_read_write() {
    let tracker = TaskHashTracker::new(HashMap::new());

    // Pre-create owned task IDs to avoid lifetime issues with TaskId borrows
    let task_ids: Vec<TaskId<'static>> = (0..50)
        .map(|i| TaskId::new("pkg", &format!("task-{i}")).into_owned())
        .collect();

    // One writer, many readers — verifies RwLock allows concurrent reads
    // while writes are exclusive, without deadlock.
    std::thread::scope(|s| {
        let tracker = &tracker;
        let task_ids = &task_ids;

        s.spawn(move || {
            for (i, task_id) in task_ids.iter().enumerate() {
                tracker.insert_hash(
                    task_id.clone(),
                    DetailedMap::default(),
                    Arc::from(format!("hash-{i}").as_str()),
                    None,
                );
            }
        });

        for _ in 0..4 {
            s.spawn(move || {
                for task_id in task_ids {
                    // May or may not find the hash depending on timing — that's fine,
                    // we're testing for absence of panics/deadlocks.
                    let _ = tracker.hash(task_id);
                    let _ = tracker.env_vars(task_id);
                    let _ = tracker.cache_status(task_id);
                }
            });
        }
    });
}

#[test]
fn test_expanded_inputs_returns_cloned_data() {
    use turborepo_types::HashTrackerInfo;

    let task_id: TaskId<'static> = TaskId::new("pkg", "build");
    // Sorted by key (the invariant FileHashes requires)
    let file_hashes = FileHashes(vec![
        (
            RelativeUnixPathBuf::new("package.json").unwrap(),
            "def456".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("src/index.ts").unwrap(),
            "abc123".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("src/utils/helper.ts").unwrap(),
            "ghi789".to_string(),
        ),
    ]);

    let mut input_hashes = HashMap::new();
    input_hashes.insert(task_id.clone(), Arc::new(file_hashes));
    let tracker = TaskHashTracker::new(input_hashes);

    // Via concrete method
    let arc_result = tracker.get_expanded_inputs(&task_id);
    assert!(arc_result.is_some());
    let arc_hashes = arc_result.unwrap();
    assert_eq!(arc_hashes.0.len(), 3);
    assert_eq!(arc_hashes.0[1].0.as_str(), "src/index.ts");
    assert_eq!(arc_hashes.0[1].1, "abc123");

    // Via trait method — returns sorted Vec
    let trait_result: Option<Vec<(RelativeUnixPathBuf, String)>> =
        HashTrackerInfo::expanded_inputs(&tracker, &task_id);
    assert!(trait_result.is_some());
    let trait_hashes = trait_result.unwrap();
    assert_eq!(trait_hashes.len(), 3);
    assert_eq!(trait_hashes[0].0.as_str(), "package.json");
    assert_eq!(trait_hashes[0].1, "def456");
    // Must be sorted by key
    assert!(
        trait_hashes.windows(2).all(|w| w[0].0 < w[1].0),
        "expanded_inputs should return sorted keys"
    );

    // Missing task returns None
    let missing = TaskId::new("other", "test");
    assert!(tracker.get_expanded_inputs(&missing).is_none());
    assert!(HashTrackerInfo::expanded_inputs(&tracker, &missing).is_none());
}

// Regression: expanded_inputs data must contain all entries and be sorted
// by key. This captures the invariant that must hold when switching the
// return type from BTreeMap to sorted Vec.
#[test]
fn test_expanded_inputs_sorted_and_complete() {
    use turborepo_types::HashTrackerInfo;

    let task_id: TaskId<'static> = TaskId::new("pkg", "build");
    // Sorted by key (FileHashes invariant)
    let file_hashes = FileHashes(vec![
        (
            RelativeUnixPathBuf::new("a/first.ts").unwrap(),
            "aaa".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("a/second.ts").unwrap(),
            "bbb".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("m/middle.ts").unwrap(),
            "mmm".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("z/last.ts").unwrap(),
            "zzz".to_string(),
        ),
    ]);

    let mut input_hashes = HashMap::new();
    input_hashes.insert(task_id.clone(), Arc::new(file_hashes));
    let tracker = TaskHashTracker::new(input_hashes);

    let result = HashTrackerInfo::expanded_inputs(&tracker, &task_id).unwrap();
    assert_eq!(result.len(), 4, "all entries must be present");

    // Entries must be sorted by key
    assert!(
        result.windows(2).all(|w| w[0].0 < w[1].0),
        "expanded_inputs must return keys in sorted order"
    );

    // Verify specific values
    assert_eq!(result[0].0.as_str(), "a/first.ts");
    assert_eq!(result[0].1, "aaa");
    assert_eq!(result[3].0.as_str(), "z/last.ts");
    assert_eq!(result[3].1, "zzz");
}

#[test]
fn test_external_deps_hash_deterministic() {
    use turborepo_lockfiles::Package;

    let deps: HashSet<Package> = vec![
        Package {
            key: "react".to_string(),
            version: "18.0.0".to_string(),
        },
        Package {
            key: "lodash".to_string(),
            version: "4.17.21".to_string(),
        },
        Package {
            key: "typescript".to_string(),
            version: "5.0.0".to_string(),
        },
    ]
    .into_iter()
    .collect();

    let hash1 = get_external_deps_hash(&Some(deps.clone()));
    let hash2 = get_external_deps_hash(&Some(deps));
    assert_eq!(hash1, hash2, "same deps should produce same hash");
    assert!(!hash1.is_empty(), "hash should be non-empty");
}

#[test]
fn test_external_deps_hash_empty() {
    let hash_none = get_external_deps_hash(&None);
    assert_eq!(hash_none, "", "None deps should produce empty hash");

    let hash_empty = get_external_deps_hash(&Some(HashSet::new()));
    assert!(
        !hash_empty.is_empty(),
        "empty set should produce non-empty hash"
    );
}

#[test]
fn test_external_deps_hash_order_independent() {
    use turborepo_lockfiles::Package;

    let deps1: HashSet<Package> = vec![
        Package {
            key: "a".to_string(),
            version: "1.0".to_string(),
        },
        Package {
            key: "b".to_string(),
            version: "2.0".to_string(),
        },
    ]
    .into_iter()
    .collect();

    let deps2: HashSet<Package> = vec![
        Package {
            key: "b".to_string(),
            version: "2.0".to_string(),
        },
        Package {
            key: "a".to_string(),
            version: "1.0".to_string(),
        },
    ]
    .into_iter()
    .collect();

    let hash1 = get_external_deps_hash(&Some(deps1));
    let hash2 = get_external_deps_hash(&Some(deps2));
    assert_eq!(
        hash1, hash2,
        "hash should be order-independent since we sort"
    );
}

#[test]
fn test_tracker_pre_sized_hashmaps() {
    let mut input_hashes = HashMap::new();
    for i in 0..100 {
        let task_id = TaskId::new("pkg", &format!("task-{i}")).into_owned();
        input_hashes.insert(task_id, Arc::new(FileHashes(Vec::new())));
    }
    let tracker = TaskHashTracker::new(input_hashes);

    // Insert hashes and verify pre-sizing didn't break anything
    for i in 0..100 {
        let task_id = TaskId::new("pkg", &format!("task-{i}")).into_owned();
        tracker.insert_hash(
            task_id.clone(),
            DetailedMap::default(),
            Arc::from(format!("hash-{i}").as_str()),
            None,
        );
        assert_eq!(
            tracker.hash(&task_id).as_deref(),
            Some(format!("hash-{i}").as_str())
        );
    }
}

// Validates that sort+dedup produces the same result as the previous
// HashSet→Vec→sort approach for dependency hash deduplication.
#[test]
fn test_sort_dedup_matches_hashset_behavior() {
    let inputs: Vec<Vec<&str>> = vec![
        vec!["abc", "def", "abc", "ghi", "def"],
        vec!["zzz", "aaa", "mmm"],
        vec!["same", "same", "same"],
        vec![],
        vec!["only-one"],
    ];

    for input in inputs {
        // New approach: sort + dedup
        let mut sort_dedup: Vec<String> = input.iter().map(|s| s.to_string()).collect();
        sort_dedup.sort_unstable();
        sort_dedup.dedup();

        // Old approach: HashSet → Vec → sort
        let hash_set: HashSet<String> = input.iter().map(|s| s.to_string()).collect();
        let mut hashset_sorted: Vec<String> = hash_set.into_iter().collect();
        hashset_sorted.sort();

        assert_eq!(
            sort_dedup, hashset_sorted,
            "sort+dedup and hashset+sort should produce identical results for: {input:?}"
        );
    }
}

} ===== END FILE: crates/turborepo-task-hash/src/lib.rs =====

===== FILE: crates/turborepo-lib/src/task_graph/visitor/mod.rs ===== mod command; mod exec;

use std::{ borrow::Cow, collections::{HashMap, HashSet}, io::Write, sync::{Arc, Mutex}, };

use console::{Style, StyledObject}; use convert_case::{Case, Casing}; use exec::ExecContextFactory; use futures::{stream::FuturesUnordered, StreamExt}; use itertools::Itertools; use miette::{Diagnostic, NamedSource, SourceSpan}; use tokio::sync::mpsc; use tracing::{debug, warn, Instrument, Span}; use turbopath::{AbsoluteSystemPath, AnchoredSystemPath}; use turborepo_ci::{Vendor, VendorBehavior}; use turborepo_engine::{TaskError, TaskWarning}; use turborepo_env::{platform::PlatformEnv, EnvironmentVariableMap}; use turborepo_errors::TURBO_SITE; use turborepo_process::ProcessManager; use turborepo_repository::package_graph::{PackageGraph, PackageName, ROOT_PKG_NAME}; use turborepo_run_summary::{self as summary, GlobalHashSummary, RunTracker}; use turborepo_scm::SCM; // Re-export output types and shared functions from turborepo-task-executor pub use turborepo_task_executor::{turbo_regex, StdWriter, TaskOutput}; use turborepo_task_id::TaskId; use turborepo_telemetry::events::{ generic::GenericEventBuilder, task::PackageTaskEventBuilder, EventBuilder, TrackedErrors, }; use turborepo_types::{EnvMode, ResolvedLogOrder, ResolvedLogPrefix}; use turborepo_ui::{ sender::UISender, ColorConfig, ColorSelector, OutputClient, OutputSink, PrefixedUI, };

use crate::{ engine::{Engine, ExecutionOptions}, microfrontends::MicrofrontendsConfigs, opts::RunOpts, run::{task_access::TaskAccess, RunCache}, task_hash::{ self, GlobalHashableInputs, PackageInputsHashes, TaskHashTrackerState, TaskHasher, }, };

// This holds the whole world pub struct Visitor<'a> { color_cache: ColorSelector, dry: bool, global_env_mode: EnvMode, manager: ProcessManager, run_opts: &'a RunOpts, package_graph: Arc, repo_root: &'a AbsoluteSystemPath, run_cache: Arc, run_tracker: RunTracker, task_access: &'a TaskAccess, sink: OutputSink, task_hasher: TaskHasher<'a>, color_config: ColorConfig, is_watch: bool, ui_sender: Option, warnings: Arc<Mutex<Vec>>, micro_frontends_configs: Option<&'a MicrofrontendsConfigs>, }

#[derive(Debug, thiserror::Error, Diagnostic)] #[error( "Your package.json script looks like it invokes a Root Task ({task_name}), creating a loop
of turbo invocations. You likely have misconfigured your scripts and tasks or your package
manager's Workspace structure." )] #[diagnostic( code(recursive_turbo_invocations), url( "{}/messages/{}", TURBO_SITE, self.code().unwrap().to_string().to_case(Case::Kebab) ) )] pub struct RecursiveTurboError { pub task_name: String, pub command: String, #[label("This script calls turbo, which calls the script, which calls turbo...")] pub span: Option, #[source_code] pub text: NamedSource, }

#[derive(Debug, thiserror::Error, Diagnostic)] pub enum Error { #[error("Cannot find package {package_name} for task {task_id}.")] MissingPackage { package_name: PackageName, task_id: TaskId<'static>, }, #[error(transparent)] #[diagnostic(transparent)] RecursiveTurbo(Box), #[error("Could not find definition for task")] MissingDefinition, #[error("Error while executing engine: {0}")] Engine(#[from] crate::engine::ExecuteError), #[error(transparent)] TaskHash(#[from] task_hash::Error), #[error(transparent)] RunSummary(#[from] summary::Error), #[error("Internal errors encountered: {0}")] InternalErrors(String), #[error("Unable to find package manager binary: {0}")] Which(#[from] which::Error), #[error(transparent)] CommandProvider(#[from] turborepo_task_executor::CommandProviderError), }

impl<'a> Visitor<'a> { // Disabling this lint until we stop adding state to the visitor. // Once we have the full picture we will go about grouping these pieces of data // together #[allow(clippy::too_many_arguments)] pub async fn new( package_graph: Arc, run_cache: Arc, run_tracker: RunTracker, task_access: &'a TaskAccess, run_opts: &'a RunOpts, package_inputs_hashes: PackageInputsHashes, env_at_execution_start: &'a EnvironmentVariableMap, global_hash: &'a str, color_config: ColorConfig, manager: ProcessManager, repo_root: &'a AbsoluteSystemPath, global_env: EnvironmentVariableMap, global_env_patterns: &'a [String], ui_sender: Option, is_watch: bool, micro_frontends_configs: Option<&'a MicrofrontendsConfigs>, ) -> Self { let (task_hasher, sink, color_cache) = { let _span = tracing::info_span!("visitor_new").entered(); let mut task_hasher = TaskHasher::new( package_inputs_hashes, run_opts, env_at_execution_start, global_hash, global_env, global_env_patterns, );

        task_hasher.precompute_external_deps_hashes(package_graph.packages());

        let sink = Self::sink(run_opts);
        let color_cache = ColorSelector::default();
        (task_hasher, sink, color_cache)
    };

    // Set up correct size for underlying pty (requires .await, so outside span)
    if let Some(app) = ui_sender.as_ref() {
        if let Some(pane_size) = app.pane_size().await {
            manager.set_pty_size(pane_size.rows, pane_size.cols);
        }
    }

    Self {
        color_cache,
        dry: false,
        global_env_mode: run_opts.env_mode,
        manager,
        run_opts,
        package_graph,
        repo_root,
        run_cache,
        run_tracker,
        task_access,
        sink,
        task_hasher,
        color_config,
        ui_sender,
        is_watch,
        warnings: Default::default(),
        micro_frontends_configs,
    }
}

/// Pre-compute task hashes and execution environments for all tasks in
/// parallel. Tasks are processed in topological waves so dependency
/// hashes are always available when needed. Returns a map from TaskId
/// to (hash, execution_env).
fn precompute_task_hashes(
    &self,
    engine: &Engine,
    telemetry: &GenericEventBuilder,
) -> Result<HashMap<TaskId<'static>, (String, EnvironmentVariableMap)>, Error> {
    use petgraph::algo::toposort;
    use rayon::prelude::*;
    use turborepo_engine::TaskNode;

    let graph = engine.task_graph();
    let mut sorted = toposort(graph, None).map_err(|_| Error::MissingDefinition)?;
    // toposort returns dependents before dependencies (edges point
    // dependent→dependency via Outgoing). Reverse so dependencies
    // come first.
    sorted.reverse();

    // Compute depth (topological level) for each node so we can process
    // independent tasks in parallel within each wave. Dependencies
    // (Outgoing neighbors) must have lower depth.
    let mut depth: HashMap<petgraph::graph::NodeIndex, usize> = HashMap::new();
    for &node_idx in &sorted {
        let max_dep_depth = graph
            .neighbors_directed(node_idx, petgraph::Direction::Outgoing)
            .filter_map(|dep| depth.get(&dep))
            .max()
            .copied();
        let d = match max_dep_depth {
            Some(dd) => dd + 1,
            None => 0,
        };
        depth.insert(node_idx, d);
    }

    let max_depth = depth.values().max().copied().unwrap_or(0);

    // Group task nodes by depth level.
    let mut waves: Vec<Vec<petgraph::graph::NodeIndex>> = vec![Vec::new(); max_depth + 1];
    for &node_idx in &sorted {
        let d = depth[&node_idx];
        waves[d].push(node_idx);
    }

    let results: Arc<Mutex<HashMap<TaskId<'static>, (String, EnvironmentVariableMap)>>> =
        Arc::new(Mutex::new(HashMap::with_capacity(sorted.len())));

    // Process each wave in parallel. Within a wave, all dependencies
    // have already been hashed in earlier waves.
    for wave in &waves {
        type HashResult =
            Result<Option<(TaskId<'static>, String, EnvironmentVariableMap)>, Error>;
        let wave_results: Vec<HashResult> = wave
            .par_iter()
            .map(|&node_idx| {
                let node = &graph[node_idx];
                let TaskNode::Task(task_id) = node else {
                    return Ok(None);
                };

                let package_name = PackageName::from(task_id.package());
                let workspace_info = self
                    .package_graph
                    .package_info(&package_name)
                    .ok_or_else(|| Error::MissingPackage {
                        package_name: package_name.clone(),
                        task_id: task_id.clone(),
                    })?;

                let task_definition = engine
                    .task_definition(task_id)
                    .ok_or(Error::MissingDefinition)?;

                let task_env_mode = task_definition.env_mode.unwrap_or(self.global_env_mode);

                let dependency_set = engine
                    .dependencies(task_id)
                    .ok_or(Error::MissingDefinition)?;

                let package_task_event =
                    PackageTaskEventBuilder::new(task_id.package(), task_id.task())
                        .with_parent(telemetry);
                package_task_event.track_env_mode(&task_env_mode.to_string());

                let task_hash_telemetry = package_task_event.child();
                let task_hash = self.task_hasher.calculate_task_hash(
                    task_id,
                    task_definition,
                    task_env_mode,
                    workspace_info,
                    &dependency_set,
                    task_hash_telemetry,
                )?;

                let execution_env =
                    self.task_hasher
                        .env(task_id, task_env_mode, task_definition)?;

                Ok(Some((task_id.clone(), task_hash, execution_env)))
            })
            .collect();

        let mut map = results.lock().expect("precompute lock poisoned");
        for result in wave_results {
            if let Some((task_id, hash, env)) = result? {
                map.insert(task_id, (hash, env));
            }
        }
    }

    Ok(Arc::try_unwrap(results)
        .expect("all wave references dropped")
        .into_inner()
        .expect("mutex not poisoned"))
}

#[tracing::instrument(skip_all)]
pub async fn visit(
    &self,
    engine: Arc<Engine>,
    telemetry: &GenericEventBuilder,
) -> Result<Vec<TaskError>, Error> {
    for task in engine.tasks().sorted() {
        self.color_cache.color_for_key(&task.to_string());
    }

    // Pre-compute all task hashes and execution envs in parallel using
    // rayon. Tasks are grouped into topological waves so that each
    // task's dependency hashes are available before it is hashed.
    // This replaces the per-task serial hashing that was inside the
    // dispatch loop.
    let precomputed = {
        let _span = tracing::info_span!("precompute_task_hashes").entered();
        self.precompute_task_hashes(&engine, telemetry)?
    };

    let concurrency = self.run_opts.concurrency as usize;
    let (node_sender, mut node_stream) = mpsc::channel(concurrency);

    let e

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

cache-prefetch-and-dual-fetch.md

Now I have a thorough picture of the codebase. Let me compile the complete analysis.

Current State

The turborepo team has been on an aggressive performance sprint (Feb 20-23, 2026) with ~15 perf commits landing in 3 days. Key wins already shipped:

Optimization	Before	After	Gain
Parallel task hash waves (rayon)	268ms	78ms	-71%
Shallow wildcard glob fast path	54ms	6ms	-89%
gix-index replacing libgit2 for status	397ms	296ms	-25%
turbo.json parallel preloading	74ms	43ms	-42%
Run summary parallel construction	92ms	10ms	-89%
TLS init deferred to background	91ms wait	0ms	-100%
Engine builder extends-chain caching	283ms	74ms	-74%
Engine callback sent before tracker	845ms	804ms	-5%

The current bottleneck profile on a ~1000-package monorepo is approximately: startup/config ~100ms, git indexing ~300ms, engine build ~43ms, hash precompute ~78ms, task execution (I/O bound). Profiling infrastructure is mature (--profile flag outputs Chromium Trace JSON, turborepo-profile-md crate renders markdown summaries).

Assumption: Benchmarks below reference a large monorepo (~1000 packages, ~1700 tasks). Gains scale with workspace count.

Required Changes

Tier 1 — High Impact (estimated 10-25% wall-clock reduction)

1. Cache artifact pre-fetch during hash precomputation

File: crates/turborepo-lib/src/task_graph/visitor/mod.rs:320-323
Problem: Task hashes are precomputed in parallel (line 191-303), but cache lookups happen serially per-task in the dispatch loop (line 392-396). Each task blocks on task_cache_new before execution starts.
Fix: After precompute_task_hashes completes, spawn a background task that issues cache.exists(hash) or cache.fetch(hash) for all known hashes in parallel. Store results in an Arc<DashMap<String, CacheFetchResult>>. The dispatch loop checks this map instead of doing a blocking fetch.
Verify: turbo run build --profile out.json — compare task_cache_new span durations before/after. For remote-cache-heavy workloads, this should collapse N sequential HTTP requests into parallel fetches.

2. Concurrent FS + HTTP cache fetch (speculative dual-read)

File: crates/turborepo-cache/src/multiplexer.rs:148-178
Problem: Cache fetch is sequential — FS first, then HTTP on miss. FS fetch is fast but not instant (~1-5ms per artifact). For FS misses with remote cache enabled, the HTTP request doesn't start until after FS confirms miss.
Fix: When both local.read and remote.read are enabled, launch FS and HTTP fetches concurrently via tokio::join!. Return whichever completes first with a hit. Cancel the other. On FS hit, the HTTP fetch overhead is just the future allocation (negligible).
Verify: Profile a fully-remote-cached build. The fetch span in multiplexer should show overlap between FS and HTTP checks. Measure with --profile on a repo where FS cache is cold but remote cache is warm.

3. Preflight response caching per origin

File: crates/turborepo-api-client/src/lib.rs:291-303, 367-377
Problem: Every get_artifact and put_artifact call issues a separate do_preflight OPTIONS request (line 293). For a 500-task build, that's 500+ extra round-trips.
Fix: Cache the PreflightResponse per URL origin in an Arc<RwLock<HashMap<String, PreflightResponse>>> with a TTL (e.g., 5 minutes). Subsequent requests to the same origin skip the OPTIONS call.
Verify: Count HTTP requests with TURBO_LOG_VERBOSITY=debug. Before: 2N requests (N preflight + N artifact). After: N+1 requests (1 preflight + N artifact).

4. Replace HashMap with BTreeMap for global file hash map in capnp serialization

File: crates/turborepo-hash/src/lib.rs:413-414, 431-432
Problem: Two sort_unstable_by calls on collected HashMap entries every time a global hash is serialized. Sorting is O(n log n) on every hash computation.
Fix: Use BTreeMap for global_file_hash_map and engines fields in the hashable struct. Iteration is already sorted.
Verify: Run cargo bench (if added) or --profile on a large repo. The LockfilePackages and GlobalHashable serialization spans should shrink.

Tier 2 — Medium Impact (estimated 3-10% wall-clock reduction)

5. Use Arc<EnvironmentVariableMap> instead of cloning execution env

File: crates/turborepo-lib/src/task_graph/visitor/mod.rs:385-388
Problem: precomputed.get(&info).clone() clones the entire EnvironmentVariableMap (a HashMap<String, String>) for every task dispatch. For large env maps, this is expensive.
Fix: Store Arc<EnvironmentVariableMap> in the precomputed map. Change the dispatch code to Arc::clone() instead of deep clone. This requires adjusting exec_context() to accept Arc<EnvironmentVariableMap>.
Verify: Use --profile and check queue_task span duration. Should drop by the clone cost per task.

6. Upgrade api_auth Mutex to RwLock

File: crates/turborepo-cache/src/http.rs:30, 67
Problem: Arc<Mutex<APIAuth>> is locked on every cache read/write. Reads vastly outnumber writes (token refreshes are rare). Under high concurrency, this serializes cache operations.
Fix: Change to Arc<tokio::sync::RwLock<APIAuth>>. Read path uses .read(), token refresh uses .write().
Verify: Profile a highly concurrent build (e.g., --concurrency=32) and measure lock contention via tracing spans around the auth read.

7. Batch cache existence checks

File: crates/turborepo-cache/src/multiplexer.rs:182-208, crates/turborepo-api-client/src/lib.rs:327-336
Problem: Each task checks cache existence individually. With remote cache, this is one HEAD request per task.
Fix: After hash precomputation, batch all hashes into a single "multi-exists" check. If the Vercel API doesn't support batch exists, implement client-side batching: launch N concurrent HEAD requests bounded by a semaphore.
Verify: Network traffic should show concurrent HEAD requests instead of sequential ones. Measure with tcpdump or API logs.

8. Eliminate redundant to_string() allocations in hot paths

Files:
- crates/turborepo-lib/src/task_graph/visitor/mod.rs:311-312 — task.to_string() for color cache
- crates/turborepo-lib/src/task_graph/visitor/mod.rs:432 — info.to_string() for UI task
- crates/turborepo-daemon/src/server.rs:630-631 — path to string in workspace loop
Problem: Allocates new String on every call in a loop iterated once per task.
Fix: Accept &str or Display in color cache API. Use Cow<str> where appropriate.
Verify: Flame graph should show reduced allocator activity in these functions.

Tier 3 — Low Impact but Low Risk (estimated 1-3% improvement)

9. Pre-allocate FuturesUnordered and HashMap capacity hints

File: crates/turborepo-lib/src/task_graph/visitor/mod.rs:332
Problem: FuturesUnordered::new() grows dynamically. HashMap::new() without capacity hint causes rehashing.
Fix: FuturesUnordered doesn't support with_capacity, but ensure all HashMap/Vec allocations in the hot path use with_capacity(sorted.len()).

10. Add jitter to retry backoff

File: crates/turborepo-api-client/src/retry.rs:69-71
Problem: Fixed exponential backoff (2s, 4s, 10s) causes thundering herd when multiple concurrent requests fail simultaneously.
Fix: Add random jitter: sleep_period + rand::random::<u64>() % 1000.

11. Use Arc<str> for task hash throughout (extend existing pattern)

File: crates/turborepo-lib/src/task_graph/visitor/mod.rs:195 — currently String
Problem: Task hashes are String cloned at lines 274, 287, 293, 385-388.
Fix: Store as Arc<str> in the precomputed map (commit 56329a6d0f already did this for dependency hashes; extend to task hashes).

Ripple Effects

Change	Affected Crates	Risk
Cache pre-fetch (#1)	`turborepo-cache`, `turborepo-lib/task_graph`	Medium — changes execution timing; tasks might see partial pre-fetch results. Needs careful cancellation on error.
Dual FS+HTTP fetch (#2)	`turborepo-cache/multiplexer`	Low — both paths already exist; just timing changes. Must handle case where FS fetch errors are swallowed if HTTP succeeds.
Preflight caching (#3)	`turborepo-api-client`	Low — additive change. TTL prevents stale auth. Must invalidate on 403.
BTreeMap for hashes (#4)	`turborepo-hash`	Low — pure data structure swap. Insertion is O(log n) vs O(1) amortized, but iteration wins dominate since hashing iterates every entry.
Arc (#5)	`turborepo-lib/task_graph`, `turborepo-task-executor`	Medium — requires API changes to `exec_context()` and downstream consumers that currently expect owned maps.
RwLock for api_auth (#6)	`turborepo-cache/http`	Low — drop-in replacement. Behavior identical under no contention.
to_string() elimination (#8)	Multiple crates	Low — mechanical refactor. Tests catch any breakage.

Critical correctness concern for #1 (cache pre-fetch): If a task's hash depends on a dependency task's output (not just its hash), pre-fetching cache artifacts before execution could race with dependency completion. However, since turborepo's cache keys are derived from inputs (file hashes, env vars, dep hashes) — not outputs — pre-fetching by hash is safe. The hash is known before execution, and the cache artifact is immutable for that hash.

What This Unlocks

Sub-second dry runs on large monorepos: With the existing wave-parallel hashing + the proposed pre-fetch, turbo run build --dry should complete in <500ms for 1000+ packages (currently ~775ms).
Near-zero remote cache latency for warm builds: Dual FS+HTTP fetch + preflight caching + batched existence checks means a fully-cached remote build's overhead drops from ~N×(preflight + HEAD + GET) to ~(1 preflight + N concurrent GETs). For 500 tasks at 50ms/request, this saves ~25 seconds.
Profiling-driven continuous improvement: The existing --profile + turborepo-profile-md infrastructure makes it trivial to measure each change. Future contributors can follow the established pattern: profile → identify span → optimize → measure.
Foundation for incremental/watch mode performance: Many of these optimizations (cached env maps, pre-sorted hashes, Arc-based sharing) reduce per-invocation overhead, which compounds in watch mode where turbo re-evaluates on every file change.

Effort & Risk

#	Change	Effort	Risk	Verification
1	Cache pre-fetch	2-3 days	Medium	`--profile` before/after; integration tests with remote cache mock
2	Dual FS+HTTP fetch	1 day	Low	Unit tests in `multiplexer.rs`; `--profile` with cold FS / warm HTTP
3	Preflight caching	0.5 day	Low	Existing API client tests; count HTTP requests in debug log
4	BTreeMap for hash maps	0.5 day	Low	Existing hash snapshot tests (`assert_json_snapshot!` in tests)
5	Arc	1 day	Medium	Existing visitor/executor tests; check `queue_task` span
6	RwLock for api_auth	0.5 day	Low	Existing `http.rs` tests (lines 600+)
7	Batch cache checks	1-2 days	Medium	New integration test; network traffic analysis
8	to_string() cleanup	0.5 day	Low	Compilation + existing tests
9	Capacity hints	0.5 day	Negligible	Compilation only
10	Retry jitter	0.5 day	Negligible	Unit test in `retry.rs`
11	Arc for task hash	0.5 day	Low	Existing hash tests

How to verify any perf change without breaking anything:

# 1. Profile baseline
turbo run build --dry --profile baseline.json
cat baseline.json.md  # human-readable summary

# 2. Apply change, profile again
turbo run build --dry --profile after.json
cat after.json.md

# 3. Compare specific spans
diff <(jq '.traceEvents[] | select(.name=="queue_task") | .dur' baseline.json | sort -n) \
     <(jq '.traceEvents[] | select(.name=="queue_task") | .dur' after.json | sort -n)

# 4. Run the full test suite to catch regressions
cargo test --workspace

# 5. Run integration tests specifically
cargo test -p turborepo-lib --test integration
cargo test -p turborepo-cache
cargo test -p turborepo-engine

# 6. For cache changes, run with remote cache enabled
TURBO_TEAM=... TURBO_TOKEN=... turbo run build --remote-only --profile remote.json

Recommended implementation order: #3 (preflight cache, easiest win) → #4 (BTreeMap, trivial) → #6 (RwLock) → #2 (dual fetch) → #5 (Arc env) → #1 (pre-fetch, highest impact but most complex) → #7 (batch checks).

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.

Exploration

AREA: engine-builder-parallel-config-loading FILES: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs MATURITY: stable RELEVANCE: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. NOTES: (unspecified)

Raw Exploration Notes: AREA: engine-builder-parallel-config-loading FILES: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs MATURITY: stable RELEVANCE: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.

Selected Code Context

===== FILE: crates/turborepo-engine/src/builder.rs ===== //! Engine builder for constructing task graphs from turbo.json configurations. //! //! This module provides EngineBuilder which constructs task graphs by: //! - Loading turbo.json configurations via the TurboJsonLoader trait //! - Resolving task dependencies through the extends chain //! - Validating task definitions and dependencies //! - Building the final execution engine

use std::collections::{HashMap, HashSet, VecDeque};

use itertools::Itertools; use miette::{NamedSource, SourceSpan}; use turbopath::{AbsoluteSystemPath, AnchoredSystemPathBuf, RelativeUnixPathBuf}; use turborepo_errors::Spanned; use turborepo_graph_utils as graph; use turborepo_repository::package_graph::{PackageGraph, PackageName, PackageNode, ROOT_PKG_NAME}; use turborepo_task_id::{TaskId, TaskName}; use turborepo_turbo_json::{ FutureFlags, HasConfigBeyondExtends, ProcessedTaskDefinition, RawTaskDefinition, TurboJson, Validator, }; use turborepo_types::TaskDefinition;

use crate::{ BuilderError, Building, CyclicExtends, Engine, MissingPackageFromTaskError, MissingPackageTaskError, MissingRootTaskInTurboJsonError, MissingTaskError, MissingTurboJsonExtends, TaskDefinitionFromProcessed, TaskDefinitionResult, TurboJsonLoader, validate_task_name, };

/// Builder for constructing a task execution engine. /// /// The EngineBuilder is generic over L: TurboJsonLoader to allow different /// implementations of configuration loading (filesystem, in-memory for tests, /// etc.) pub struct EngineBuilder<'a, L: TurboJsonLoader> { repo_root: &'a AbsoluteSystemPath, package_graph: &'a PackageGraph, turbo_json_loader: Option<&'a L>, is_single: bool, workspaces: Vec, tasks: Vec<Spanned<TaskName<'static>>>, root_enabled_tasks: HashSet<TaskName<'static>>, tasks_only: bool, add_all_tasks: bool, should_validate_engine: bool, validator: Validator, }

impl<'a, L: TurboJsonLoader> EngineBuilder<'a, L> { pub fn new( repo_root: &'a AbsoluteSystemPath, package_graph: &'a PackageGraph, turbo_json_loader: &'a L, is_single: bool, ) -> Self { Self { repo_root, package_graph, turbo_json_loader: Some(turbo_json_loader), is_single, workspaces: Vec::new(), tasks: Vec::new(), root_enabled_tasks: HashSet::new(), tasks_only: false, add_all_tasks: false, should_validate_engine: true, validator: Validator::new(), } }

pub fn with_future_flags(mut self, future_flags: FutureFlags) -> Self {
    self.validator = self.validator.with_future_flags(future_flags);
    self
}

pub fn with_tasks_only(mut self, tasks_only: bool) -> Self {
    self.tasks_only = tasks_only;
    self
}

pub fn with_root_tasks<I: IntoIterator<Item = TaskName<'static>>>(mut self, tasks: I) -> Self {
    self.root_enabled_tasks = tasks
        .into_iter()
        .filter(|name| name.package() == Some(ROOT_PKG_NAME))
        .map(|name| name.into_non_workspace_task())
        .collect();
    self
}

pub fn with_workspaces(mut self, workspaces: Vec<PackageName>) -> Self {
    self.workspaces = workspaces;
    self
}

pub fn with_tasks<I: IntoIterator<Item = Spanned<TaskName<'static>>>>(
    mut self,
    tasks: I,
) -> Self {
    self.tasks = tasks.into_iter().collect();
    self
}

/// If set, we will include all tasks in the graph, even if they are not
/// specified
pub fn add_all_tasks(mut self) -> Self {
    self.add_all_tasks = true;
    self
}

pub fn do_not_validate_engine(mut self) -> Self {
    self.should_validate_engine = false;
    self
}

// Returns the set of allowed tasks that can be run if --only is used
// The set is exactly the product of the packages in filter and tasks specified
// by CLI
fn allowed_tasks(&self) -> Option<HashSet<TaskId<'static>>> {
    if self.tasks_only {
        Some(
            self.workspaces
                .iter()
                .cartesian_product(self.tasks.iter())
                .map(|(package, task_name)| {
                    task_name
                        .task_id()
                        .unwrap_or(TaskId::new(package.as_ref(), task_name.task()))
                        .into_owned()
                })
                .collect(),
        )
    } else {
        None
    }
}

pub fn build(mut self) -> Result<Engine<crate::Built, TaskDefinition>, BuilderError> {
    // If there are no affected packages, we don't need to go through all this work
    // we can just exit early.
    // TODO(mehulkar): but we still need to validate bad task names?
    if self.workspaces.is_empty() {
        return Ok(Engine::default().seal());
    }

    let turbo_json_loader = self
        .turbo_json_loader
        .take()
        .expect("engine builder cannot be constructed without TurboJsonLoader");
    let mut missing_tasks: HashMap<&TaskName<'_>, Spanned<()>> =
        HashMap::from_iter(self.tasks.iter().map(|spanned| spanned.as_ref().split()));
    let mut traversal_queue = VecDeque::with_capacity(1);
    let tasks: Vec<Spanned<TaskName<'static>>> = if self.add_all_tasks {
        let mut tasks_set = HashSet::new();

        // Collect tasks from root and its extends chain
        let root_tasks =
            TaskInheritanceResolver::new(turbo_json_loader).resolve(&PackageName::Root)?;
        tasks_set.extend(root_tasks);

        // Collect tasks from each workspace and its extends chain
        for workspace in self.workspaces.iter() {
            let workspace_tasks =
                TaskInheritanceResolver::new(turbo_json_loader).resolve(workspace)?;
            tasks_set.extend(workspace_tasks);
        }

        tasks_set.into_iter().map(Spanned::new).collect()
    } else {
        self.tasks.clone()
    };

    for (workspace, task) in self.workspaces.iter().cartesian_product(tasks.iter()) {
        let task_id = task
            .task_id()
            .unwrap_or_else(|| TaskId::new(workspace.as_ref(), task.task()));

        if Self::has_task_definition_in_run(turbo_json_loader, workspace, task, &task_id)? {
            missing_tasks.remove(task.as_inner());

            // Even if a task definition was found, we _only_ want to add it as an entry
            // point to the task graph (i.e. the traversalQueue), if
            // it's:
            // - A task from the non-root workspace (i.e. tasks from every other workspace)
            // - A task that we *know* is rootEnabled task (in which case, the root
            //   workspace is acceptable)
            if !matches!(workspace, PackageName::Root) || self.root_enabled_tasks.contains(task)
            {
                let task_id = task.to(task_id);
                traversal_queue.push_back(task_id);
            }
        }
    }

    {
        // We can encounter IO errors trying to load turbo.jsons which prevents using
        // `retain` in the standard way. Instead we store the possible error
        // outside of the loop and short circuit checks if we've encountered an error.
        let mut error = None;
        missing_tasks.retain(|task_name, _| {
            // If we've already encountered an error skip checking the rest.
            if error.is_some() {
                return true;
            }
            match Self::has_task_definition_in_repo(
                turbo_json_loader,
                self.package_graph,
                task_name,
            ) {
                Ok(has_defn) => !has_defn,
                Err(e) => {
                    error.get_or_insert(e);
                    true
                }
            }
        });
        if let Some(err) = error {
            return Err(err);
        }
    }

    if !missing_tasks.is_empty() {
        let missing_pkgs: HashMap<_, _> = missing_tasks
            .iter()
            .filter_map(|(task, _)| {
                let pkg = task.package()?;
                let missing_pkg = self
                    .package_graph
                    .package_info(&PackageName::from(pkg))
                    .is_none();
                missing_pkg.then(|| (task.to_string(), pkg.to_string()))
            })
            .collect();
        let mut missing_tasks = missing_tasks
            .into_iter()
            .map(|(task_name, span)| (task_name.to_string(), span))
            .collect::<Vec<_>>();
        // We sort the tasks mostly to keep it deterministic for our tests
        missing_tasks.sort_by(|a, b| a.0.cmp(&b.0));
        let errors = missing_tasks
            .into_iter()
            .map(|(name, span)| {
                if let Some(pkg) = missing_pkgs.get(&name) {
                    MissingTaskError::MissingPackage { name: pkg.clone() }
                } else {
                    let (span, text) = span.span_and_text("turbo.json");
                    MissingTaskError::MissingTaskDefinition { name, span, text }
                }
            })
            .collect();

        return Err(BuilderError::MissingTasks(errors));
    }

    let allowed_tasks = self.allowed_tasks();

    let mut visited = HashSet::new();
    let mut engine: Engine<Building, TaskDefinition> = Engine::default();
    let mut turbo_json_chain_cache: HashMap<PackageName, Vec<&TurboJson>> = HashMap::new();

    while let Some(task_id) = traversal_queue.pop_front() {
        {
            let (task_id, span) = task_id.clone().split();
            engine.add_task_location(task_id.into_owned(), span);
        }

        // Skip before doing expensive work if we've already processed this task.
        if visited.contains(task_id.as_inner()) {
            continue;
        }

        // For root tasks, verify they are either explicitly enabled OR (when using
        // add_all_tasks mode like devtools) have a definition in root turbo.json.
        // Tasks defined without the //#  prefix (like "transit") in root turbo.json
        // are valid root tasks when referenced as dependencies in add_all_tasks mode.
        if task_id.package() == ROOT_PKG_NAME
            && !self
                .root_enabled_tasks
                .contains(&task_id.as_non_workspace_task_name())
        {
            // In add_all_tasks mode (devtools), allow root tasks that have a definition
            // in turbo.json even if not explicitly in root_enabled_tasks
            let should_allow = if self.add_all_tasks {
                let task_name: TaskName<'static> =
                    TaskName::from(task_id.task().to_string()).into_owned();
                let task_id_owned = task_id.as_inner().clone().into_owned();
                Self::has_task_definition_in_run(
                    turbo_json_loader,
                    &PackageName::Root,
                    &task_name,
                    &task_id_owned,
                )?
            } else {
                false
            };

            if !should_allow {
                let (span, text) = task_id.span_and_text("turbo.json");
                return Err(BuilderError::MissingRootTaskInTurboJson(Box::new(
                    MissingRootTaskInTurboJsonError {
                        span,
                        text,
                        task_id: task_id.to_string(),
                    },
                )));
            }
        }

        validate_task_name(task_id.to(task_id.task()))?;

        if task_id.package() != ROOT_PKG_NAME
            && self
                .package_graph
                .package_json(&PackageName::from(task_id.package()))
                .is_none()
        {
            // If we have a pkg it should be in PackageGraph.
            // If we're hitting this error something has gone wrong earlier when building
            // PackageGraph or the package really doesn't exist and
            // turbo.json is misconfigured.
            let (span, text) = task_id.span_and_text("turbo.json");
            return Err(BuilderError::MissingPackageFromTask(Box::new(
                MissingPackageFromTaskError {
                    span,
                    text,
                    package: task_id.package().to_string(),
                    task_id: task_id.to_string(),
                },
            )));
        }

        let task_definition = self.task_definition_cached(
            turbo_json_loader,
            &task_id,
            &task_id.as_non_workspace_task_name(),
            &mut turbo_json_chain_cache,
        )?;

        visited.insert(task_id.as_inner().clone());

        // Note that the Go code has a whole if/else statement for putting stuff into
        // deps or calling e.AddDep the bool is cannot be true so we skip to
        // just doing deps
        let deps = task_definition
            .task_dependencies
            .iter()
            .map(|spanned| spanned.as_ref().split())
            .collect::<HashMap<_, _>>();
        let topo_deps = task_definition
            .topological_dependencies
            .iter()
            .map(|spanned| spanned.as_ref().split())
            .collect::<HashMap<_, _>>();

        // Don't ask why, but for some reason we refer to the source as "to"
        // and the target node as "from"
        let to_task_id = task_id.as_inner().clone().into_owned();
        let to_task_index = engine.get_index(&to_task_id);

        let dep_pkgs = self
            .package_graph
            .immediate_dependencies(&PackageNode::Workspace(to_task_id.package().into()));

        let mut has_deps = false;
        let mut has_topo_deps = false;

        topo_deps
            .iter()
            .cartesian_product(dep_pkgs.iter().flatten())
            .for_each(|((from, span), dependency_workspace)| {
                // We don't need to add an edge from the root node if we're in this branch
                if let PackageNode::Workspace(dependency_workspace) = dependency_workspace {
                    let from_task_id = TaskId::from_graph(dependency_workspace, from);
                    if let Some(allowed_tasks) = &allowed_tasks
                        && !allowed_tasks.contains(&from_task_id)
                    {
                        return;
                    }
                    let from_task_index = engine.get_index(&from_task_id);
                    has_topo_deps = true;
                    engine
                        .task_graph_mut()
                        .add_edge(to_task_index, from_task_index, ());
                    let from_task_id = span.to(from_task_id);
                    traversal_queue.push_back(from_task_id);
                }
            });

        for (sibling, span) in task_definition
            .with
            .iter()
            .flatten()
            .map(|s| s.as_ref().split())
        {
            let sibling_task_id = sibling
                .task_id()
                .unwrap_or_else(|| TaskId::new(to_task_id.package(), sibling.task()))
                .into_owned();
            traversal_queue.push_back(span.to(sibling_task_id));
        }

        for (dep, span) in deps {
            let from_task_id = dep
                .task_id()
                .unwrap_or_else(|| TaskId::new(to_task_id.package(), dep.task()))
                .into_owned();
            if let Some(allowed_tasks) = &allowed_tasks
                && !allowed_tasks.contains(&from_task_id)
            {
                continue;
            }
            has_deps = true;
            let from_task_index = engine.get_index(&from_task_id);
            engine
                .task_graph_mut()
                .add_edge(to_task_index, from_task_index, ());
            let from_task_id = span.to(from_task_id);
            traversal_queue.push_back(from_task_id);
        }

        engine.add_definition(task_id.as_inner().clone().into_owned(), task_definition);
        if !has_deps && !has_topo_deps {
            engine.connect_to_root(&to_task_id);
        }
    }

    graph::validate_graph(engine.task_graph_mut())?;

    Ok(engine.seal())
}

// Helper methods used when building the engine
/// Checks if there's a task definition somewhere in the repository
pub fn has_task_definition_in_repo(
    loader: &L,
    package_graph: &PackageGraph,
    task_name: &TaskName<'static>,
) -> Result<bool, BuilderError> {
    for (package, _) in package_graph.packages() {
        let task_id = task_name
            .task_id()
            .unwrap_or_else(|| TaskId::new(package.as_str(), task_name.task()));
        if Self::has_task_definition_in_run(loader, package, task_name, &task_id)? {
            return Ok(true);
        }
    }
    Ok(false)
}

/// Checks if there's a task definition in the current run
pub fn has_task_definition_in_run(
    loader: &L,
    workspace: &PackageName,
    task_name: &TaskName<'static>,
    task_id: &TaskId,
) -> Result<bool, BuilderError> {
    let result = Self::has_task_definition_in_run_inner(
        loader,
        workspace,
        task_name,
        task_id,
        &mut HashSet::new(),
    )?;
    Ok(result.has_definition())
}

fn has_task_definition_in_run_inner(
    loader: &L,
    workspace: &PackageName,
    task_name: &TaskName<'static>,
    task_id: &TaskId,
    visited: &mut HashSet<PackageName>,
) -> Result<TaskDefinitionResult, BuilderError> {
    // Avoid infinite loops from cyclic extends
    if visited.contains(workspace) {
        return Ok(TaskDefinitionResult::not_found());
    }
    visited.insert(workspace.clone());

    let turbo_json = loader.load(workspace).map_or_else(
        |err| {
            if err.is_no_turbo_json() && !matches!(workspace, PackageName::Root) {
                Ok(None)
            } else {
                Err(err)
            }
        },
        |turbo_json| Ok(Some(turbo_json)),
    )?;

    let Some(turbo_json) = turbo_json else {
        // If there was no turbo.json in the workspace, fallback to the root turbo.json
        return Self::has_task_definition_in_run_inner(
            loader,
            &PackageName::Root,
            task_name,
            task_id,
            visited,
        );
    };

    let task_id_as_name = task_id.as_task_name();

    // Helper to check task definition status based on extends configuration
    let check_task_def = |task_def: &RawTaskDefinition| -> TaskDefinitionResult {
        let has_extends_false = task_def
            .extends
            .as_ref()
            .map(|e| !*e.as_inner())
            .unwrap_or(false);

        if has_extends_false && !task_def.has_config_beyond_extends() {
            // Task is explicitly excluded via `extends: false` with no config
            TaskDefinitionResult::excluded()
        } else {
            // Task exists (either with `extends: false` + config, or normal definition)
            TaskDefinitionResult::found()
        }
    };

    // Check if this package's turbo.json has the task defined under various key
    // formats
    let base_task_name = TaskName::from(task_name.task());
    let check_base_task = matches!(workspace, PackageName::Root)
        || workspace == &PackageName::from(task_id.package());

    // Try task keys in order of specificity: task_id, task_name, base_task_name
    let task_def = turbo_json
        .tasks
        .get(&task_id_as_name)
        .or_else(|| turbo_json.tasks.get(task_name))
        .or_else(|| {
            if check_base_task {
                turbo_json.tasks.get(&base_task_name)
            } else {
                None
            }
        });

    if let Some(task_def) = task_def {
        return Ok(check_task_def(task_def));
    }

    // Check the extends chain for the task definition
    // Track if any package in the chain excluded this task
    for extend in turbo_json.extends.as_inner().iter() {
        let extend_package = PackageName::from(extend.as_str());
        let result = Self::has_task_definition_in_run_inner(
            loader,
            &extend_package,
            task_name,
            task_id,
            visited,
        )?;
        // If any package in the chain excluded this task, propagate that exclusion
        if result.is_excluded() {
            return Ok(TaskDefinitionResult::excluded());
        }
        if result.has_definition() {
            return Ok(TaskDefinitionResult::found());
        }
    }

    // This fallback only applies when there's no explicit `extends` field.
    // If `extends` is present (even if it only contains non-root packages),
    // we don't implicitly fall back to root since the validator ensures
    // the extends chain will eventually reach root.
    if turbo_json.extends.is_empty() && !matches!(workspace, PackageName::Root) {
        return Self::has_task_definition_in_run_inner(
            loader,
            &PackageName::Root,
            task_name,
            task_id,
            visited,
        );
    }

    Ok(TaskDefinitionResult::not_found())
}

/// Resolves the merged `TaskDefinition` for a task, caching the turbo.json
/// chain per package. The chain only depends on the package name (not the
/// task), so multiple tasks in the same package share the cached chain.
fn task_definition_cached<'b>(
    &self,
    turbo_json_loader: &'b L,
    task_id: &Spanned<TaskId>,
    task_name: &TaskName,
    chain_cache: &mut HashMap<PackageName, Vec<&'b TurboJson>>,
) -> Result<TaskDefinition, BuilderError> {
    let processed_task_definition = ProcessedTaskDefinition::from_iter(
        self.task_definition_chain_cached(turbo_json_loader, task_id, task_name, chain_cache)?,
    );
    let path_to_root = self.path_to_root(task_id.as_inner())?;
    TaskDefinition::from_processed(processed_task_definition, &path_to_root)
}

/// Like `task_definition_chain` but caches the turbo.json chain per
/// package.
fn task_definition_chain_cached<'b>(
    &self,
    turbo_json_loader: &'b L,
    task_id: &Spanned<TaskId>,
    task_name: &TaskName,
    chain_cache: &mut HashMap<PackageName, Vec<&'b TurboJson>>,
) -> Result<Vec<ProcessedTaskDefinition>, BuilderError> {
    let package_name = PackageName::from(task_id.package());
    let turbo_json_chain = match chain_cache.get(&package_name) {
        Some(cached) => cached.clone(),
        None => {
            let chain = self.turbo_json_chain(turbo_json_loader, &package_name)?;
            chain_cache.insert(package_name, chain.clone());
            chain
        }
    };

    Self::resolve_task_definitions_from_chain(
        turbo_json_chain,
        task_id,
        task_name,
        self.is_single,
        self.should_validate_engine,
    )
}

pub fn task_definition_chain(
    &self,
    turbo_json_loader: &L,
    task_id: &Spanned<TaskId>,
    task_name: &TaskName,
) -> Result<Vec<ProcessedTaskDefinition>, BuilderError> {
    let package_name = PackageName::from(task_id.package());
    let turbo_json_chain = self.turbo_json_chain(turbo_json_loader, &package_name)?;
    Self::resolve_task_definitions_from_chain(
        turbo_json_chain,
        task_id,
        task_name,
        self.is_single,
        self.should_validate_engine,
    )
}

/// Given a resolved turbo.json chain for a package, extract the task
/// definitions for a specific task by walking the chain and handling
/// `extends: false`.
fn resolve_task_definitions_from_chain(
    turbo_json_chain: Vec<&TurboJson>,
    task_id: &Spanned<TaskId>,
    task_name: &TaskName,
    is_single: bool,
    should_validate_engine: bool,
) -> Result<Vec<ProcessedTaskDefinition>, BuilderError> {
    let mut task_definitions = Vec::new();

    // Find the first package in the chain (iterating in reverse from leaf to root)
    // that has `extends: false` for this task. This stops inheritance from earlier
    // packages.
    let mut extends_false_index: Option<usize> = None;
    for (index, turbo_json) in turbo_json_chain.iter().enumerate().rev() {
        if let Some(task_def) = turbo_json.tasks.get(task_name)
            && task_def
                .extends
                .as_ref()
                .map(|e| !*e.as_inner())
                .unwrap_or(false)
        {
            // Found `extends: false` for this task in this package
            extends_false_index = Some(index);
            break;
        }
    }

    // If we found extends: false, only process from that point onwards
    if let Some(index) = extends_false_index {
        if let Some(turbo_json) = turbo_json_chain.get(index)
            && let Some(local_def) = turbo_json.task(task_id, task_name)?
            && local_def.has_config_beyond_extends()
        {
            task_definitions.push(local_def);
        }
        // Process any packages after this one (towards the leaf)
        for turbo_json in turbo_json_chain.iter().skip(index + 1) {
            if let Some(workspace_def) = turbo_json.task(task_id, task_name)? {
                task_definitions.push(workspace_def);
            }
        }
        return Ok(task_definitions);
    }

    // Normal inheritance path
    let mut turbo_json_chain = turbo_json_chain.into_iter();

    if let Some(root_definition) = turbo_json_chain
        .next()
        .expect("root turbo.json is always in chain")
        .task(task_id, task_name)?
    {
        task_definitions.push(root_definition)
    }

    if is_single {
        return match task_definitions.is_empty() {
            true => {
                let (span, text) = task_id.span_and_text("turbo.json");
                Err(BuilderError::MissingRootTaskInTurboJson(Box::new(
                    MissingRootTaskInTurboJsonError {
                        span,
                        text,
                        task_id: task_id.to_string(),
                    },
                )))
            }
            false => Ok(task_definitions),
        };
    }

    for turbo_json in turbo_json_chain {
        if let Some(workspace_def) = turbo_json.task(task_id, task_name)? {
            task_definitions.push(workspace_def);
        }
    }

    if task_definitions.is_empty() && should_validate_engine {
        let (span, text) = task_id.span_and_text("turbo.json");
        return Err(BuilderError::MissingPackageTask(Box::new(
            MissingPackageTaskError {
                span,
                text,
                task_id: task_id.to_string(),
                task_name: task_name.to_string(),
            },
        )));
    }

    Ok(task_definitions)
}

// Provide the chain of turbo.json's to load to fully resolve all extends for a
// package turbo.json.
fn turbo_json_chain<'b>(
    &self,
    turbo_json_loader: &'b L,
    package_name: &PackageName,
) -> Result<Vec<&'b TurboJson>, BuilderError> {
    let validator = &self.validator;
    let mut turbo_jsons = Vec::with_capacity(2);

    enum ReadReq {
        // An inferred check we perform for each package to see if there is a package specific
        // turbo.json
        Infer(PackageName),
        // A specifically requested read from a package name being present in `extends`
        Request(Spanned<PackageName>),
    }

    impl ReadReq {
        fn package_name(&self) -> &PackageName {
            match self {
                ReadReq::Infer(package_name) => package_name,
                ReadReq::Request(package_name) => package_name.as_inner(),
            }
        }

        fn required(&self) -> Option<(Option<SourceSpan>, NamedSource<String>)> {
            match self {
                ReadReq::Infer(_) => None,
                ReadReq::Request(spanned) => Some(spanned.span_and_text("turbo.json")),
            }
        }
    }

    let mut read_stack = vec![(ReadReq::Infer(package_name.clone()), vec![])];
    let mut visited = std::collections::HashSet::new();

    while let Some((read_req, mut path)) = read_stack.pop() {
        let package_name = read_req.package_name();

        // Check for cycle by seeing if this package is already in the current path
        if let Some(cycle_index) = path.iter().position(|p: &PackageName| p == package_name) {
            // Found a cycle - build the cycle portion for error
            let mut cycle = path[cycle_index..]
                .iter()
                .map(|p| p.to_string())
                .collect::<Vec<_>>();
            cycle.push(package_name.to_string());

            let (span, text) = read_req
                .required()
                .unwrap_or_else(|| (None, NamedSource::new("turbo.json", String::new())));

            return Err(BuilderError::CyclicExtends(Box::new(CyclicExtends {
                cycle,
                span,
                text,
            })));
        }

        // Skip if we've already fully processed this package
        if visited.contains(package_name) {
            continue;
        }

        let turbo_json = turbo_json_loader
            .load(package_name)
            .map(Some)
            .or_else(|err| {
                if let Some((span, text)) = read_req.required() {
                    if err.is_no_turbo_json() {
                        Err(BuilderError::MissingTurboJsonExtends(Box::new(
                            MissingTurboJsonExtends {
                                package_name: read_req.package_name().to_string(),
                                span,
                                text,
                            },
                        )))
                    } else {
                        Err(err)
                    }
                } else if err.is_no_turbo_json() {
                    Ok(None)
                } else {
                    Err(err)
                }
            })?;
        if let Some(turbo_json) = turbo_json {
            BuilderError::from_validation(
                validator
                    .validate_turbo_json(package_name, turbo_json)
                    .into_iter()
                    .map(turborepo_config::Error::from)
                    .collect(),
            )?;
            turbo_jsons.push(turbo_json);
            visited.insert(package_name.clone());

            // Add current package to path for cycle detection
            path.push(package_name.clone());

            // Add the new turbo.json we are extending from
            let (extends, span) = turbo_json.extends.clone().split();
            for extend_package in extends {
                let extend_package_name = PackageName::from(extend_package);
                read_stack.push((
                    ReadReq::Request(span.clone().to(extend_package_name)),
                    path.clone(),
                ));
            }
        } else if turbo_jsons.is_empty() {
            // If there is no package turbo.json extend from root by default
            read_stack.push((ReadReq::Infer(PackageName::Root), path));
        }
    }

    Ok(turbo_jsons.into_iter().rev().collect())
}

/// Returns the path from a task's package directory to the repo root
pub fn path_to_root(&self, task_id: &TaskId) -> Result<RelativeUnixPathBuf, BuilderError> {
    let package_name = PackageName::from(task_id.package());
    let pkg_path = self
        .package_graph
        .package_dir(&package_name)
        .ok_or_else(|| BuilderError::MissingPackageJson {
            workspace: package_name,
        })?;
    Ok(AnchoredSystemPathBuf::relative_path_between(
        &self.repo_root.resolve(pkg_path),
        self.repo_root,
    )
    .to_unix())
}

}

/// Controls whether validation is performed during task inheritance resolution. /// /// This enum replaces a boolean flag to make the code's intent clearer at call /// sites. Validation checks that tasks referenced with extends: false /// actually exist in the extends chain. #[derive(Debug, Clone, Copy, PartialEq, Eq)] pub enum ValidationMode { /// Validate that extends: false references existing tasks. /// Used at the entry point of resolution. Validate, /// Skip validation. Used in recursive calls where validation /// has already been performed at the entry point. Skip, }

/// Resolves task inheritance through the extends chain. /// /// This struct encapsulates the logic for collecting tasks from a turbo.json /// and its extends chain, handling task-level extends: false which can: /// - Exclude a task entirely (when no other config is provided) /// - Create a fresh task definition (when other config is provided) /// /// Task exclusions propagate through the extends chain. If package B /// excludes a task from package C, and package A extends B, then A will /// not see that task from C (unless A explicitly re-adds it). pub struct TaskInheritanceResolver<'a, L: TurboJsonLoader> { loader: &'a L, /// Controls validation of extends: false usage. /// Set to Validate at entry point, Skip in recursive calls. validation_mode: ValidationMode, }

/// Internal state for recursive resolution. /// Separated from TaskInheritanceResolver to allow sharing the visited set /// across the entire resolution without cloning. struct ResolutionState { /// Tasks collected from the inheritance chain tasks: HashSet<TaskName<'static>>, /// Tasks that have been excluded via extends: false excluded_tasks: HashSet<TaskName<'static>>, /// Packages that have been visited to prevent infinite loops. /// This is shared across all recursive calls to avoid O(n²) cloning. visited: HashSet, } impl<'a, L: TurboJsonLoader> TaskInheritanceResolver<'a, L> { /// Creates a new resolver for collecting tasks from a workspace. pub fn new(loader: &'a L) -> Self { Self { loader, validation_mode: ValidationMode::Validate, } }

/// Resolves all tasks from the given workspace and its extends chain.
pub fn resolve(
    self,
    workspace: &PackageName,
) -> Result<HashSet<TaskName<'static>>, BuilderError> {
    let mut state = ResolutionState {
        tasks: HashSet::new(),
        excluded_tasks: HashSet::new(),
        visited: HashSet::new(),
    };
    self.collect_from_workspace(workspace, &mut state)?;
    Ok(state.tasks)
}

/// Internal recursive collection that tracks exclusions.
/// Uses a shared mutable state to avoid cloning the visited set on each
/// iteration.
fn collect_from_workspace(
    &self,
    workspace: &PackageName,
    state: &mut ResolutionState,
) -> Result<(), BuilderError> {
    // Avoid infinite loops from cyclic extends
    if state.visited.contains(workspace) {
        return Ok(());
    }
    state.visited.insert(workspace.clone());

    let turbo_json = match self.loader.load(workspace) {
        Ok(json) => json,
        Err(err) if err.is_no_turbo_json() && !matches!(workspace, PackageName::Root) => {
            // If no turbo.json for this workspace, check root
            return self.collect_from_workspace(&PackageName::Root, state);
        }
        Err(err) => return Err(err),
    };

    // Collect inherited tasks from the extends chain
    let (inherited_tasks, chain_exclusions) =
        self.collect_from_extends_chain(turbo_json, state)?;

    // Process tasks from this turbo.json
    self.process_local_tasks(turbo_json, &inherited_tasks, state)?;

    // Add inherited tasks that aren't excluded
    Self::merge_inherited_tasks(inherited_tasks, &chain_exclusions, state);

    // Merge chain exclusions into our exclusions (they propagate up)
    state.excluded_tasks.extend(chain_exclusions);

    Ok(())
}

/// Collects tasks from the extends chain of a turbo.json.
/// Uses the shared visited set from state to avoid O(n²) cloning for deep
/// chains.
fn collect_from_extends_chain(
    &self,
    turbo_json: &TurboJson,
    state: &mut ResolutionState,
) -> Result<(HashSet<TaskName<'static>>, HashSet<TaskName<'static>>), BuilderError> {
    let mut inherited_tasks = HashSet::new();
    let mut chain_exclusions = HashSet::new();

    for extend in turbo_json.extends.as_inner().iter() {
        let extend_package = PackageName::from(extend.as_str());

        // Skip if already visited (cycle detection without cloning)
        if state.visited.contains(&extend_package) {
            continue;
        }

        // Create a child resolver that skips validation (only validate at entry point)
        let child_resolver = TaskInheritanceResolver {
            loader: self.loader,
            validation_mode: ValidationMode::Skip,
        };

        // Use separate state for child to collect its tasks/exclusions,
        // but share the visited set to avoid cloning
        let mut child_state = ResolutionState {
            tasks: HashSet::new(),
            excluded_tasks: HashSet::new(),
            // Take ownership of visited temporarily to avoid cloning
            visited: std::mem::take(&mut state.visited),
        };

        child_resolver.collect_from_workspace(&extend_package, &mut child_state)?;

        // Restore visited set (now includes all packages visited by child)
        state.visited = child_state.visited;

        inherited_tasks.extend(child_state.tasks);
        chain_exclusions.extend(child_state.excluded_tasks);
    }

    // Fallback to root if no explicit extends and not already at root
    if turbo_json.extends.is_empty() && !state.visited.contains(&PackageName::Root) {
        let child_resolver = TaskInheritanceResolver {
            loader: self.loader,
            validation_mode: ValidationMode::Skip,
        };

        // Use separate state for child, sharing visited set
        let mut child_state = ResolutionState {
            tasks: HashSet::new(),
            excluded_tasks: HashSet::new(),
            visited: std::mem::take(&mut state.visited),
        };

        child_resolver.collect_from_workspace(&PackageName::Root, &mut child_state)?;

        // Restore visited set
        state.visited = child_state.visited;

        inherited_tasks.extend(child_state.tasks);
        chain_exclusions.extend(child_state.excluded_tasks);
    }

    Ok((inherited_tasks, chain_exclusions))
}

/// Processes tasks defined in the local turbo.json.
fn process_local_tasks(
    &self,
    turbo_json: &TurboJson,
    inherited_tasks: &HashSet<TaskName<'static>>,
    state: &mut ResolutionState,
) -> Result<(), BuilderError> {
    for (task_name, task_def) in turbo_json.tasks.iter() {
        match task_def.extends.as_ref().map(|s| *s.as_inner()) {
            Some(false) => {
                self.handle_excluded_task(
                    turbo_json,
                    task_name,
                    task_def,
                    inherited_tasks,
                    state,
                )?;
            }
            _ => {
                // Normal task or explicit `extends: true` - add it
                state.tasks.insert(task_name.clone());
            }
        }
    }
    Ok(())
}

/// Handles a task with `extends: false`.
fn handle_excluded_task(
    &self,
    turbo_json: &TurboJson,
    task_name: &TaskName<'static>,
    task_def: &RawTaskDefinition,
    inherited_tasks: &HashSet<TaskName<'static>>,
    state: &mut ResolutionState,
) -> Result<(), BuilderError> {
    // Validate that the task exists in the extends chain (only at entry point)
    if self.validation_mode == ValidationMode::Validate && !inherited_tasks.contains(task_name)
    {
        let (span, text) = task_def
            .extends
            .as_ref()
            .unwrap()
            .span_and_text("turbo.json");
        let extends_chain = Self::format_extends_chain(turbo_json, inherited_tasks);
        return Err(BuilderError::TurboJson(
            turborepo_turbo_json::Error::TaskNotInExtendsChain {
                task_name: task_name.to_string(),
                extends_chain,
                span,
                text,
            },
        ));
    }

    if task_def.has_config_beyond_extends() {
        // Has other config - this is a fresh definition, add it
        state.tasks.insert(task_name.clone());
    }
    // Track as excluded (propagates to parent packages)
    state.excluded_tasks.insert(task_name.clone());
    Ok(())
}

/// Merges inherited tasks that aren't excluded.
fn merge_inherited_tasks(
    inherited_tasks: HashSet<TaskName<'static>>,
    chain_exclusions: &HashSet<TaskName<'static>>,
    state: &mut ResolutionState,
) {
    for task in inherited_tasks {
        if !state.excluded_tasks.contains(&task) && !chain_exclusions.contains(&task) {
            state.tasks.insert(task);
        }
    }
}

/// Formats the extends chain for error messages.
fn format_extends_chain(
    turbo_json: &TurboJson,
    available_tasks: &HashSet<TaskName<'static>>,
) -> String {
    let mut result = String::new();
    result.push_str("The extends chain includes:\n");

    let extends = turbo_json.extends.as_inner();
    if extends.is_empty() {
        result.push_str("  → // (root)\n");
    } else {
        for extend in extends {
            result.push_str(&format!("  → {}\n", extend));
        }
    }

    result.push_str("\nTasks available from extends chain:\n");
    if available_tasks.is_empty() {
        result.push_str("  (none)\n");
    } else {
        let mut sorted_tasks: Vec<_> = available_tasks.iter().collect();
        sorted_tasks.sort();
        for task in sorted_tasks {
            result.push_str(&format!("  • {}\n", task));
        }
    }

    result
}

}

#[cfg(test)] mod test { use std::collections::{HashMap, HashSet};

use insta::{assert_json_snapshot, assert_snapshot};
use pretty_assertions::assert_eq;
use serde_json::json;
use tempfile::TempDir;
use test_case::test_case;
use turbopath::AbsoluteSystemPathBuf;
use turborepo_errors::Spanned;
use turborepo_lockfiles::Lockfile;
use turborepo_repository::{
    discovery::PackageDiscovery,
    package_graph::{PackageGraph, PackageName},
    package_json::PackageJson,
    package_manager::PackageManager,
};
use turborepo_task_id::{TaskId, TaskName};
use turborepo_turbo_json::{
    FutureFlags, RawPackageTurboJson, RawRootTurboJson, RawTurboJson, TurboJson,
};
use turborepo_types::TaskDefinition;

use crate::{
    BuilderError, Built, CyclicExtends, EngineBuilder, TaskInheritanceResolver, TaskNode,
};

/// Test implementation of TurboJsonLoader that returns pre-configured
/// TurboJson structures without reading from disk.
struct TestTurboJsonLoader {
    turbo_jsons: HashMap<PackageName, TurboJson>,
}

impl TestTurboJsonLoader {
    fn new(turbo_jsons: HashMap<PackageName, TurboJson>) -> Self {
        Self { turbo_jsons }
    }
}

impl crate::TurboJsonLoader for TestTurboJsonLoader {
    fn load(&self, package: &PackageName) -> Result<&TurboJson, BuilderError> {
        self.turbo_jsons
            .get(package)
            .ok_or_else(|| BuilderError::TurboJson(turborepo_turbo_json::Error::NoTurboJSON))
    }
}

// Only used to prevent package graph construction from attempting to read
// lockfile from disk
#[derive(Debug)]
struct MockLockfile;
impl Lockfile for MockLockfile {
    fn resolve_package(
        &self,
        _workspace_path: &str,
        _name: &str,
        _version: &str,
    ) -> Result<Option<turborepo_lockfiles::Package>, turborepo_lockfiles::Error> {
        unreachable!()
    }

    fn all_dependencies(
        &self,
        _key: &str,
    ) -> Result<Option<std::borrow::Cow<'_, HashMap<String, String>>>, turborepo_lockfiles::Error>
    {
        unreachable!()
    }

    fn subgraph(
        &self,
        _workspace_packages: &[String],
        _packages: &[String],
    ) -> Result<Box<dyn Lockfile>, turborepo_lockfiles::Error> {
        unreachable!()
    }

    fn encode(&self) -> Result<Vec<u8>, turborepo_lockfiles::Error> {
        unreachable!()
    }

    fn global_change(&self, _other: &dyn Lockfile) -> bool {
        unreachable!()
    }

    fn turbo_version(&self) -> Option<String> {
        None
    }
}

struct MockDiscovery;
impl PackageDiscovery for MockDiscovery {
    async fn discover_packages(
        &self,
    ) -> Result<
        turborepo_repository::discovery::DiscoveryResponse,
        turborepo_repository::discovery::Error,
    > {
        Ok(turborepo_repository::discovery::DiscoveryResponse {
            package_manager: PackageManager::Npm,
            workspaces: vec![], // we don't care about this
        })
    }

    async fn discover_packages_blocking(
        &self,
    ) -> Result<
        turborepo_repository::discovery::DiscoveryResponse,
        turborepo_repository::discovery::Error,
    > {
        self.discover_packages().await
    }
}

macro_rules! package_jsons {
    {$root:expr, $($name:expr => $deps:expr),+} => {
        {
            let mut _map = HashMap::new();
            $(
                let path = $root.join_components(&["packages", $name, "package.json"]);
                let dependencies = Some($deps.iter().map(|dep: &&str| (dep.to_string(), "workspace:*".to_string())).collect());
                let package_json = PackageJson { name: Some(Spanned::new($name.to_string())), dependencies, ..Default::default() };
                _map.insert(path, package_json);
            )+
            _map
        }
    };
}

fn mock_package_grap

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

cache-restore-pipeline.md

The analysis above covers every performance opportunity I found across the turborepo Rust codebase. The top 3 wins (parallel cache restore, avoiding HTTP body clones, concurrent FS+HTTP put) are all achievable with relatively low effort and would compound to meaningfully faster builds — especially on the cache-hit path that dominates CI builds.

Would you like me to dive deeper into implementing any of these, or save these findings to the memory/plan file?

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

task-hash-tracker-lock-contention
cache-archive-restore-pipeline

Exploration

AREA: cache-archive-restore-pipeline FILES: crates/turborepo-cache/src/cache_archive/restore.rs, crates/turborepo-cache/src/cache_archive/restore_regular.rs, crates/turborepo-cache/src/cache_archive/restore_directory.rs MATURITY: stable RELEVANCE: Every cache hit funnels through a fully sequential tar-entry-to-disk loop in restore_entries; buffering decoded entries then writing files in parallel with rayon could cut cache-restore wall time 20-40% on large workspaces.

Raw Exploration Notes: Based on my exploration of the turborepo codebase, here's my recommendation:

Key evidence from the code:

restore.rs:89-114 — The restore_entries loop iterates tr.entries() one-at-a-time, calling restore_entry (stat + mkdir + write) sequentially for each file. No parallelism at all.
The CachedDirTree optimization (line 82) avoids redundant lstat calls but doesn't help with the sequential write bottleneck.
By contrast, other hot paths are already parallelized: package.json parsing uses rayon::par_iter (builder.rs:344), git index stat uses par_iter (repo_index.rs:125), and file hashing uses into_par_iter (hash_object.rs:50).
The tar format requires sequential reading (streaming), but nothing prevents buffering entries in memory and then writing them to disk in parallel using rayon once the tar is fully decoded.
The zstd decompression uses default level (create.rs:115,151 — Encoder::new(writer, 0)), so decompressed data is available quickly, but the filesystem writes serialized after it are the bottleneck.

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

Raw

env-regex-and-tracker-lock.md

Error in user YAML: (<unknown>): could not find expected ':' while scanning a simple key at line 5 column 1

---

## Current State

Turborepo's build pipeline has **six distinct hot paths** where measurable time is spent on every `turbo run`:

1. **Lockfile transitive closure** (`crates/turborepo-lockfiles/src/lib.rs:130-149`) — `all_transitive_closures` uses rayon `par_iter` over workspaces with a shared `DashMap` resolve cache, but the DashMap is initialized with **no capacity hint** (line 135), `all_dependencies()` results are **never cached** (each workspace re-walks shared subgraphs), and `Package` structs are **cloned on every cache hit** (line 232). Called on the critical startup path from `PackageGraph::populate_transitive_dependencies`.

2. **Engine builder config loading** (`crates/turborepo-engine/src/builder.rs:157-165, 803-804`) — turbo.json configs are loaded **serially** per workspace during BFS graph construction. Each workspace's entire extends chain is resolved sequentially before moving to the next. The chain cache (line 259) is populated lazily during traversal, missing the opportunity to pre-load all needed configs in parallel.

3. **Cache archive compression** (`crates/turborepo-cache/src/cache_archive/create.rs:115,151`) — zstd compression is hardcoded to `Encoder::new(writer, 0)` with **no multi-thread configuration**. Files are added to tar archives sequentially with per-file `symlink_metadata()` + `open()` syscalls (lines 177-187). The write buffer is 1MB (line 146).

4. **Globwalk pattern recompilation** (`crates/turborepo-globwalk/src/lib.rs:478`) — `compile_globs()` is called on **every `globwalk()` invocation** with zero caching. In a 100-package monorepo where many packages share `dist/**` output patterns, the same regexes are compiled 100+ times across `save_outputs` and file-hashing paths.

5. **SCM hash allocations** (`crates/turborepo-scm/src/hash_object.rs:69`, `repo_index.rs:162,242,255`) — Per-file `String` allocation from hex encoding on every hash, per-file `RelativeUnixPathBuf` and hash clones when building per-package hash sets, and a **double directory walk** in manual mode (`manual.rs:119-161` then `166-202`) when both explicit inputs and defaults are needed.

6. **Lockfile `all_dependencies` implementations vary wildly** — pnpm pre-indexes at parse time (O(1) borrowed lookup), but npm generates 5-10 candidate keys per dependency with BTreeMap lookups, and berry iterates **all overrides** for every dependency (O(N×M)).

---

Required Changes

1. Pre-size DashMap and cache `all_dependencies` results

File: crates/turborepo-lockfiles/src/lib.rs

// Before (line 135):
let resolve_cache: ResolveCache = DashMap::new();

// After — estimate capacity from total dependency count:
let total_deps: usize = workspaces.values().map(|d| d.len()).sum();
let resolve_cache: ResolveCache = DashMap::with_capacity(total_deps * 4);

Add a second cache for all_dependencies results to avoid redundant lockfile lookups across workspace walks:

type DepsCache = DashMap<String, Option<Arc<HashMap<String, String>>>>;

In the walk() method (line 278), check this cache before calling self.lockfile.all_dependencies(&pkg.key).

Replace Package clones with Arc<Package> in the resolve cache to eliminate heap allocation on every cache hit (line 232).

2. Parallelize engine builder config loading

File: crates/turborepo-engine/src/builder.rs

Replace the serial loop at lines 157-165 with parallel pre-loading:

// Collect all workspace names
let all_names: Vec<_> = self.workspaces.iter().cloned().collect();
// Pre-load all chains in parallel using rayon
let chains: HashMap<_, _> = all_names
    .par_iter()
    .map(|ws| {
        let chain = self.turbo_json_chain(turbo_json_loader, ws)?;
        Ok((ws.clone(), chain))
    })
    .collect::<Result<_, _>>()?;

Pre-populate turbo_json_chain_cache before the BFS loop starts. This requires making turbo_json_loader thread-safe (wrap in Arc).

3. Enable multi-threaded zstd compression

File: crates/turborepo-cache/src/cache_archive/create.rs

// Before (line 151):
let zw = zstd::Encoder::new(file_buffer, 0)?.auto_finish();

// After:
let mut encoder = zstd::Encoder::new(file_buffer, 0)?;
encoder.multithread(std::thread::available_parallelism()
    .map(|n| n.get() as u32)
    .unwrap_or(1))?;
let zw = encoder.auto_finish();

Apply same change at line 115 for the from_writer path. Increase buffer to 4MB for large outputs:

let file_buffer = BufWriter::with_capacity(4 * 1024 * 1024, file);

4. Add compiled-pattern cache for globwalk

File: crates/turborepo-globwalk/src/lib.rs

Add a thread_local! or Arc<DashMap> cache keyed by (base_path, include_patterns, exclude_patterns) that stores Arc<CompiledGlobs>. On cache hit, skip compile_globs() entirely:

use std::sync::OnceLock;
static COMPILED_CACHE: OnceLock<DashMap<u64, Arc<CompiledGlobs>>> = OnceLock::new();

pub fn globwalk(...) -> Result<...> {
    let cache_key = hash_of(base_path, include, exclude);
    let cache = COMPILED_CACHE.get_or_init(DashMap::new);
    let compiled = match cache.get(&cache_key) {
        Some(hit) => hit.clone(),
        None => {
            let c = Arc::new(compile_globs(base_path, include, exclude)?);
            cache.insert(cache_key, c.clone());
            c
        }
    };
    retry_on_emfile(|| walk_compiled_globs(&compiled, walk_type, Default::default()))
}

5. Reduce SCM hash allocations

Files: hash_object.rs, repo_index.rs, manual.rs

Use Arc<str> instead of String for hex hashes — share the same 40-byte string across packages that reference the same file:

// hash_object.rs line 69:
let hash_str: Arc<str> = Arc::from(unsafe { std::str::from_utf8_unchecked(&hex_buf) });

Merge the double walk in manual.rs into a single traversal with a flag tracking whether each file came from explicit inputs or defaults, eliminating the second walk() call entirely.
Return borrowed slices from get_package_hashes() instead of cloning per-file paths and hashes when the package prefix matches the full index.

6. Optimize npm/berry `all_dependencies`

npm: Pre-build a dependency index at parse time (like pnpm does) to avoid generating 5-10 candidate keys per dependency lookup.
berry: Pre-compute which overrides apply to which packages at parse time, replacing the O(N×M) override loop with O(1) lookup.

Ripple Effects

Change	Affects	Risk
DashMap pre-sizing + deps cache	All lockfile formats (npm/pnpm/berry/bun)	Low — additive caching, fallback to existing behavior
Parallel config loading	Engine builder, turbo.json loader	Medium — requires thread-safety for `TurboJsonLoader`; must verify that turbo.json loading has no mutation side effects
Multi-threaded zstd	Every cache-miss task, remote cache upload	Low — zstd multithread is a single API call; verify compressed archives are byte-compatible with existing caches (they use the same zstd format)
Globwalk pattern cache	`save_outputs`, file hashing, globwalk consumers	Low — pure memoization; cache invalidation is automatic since patterns are deterministic
SCM hash allocation reduction	`PackageInputsHashes`, `RepoIndex`, manual fallback	Medium — changing `String` to `Arc<str>` ripples through `FileHashes` type; need coordinated change across `task-hash` and `scm` crates
npm/berry `all_dependencies` indexing	Lockfile parsing for those formats	Medium — changes parse-time behavior; thorough lockfile round-trip tests exist

Cache compatibility note: Multi-threaded zstd produces bit-identical decompression output but different compressed bytes. Remote cache hits from older turborepo versions will still decompress correctly. No cache invalidation needed.

What This Unlocks

Estimated wall-clock improvements (100-package monorepo, cold start):

Optimization	Expected Speedup	Where Time Is Saved
DashMap pre-sizing + deps cache	15-25% of `all_transitive_closures`	Startup / lockfile analysis
Parallel config loading	50-80% of engine build phase	Startup / graph construction
Multi-threaded zstd	2-4x faster cache writes	Every cache-miss task
Globwalk pattern caching	10-30% of file hashing	Hash computation + cache save
SCM allocation reduction	15-25% of file hashing	Hash computation
npm/berry dep indexing	30-50% of `all_dependencies` for those formats	Startup / lockfile analysis

Compound effect: In a large monorepo hitting mostly cache misses, these changes stack — faster startup (config + lockfile), faster hash computation (glob + SCM), faster cache writes (zstd) = potentially 30-50% faster end-to-end turbo run on cold builds.

For warm builds (all cache hits), the startup optimizations (lockfile + engine builder) are what matter most since task execution is skipped.

Effort & Risk

Change	Effort	Risk	Prerequisite
DashMap pre-sizing	1-2 hours	Very low	None
`all_dependencies` result cache	1-2 days	Low	Must handle `Cow::Borrowed` vs `Cow::Owned` across lockfile impls
`Arc<Package>` in resolve cache	2-4 hours	Low	Type change ripples to `HashSet<Package>` consumers
Parallel config loading	3-5 days	Medium	`TurboJsonLoader` must be `Send + Sync`; verify no config mutation
Multi-threaded zstd	2-4 hours	Very low	Verify `zstd` crate feature flags are enabled
Globwalk pattern cache	1-2 days	Low	Hash function for pattern identity; cache lifetime management
SCM `Arc<str>` migration	2-3 days	Medium	Coordinate `FileHashes` type across `turborepo-scm` and `turborepo-task-hash`
Merge double walk in manual.rs	1 day	Low	Single-pass gitignore logic
npm dep pre-indexing	2-3 days	Medium	Must match current resolution semantics exactly
Berry override pre-indexing	1-2 days	Medium	Override matching logic is nuanced

How to verify each change doesn't break anything

Run the existing test suite:

cargo test -p turborepo-lockfiles
cargo test -p turborepo-cache
cargo test -p turborepo-scm
cargo test -p turborepo-engine

Run integration tests:

# From turborepo-tests/integration/
prysk tests/run/*.t

Profile before/after with Chrome tracing:

# Before changes:
turbo run build --profile=before.trace
cat before.trace.md  # Auto-generated markdown summary

# After changes:
turbo run build --profile=after.trace
cat after.trace.md

# Compare hot functions:
diff before.trace.md after.trace.md

Lockfile round-trip validation (for lockfile changes):

cargo test -p turborepo-lockfiles -- --test-threads=1
# Existing tests cover npm, pnpm, berry, bun, yarn1 lockfile parsing + serialization

Cache archive compatibility (for zstd changes):

# Create archive with old binary, restore with new binary (and vice versa)
# Existing test in cache_archive/mod.rs covers this
cargo test -p turborepo-cache

SCM hash correctness (for allocation changes):

# git_index_regression_tests.rs creates real repos and validates hash equivalence
cargo test -p turborepo-scm -- git_index_regression

Recommended order: Start with DashMap pre-sizing (lowest risk, immediate win), then multi-threaded zstd (easy API change, big payoff), then globwalk caching, then tackle the larger refactors (parallel config loading, SCM types).

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.

Exploration

AREA: lockfile-transitive-closure FILES: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs MATURITY: stable RELEVANCE: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. NOTES: (unspecified)

Raw Exploration Notes: AREA: lockfile-transitive-closure FILES: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs MATURITY: stable RELEVANCE: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.

Selected Code Context

===== FILE: crates/turborepo-lockfiles/src/lib.rs ===== //! Package manager lockfile parsing, analysis, and serialization //! //! Parsing and analysis are used to track which external packages a workspace //! package depends on. This allows Turborepo to not perform a global //! invalidation on a lockfile change, but instead only the packages which //! depend on the changed external packages. //! //! Serialization is exclusively used by turbo prune and is far more error //! prone than deserialization and analysis.

#![deny(clippy::all)] // the pest proc macro adds an empty doc comment. #![allow(clippy::empty_docs)]

mod berry; mod bun; mod error; mod npm; mod pnpm; mod yarn1;

use std::{ any::Any, borrow::Cow, collections::{HashMap, HashSet}, };

pub use berry::{Error as BerryError, }; pub use bun::{BunLockfile, bun_global_change}; use dashmap::DashMap; pub use error::Error; pub use npm::; pub use pnpm::{PnpmLockfile, pnpm_global_change, pnpm_subgraph}; use rayon::prelude::*; use serde::Serialize; use turbopath::RelativeUnixPathBuf; pub use yarn1::{Yarn1Lockfile, yarn_subgraph};

type ResolveCache = DashMap<String, Option>;

#[derive(Debug, PartialEq, Eq, Clone, PartialOrd, Ord, Hash, Serialize)] pub struct Package { pub key: String, pub version: String, }

/// A trait for exposing common operations for lockfile parsing, analysis, and /// encoding. /// /// External packages are identified by key strings which have no shared /// structure other than being able to uniquely identify a package in the /// corresponding lockfile. When programming against these keys they should be /// viewed as a black box and any logic for handling them should live in the /// specific lockfile implementation which might have additional understanding /// of them. Using human_name can provide a version of the key that is /// formatted for human viewing. The fact that keys are still represented as /// Strings is a vestige of the translation from Go. /// /// We cannot easily expose lockfiles as a standard /// graph due to overrides that various lockfile formats support. A dependency /// of "package": "1.0.0" might resolve to a different version depending on /// how it is imported. See https://pnpm.io/settings#overrides pub trait Lockfile: Send + Sync + Any + std::fmt::Debug { /// Resolve a dependency declaration from a workspace package to a lockfile /// key fn resolve_package( &self, workspace_path: &str, name: &str, version: &str, ) -> Result<Option, Error>;

/// Given a lockfile key return all (prod/dev/optional) direct dependencies
/// of that package.
fn all_dependencies(
    &self,
    key: &str,
) -> Result<Option<Cow<'_, HashMap<String, String>>>, Error>;

/// Given a list of workspace packages and external packages that are
/// dependencies of the workspace packages, produce a lockfile that only
/// references said packages.
///
/// The caller is expected to have calculated the correct `packages` for the
/// provided `workspace_packages` using `resolve_package` and
/// `all_dependencies` as otherwise `subgraph` might fail or produce an
/// incorrect lockfile.
fn subgraph(
    &self,
    workspace_packages: &[String],
    packages: &[String],
) -> Result<Box<dyn Lockfile>, Error>;

/// Encode the lockfile to a string of bytes that can be written to disk
fn encode(&self) -> Result<Vec<u8>, Error>;

/// All patch files referenced in the lockfile
///
/// Useful for identifying any patch files that are referenced by the
/// lockfile
fn patches(&self) -> Result<Vec<RelativeUnixPathBuf>, Error> {
    Ok(Vec::new())
}

/// Determine if there's a global change between two lockfiles
///
/// This generally is only `true` across lockfile version changes or when a
/// setting changes where it is safer to view everything as changed rather
/// than try to understand the change.
fn global_change(&self, other: &dyn Lockfile) -> bool;

/// Return any turbo version found in the lockfile
///
/// Used for identifying which version of `turbo` the lockfile references if
/// no local `turbo` binary is found.
fn turbo_version(&self) -> Option<String>;

/// A human friendly version of a lockfile key.
/// Usually of the form `package@version`, but version might include
/// additional information to convey difference from other packages in
/// the lockfile e.g. differing peer dependencies.
#[allow(unused)]
fn human_name(&self, package: &Package) -> Option<String> {
    None
}

}

/// Takes a lockfile, and a map of workspace directory paths -> (package name, /// version) and calculates the transitive closures for all of them pub fn all_transitive_closures<L: Lockfile + ?Sized>( lockfile: &L, workspaces: HashMap<String, HashMap<String, String>>, ignore_missing_packages: bool, ) -> Result<HashMap<String, HashSet>, Error> { let resolve_cache: ResolveCache = DashMap::new(); workspaces .into_par_iter() .map(|(workspace, unresolved_deps)| { let closure = transitive_closure_cached( lockfile, &workspace, unresolved_deps, ignore_missing_packages, &resolve_cache, )?; Ok((workspace, closure)) }) .collect() }

#[tracing::instrument(skip_all)] pub fn transitive_closure<L: Lockfile + ?Sized>( lockfile: &L, workspace_path: &str, unresolved_deps: HashMap<String, String>, ignore_missing_packages: bool, ) -> Result<HashSet, Error> { let resolve_cache: ResolveCache = DashMap::new(); transitive_closure_cached( lockfile, workspace_path, unresolved_deps, ignore_missing_packages, &resolve_cache, ) }

fn transitive_closure_cached<L: Lockfile + ?Sized>( lockfile: &L, workspace_path: &str, unresolved_deps: HashMap<String, String>, ignore_missing_packages: bool, resolve_cache: &ResolveCache, ) -> Result<HashSet, Error> { let mut ctx = ClosureContext { lockfile, workspace_path, resolve_cache, key_buf: String::new(), }; let mut transitive_deps = HashSet::new(); ctx.walk( &unresolved_deps, &mut transitive_deps, ignore_missing_packages, true, )?; Ok(transitive_deps) }

struct ClosureContext<'a, L: Lockfile + ?Sized> { lockfile: &'a L, workspace_path: &'a str, resolve_cache: &'a ResolveCache, key_buf: String, }

impl<L: Lockfile + ?Sized> ClosureContext<'_, L> { fn make_cache_key(&mut self, workspace_path: Option<&str>, name: &str, specifier: &str) { self.key_buf.clear(); if let Some(wp) = workspace_path { self.key_buf .reserve(wp.len() + name.len() + specifier.len() + 2); self.key_buf.push_str(wp); self.key_buf.push('\0'); } else { self.key_buf.reserve(name.len() + specifier.len() + 1); } self.key_buf.push_str(name); self.key_buf.push('\0'); self.key_buf.push_str(specifier); }

fn resolve_deps(
    &mut self,
    unresolved_deps: &HashMap<String, String>,
    ignore_missing_packages: bool,
    is_workspace_root_deps: bool,
) -> Result<Vec<Package>, Error> {
    let mut newly_resolved = Vec::new();

    for (name, specifier) in unresolved_deps {
        // For direct workspace dependencies, include workspace_path in the cache key
        // since resolution depends on the workspace's importer entry.
        // For transitive sub-dependencies, the resolution is workspace-independent
        // (the version is already a resolved lockfile key), so we omit workspace_path
        // to enable cross-workspace cache sharing.
        let wp = is_workspace_root_deps.then_some(self.workspace_path);
        self.make_cache_key(wp, name, specifier);

        let pkg = match self.resolve_cache.get(self.key_buf.as_str()) {
            Some(cached) => cached.clone(),
            None => {
                let result =
                    match self
                        .lockfile
                        .resolve_package(self.workspace_path, name, specifier)
                    {
                        Ok(pkg) => pkg,
                        Err(Error::MissingWorkspace(_)) if ignore_missing_packages => {
                            self.resolve_cache.insert(self.key_buf.clone(), None);
                            continue;
                        }
                        Err(e) => return Err(e),
                    };
                self.resolve_cache
                    .insert(self.key_buf.clone(), result.clone());
                result
            }
        };

        if let Some(pkg) = pkg {
            newly_resolved.push(pkg);
        }
    }

    Ok(newly_resolved)
}

fn walk(
    &mut self,
    unresolved_deps: &HashMap<String, String>,
    resolved_deps: &mut HashSet<Package>,
    ignore_missing_packages: bool,
    is_workspace_root_deps: bool,
) -> Result<(), Error> {
    let newly_resolved = self.resolve_deps(
        unresolved_deps,
        ignore_missing_packages,
        is_workspace_root_deps,
    )?;

    for pkg in newly_resolved {
        if resolved_deps.contains(&pkg) {
            continue;
        }

        let all_deps = self.lockfile.all_dependencies(&pkg.key)?;
        resolved_deps.insert(pkg);
        if let Some(deps) = all_deps {
            self.walk(&deps, resolved_deps, false, false)?;
        }
    }

    Ok(())
}

}

impl Package { pub fn new(key: impl Into, version: impl Into) -> Self { let key = key.into(); let version = version.into(); Self { key, version } } } ===== END FILE: crates/turborepo-lockfiles/src/lib.rs =====

===== FILE: crates/turborepo-lockfiles/src/berry/identifiers.rs ===== use std::{borrow::Cow, fmt, sync::OnceLock};

use regex::Regex; use thiserror::Error;

fn ident() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"^(?:@([^/]+?)/)?([^@/]+)$").unwrap()) }

fn descriptor() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"^(?:@([^/]+?)/)?([^@/]+?)(?:@(.+))$").unwrap()) }

fn patch_ref() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"patch:(.+)#(?:./)?([^:]+)(?:::)?.*$").unwrap()) }

fn multikey() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r" *, *").unwrap()) }

fn builtin() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"^(?:optional!)?builtin<([^>]+)>$").unwrap()) }

#[derive(Debug, Error)] pub enum Error { #[error("Invalid identifier ({0})")] Ident(String), #[error("Invalid descriptor ({0})")] Descriptor(String), #[error("Invalid locator ({0})")] Locator(String), }

/// A package scope and name /// For example: typescript, @babel/core #[derive(Debug, Clone, PartialEq, Eq, PartialOrd, Ord, Hash)] pub struct Ident<'a> { scope: Option<Cow<'a, str>>, name: Cow<'a, str>, }

/// An identifier with a semver range /// For example: is-even@^1.0.0, next@npm:13.0.0 #[derive(Debug, Clone, PartialEq, Eq, PartialOrd, Ord, Hash)] pub struct Descriptor<'a> { pub ident: Ident<'a>, pub range: Cow<'a, str>, }

/// An identifier with a resolved version. /// They are similar to descriptors except that descriptors can reference /// multiple packages whereas a locator references exactly one. /// For example: is-number@npm:1.0.0, web@workspace:* #[derive(Debug, Clone, PartialEq, Eq, PartialOrd, Ord, Hash)] pub struct Locator<'a> { pub ident: Ident<'a>, pub reference: Cow<'a, str>, }

impl Ident<'_> { /// Clones underlying strings and changes lifetime to represent this pub fn to_owned(&self) -> Ident<'static> { let Ident { scope, name } = self; let scope = scope .as_ref() .map(|scope| scope.to_string()) .map(Cow::Owned); let name = Cow::Owned(name.to_string()); Ident { scope, name } } }

// These TryFrom impls should be FromStr, but to avoid unnecessary copying we // use TryFrom so we can use a lifetime. impl<'a> TryFrom<&'a str> for Ident<'a> { type Error = Error;

fn try_from(value: &'a str) -> Result<Self, Self::Error> {
    let make_err = || Error::Ident(value.to_string());
    let captures = ident().captures(value).ok_or_else(make_err)?;
    let scope = captures.get(1).map(|m| Cow::Borrowed(m.as_str()));
    let name = Cow::Borrowed(captures.get(2).map(|m| m.as_str()).ok_or_else(make_err)?);
    Ok(Self { scope, name })
}

}

impl fmt::Display for Ident<'> { fn fmt(&self, f: &mut fmt::Formatter<'>) -> std::fmt::Result { if let Some(scope) = self.scope.as_deref() { f.write_fmt(format_args!("@{scope}/"))?; } f.write_str(&self.name) } }

impl<'a> TryFrom<&'a str> for Descriptor<'a> { type Error = Error;

fn try_from(value: &'a str) -> Result<Self, Self::Error> {
    let make_err = || Error::Descriptor(value.to_string());
    let captures = descriptor().captures(value).ok_or_else(make_err)?;
    let scope = captures.get(1).map(|m| Cow::Borrowed(m.as_str()));
    let name = Cow::Borrowed(captures.get(2).map(|m| m.as_str()).ok_or_else(make_err)?);
    let range = Cow::Borrowed(captures.get(3).map(|m| m.as_str()).ok_or_else(make_err)?);
    let ident = Ident { scope, name };
    Ok(Descriptor { ident, range })
}

}

impl fmt::Display for Descriptor<'> { fn fmt(&self, f: &mut fmt::Formatter<'>) -> fmt::Result { f.write_fmt(format_args!("{}@{}", self.ident, self.range)) } }

impl<'a> Descriptor<'a> { pub fn new(ident: &'a str, range: &'a str) -> Result<Self, Error> { let ident = Ident::try_from(ident)?; let range = range.into(); Ok(Self { ident, range }) }

/// Extracts all descriptors that are present in a lockfile entry key
pub fn from_lockfile_key(key: &'a str) -> impl Iterator<Item = Result<Descriptor<'a>, Error>> {
    multikey().split(key).map(Descriptor::try_from)
}

/// Removes the protocol from a version range
pub fn strip_protocol(range: &str) -> &str {
    range.split_once(':').map_or(range, |(_, rest)| rest)
}

pub fn into_owned(self) -> Descriptor<'static> {
    let Self { ident, range } = self;
    let range = Cow::Owned(range.to_string());
    Descriptor {
        ident: ident.to_owned(),
        range,
    }
}

/// Returns the protocol of the version range if one is present
pub fn protocol(&self) -> Option<&str> {
    self.range.split_once(':').map(|(protocol, _)| protocol)
}

/// Access the range based on the lifetime of the underlying string slice
/// this will return None if the underlying range is owned.
pub(crate) fn range(&self) -> Option<&'a str> {
    match self.range {
        Cow::Borrowed(s) => Some(s),
        _ => None,
    }
}

/// If the descriptor is a patch returns the version that the patch targets
pub fn primary_version(&self) -> Option<String> {
    let Locator { reference, .. } = Locator::from_patch_reference(&self.range)?;
    // This is always owned due to needing to replace '%3A' with ':' so
    // we extract the owned string.
    Some(reference.into_owned())
}

}

impl<'a> TryFrom<&'a str> for Locator<'a> { type Error = Error;

fn try_from(value: &'a str) -> Result<Self, Self::Error> {
    // Descriptors and locators have the same structure so we use the descriptor
    // parsing logic
    let Descriptor { ident, range } = Descriptor::try_from(value).map_err(|err| match err {
        Error::Descriptor(val) => Error::Locator(val),
        _ => err,
    })?;
    Ok(Locator {
        ident,
        reference: range,
    })
}

}

impl fmt::Display for Locator<'> { fn fmt(&self, f: &mut fmt::Formatter<'>) -> fmt::Result { f.write_fmt(format_args!("{}@{}", self.ident, self.reference)) } }

const WORKSPACE_PROTOCOL: &str = "workspace:";

impl<'a> Locator<'a> { pub fn new(ident: &'a str, reference: &'a str) -> Result<Self, Error> { let ident = Ident::try_from(ident)?; Ok(Self { ident, reference: reference.into(), }) }

pub fn from_patch_reference(patch_reference: &'a str) -> Option<Self> {
    let caps = patch_ref().captures(patch_reference)?;
    let capture_group = caps.get(1)?;
    let Locator { ident, reference } = Locator::try_from(capture_group.as_str()).ok()?;
    // This might seem like a special case hack, but this is what yarn does
    let mut decoded_reference = reference.replace("npm%3A", "npm:");
    // Some older versions of yarn don't encode the npm protocol
    if !decoded_reference.starts_with("npm:") {
        decoded_reference.insert_str(0, "npm:");
    }
    Some(Locator {
        ident,
        reference: Cow::Owned(decoded_reference),
    })
}

pub fn is_patch_builtin(patch: &str) -> bool {
    patch.starts_with('~') || builtin().is_match(patch)
}

pub fn is_workspace_path(&self, workspace_path: &str) -> bool {
    // This is slightly awkward, but it allows us to avoid an allocation
    self.reference.starts_with(WORKSPACE_PROTOCOL)
        && &self.reference[WORKSPACE_PROTOCOL.len()..] == workspace_path
}

/// Converts a possibly borrowed Locator to one that must be owned
pub fn as_owned(&self) -> Locator<'static> {
    let Locator { ident, reference } = self;
    let ident = ident.to_owned();
    let reference = Cow::Owned(reference.to_string());
    Locator { ident, reference }
}

pub fn patch_file(&self) -> Option<&str> {
    patch_ref()
        .captures(&self.reference)
        .and_then(|caps| caps.get(2))
        .map(|m| {
            let s = m.as_str();
            s.strip_prefix("./")
                // Yarn 4 uses ~ to indicate the yarn root
                .or_else(|| s.strip_prefix("~/"))
                .unwrap_or(s)
        })
}

pub fn patched_locator(&self) -> Option<Locator<'_>> {
    // THis has an issue of cutting off the last char
    Locator::from_patch_reference(&self.reference)
}

}

impl<'a> From<Locator<'a>> for Descriptor<'a> { fn from(value: Locator<'a>) -> Self { let Locator { ident, reference } = value; Descriptor { ident, range: reference, } } }

#[cfg(test)] mod test { use pretty_assertions::assert_eq;

use super::*;

#[test]
fn test_parse_ident_with_scope() {
    assert_eq!(
        Ident::try_from("@babel/parser").unwrap(),
        Ident {
            scope: Some("babel".into()),
            name: "parser".into()
        }
    )
}

#[test]
fn test_parse_ident_without_scope() {
    assert_eq!(
        Ident::try_from("turbo").unwrap(),
        Ident {
            scope: None,
            name: "turbo".into(),
        }
    )
}

#[test]
fn test_ident_roundtrip() {
    for ident in ["turbo", "@babel/parser"] {
        assert_eq!(ident, Ident::try_from(ident).unwrap().to_string());
    }
}

#[test]
fn test_parse_descriptor() {
    assert_eq!(
        Descriptor::try_from("@babel/code-frame@npm:7.12.11").unwrap(),
        Descriptor {
            ident: Ident {
                scope: Some("babel".into()),
                name: "code-frame".into()
            },
            range: "npm:7.12.11".into(),
        }
    )
}

#[test]
fn test_locator_buildin_patch() {
    assert_eq!(
        Locator::try_from(
            "resolve@patch:resolve@npm%3A1.22.1#~builtin<compat/resolve>::version=1.22.1&\
             hash=07638b"
        )
        .unwrap(),
        Locator {
            ident: Ident {
                scope: None,
                name: "resolve".into()
            },
            reference: "patch:resolve@npm%3A1.22.1#~builtin<compat/resolve>::version=1.22.1&\
                        hash=07638b"
                .into()
        },
    );
}

#[test]
fn test_descriptor_roundtrip() {
    for descriptor in [
        "@babel/code-frame@npm:7.12.11",
        "lodash@patch:lodash@npm%3A4.17.21#./.yarn/patches/lodash-npm-4.17.21-6382451519.\
         patch::version=4.17.21&hash=2c6e9e&locator=berry-patch%40workspace%3A.",
    ] {
        assert_eq!(
            descriptor,
            Descriptor::try_from(descriptor).unwrap().to_string()
        )
    }
}

#[test]
fn test_locator_patch_file() {
    struct TestCase {
        locator: &'static str,
        file: Option<&'static str>,
    }
    let test_cases = [
        TestCase {
            locator: "lodash@patch:lodash@npm%3A4.17.21#./.yarn/patches/lodash-npm-4.17.\
                      21-6382451519.patch::version=4.17.21&hash=2c6e9e&locator=berry-patch%\
                      40workspace%3A.",
            file: Some(".yarn/patches/lodash-npm-4.17.21-6382451519.patch"),
        },
        TestCase {
            locator: "lodash@npm:4.17.21",
            file: None,
        },
        TestCase {
            locator: "resolve@patch:resolve@npm%3A2.0.0-next.4#~builtin<compat/\
                      resolve>::version=2.0.0-next.4&hash=07638b",
            file: Some("~builtin<compat/resolve>"),
        },
    ];
    for tc in test_cases {
        let locator = Locator::try_from(tc.locator).unwrap();
        assert_eq!(locator.patch_file(), tc.file);
    }
}

#[test]
fn test_locator_patch_original_locator() {
    let locator = Locator::try_from(
        "lodash@patch:lodash@npm%3A4.17.21#./.yarn/patches/lodash-npm-4.17.21-6382451519.\
         patch::version=4.17.21&hash=2c6e9e&locator=berry-patch%40workspace%3A.",
    )
    .unwrap();
    let original = locator.patched_locator().unwrap();
    assert_eq!(original, Locator::try_from("lodash@npm:4.17.21").unwrap())
}

#[test]
fn test_patch_primary_version() {
    struct TestCase {
        locator: &'static str,
        original: Option<&'static str>,
    }
    let test_cases = [
        TestCase {
            locator: "lodash@patch:lodash@npm%3A4.17.21#./.yarn/patches/lodash-npm-4.17.\
                      21-6382451519.patch::locator=berry-patch%40workspace%3A.",
            original: Some("lodash@npm:4.17.21"),
        },
        TestCase {
            locator: "typescript@patch:typescript@^4.5.2#~builtin<compat/typescript>",
            original: Some("typescript@npm:^4.5.2"),
        },
        TestCase {
            locator: "react@npm:18.2.0",
            original: None,
        },
        TestCase {
            locator: "resolve@patch:resolve@npm%3A1.22.1#~builtin<compat/resolve>::version=1.\
                      22.1&hash=07638b",
            original: Some("resolve@npm:1.22.1"),
        },
    ];
    for tc in test_cases {
        let locator = Locator::try_from(tc.locator).unwrap();
        let expected = tc.original.map(Locator::try_from).transpose().unwrap();
        let patch_locator = locator.patched_locator();
        assert_eq!(patch_locator, expected, "{}", tc.locator);
    }
}

} ===== END FILE: crates/turborepo-lockfiles/src/berry/identifiers.rs =====

===== FILE: crates/turborepo-lockfiles/src/berry/protocol_resolver.rs ===== use std::collections::HashMap;

use super::identifiers::{Descriptor, Ident};

/// A data structure for resolving descriptors when the protocol isn't known #[derive(Debug, Clone, PartialEq, Eq, Default)] pub struct DescriptorResolver { mapping: HashMap<Key, Entry>, }

#[derive(Debug, Clone, Hash, PartialEq, Eq, PartialOrd, Ord)] struct Key { ident: Ident<'static>, range: String, }

#[derive(Debug, Clone, Hash, PartialEq, Eq, PartialOrd, Ord, Default)] struct Entry { without: Option, with: Option, }

#[derive(Debug, Clone, Hash, PartialEq, Eq, PartialOrd, Ord, Default)] struct RangeAndProtocol { protocol: String, range: String, }

impl DescriptorResolver { /// Add a descriptor to the resolver pub fn insert(&mut self, descriptor: &Descriptor) -> Option { let key = Key::new(descriptor)?; let entry = self.mapping.entry(key).or_default(); entry.insert_descriptor(descriptor) }

/// If given a descriptor without a protocol it will return all matching
/// descriptors with a protocol
pub fn get(&self, descriptor: &Descriptor) -> Option<&str> {
    let key = Key::new(descriptor)?;
    self.mapping.get(&key).and_then(|e| e.get(descriptor))
}

}

impl Key { fn new(desc: &Descriptor) -> Option { let ident = desc.ident.to_owned(); let range = Descriptor::strip_protocol(desc.range()?).to_string(); Some(Key { ident, range }) } }

impl Entry { // Insert the given descriptor's range into the correct slot depending if it is // with or without a protocol fn insert_descriptor(&mut self, descriptor: &Descriptor) -> Option { let range = descriptor.range()?.to_string(); match descriptor.protocol() { Some(protocol) => { // Yarn 4 made the default npm protocol explicit in the lockfile. // In order to return the more specific protocol we avoid overwriting other // protocols with the now explicit npm protocol. if protocol != "npm" || self.with.is_none() { match self.with.replace(RangeAndProtocol { range, protocol: protocol.to_string(), }) { // We only return an ejected range if the protocol isn't the default npm // protocol Some(RangeAndProtocol { range, protocol }) if protocol != "npm" => { Some(range) } _ => None, } } else { None } } None => self.without.replace(range), } }

fn get(&self, descriptor: &Descriptor) -> Option<&str> {
    // We only return the without protocol range if `without` is present
    // and the given descriptor is also without a protocol
    if self.without.is_some() && descriptor.protocol().is_none() {
        self.without.as_deref()
    } else {
        self.with.as_ref().map(|x| x.range.as_str())
    }
}

}

#[cfg(test)] mod test { use super::*;

#[test]
fn test_descriptor_reconstruction() {
    let mut resolver = DescriptorResolver::default();
    let babel_npm = Descriptor::new("@babel/core", "npm:^5.0.0").unwrap();
    let babel_file = Descriptor::new("@babel/core", "file:4.5.0").unwrap();
    assert!(resolver.insert(&babel_npm).is_none());
    assert!(resolver.insert(&babel_file).is_none());
    let babel_5 = Descriptor::new("@babel/core", "^5.0.0").unwrap();
    let babel_4 = Descriptor::new("@babel/core", "4.5.0").unwrap();
    assert_eq!(resolver.get(&babel_5), Some("npm:^5.0.0"));
    assert_eq!(resolver.get(&babel_4), Some("file:4.5.0"));
}

#[test]
fn test_descriptors_without_protocols() {
    let mut resolver = DescriptorResolver::default();
    let workspace = Descriptor::new("internal-workspace", "*").unwrap();
    let workspace_with_protocol = Descriptor::new("internal-workspace", "workspace:*").unwrap();
    assert!(resolver.insert(&workspace).is_none());
    assert!(resolver.insert(&workspace_with_protocol).is_none());
    assert_eq!(resolver.get(&workspace), Some("*"));
    assert_eq!(resolver.get(&workspace_with_protocol), Some("workspace:*"));
}

} ===== END FILE: crates/turborepo-lockfiles/src/berry/protocol_resolver.rs =====

===== FILE: crates/turborepo-lockfiles/src/berry/ser.rs ===== use std::{borrow::Cow, fmt, sync::OnceLock};

use regex::Regex;

use super::{BerryPackage, DependencyMeta, LockfileData, Metadata};

fn simple_string() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| { Regex::new(r#"^[^-?:,\]\[{}#&*!|>'"%@` \t\r\n]([ \t][^,\]\[{}:# \t\r\n])$"#).unwrap() }) }

const HEADER: &str = "# This file is generated by running "yarn install" inside your project.

Manual changes might be lost - proceed with caution!

// We implement Display in order to produce a correctly serialized yarn.lock // Since Berry is so particular about the contents we can't use the serde_yml // serializer without forking it and heavy modifications. Implementing Display // is more honest than writing a Serializer implementation since the serializer // would only support a single type. impl fmt::Display for LockfileData { fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result { write!(f, "{HEADER}\n{}\n", self.metadata)?;

    for (key, entry) in &self.packages {
        let wrapped_key = wrap_string(key);
        // Yaml 1.2 spec says that keys over 1024 characters need to be prefixed with ?
        // and the : goes in a new line
        let key_line = match wrapped_key.len() <= 1024 {
            true => format!("{wrapped_key}:"),
            false => format!("? {wrapped_key}\n:"),
        };
        write!(f, "\n{key_line}\n{entry}\n")?;
    }

    Ok(())
}

}

impl fmt::Display for Metadata { fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result { write!(f, "__metadata:\n version: {}", self.version,)?; if let Some(cache_key) = &self.cache_key { write!(f, "\n cacheKey: {}", wrap_string(cache_key))?; } Ok(()) } }

const SPACE: char = ' '; const NEWLINE: char = '\n'; const INDENT: &str = " ";

impl fmt::Display for BerryPackage { fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result { // we only want to write a newline there was something before let mut first = true; let mut write_line = |field: &str, whitespace: char, value: &str| -> fmt::Result { if !value.is_empty() { if !first { writeln!(f)?; } write!(f, " {field}:{whitespace}{value}")?; first = false; } Ok(()) };

    write_line("version", SPACE, &wrap_string(self.version.as_ref()))?;
    write_line("resolution", SPACE, &wrap_string(&self.resolution))?;
    if let Some(deps) = &self.dependencies {
        write_line(
            "dependencies",
            NEWLINE,
            &stringify_dependencies(deps.iter()),
        )?;
    }
    if let Some(peer_deps) = &self.peer_dependencies {
        write_line(
            "peerDependencies",
            NEWLINE,
            &stringify_dependencies(peer_deps.iter()),
        )?;
    }
    if let Some(deps_meta) = &self.dependencies_meta {
        write_line(
            "dependenciesMeta",
            NEWLINE,
            &stringify_dependencies_meta(deps_meta.iter()),
        )?;
    }
    if let Some(peer_deps_meta) = &self.peer_dependencies_meta {
        write_line(
            "peerDependenciesMeta",
            NEWLINE,
            &stringify_dependencies_meta(peer_deps_meta.iter()),
        )?;
    }
    if let Some(bin) = &self.bin {
        write_line("bin", NEWLINE, &stringify_dependencies(bin.iter()))?;
    }

    if let Some(checksum) = &self.checksum {
        write_line("checksum", SPACE, &wrap_string(checksum))?;
    }
    if let Some(conditions) = &self.conditions {
        write_line("conditions", SPACE, &wrap_string(conditions))?;
    }
    if let Some(language_name) = &self.language_name {
        write_line("languageName", SPACE, &wrap_string(language_name))?;
    }
    if let Some(link_type) = &self.link_type {
        write_line("linkType", SPACE, &wrap_string(link_type))?;
    }

    Ok(())
}

}

fn stringify_dependencies<I, S1, S2>(entries: I) -> String where I: Iterator<Item = (S1, S2)>, S1: AsRef, S2: AsRef, { let mut string = String::new(); let mut first = true; for (key, value) in entries { let key = key.as_ref(); let value = value.as_ref();

    if !first {
        string.push('\n');
    }
    string.push_str(&format!("    {}: {}", wrap_string(key), wrap_string(value)));
    first = false;
}
string

}

fn stringify_dependencies_meta<'a, I, S>(metadata: I) -> String where I: Iterator<Item = (S, &'a DependencyMeta)>, S: AsRef, { struct Builder { s: String, }

impl Builder {
    fn new() -> Self {
        Self { s: String::new() }
    }

    fn add_field(&mut self, field: &str, value: bool) {
        if !self.s.is_empty() {
            self.s.push('\n');
        }
        self.s.push_str(&format!(
            "{INDENT}{INDENT}{INDENT}{}: {value}",
            wrap_string(field)
        ));
    }

    fn add_dependency(&mut self, dependency: &str) {
        if !self.s.is_empty() {
            self.s.push('\n');
        }
        self.s
            .push_str(&format!("{INDENT}{INDENT}{}:", wrap_string(dependency),));
    }
}

let mut builder = Builder::new();

for (dependency, meta) in metadata {
    let dependency = dependency.as_ref();
    let mut first = true;
    let mut add_line = |field: &str, value: bool| {
        if first {
            builder.add_dependency(dependency);
        }
        builder.add_field(field, value);
        first = false;
    };
    if let Some(optional) = meta.optional {
        add_line("optional", optional);
    }
    if let Some(unplugged) = meta.unplugged {
        add_line("unplugged", unplugged);
    }
    if let Some(built) = meta.built {
        add_line("built", built);
    }
}

builder.s

}

fn wrap_string(s: &str) -> Cow<', str> { match simple_string().is_match(s) { // Simple strings require no wrapping true => Cow::from(s), // Complex strings require wrapping false => { Cow::from(serde_json::to_string(s).unwrap_or_else(|| panic!("Unable to encode '{s}'"))) } } }

#[cfg(test)] mod test { use pretty_assertions::assert_eq;

use super::*;

#[test]
fn test_metadata_display() {
    let metadata = Metadata {
        version: "6".into(),
        cache_key: Some("8c0".to_string()),
    };
    assert_eq!(
        metadata.to_string(),
        "__metadata:

version: 6 cacheKey: 8c0" ); }

#[test]
fn test_wrap_string() {
    fn assert(input: &str, expected: &str) {
        assert_eq!(wrap_string(input), expected);
    }
    assert("debug@4.3.4", "debug@4.3.4");
    assert(
        "eslint-module-utils@npm:^2.7.3",
        "\"eslint-module-utils@npm:^2.7.3\"",
    );
    assert("@babel/core", "\"@babel/core\"");
}

#[test]
fn test_long_key_gets_wrapped() {
    let long_key = "a".repeat(1025);
    let lockfile = LockfileData {
        metadata: Metadata {
            version: "6".into(),
            cache_key: Some("8".into()),
        },
        packages: [(
            long_key.clone(),
            BerryPackage {
                version: "1.2.3".to_string(),
                ..Default::default()
            },
        )]
        .iter()
        .cloned()
        .collect(),
    };
    let serialized = lockfile.to_string();
    assert!(serialized.contains(&format!("? {long_key}\n")));
}

#[test]
fn test_stringify_dependencies_meta() {
    let metadata = DependencyMeta {
        optional: Some(false),
        unplugged: None,
        built: Some(true),
    };
    assert_eq!(
        stringify_dependencies_meta(Some(("turbo", &metadata)).into_iter()),
        "    turbo:\n      optional: false\n      built: true"
    );
}

#[test]
fn test_stringify_dependencies_meta_multi() {
    let foo = DependencyMeta {
        optional: Some(true),
        unplugged: None,
        built: None,
    };
    let bar = DependencyMeta {
        optional: None,
        unplugged: None,
        built: Some(true),
    };
    assert_eq!(
        stringify_dependencies_meta(vec![("foo", &foo), ("bar", &bar)].into_iter()),
        "    foo:\n      optional: true\n    bar:\n      built: true"
    );
}

#[test]
fn test_empty_dep_meta() {
    let metadata = DependencyMeta {
        optional: None,
        unplugged: None,
        built: None,
    };
    assert_eq!(
        stringify_dependencies_meta(Some(("turbo", &metadata)).into_iter()),
        ""
    );
}

} ===== END FILE: crates/turborepo-lockfiles/src/berry/ser.rs =====

===== FILE: crates/turborepo-lockfiles/src/berry/de.rs ===== use std::collections::BTreeMap;

use serde::Deserialize;

use super::{BerryPackage, DependencyMeta, LockfileData, Metadata};

const METADATA_KEY: &str = "__metadata";

/// Union type of yarn.lock metadata entry and package entries. /// Only as a workaround for serde_yml behavior around parsing numbers as /// strings. // In the ideal world this would be an enum, but serde_yml currently has behavior // where using #[serde(untagged)] or #[serde(flatten)] affects how it handles // YAML numbers being parsed as Strings. // If these macros are present, then it will refuse to parse 1 or 1.0 as a String // and will instead only parse them as an int/float respectively. // If these macros aren't present, then it will happily parse 1 or 1.0 as // "1" and "1.0". #[derive(Debug, Deserialize, Default)] #[serde(rename_all = "camelCase")] pub(crate) struct Entry { version: String, language_name: Option, dependencies: Option<BTreeMap<String, String>>, peer_dependencies: Option<BTreeMap<String, String>>, dependencies_meta: Option<BTreeMap<String, DependencyMeta>>, peer_dependencies_meta: Option<BTreeMap<String, DependencyMeta>>, bin: Option<BTreeMap<String, String>>, link_type: Option, resolution: Option, checksum: Option, conditions: Option, cache_key: Option, }

#[derive(Debug, thiserror::Error)] pub enum Error { #[error("Missing resolution for entry {0}.")] MissingResolution(String), #[error("Multiple entry {0} has fields that should only appear in metadata.")] InvalidMetadataFields(String), #[error("Lockfile missing {METADATA_KEY} entry.")] MissingMetadata, }

impl TryFrom<BTreeMap<String, Entry>> for LockfileData { type Error = Error;

fn try_from(mut value: BTreeMap<String, Entry>) -> Result<Self, Self::Error> {
    let Entry {
        version, cache_key, ..
    } = value.remove(METADATA_KEY).ok_or(Error::MissingMetadata)?;
    let metadata = Metadata { version, cache_key };
    let mut packages = BTreeMap::new();
    for (key, entry) in value {
        let Entry {
            version,
            language_name,
            dependencies,
            peer_dependencies,
            dependencies_meta,
            peer_dependencies_meta,
            bin,
            link_type,
            resolution,
            checksum,
            conditions,
            cache_key,
        } = entry;
        if cache_key.is_some() {
            return Err(Error::InvalidMetadataFields(key));
        }
        let resolution = resolution.ok_or_else(|| Error::MissingResolution(key.clone()))?;
        packages.insert(
            key,
            BerryPackage {
                version,
                language_name,
                dependencies,
                peer_dependencies,
                dependencies_meta,
                peer_dependencies_meta,
                bin,
                link_type,
                resolution,
                checksum,
                conditions,
            },
        );
    }

    Ok(LockfileData { metadata, packages })
}

}

#[cfg(test)] mod test { use super::*;

#[test]
fn test_requires_metadata() {
    let data = BTreeMap::new();
    assert!(LockfileData::try_from(data).is_err());
}

#[test]
fn test_rejects_cache_key_in_packages() {
    let mut data = BTreeMap::new();
    data.insert(
        METADATA_KEY.to_string(),
        Entry {
            version: "1".into(),
            cache_key: Some("8".into()),
            ..Default::default()
        },
    );
    data.insert(
        "foo".to_string(),
        Entry {
            version: "1".into(),
            resolution: Some("resolved".into()),
            cache_key: Some("8".into()),
            ..Default::default()
        },
    );
    assert!(LockfileData::try_from(data).is_err());
}

#[test]
fn test_requires_resolution() {
    let mut data = BTreeMap::new();
    data.insert(
        METADATA_KEY.to_string(),
        Entry {
            version: "1".into(),
            cache_key: Some("8".into()),
            ..Default::default()
        },
    );
    data.insert(
        "foo".to_string(),
        Entry {
            version: "1".into(),
            resolution: None,
            ..Default::default()
        },
    );
    assert!(LockfileData::try_from(data).is_err());
}

} ===== END FILE: crates/turborepo-lockfiles/src/berry/de.rs =====

===== FILE: crates/turborepo-lockfiles/src/berry/mod.rs ===== mod de; mod identifiers; mod protocol_resolver; mod resolution; mod ser;

use std::{ any::Any, collections::{HashMap, HashSet}, iter, sync::Arc, };

use de::Entry; use identifiers::{Descriptor, Ident, Locator}; use protocol_resolver::DescriptorResolver; use semver::Version; use serde::Deserialize; use thiserror::Error; use turbopath::RelativeUnixPathBuf;

use self::resolution::{Resolution, parse_resolution}; use super::Lockfile;

#[derive(Debug, Error)] pub enum Error { #[error("Unable to parse yaml: {0}")] Parse(#[from] serde_yaml_ng::Error), #[error("Unable to parse identifier: {0}")] Identifiers(#[from] identifiers::Error), #[error("Unable to find original package in patch locator {0}")] PatchMissingOriginalLocator(Locator<'static>), #[error("Unable to parse resolutions field: {0}")] Resolutions(#[from] resolution::Error), #[error("Unable to find entry for {0}")] MissingPackageForLocator(Locator<'static>), #[error("Unable to find any locator for {0}")] MissingLocator(Descriptor<'static>), #[error("Descriptor collision {descriptor} and {other}")] DescriptorCollision { descriptor: Descriptor<'static>, other: String, }, #[error("Unable to parse as patch reference: {0}")] InvalidPatchReference(String), #[error("Package '{name}' not found in catalog '{catalog}'")] MissingCatalogEntry { name: String, catalog: String }, }

// We depend on BTree iteration being sorted for correct serialization type Map<K, V> = std::collections::BTreeMap<K, V>;

type CatalogMap = Map<String, Map<String, String>>;

#[derive(Debug)] pub struct BerryLockfile { data: LockfileData, resolutions: Map<Descriptor<'static>, Locator<'static>>, // A mapping from descriptors without protocols to a range with a protocol resolver: DescriptorResolver, locator_package: Map<Locator<'static>, BerryPackage>, // Map of regular locators to patch locators that apply to them patches: Map<Locator<'static>, Locator<'static>>, // Descriptors that come from default package extensions that ship with berry extensions: HashSet<Descriptor<'static>>, // Package overrides overrides: Map<Resolution, String>, // Map from workspace paths to package locators workspace_path_to_locator: HashMap<String, Locator<'static>>, // Yarn 4+ catalog support catalogs: Arc, }

// This is the direct representation of the lockfile as it appears on disk. // More internal tracking is required for effectively altering the lockfile #[derive(Debug, Clone, Deserialize)] #[serde(try_from = "Map<String, Entry>")] pub struct LockfileData { metadata: Metadata, packages: Map<String, BerryPackage>, }

#[derive(Debug, PartialEq, Eq, Hash, PartialOrd, Ord, Clone)] struct Metadata { version: String, cache_key: Option, }

#[derive(Debug, PartialEq, Eq, Default, Clone)] struct BerryPackage { version: String, language_name: Option, dependencies: Option<Map<String, String>>, peer_dependencies: Option<Map<String, String>>, dependencies_meta: Option<Map<String, DependencyMeta>>, peer_dependencies_meta: Option<Map<String, DependencyMeta>>, // Structured metadata we need to persist bin: Option<Map<String, String>>, link_type: Option, resolution: String, checksum: Option, conditions: Option, }

#[derive(Debug, Deserialize, PartialEq, Eq, Hash, PartialOrd, Ord, Clone, Copy)] struct DependencyMeta { optional: Option, unplugged: Option, built: Option, }

#[derive(Debug, Clone, Deserialize)] pub struct BerryManifest { resolutions: Option<Map<String, String>>, // Yarn 4+ catalog support - default catalog catalog: Option<Map<String, String>>, // Yarn 4+ catalog support - named catalogs catalogs: Option<Map<String, Map<String, String>>>, }

impl BerryLockfile { pub fn load(contents: &[u8], manifest: Option) -> Result<Self, super::Error> { let data = LockfileData::from_bytes(contents)?; let lockfile = BerryLockfile::new(data, manifest)?; Ok(lockfile) }

pub fn new(lockfile: LockfileData, manifest: Option<BerryManifest>) -> Result<Self, Error> {
    let mut patches = Map::new();
    let mut locator_package = Map::new();
    let mut descriptor_locator = Map::new();
    let mut resolver = DescriptorResolver::default();
    let mut workspace_path_to_locator = HashMap::new();
    for (key, package) in &lockfile.packages {
        let locator = Locator::try_from(package.resolution.as_str())?;

        if locator.patch_file().is_some() {
            let original_locator = locator
                .patched_locator()
                .ok_or_else(|| Error::PatchMissingOriginalLocator(locator.as_owned()))?;
            patches.insert(original_locator.as_owned(), locator.as_owned());
        }

        locator_package.insert(locator.as_owned(), package.clone());

        if let Some(path) = locator.reference.strip_prefix("workspace:") {
            workspace_path_to_locator.insert(path.to_string(), locator.as_owned());
        }

        for descriptor in Descriptor::from_lockfile_key(key) {
            let descriptor = descriptor?;
            if let Some(other) = resolver.insert(&descriptor) {
                Err(Error::DescriptorCollision {
                    descriptor: descriptor.clone().into_owned(),
                    other,
                })?;
            }
            descriptor_locator.insert(descriptor.into_owned(), locator.as_owned());
        }
    }

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

glob-cache-and-benchmarks.md

Error in user YAML: (<unknown>): mapping values are not allowed in this context at line 6 column 35

---

## Current State

Turborepo has undergone an intensive perf sprint (15+ commits since early 2025) that parallelized task hash computation across topological waves, replaced git subprocesses with `gix-index`, added fast paths for shallow wildcard globs (54ms → 6ms), and reordered DAG callbacks to unblock dependent tasks sooner. On a ~1,200-package monorepo, cumulative `--dry-run` time dropped from ~600ms to ~455ms (25%+). However, several significant bottlenecks remain untouched, and the project lacks regression benchmarks (no criterion suite, no CI perf gates).

**Already optimized:**
- Git/SCM layer (3-stage evolution: subprocesses → libgit2 → gix-index)
- Glob workspace discovery (literal stat fast path + shallow wildcard readdir)
- Task hash computation (wave-parallel via rayon)
- Pre-execution startup (parallelized)
- DAG dispatch latency (callback reordering)

**Profiling infrastructure available:**
- `turbo run --profile output.json` → Chrome Trace Event Format
- `turborepo-profile-md` auto-generates LLM-friendly markdown summaries
- Optional `pprof` CPU/heap profiling behind feature flag
- No criterion benchmarks or CI perf regression detection

---

Required Changes

1. Glob Pattern Compilation Cache (High leverage, low risk)

Files: crates/turborepo-globwalk/src/lib.rs (compile_globs fn), crates/turborepo-run-cache/src/lib.rs (save_outputs)

Problem: ValidatedGlob patterns are compiled from string → regex every time globwalk() is called. In a 100-package monorepo, the same dist/** pattern is regex-compiled 100+ times per build across save_outputs and file-hashing paths.

Change: Add a HashMap<String, Glob<'static>> compilation cache (thread-local or DashMap) inside compile_globs(). Key on the preprocessed glob string, return cached Glob<'static> on hit.

Verification:

# Before/after comparison with profiling
turbo run build --dry-run --profile before.json  # baseline
# Apply change
turbo run build --dry-run --profile after.json   # measure
# Compare compile_globs spans in the markdown summaries
diff before.json.md after.json.md
# Run existing tests
cargo nextest run -p turborepo-globwalk

Expected impact: 2-5% on repos with many packages sharing identical output globs.

2. Multi-threaded Zstd Compression for Cache Archives (High leverage, medium risk)

Files: crates/turborepo-cache/src/cache_archive/create.rs:115,151

Problem: zstd::Encoder::new(writer, 0) uses single-threaded compression at level 0. Every cache-miss task pays this cost. Large Next.js builds produce hundreds of output chunks.

Change: Enable zstd multi-threading via encoder.multithread(num_cpus::get() as u32)? after encoder creation. Keep level 0 for speed. The zstd crate supports this natively.

Verification:

# Time cache writes on a non-trivial build
turbo run build --force --profile before.json
# Apply change
turbo run build --force --profile after.json
# Look for cache_archive::create spans
# Also verify archives are byte-compatible:
cargo nextest run -p turborepo-cache
# Restore test: ensure old turbo can read new archives

Expected impact: 10-30% faster cache writes on multi-core CI machines, especially for large output sets.

3. Eliminate Double Directory Walk in Manual File Hashing (High leverage, low risk)

Files: crates/turborepo-scm/src/manual.rs:69-212

Problem: get_package_file_hashes_without_git() walks the directory tree twice when include_default_files=true — once for inputs, once for defaults with different gitignore settings.

Change: Merge into a single walk that tracks both sets. During the walk, classify each file as "input-matched" or "default-matched" based on the gitignore setting. Build both result sets from a single directory traversal.

Verification:

cargo nextest run -p turborepo-scm
# Profile the manual hashing path specifically:
turbo run build --dry-run --profile profile.json
# Check get_package_file_hashes_without_git span durations

Expected impact: 15-25% faster manual (non-git) file hashing.

4. Parallel turbo.json Config Loading in Engine Builder (Medium leverage, medium risk)

Files: crates/turborepo-engine/src/builder.rs:261-425 (BFS loop), turbo_json_chain fn at lines 740-855

Problem: The BFS traversal in EngineBuilder::build() loads turbo.json configs serially per workspace. turbo_json_chain() uses a synchronous stack-based walk to resolve extends chains, doing disk I/O per package.

Change: Pre-load all turbo.json files in parallel (rayon par_iter over workspace packages) before entering the BFS loop. Store results in a HashMap<PackageName, Vec<TurboJson>> and look up from there during BFS.

Verification:

cargo nextest run -p turborepo-engine
turbo run build --dry-run --profile profile.json
# Compare turbo_json_chain and build span durations before/after

Expected impact: Proportional to workspace count — 5-15% for 50+ packages, more for 200+.

5. Batch Retry Logic for File Hashing (Medium leverage, low risk)

Files: crates/turborepo-scm/src/hash_object.rs:8-94

Problem: hash_file_with_retry() does per-file exponential backoff with std::thread::sleep() on rayon worker threads. This blocks the worker, preventing it from stealing other work.

Change: Replace per-file retry with a batch retry strategy: if any file in the rayon batch hits EMFILE, pause the entire batch (or reduce parallelism), then retry only failed files. Use rayon's scope with backpressure instead of individual sleeps.

Verification:

cargo nextest run -p turborepo-scm
# Stress test: artificially lower ulimit and profile
ulimit -n 256 && turbo run build --dry-run --profile profile.json

Expected impact: 10-20% faster file hashing under file descriptor pressure (common on CI).

6. DashMap for Daemon `times_saved` (Low leverage, low risk)

Files: crates/turborepo-daemon/src/server.rs:338,416-417

Problem: Arc<Mutex<HashMap<String, u64>>> for times_saved serializes all notify_outputs_written RPC calls on the mutex.

Change: Replace with DashMap<String, u64> for lock-free concurrent reads/writes.

Verification:

cargo nextest run -p turborepo-daemon
# Profile with many concurrent tasks:
turbo run build --concurrency=100 --profile profile.json

Expected impact: 2-5ms per 100 concurrent tasks.

7. Pre-filter Excludes Before Hashing in package_deps (Medium leverage, low risk)

Files: crates/turborepo-scm/src/package_deps.rs:363-377

Problem: Exclude globs are compiled and matched after all files are hashed. Files matching exclusions are hashed unnecessarily, then retain() discards them.

Change: Compile exclude patterns once before hashing, then filter files before calling hash_objects().

Verification:

cargo nextest run -p turborepo-scm
turbo run build --dry-run --profile profile.json
# Check get_package_file_hashes_from_inputs_and_index span

Expected impact: 3-8% for repos with broad excludes (e.g., !**/*.test.*).

8. Add Criterion Benchmark Suite (Meta: enables all future perf work)

Files: New crates/turborepo-benchmarks/ or benches/ in existing crates

Problem: No automated performance regression detection. Every perf PR relies on manual profiling.

Change: Add criterion benchmarks for the hot paths:

globwalk::globwalk() with 100/500/1000-package fixtures
compile_globs() pattern compilation
hash_objects() file hashing
CacheWriter::create() archive creation
EngineBuilder::build() graph construction

Verification:

cargo bench -- globwalk  # Run glob benchmarks
# CI integration: use criterion's --save-baseline / --baseline flags
# Add to CI as a non-blocking check that reports regressions

Expected impact: Prevents regressions, enables confident optimization. Foundation for all future perf work.

Ripple Effects

Change	Affected subsystems	Risk vectors
Glob pattern cache	globwalk, run-cache, scm	Memory growth if cache unbounded; stale entries if patterns mutated (they aren't)
Multi-threaded zstd	cache-archive, async-cache	Archive format unchanged (zstd frames are compatible), but older turbo versions must still read new archives — test cross-version restore
Single-pass manual walk	scm, task-hash	Behavioral parity required: same files must match under both gitignore settings — careful test coverage needed
Parallel config loading	engine builder, turbo-json loader	Errors from parallel loads must be aggregated correctly; extends chain cycles must still be detected
Batch retry	scm hash_object	Must preserve correctness under EMFILE; if retry fails globally, error propagation must still work
DashMap for times_saved	daemon server	Concurrent iteration semantics differ from Mutex — verify no iteration-during-mutation patterns
Pre-filter excludes	package_deps, scm	Must produce identical hash sets — add property-based tests comparing old vs new code paths
Criterion benchmarks	CI, developer workflow	Benchmark variance on CI runners can cause flaky alerts — use statistical thresholds (>5% regression)

What This Unlocks

Glob pattern cache is the prerequisite for a broader "compiled session context" — once patterns are cached, the same cache can serve both save_outputs and get_package_file_hashes, eliminating redundant work across the hash→execute→cache pipeline.
Multi-threaded zstd directly reduces cache-miss wall time, which is the dominant cost in CI cold starts. Combined with the existing shallow-wildcard glob optimization, this makes turbo run build on a fresh CI runner measurably faster for large monorepos.
Criterion benchmarks are the foundation for all future performance work. Without them, every optimization is validated by ad-hoc profiling and risks silent regressions. Once in CI, they provide a performance regression gate that catches problems before merge.
Single-pass manual walk + pre-filter excludes together eliminate the two largest sources of redundant I/O in the non-git file hashing path, which is the fallback for repos without git, shallow clones, and CI environments where git status is unreliable.
Parallel config loading becomes increasingly valuable as monorepo size grows. For enterprise repos with 500+ packages each with turbo.json + extends chains, this is the difference between 200ms and 50ms for graph construction.

Effort & Risk

Change	Effort	Risk	Priority
Glob pattern cache	Small (1-2 days)	Low — additive cache, falls back to compilation on miss	P0 — high leverage, minimal risk
Multi-threaded zstd	Small (1 day)	Low — one-line API change, format compatible	P0 — immediate CI speedup
Criterion benchmarks	Medium (3-5 days)	None — additive infrastructure	P0 — enables all future work
Pre-filter excludes before hashing	Small (1-2 days)	Low — compare output sets in tests	P1 — straightforward
Eliminate double directory walk	Medium (2-3 days)	Medium — subtle gitignore interaction edge cases	P1 — significant I/O savings
Batch retry for file hashing	Small (1-2 days)	Low — EMFILE is already handled, just restructuring	P1 — CI-specific win
Parallel turbo.json loading	Medium (3-4 days)	Medium — error aggregation, cycle detection under parallelism	P2 — scales with repo size
DashMap for times_saved	Tiny (hours)	Low — drop-in replacement	P2 — minor but free

Assumptions:

Profiling numbers cited are from the existing trace infrastructure (--profile flag). All "expected impact" percentages are estimates based on code analysis, not measured benchmarks — the Criterion suite (P0) is needed to validate them.
"Large monorepo" means 100+ packages. Impact scales roughly linearly with package count for items 1, 3, 4, 5, and 7.
CI machines have 4+ cores, making multi-threaded zstd worthwhile. Single-core environments see no benefit.
The existing test suite (cargo nextest run) provides correctness coverage for all changes. No new test fixtures are needed except for Criterion benchmarks.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.

Exploration

AREA: globwalk-pattern-caching FILES: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs MATURITY: stable RELEVANCE: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. NOTES: (unspecified)

Raw Exploration Notes: AREA: globwalk-pattern-caching FILES: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs MATURITY: stable RELEVANCE: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.

Selected Code Context

===== FILE: crates/turborepo-globwalk/src/lib.rs ===== //! Glob pattern matching and directory walking //! This is a layer on top of wax that performs some corrections to user //! provided globs as well as escaping characters that wax considers special, //! but we do not support.

#![feature(assert_matches)] #![deny(clippy::all)]

use std::{ borrow::Cow, collections::HashSet, path::{Path, PathBuf}, str::FromStr, sync::OnceLock, };

use itertools::Itertools; use path_clean::PathClean; use path_slash::PathExt; use rayon::prelude::*; use regex::Regex; use tracing::debug; use turbopath::{AbsoluteSystemPath, AbsoluteSystemPathBuf, PathError, RelativeUnixPath}; use wax::{ BuildError, Glob, Program, walk::{FileIterator, FilterAny}, };

#[derive(Debug, PartialEq, Clone, Copy)] pub enum WalkType { Files, Folders, All, }

pub use walkdir::Error as WalkDirError; use wax::walk::{Entry, EntryResidue};

#[derive(Debug, thiserror::Error)] pub enum WalkError { // note: wax 0.5 has a lifetime in the BuildError, so we can't use it here #[error("bad pattern {0}: {1}")] BadPattern(String, Box), #[error("invalid path")] InvalidPath, #[error("walk error: {0}")] WalkError(#[from] walkdir::Error), #[error(transparent)] Path(#[from] PathError), #[error(transparent)] WaxWalk(#[from] wax::walk::WalkError), #[error("Internal error on glob {glob}: {error}")] InternalError { glob: String, error: String }, #[error("IO Error: {0}")] IO(#[from] std::io::Error), }

fn join_unix_like_paths(a: &str, b: &str) -> String { [a.trim_end_matches('/'), "/", b.trim_start_matches('/')].concat() }

fn glob_literals() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"(?[?*$:<>()[]{},])").unwrap()) }

fn escape_glob_literals(literal_glob: &str) -> Cow<'_, str> { glob_literals().replace_all(literal_glob, "\$literal") }

#[tracing::instrument(skip(include, exclude))] fn preprocess_paths_and_globs<S: AsRef>( base_path: &AbsoluteSystemPath, include: &[S], exclude: &[S], ) -> Result<(PathBuf, Vec, Vec), WalkError> { let raw_slash = base_path .as_std_path() .to_slash() .ok_or(WalkError::InvalidPath)?; let base_path_slash = escape_glob_literals(&raw_slash);

let (include_paths, lowest_segment) = {
    let mut paths = Vec::with_capacity(include.len());
    let mut lowest = usize::MAX;
    for s in include {
        let mut fixed = fix_glob_pattern(s.as_ref()).into_owned();
        // We need to check inclusion globs before the join
        // as to_slash doesn't preserve Windows drive names.
        add_doublestar_to_dir(base_path, &mut fixed);
        let joined = join_unix_like_paths(&base_path_slash, &fixed);
        if let Some((collapsed, segment)) = collapse_path(&joined) {
            lowest = std::cmp::min(lowest, segment);
            paths.push(collapsed.into_owned());
        }
    }
    (paths, lowest)
};

let base_path = base_path
    .components()
    .take(
        // this can be usize::MAX if there are no include paths
        lowest_segment.saturating_add(1),
    )
    .collect::<PathBuf>();

let mut exclude_paths = Vec::with_capacity(exclude.len() * 2);
for s in exclude {
    let fixed = fix_glob_pattern(s.as_ref());
    let joined = join_unix_like_paths(&base_path_slash, fixed.as_ref());
    if let Some((collapsed, _)) = collapse_path(&joined) {
        add_trailing_double_star(&mut exclude_paths, &collapsed);
    }
}

Ok((base_path, include_paths, exclude_paths))

}

fn double_doublestar() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"**(?:/**)+").unwrap()) }

fn leading_doublestar() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"**(?P[^*/]+)").unwrap()) }

fn trailing_doublestar() -> &'static Regex { static RE: OnceLock = OnceLock::new(); RE.get_or_init(|| Regex::new(r"(?P[^*/]+)**").unwrap()) }

pub fn fix_glob_pattern(pattern: &str) -> Cow<', str> { // On Unix, Path::new(pattern).to_slash() is a no-op that returns Cow::Borrowed. // Skip the roundtrip entirely on Unix to avoid the overhead. #[cfg(not(windows))] let p0: Cow<', str> = Cow::Borrowed(pattern);

#[cfg(windows)]
let p0: Cow<'_, str> = {
    // path-slash strips trailing slashes from Windows paths, so we need to restore
    // them.
    let needs_trailing_slash = pattern.ends_with('/') || pattern.ends_with('\\');
    let converted = Path::new(pattern)
        .to_slash()
        .expect("failed to roundtrip through Path");
    if needs_trailing_slash && !converted.ends_with('/') {
        Cow::Owned(format!("{converted}/"))
    } else {
        converted
    }
};

// Chain regex replacements, taking advantage of Cow<str>:
// - If no match, replace() returns Cow::Borrowed pointing to the input
// - If match, replace() returns Cow::Owned with the replacement
// This avoids allocations when patterns don't need modification.
let p1 = double_doublestar().replace(&p0, "**");
let p2 = leading_doublestar().replace(&p1, "**/*$suffix");
let p3 = trailing_doublestar().replace(&p2, "$prefix*/**");

// Determine if we can return a borrowed reference to the original pattern.
// On Unix, if no regex matched, all Cows in the chain are Borrowed and
// transitively reference `pattern`. We can detect this by checking if
// p3's pointer equals pattern's pointer.
//
// On Windows, p0 may be Owned (if path conversion changed something), so
// we can only return Borrowed if p0 was also Borrowed.
match p3 {
    Cow::Borrowed(s) if std::ptr::eq(s, pattern) => {
        // No allocations occurred anywhere in the chain
        Cow::Borrowed(pattern)
    }
    Cow::Borrowed(s) => {
        // p3 is borrowed from an intermediate owned string (some regex matched).
        // We need to allocate to return an owned copy.
        Cow::Owned(s.to_string())
    }
    Cow::Owned(s) => Cow::Owned(s),
}

}

/// collapse a path, returning a new path with all the dots and dotdots removed /// /// also returns the position in the path of the first encountered collapse, /// for the purposes of calculating the new base path fn collapse_path(path: &str) -> Option<(Cow<'_, str>, usize)> { let mut stack: Vec<&str> = vec![]; let mut changed = false; let is_root = path.starts_with('/');

// the index of the lowest segment that was collapsed
// this is defined as the lowest stack size after a collapse
let mut lowest_index = None;

for segment in path.trim_start_matches('/').split('/') {
    match segment {
        ".." => {
            stack.pop()?;
            // Set this value post-pop so that we capture
            // the remaining prefix, and not the segment we're
            // about to remove. Note that this gets papered over
            // below when we compare against the current stack length.
            lowest_index.get_or_insert(stack.len());
            changed = true;
        }
        "." => {
            lowest_index.get_or_insert(stack.len());
            changed = true;
        }
        _ => stack.push(segment),
    }
    if let Some(lowest_index) = lowest_index.as_mut() {
        *lowest_index = (*lowest_index).min(stack.len());
    }
}

let lowest_index = lowest_index.unwrap_or(stack.len());
if !changed {
    Some((Cow::Borrowed(path), lowest_index))
} else {
    let string = if is_root {
        std::iter::once("").chain(stack).join("/")
    } else {
        stack.join("/")
    };

    Some((Cow::Owned(string), lowest_index))
}

}

fn add_trailing_double_star(exclude_paths: &mut Vec, glob: &str) { if let Some(stripped) = glob.strip_suffix('/') { if stripped.ends_with("") { exclude_paths.push(stripped.to_string()); } else { exclude_paths.push(format!("{glob}")); } } else if glob.ends_with("/") { exclude_paths.push(glob.to_string()); } else { // Match Go globby behavior. If the glob doesn't already end in /, add it // We use the unix style operator as wax expects unix style paths exclude_paths.push(format!("{glob}/**")); exclude_paths.push(glob.to_string()); } }

fn add_doublestar_to_dir(base: &AbsoluteSystemPath, glob: &mut String) { // If the glob has a glob literal in it e.g. * // then skip trying to read it as a file path. if glob_literals().is_match(&*glob) { return; }

// Globs are given in unix style
let Ok(glob_path) = RelativeUnixPath::new(&*glob) else {
    // Glob isn't valid relative unix path so can't check if dir
    debug!("'{glob}' isn't valid path");
    return;
};

let path = base.join_unix_path(glob_path);

let Ok(metadata) = path.symlink_metadata() else {
    debug!("'{path}' doesn't have metadata");
    return;
};

if !metadata.is_dir() {
    return;
}

debug!("'{path}' is a directory");

// Glob points to a dir, must add **
if !glob.ends_with('/') {
    glob.push('/');
}
glob.push_str("**");

}

fn glob_with_contextual_error<S: AsRef>(raw: S) -> Result<Glob<'static>, WalkError> { let raw = raw.as_ref(); Glob::new(raw) .map(|g| g.into_owned()) .map_err(|e| WalkError::BadPattern(raw.to_string(), Box::new(e))) }

#[derive(Debug, thiserror::Error)] #[error("Invalid globwalking input {raw_input}: {reason}")] pub struct GlobError { raw_input: String, reason: String, }

#[derive(Debug, Default, Copy, Clone)] pub struct Settings { /// Don't recurse into a directory if it contains a package.json file. /// NOTE: If globbing from the root of a workspace, this setting will cause /// us to not recurse into the individual packages. Therefore, only use this /// setting if you are globbing in an individual package at not at the /// workspace root. ignore_nested_packages: bool, }

impl Settings { pub fn ignore_nested_packages(mut self) -> Self { self.ignore_nested_packages = true; self } }

/// ValidatedGlob. /// /// Represents an input string that we have either validated or /// modified to fit our constraints. It does not yet validate that the glob is /// a valid glob pattern, just that we have checked for unix format, ':'s, clean /// paths, etc. #[derive(Clone, Debug)] pub struct ValidatedGlob { inner: String, }

impl ValidatedGlob { pub fn as_str(&self) -> &str { self.inner.as_str() } }

impl AsRef for ValidatedGlob { fn as_ref(&self) -> &str { self.inner.as_str() } }

impl FromStr for ValidatedGlob { type Err = GlobError;

fn from_str(s: &str) -> Result<Self, Self::Err> {
    // Valid globs:
    // 1. are to_slash'd,
    // 2. are relative
    // 3. directory traversals are leading or not at all
    // 4. single `.`s removed
    // 5. colons escaped on unix, error on windows

    #[cfg(not(windows))]
    {
        // Fast path: check if cleaning is needed by looking for `.` or `..` segments
        let needs_cleaning = needs_path_cleaning(s);

        let cleaned: Cow<'_, str> = if needs_cleaning {
            // Only allocate when we actually need to clean
            let path = Path::new(s);
            let cleaned_path = path.clean();
            Cow::Owned(cleaned_path.to_str().expect("valid utf-8").to_owned())
        } else {
            Cow::Borrowed(s)
        };

        // Strip leading slashes
        let without_leading_slash = cleaned.trim_start_matches('/');

        // Only allocate for colon escaping if colons are present (rare case)
        let result = if without_leading_slash.contains(':') {
            without_leading_slash.replace(':', "\\:")
        } else {
            without_leading_slash.to_owned()
        };

        Ok(Self { inner: result })
    }

    #[cfg(windows)]
    {
        // On Windows, we need to convert backslashes to forward slashes
        let needs_slash_conversion = s.contains('\\');
        let needs_cleaning = needs_path_cleaning(s);

        // If we need either conversion, we have to allocate
        let processed: Cow<'_, str> = if needs_slash_conversion || needs_cleaning {
            let path = Path::new(s);
            let cleaned_path = path.clean();
            let cleaned_str = cleaned_path.to_str().expect("valid utf-8");
            if needs_slash_conversion || cleaned_str.contains('\\') {
                Cow::Owned(cleaned_str.replace('\\', "/"))
            } else {
                Cow::Owned(cleaned_str.to_owned())
            }
        } else {
            Cow::Borrowed(s)
        };

        // Check for invalid ':' character on Windows
        if let Some(index) = processed.find(':') {
            return Err(GlobError {
                raw_input: s.to_owned(),
                reason: format!(
                    "Found invalid windows relative path character ':' at position {index}"
                ),
            });
        }

        Ok(Self {
            inner: processed.into_owned(),
        })
    }
}

}

/// Check if a path string needs cleaning (contains . or .. segments). /// This is a fast check to avoid allocations in the common case. #[inline] fn needs_path_cleaning(s: &str) -> bool { // We need to check for actual path segments, not just any dot // A segment is defined by being surrounded by '/' or at start/end let bytes = s.as_bytes(); let len = bytes.len();

let mut i = 0;
while i < len {
    // Check for segment start (beginning of string or after '/')
    let at_segment_start = i == 0 || bytes[i - 1] == b'/';

    if at_segment_start && bytes[i] == b'.' {
        // Check for "." segment (single dot followed by '/' or end)
        if i + 1 == len || bytes[i + 1] == b'/' {
            return true;
        }
        // Check for ".." segment (double dot followed by '/' or end)
        if i + 1 < len && bytes[i + 1] == b'.' && (i + 2 == len || bytes[i + 2] == b'/') {
            return true;
        }
    }
    i += 1;
}
false

}

/// Returns true if the pattern contains glob metacharacters (, ?, [, {). /// Literal file paths return false. pub fn is_glob_pattern(pattern: &str) -> bool { // Check for unescaped glob metacharacters let mut chars = pattern.chars().peekable(); while let Some(c) = chars.next() { if c == '\' { // Skip escaped character chars.next(); continue; } if matches!(c, '' | '?' | '[' | '{') { return true; } } false }

pub fn globwalk_with_settings( base_path: &AbsoluteSystemPath, include: &[ValidatedGlob], exclude: &[ValidatedGlob], walk_type: WalkType, settings: Settings, ) -> Result<HashSet, WalkError> { let compiled = compile_globs(base_path, include, exclude)?; retry_on_emfile(|| walk_compiled_globs(&compiled, walk_type, settings)) }

pub fn globwalk( base_path: &AbsoluteSystemPath, include: &[ValidatedGlob], exclude: &[ValidatedGlob], walk_type: WalkType, ) -> Result<HashSet, WalkError> { let compiled = compile_globs(base_path, include, exclude)?; retry_on_emfile(|| walk_compiled_globs(&compiled, walk_type, Default::default())) }

fn is_too_many_open_files(err: &WalkError) -> bool { // visit_file converts all walkdir/wax errors into WalkError::IO, // so this is the only variant we need to check. let WalkError::IO(e) = err else { return false; };

#[cfg(unix)]
{
    e.raw_os_error() == Some(24)
} // EMFILE
#[cfg(windows)]
{
    e.raw_os_error() == Some(4)
} // ERROR_TOO_MANY_OPEN_FILES
#[cfg(not(any(unix, windows)))]
{
    false
}

}

fn retry_on_emfile(mut f: F) -> Result<HashSet, WalkError> where F: FnMut() -> Result<HashSet, WalkError>, { const MAX_RETRIES: u32 = 10; const BASE_DELAY_MS: u64 = 10; const MAX_DELAY_MS: u64 = 1000;

for attempt in 0..MAX_RETRIES {
    match f() {
        Ok(result) => return Ok(result),
        Err(err) if is_too_many_open_files(&err) => {
            let delay = std::cmp::min(BASE_DELAY_MS * 2u64.pow(attempt), MAX_DELAY_MS);
            debug!(
                attempt = attempt + 1,
                delay_ms = delay,
                "too many open files, retrying globwalk"
            );
            std::thread::sleep(std::time::Duration::from_millis(delay));
        }
        Err(err) => return Err(err),
    }
}

// Final attempt — propagate whatever happens.
f()

}

struct CompiledGlobs { base_path: PathBuf, include_patterns: Vec<Glob<'static>>, ex_patterns: Vec<Glob<'static>>, ex_filter: FilterAny, }

#[tracing::instrument(skip(include, exclude))] fn compile_globs<S: AsRef>( base_path: &AbsoluteSystemPath, include: &[S], exclude: &[S], ) -> Result<CompiledGlobs, WalkError> { let (base_path_new, include_paths, exclude_paths) = preprocess_paths_and_globs(base_path, include, exclude)?;

let ex_patterns: Vec<Glob<'static>> = exclude_paths
    .into_iter()
    .map(glob_with_contextual_error)
    .collect::<Result<_, _>>()?;

let ex_filter = FilterAny::any(ex_patterns.clone())
    .map_err(|e| WalkError::BadPattern("exclusion".into(), Box::new(e)))?;

let include_patterns = include_paths
    .into_par_iter()
    .map(glob_with_contextual_error)
    .collect::<Result<Vec<_>, _>>()?;

Ok(CompiledGlobs {
    base_path: base_path_new,
    include_patterns,
    ex_patterns,
    ex_filter,
})

}

/// Try to decompose a glob pattern relative to base_path into /// (literal_prefix, literal_suffix) separated by a single * segment. /// /// For example, with base /repo and glob pattern matching /// /repo/packages/*/package.json, this returns /// Some(("/repo/packages", "package.json")). /// /// Returns None if the pattern doesn't have exactly one * wildcard segment /// or contains **, ?, [, or { metacharacters. fn try_decompose_shallow_wildcard( base_path: &Path, glob: &Glob<'>, ) -> Option<(PathBuf, PathBuf)> { // Wax glob patterns are relative to the base path that walk is called // with, but preprocess_paths_and_globs joins them with the absolute base // path before compilation. So glob.to_string() returns an absolute path // like /repo/packages/*/package.json. We need to work with the full // absolute path and extract the prefix/suffix around the *. let pattern = glob.to_string();

// Quick reject: skip patterns with complex metacharacters.
if pattern.contains("**")
    || pattern.contains('?')
    || pattern.contains('[')
    || pattern.contains('{')
{
    return None;
}

// Split on '/' and find segments containing `*`. Filter out empty segments
// (leading '/' on absolute paths creates one).
let segments: Vec<&str> = pattern.split('/').collect();
let star_positions: Vec<usize> = segments
    .iter()
    .enumerate()
    .filter(|(_, s)| !s.is_empty() && s.contains('*'))
    .map(|(i, _)| i)
    .collect();

// Must have exactly one segment containing `*`, and it must be exactly `*`.
if star_positions.len() != 1 {
    return None;
}
let star_idx = star_positions[0];
if segments[star_idx] != "*" {
    return None;
}

// Require at least one suffix segment after the star — patterns like `dir/*`
// match files directly and need the full walker.
if star_idx + 1 >= segments.len() {
    return None;
}

// Build the literal prefix from the segments before the star.
// For absolute paths like "/repo/packages", rejoin with '/' separator.
let prefix: PathBuf = segments[..star_idx].join("/").into();

// Build the literal suffix from segments after the star.
let suffix: PathBuf = segments[star_idx + 1..].iter().collect();

// The prefix must be an existing directory.
if !prefix.is_dir() {
    return None;
}

Some((prefix, suffix))

}

fn walk_compiled_globs( compiled: &CompiledGlobs, walk_type: WalkType, settings: Settings, ) -> Result<HashSet, WalkError> { // Partition globs into three categories for optimal dispatch: // // 1. Invariant (no wildcards) — resolved via a single stat syscall. // 2. Shallow wildcard (<prefix>/*/<suffix>) — expanded via readdir on the // prefix dir, then each child is checked in parallel. This avoids the // walkdir overhead for patterns like packages/*/package.json where a // single glob walker would sequentially traverse hundreds of entries. // 3. General variant — full wax directory walk (one rayon task per pattern). let mut literal_results: Vec = Vec::new(); let mut shallow_wildcards: Vec<(PathBuf, PathBuf)> = Vec::new(); let mut variant_globs: Vec<&Glob<'static>> = Vec::new();

for glob in &compiled.include_patterns {
    if let Some(path) = glob.variance().path() {
        // Invariant: no wildcards at all — single stat.
        let full_path = compiled.base_path.join(path);
        let dominated = full_path
            .symlink_metadata()
            .ok()
            .is_some_and(|m| match walk_type {
                WalkType::Files => !m.is_dir(),
                WalkType::Folders => m.is_dir(),
                WalkType::All => true,
            });
        if dominated && let Ok(abs) = AbsoluteSystemPathBuf::try_from(full_path.as_path()) {
            literal_results.push(abs);
        }
    } else if let Some(decomposed) = try_decompose_shallow_wildcard(&compiled.base_path, glob) {
        shallow_wildcards.push(decomposed);
    } else {
        variant_globs.push(glob);
    }
}

// Expand shallow wildcards: readdir each prefix, then stat candidates in
// parallel. This turns one slow sequential walk (e.g. 54ms for 603 dirs)
// into a batch of parallel stat calls spread across all rayon workers.
let shallow_candidates: Vec<PathBuf> = shallow_wildcards
    .iter()
    .flat_map(|(prefix, suffix)| {
        let Ok(entries) = std::fs::read_dir(prefix) else {
            return Vec::new();
        };
        entries
            .filter_map(|e| e.ok())
            .map(|e| e.path().join(suffix))
            .collect::<Vec<_>>()
    })
    .collect();

let ex_patterns = &compiled.ex_patterns;
let shallow_results: Vec<AbsoluteSystemPathBuf> = shallow_candidates
    .par_iter()
    .filter_map(|candidate| {
        let meta = candidate.symlink_metadata().ok()?;
        let dominated = match walk_type {
            WalkType::Files => !meta.is_dir(),
            WalkType::Folders => meta.is_dir(),
            WalkType::All => true,
        };
        if !dominated {
            return None;
        }
        // Check exclusion patterns. Exclusion globs are compiled from
        // absolute paths (e.g. `/repo/**/node_modules/**`), so match
        // against the full candidate path using slash-normalized form for
        // cross-platform correctness.
        if !ex_patterns.is_empty() {
            let candidate_str = candidate.to_slash_lossy();
            if ex_patterns
                .iter()
                .any(|ex| ex.is_match(candidate_str.as_ref()))
            {
                return None;
            }
        }
        AbsoluteSystemPathBuf::try_from(candidate.as_path()).ok()
    })
    .collect();

let mut results: HashSet<AbsoluteSystemPathBuf> = variant_globs
    .par_iter()
    .flat_map_iter(|glob| {
        walk_glob(
            walk_type,
            &compiled.base_path,
            compiled.ex_filter.clone(),
            glob,
            settings,
        )
    })
    .collect::<Result<HashSet<_>, _>>()?;

results.extend(literal_results);
results.extend(shallow_results);
Ok(results)

}

#[tracing::instrument(skip_all)] fn walk_glob( walk_type: WalkType, base_path_new: &Path, ex_filter: FilterAny, glob: &Glob<'static>, settings: Settings, ) -> Vec<Result<AbsoluteSystemPathBuf, WalkError>> { let iter = glob.walk(base_path_new).not_any(ex_filter);

if settings.ignore_nested_packages {
    iter.filter_entry(|entry| {
        let path = entry.path();
        if path.is_dir() && path != base_path_new && path.join("package.json").exists() {
            return Some(EntryResidue::Tree);
        }

        None
    })
    .filter_map(|entry| visit_file(walk_type, entry))
    .collect::<Vec<_>>()
} else {
    iter.filter_map(|entry| visit_file(walk_type, entry))
        .collect::<Vec<_>>()
}

}

fn visit_file( walk_type: WalkType, entry: Result<wax::walk::GlobEntry, wax::walk::WalkError>, ) -> Option<Result<AbsoluteSystemPathBuf, WalkError>> { match entry { Ok(entry) if walk_type == WalkType::Files && entry.file_type().is_dir() => None, Ok(entry) => Some(AbsoluteSystemPathBuf::try_from(entry.path()).map_err(|e| e.into())), Err(e) => { let io_err = std::io::Error::from(e); match io_err.kind() { // Ignore missing file and permission errors std::io::ErrorKind::NotFound | std::io::ErrorKind::PermissionDenied => None, _ => Some(Err(io_err.into())), } } } }

#[cfg(test)] mod test { use std::{collections::HashSet, str::FromStr};

use itertools::Itertools;
use tempfile::TempDir;
use test_case::test_case;
use turbopath::{AbsoluteSystemPath, AbsoluteSystemPathBuf};

use crate::{
    Settings, ValidatedGlob, WalkError, WalkType, add_doublestar_to_dir, collapse_path,
    escape_glob_literals, fix_glob_pattern, globwalk, needs_path_cleaning,
};

#[cfg(unix)]
const ROOT: &str = "/";
#[cfg(windows)]
const ROOT: &str = "C:\\";
#[cfg(unix)]
const GLOB_ROOT: &str = "/";
#[cfg(windows)]
const GLOB_ROOT: &str = "C\\:/"; // in globs, expect an escaped ':' token

#[test_case("a", "a" ; "no change")]
#[test_case("**/**", "**")]
#[test_case("**/**/**", "**" ; "Triple doublestar")]
#[test_case("**token/foo", "**/*token/foo")]
#[test_case("**token**", "**/*token*/**")]
fn test_fix_glob_pattern(input: &str, expected: &str) {
    let output = fix_glob_pattern(input);
    assert_eq!(output.as_ref(), expected);
}

#[test]
#[cfg(not(windows))]
fn test_fix_glob_pattern_returns_borrowed_when_no_change() {
    use std::borrow::Cow;
    // On Unix, when no regex matches, we should get a borrowed reference
    // to the original string, avoiding allocation.
    let input = "packages/*/src/**/*.ts";
    let output = fix_glob_pattern(input);
    assert!(
        matches!(output, Cow::Borrowed(_)),
        "expected Cow::Borrowed for pattern that needs no modification"
    );
    // Verify it's actually the same memory
    assert!(std::ptr::eq(output.as_ref(), input));
}

#[test]
fn test_fix_glob_pattern_returns_owned_when_changed() {
    use std::borrow::Cow;
    // When a regex matches, we should get an owned string
    let input = "**/**";
    let output = fix_glob_pattern(input);
    assert!(
        matches!(output, Cow::Owned(_)),
        "expected Cow::Owned for pattern that needs modification"
    );
    assert_eq!(output.as_ref(), "**");
}

#[test_case("a/./././b", "a/b", 1 ; "test path with dot segments")]
#[test_case("a/../b", "b", 0 ; "test path with dotdot segments")]
#[test_case("a/./../b", "b", 0 ; "test path with mixed dot and dotdot segments")]
#[test_case("./a/b", "a/b", 0 ; "test path starting with dot segment")]
#[test_case("a/b/..", "a", 1 ; "test path ending with dotdot segment")]
#[test_case("a/b/.", "a/b", 2 ; "test path ending with dot segment")]
#[test_case("a/.././b", "b", 0 ; "test path with mixed and consecutive ./ and ../ segments")]
#[test_case("/a/./././b", "/a/b", 1 ; "test path with leading / and ./ segments")]
#[test_case("/a/../b", "/b", 0 ; "test path with leading / and dotdot segments")]
#[test_case("/a/./../b", "/b", 0 ; "test path with leading / and mixed dot and dotdot segments")]
#[test_case("/./a/b", "/a/b", 0 ; "test path with leading / and starting with dot segment")]
#[test_case("/a/b/..", "/a", 1 ; "test path with leading / and ending with dotdot segment")]
#[test_case("/a/b/.", "/a/b", 2 ; "test path with leading / and ending with dot segment")]
#[test_case("/a/.././b", "/b", 0 ; "test path with leading / and mixed and consecutive dot and dotdot segments")]
#[test_case("/a/b/c/../../d/e/f/g/h/i/../j", "/a/d/e/f/g/h/j", 1 ; "leading collapse followed by shorter one")]
fn test_collapse_path(glob: &str, expected: &str, earliest_collapsed_segment: usize) {
    let (glob, segment) = collapse_path(glob).unwrap();
    assert_eq!(glob, expected);
    assert_eq!(segment, earliest_collapsed_segment);
}

#[test_case("../a/b" ; "test path starting with ../ segment should return None")]
#[test_case("/../a" ; "test path with leading dotdotdot segment should return None")]
fn test_collapse_path_not(glob: &str) {
    assert_eq!(collapse_path(glob), None);
}

#[test_case("a/b/c/d", &["/e/../../../f"], &[], "a/b", None, None ; "can traverse beyond the root")]
#[test_case("a/b/c/d/", &["/e/../../../f"], &[], "a/b", None, None ; "can handle slash-trailing base path")]
#[test_case("a/b/c/d/", &["e/../../../f"], &[], "a/b", None, None ; "can handle no slash on glob")]
#[test_case("a/b/c/d", &["e/../../../f"], &[], "a/b", None, None ; "can handle no slash on either")]
#[test_case("a/b/c/d", &["/e/f/../g"], &[], "a/b/c/d", None, None ; "can handle no collapse")]
#[test_case("a/b/c/d", &["./././../.."], &[], "a/b", None, None ; "can handle dot followed by dotdot")]
#[test_case("a/b/c/d", &["**"], &["**/"], "a/b/c/d", None, Some(&["a/b/c/d/**"]) ; "can handle dot followed by dotdot and dot")]
#[test_case("a/b/c", &["**"], &["d/"], "a/b/c", None, Some(&["a/b/c/d/**"]) ; "will exclude all subfolders")]
#[test_case("a/b/c", &["**"], &["d"], "a/b/c", None, Some(&["a/b/c/d/**", "a/b/c/d"]) ; "will exclude all subfolders and file")]
fn preprocess_paths_and_globs(
    base_path: &str,
    include: &[&str],
    exclude: &[&str],
    base_path_exp: &str,
    include_exp: Option<&[&str]>,
    exclude_exp: Option<&[&str]>,
) {
    let raw_path = format!("{ROOT}{base_path}");
    let base_path = AbsoluteSystemPathBuf::new(raw_path).unwrap();
    let include = include.iter().map(|s| s.to_string()).collect_vec();
    let exclude = exclude.iter().map(|s| s.to_string()).collect_vec();

    let (result_path, include, exclude) =
        super::preprocess_paths_and_globs(&base_path, &include, &exclude).unwrap();

    let expected = format!(
        "{}{}",
        ROOT,
        base_path_exp.replace('/', std::path::MAIN_SEPARATOR_STR)
    );
    assert_eq!(result_path.to_string_lossy(), expected);

    if let Some(include_exp) = include_exp {
        assert_eq!(
            include,
            include_exp
                .iter()
                .map(|s| format!("{GLOB_ROOT}{s}"))
                .collect_vec()
                .as_slice()
        );
    }

    if let Some(exclude_exp) = exclude_exp {
        assert_eq!(
            exclude,
            exclude_exp
                .iter()
                .map(|s| format!("{GLOB_ROOT}{s}"))
                .collect_vec()
                .as_slice()
        );
    }
}

/// set up a globwalk test in a tempdir, returning the path to the tempdir
fn setup() -> tempfile::TempDir {
    let tmp = tempfile::TempDir::with_prefix("globwalk").unwrap();

    let directories = ["a/b/c", "a/c", "abc", "axbxcxdxe/xxx", "axbxcxdxexxx", "b"];

    let files = [
        "a/abc",
        "a/b/c/d",
        "a/c/b",
        "abc/b",
        "abcd",
        "abcde",
        "abxbbxdbxebxczzx",
        "abxbbxdbxebxczzy",
        "axbxcxdxe/f",
        "axbxcxdxe/xxx/f",
        "axbxcxdxexxx/f",
        "axbxcxdxexxx/fff",
        "a☺b",
        "b/c",
        "c",
        "x",
        "xxx",
        "z",
        "α",
        "abc/【test】.txt",
    ];

    for dir in directories.iter() {
        std::fs::create_dir_all(tmp.path().join(dir)).unwrap();
    }

    for file in files.iter() {
        std::fs::File::create(tmp.path().join(file)).unwrap();
    }

    #[cfg(unix)]
    {
        // these files/symlinks won't work on Windows
        std::fs::File::create(tmp.path().join("-")).unwrap();
        std::fs::File::create(tmp.path().join("]")).unwrap();

        std::os::unix::fs::symlink("../axbxcxdxe/", tmp.path().join("b/symlink-dir")).unwrap();
        std::os::unix::fs::symlink(
            "/tmp/nonexistant-file-20160902155705",
            tmp.path().join("broken-symlink"),
        )
        .unwrap();
        std::os::unix::fs::symlink("a/b", tmp.path().join("working-symlink")).unwrap();
    }
    tmp
}

#[test_case("a*/**", 22, 22 => matches None ; "wildcard followed by doublestar")]
#[test_case("**/*f", 4, 4 => matches None ; "leading doublestar expansion")]
#[test_case("**f", 4, 4 => matches None ; "transform leading doublestar")]
#[test_case("a**", 22, 22 => matches None ; "transform trailing doublestar")]
#[test_case("abc", 3, 3 => matches None ; "exact match")]
#[test_case("*", 19, 15 => matches None ; "single star match")]
#[test_case("*c", 2, 2 => matches None ; "single star suffix match")]
#[test_case("a*", 9, 9 => matches None ; "single star prefix match")]
#[test_case("a*/b", 2, 2 => matches None ; "single star prefix with suffix match")]
#[test_case("a*b*c*d*e*", 3, 3 => matches None ; "multiple single stars match")]
#[test_case("a*b*c*d*e*/f", 2, 2 => matches None ; "single star and double star match")]
#[test_case("a*b?c*x", 2, 2 => matches None ; "single star and question mark match")]
#[test_case("ab[c]", 1, 1 => matches None ; "character class match")]
#[test_case("ab[b-d]", 1, 1 => matches None ; "character class range match")]
#[test_case("ab[e-g]", 0, 0 => matches None ; "character class range mismatch")]
#[test_case("a?b", 1, 1 => matches None ; "question mark unicode match")]
#[test_case("a[!a]b", 1, 1 => matches None ; "negated character class unicode match 2")]
#[test_case("a???b", 0, 0 => matches None ; "insufficient question marks mismatch")]
#[test_case("a[^a][^a][^a]b", 0, 0 => matches None ; "multiple negated character classes mismatch")]
#[test_case("a?b", 1, 1 => matches None ; "question mark not matching slash")]
#[test_case("a*b", 1, 1 => matches None ; "single star not matching slash 2")]
#[test_case("[x-]", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "trailing dash in character class fail")]
#[test_case("[-x]", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "leading dash in character class fail")]
#[test_case("[a-b-d]", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "dash within character class range fail")]
#[test_case("[a-b-x]", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "dash within character class range fail 2")]
#[test_case("[", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "unclosed character class error")]
#[test_case("[^", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "unclosed negated character class error")]
#[test_case("[^bc", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "unclosed negated character class error 2")]
#[test_case("a[", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "unclosed character class error after pattern")]
#[test_case("ad[", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "unclosed character class error after pattern 2")]
#[test_case("*x", 4, 4 => matches None ; "star pattern match")]
#[test_case("[abc]", 3, 3 => matches None ; "single character class match")]
#[test_case("a/**", 7, 7 => matches None ; "a followed by double star match")]
#[test_case("**/c", 4, 4 => matches None ; "double star and single subdirectory match")]
#[test_case("a/**/b", 2, 2 => matches None ; "a followed by double star and single subdirectory match")]
#[test_case("a/**/c", 2, 2 => matches None ; "a followed by double star and multiple subdirectories match 2")]
#[test_case("a/**/d", 1, 1 => matches None ; "a followed by double star and multiple subdirectories with target match")]
#[test_case("a/b/c", 2, 2 => matches None ; "a followed by subdirectories and double slash mismatch")]
#[test_case("ab{c,d}", 1, 1 => matches None ; "pattern with curly braces match")]
#[test_case("ab{c,d,*}", 5, 5 => matches None ; "pattern with curly braces and wildcard match")]
#[test_case("ab{c,d}[", 0, 0 => matches Some(WalkError::BadPattern(_, _)))]
#[test_case("a{,bc}", 0, 0 => matches Some(WalkError::BadPattern(_, _)) ; "a followed by comma or b or c")]
#[test_case("a/{b/c,c/b}", 2, 2 => matches None)]
#[test_case("{a/{b,c},abc}", 3, 3 => matches None)]
#[test_case("{a/ab*}", 1, 1 => matches None)]
#[test_case("a/*", 3, 3 => matches None)]
#[test_case("{a/*}", 3, 3 => matches None ; "curly braces with single star match")]
#[test_case("{a/abc}", 1, 1 => matches None)]
#[test_case("{a/b,a/c}", 2, 2 => matches None)]
#[test_case("abc/**", 3, 3 => matches None ; "abc then doublestar")]
#[test_case("**/abc", 2, 2 => matches None)]
#[test_case("**/*.txt", 1, 1 => matches None)]
#[test_case("**/【*", 1, 1 => matches None ; "star with unicode")]
#[test_case("b/**/f", 0, 0 => matches None)]
fn glob_walk(
    pattern: &str,
    result_count: usize,
    result_count_windows: usize,
) -> Option<WalkError> {
    glob_walk_inner(
        pattern,
        if cfg!(windows) {
            result_count_windows
        } else {
            result_count
        },
    )
}

// these tests were configured to only run on unix, and not on windows
#[cfg(unix)]
// cannot use * as a path token on windows
#[test_case("a\\*b", 0 => matches None ; "escaped star mismatch")]
#[test_case("[\\]a]", 2 => matches None ; "escaped bracket match")]
#[test_case("[\\-]", 1  => matches None; "escaped dash match")]
#[test_case("[x\\-]", 2  => matches None; "escaped dash in character class match")]
#[test_case("[\\-x]", 2  => matches None; "escaped dash and character match")]
// #[test_case("[-]", Some(WalkError::BadPattern("[-]".into())), 0 ; "bare dash in character
// class match")] #[test_case("[x-]", Some(WalkError::BadPattern("[x-]".into())), 0 ;
// "trailing dash in character class match 2")] #[test_case("[-x]",
// Some(WalkError::BadPattern("[-x]".into())), 0 ; "leading dash in character class match 2")]
// #[test_case("[a-b-d]", Some(WalkError::BadPattern("[a-b-d]".into())), 0 ; "dash within
// character class range match 3")] #[test_case("\\",
// Some(WalkError::BadPattern("\\".into())), 0 ; "single backslash error")]
#[test_case("a/\\**", 0  => matches None; "a followed by escaped double star and subdirectories mismatch")]
#[test_case("a/\\[*\\]", 0  => matches None; "a followed by escaped character class and pattern mismatch")]
// in the go implementation, broken-symlink is yielded,
// however in symlink mode, walkdir yields broken symlinks as errors.
// Note that walkdir _always_ follows root symlinks. We handle this in the layer
// above wax.
#[test_case("broken-symlink", 1 => matches None ; "broken symlinks should be yielded")]
// globs that match across a symlink should not follow the symlink
#[test_case("working-symlink/c/*", 0 => matches None ; "working symlink should not be followed")]
#[test_case("working-sym*/*", 0 => matches None ; "working symlink should not be followed 2")]
fn glob_walk_unix(pattern: &str, result_count: usize) -> Option<WalkError> {
    glob_walk_inner(pattern, result_count)
}

fn glob_walk_inner(pattern: &str, result_count: usize) -> Option<WalkError> {
    let dir = setup();

    let path = AbsoluteSystemPathBuf::try_from(dir.path()).unwrap();
    let validated = ValidatedGlob::from_str(pattern).unwrap();
    let success = match super::globwalk(&path, &[validated], &[], crate::WalkType::All) {
        Ok(e) => e.into_iter(),
        Err(e) => return Some(e),
    };

    assert_eq!(
        success.len(),
        result_count,
        "{pattern}: expected {result_count} matches, but got {success:#?}"
    );

    None
}

#[test_case(
    &["/test.txt"],
    "/",
    &["*.txt"],
    &[],
    &["/test.txt"],
    &["/test.txt"],
    Default::default()
    ; "hello world"
)]
#[test_case(
    &["/test.txt", "/subdir/test.txt", "/other/test.txt"],
    "/",
    &["subdir/test.txt", "test.txt"],
    &[],
    &["/subdir/test.txt", "/test.txt"],
    &["/subdir/test.txt", "/test.txt"],
    Default::default()
    ; "bullet files"
)]
#[test_case(&[
        "/external/file.txt",
        "/repos/some-app/apps/docs/package.json",
        "/repos/some-app/apps/web/package.json",
        "/repos/some-app/bower_components/readline/package.json",
        "/repos/some-app/examples/package.json",
        "/repos/some-app/node_modules/gulp/bower_components/readline/package.json",
        "/repos/some-app/node_modules/react/package.json",
        "/repos/some-app/package.json",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
        "/repos/some-app/test/mocks/kitchen-sink/package.json",
        "/repos/some-app/tests/mocks/kitchen-sink/package.json",
    ],
    "/repos/some-app/",
    &["packages/*/package.json", "apps/*/package.json"], &["**/node_modules/", "**/bower_components/", "**/test/", "**/tests/"],
    &[
        "/repos/some-app/apps/docs/package.json",
        "/repos/some-app/apps/web/package.json",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
    ],
    &[
        "/repos/some-app/apps/docs/package.json",
        "/repos/some-app/apps/web/package.json",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
    ],
    Default::default()
    ; "finding workspace package.json files"
)]
#[test_case(&[
        "/external/file.txt",
        "/repos/some-app/apps/docs/package.json",
        "/repos/some-app/apps/web/package.json",
        "/repos/some-app/bower_components/readline/package.json",
        "/repos/some-app/examples/package.json",
        "/repos/some-app/node_modules/gulp/bower_components/readline/package.json",
        "/repos/some-app/node_modules/react/package.json",
        "/repos/some-app/package.json",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
        "/repos/some-app/test/mocks/spanish-inquisition/package.json",
        "/repos/some-app/tests/mocks/spanish-inquisition/package.json",
    ],
    "/repos/some-app/",
    &["**/package.json"],
    &["**/node_modules/", "**/bower_components/", "**/test/", "**/tests/"],
    &[
        "/repos/some-app/apps/docs/package.json",
        "/repos/some-app/apps/web/package.json",
        "/repos/some-app/examples/package.json",
        "/repos/some-app/package.json",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
    ],
    &[
        "/repos/some-app/apps/docs/package.json",
        "/repos/some-app/apps/web/package.json",
        "/repos/some-app/examples/package.json",
        "/repos/some-app/package.json",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
    ],
    Default::default()
    ; "excludes unexpected workspace package.json files"
)]
#[test_case(&[
        "/external/file.txt",
        "/repos/some-app/apps/docs/package.json",
        "/repos/some-app/apps/web/package.json",
        "/repos/some-app/bower_components/readline/package.json",
        "/repos/some-app/examples/package.json",
        "/repos/some-app/node_modules/gulp/bower_components/readline/package.json",
        "/repos/some-app/node_modules/react/package.json",
        "/repos/some-app/package.json",
        "/repos/some-app/packages/xzibit/package.json",
        "/repos/some-app/packages/xzibit/node_modules/street-legal/package.json",
        "/repos/some-app/packages/xzibit/node_modules/paint-colors/package.json",
        "/repos/some-app/packages/xzibit/packages/yo-dawg/package.json",
        "/repos/some-app/packages/xzibit/packages/yo-dawg/node_modules/meme/package.json",
        "/repos/some-app/packages/xzibit/packages/yo-dawg/node_modules/yo-dawg/package.json",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
        "/repos/some-app/test/mocks/spanish-inquisition/package.json",
        "/repos/some-app/tests/mocks/spanish-inquisition/package.json",
    ],
    "/repos/some-app/",
    &["packages/**/package.json"],
    &["**/node_modules/", "**/bower_components/", "**/test/", "**/tests/"],
    &[
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
        "/repos/some-app/packages/xzibit/package.json",
        "/repos/some-app/packages/xzibit/packages/yo-dawg/package.json",
    ],
    &[
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/faker/package.json",
        "/repos/some-app/packages/left-pad/package.json",
        "/repos/some-app/packages/xzibit/package.json",
        "/repos/some-app/packages/xzibit/packages/yo-dawg/package.json",
    ],
    Default::default()
    ; "nested packages work")]
#[test_case(&[
        "/external/file.txt",
        "/repos/some-app/package.json",
        "/repos/some-app/index.js",
        "/repos/some-app/just-a-dir/index.js",
        "/repos/some-app/packages/xzibit/package.json",
        "/repos/some-app/packages/xzibit/index.js",
        "/repos/some-app/packages/colors/package.json",
        "/repos/some-app/packages/colors/i

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

gpt5-validated-18-bottlenecks.md

Now I have all the data needed. Let me produce the comprehensive report.

Current State

Turborepo's Rust codebase has 11 identified performance bottlenecks spanning the entire turbo run critical path. Several recent commits (e.g., b3c0f46da8 parallelizing task hash computation, db01cb4490 fast-pathing shallow wildcard globs, 4b5410b397 reordering engine callbacks) show the team is actively optimizing, but significant wins remain. The codebase has strong tracing instrumentation (#[tracing::instrument] in 49+ files) and built-in Chrome trace profiling (--profile), but zero benchmarks (criterion, cargo bench, or [[bench]] sections are completely absent).

Critical path of turbo run (cold start):

Package discovery (sequential turbo.json stats)
Lockfile transitive closure computation
Engine construction (serial turbo.json chain loading)
Task hash computation (wave-based parallel, but contended RwLock)
Cache archive creation/restoration (single-threaded zstd)
Task execution dispatch (redundant clones per task)

Required Changes

Tier 1: High Impact, Low Risk

1. Parallelize turbo.json existence checks in discovery (crates/turborepo-repository/src/discovery.rs:157)

Replace .then() with .map().buffer_unordered(64) on the tokio stream
Converts O(N) serial stat calls to O(1) wall-clock with bounded concurrency
Expected win: 50-200ms in 100+ package monorepos (stat latency × package count → stat latency × 1)

Verify:

cargo build --profile release-turborepo
# Before/after on a large monorepo:
hyperfine --warmup 2 './target/release-turborepo/turbo build --dry=json 2>/dev/null'
# Or with tracing:
turbo build --profile=discovery-before.trace --dry=json
# After change:
turbo build --profile=discovery-after.trace --dry=json
# Compare discover_packages span in the .trace.md summaries

Safety: cargo nextest run -p turborepo-repository; integration tests in turborepo-tests/integration/tests/run/

2. Pre-compile task env wildcards (crates/turborepo-env/src/lib.rs:273)

Extend CompiledWildcards usage to hashable_task_env() path — currently only pass_through_env uses pre-compiled regexes
The wildcard_map_from_wildcards() function recompiles regexes on every task invocation
Expected win: O(tasks × patterns) regex compilations → O(unique_pattern_sets) compilations

Verify:

cargo nextest run -p turborepo-env -p turborepo-task-hash
turbo build --profile=env-before.trace  # compare hashable_task_env span times

3. Split TaskHashTracker into per-field locks (crates/turborepo-task-hash/src/lib.rs:226)

Replace single RwLock<TaskHashTrackerState> with individual DashMap or per-field RwLock for each of the 5 maps
Currently a write to package_task_hashes blocks reads from package_task_env_vars, etc.
Expected win: Eliminates cross-wave serialization in parallel hash computation; biggest impact on repos with deep dependency graphs where many tasks compute hashes simultaneously

Verify:

cargo nextest run -p turborepo-task-hash
# Profile with tracing to measure calculate_hashes span:
turbo build --profile=tracker-before.trace
# After: same, compare wave completion times in Chrome trace

4. Cache compiled glob patterns (crates/turborepo-globwalk/src/lib.rs)

Add a thread_local! or DashMap<Vec<String>, CompiledGlobs> cache keyed on raw pattern strings
Same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths
Expected win: Proportional to workspace count × shared patterns (common in monorepos where all packages use identical output globs)

Verify:

cargo nextest run -p turborepo-globwalk -p turborepo-run-cache
turbo build --profile=glob-before.trace  # measure globwalk spans

Tier 2: Medium Impact, Medium Risk

5. Multi-threaded zstd compression for cache archives (crates/turborepo-cache/src/cache_archive/create.rs:115,151)

zstd's Encoder supports multithread(N) — currently hardcoded to single-threaded at level 0
For large Next.js builds with hundreds of output chunks, this is a significant bottleneck on cache-miss tasks

Implementation:

// Before:
let mut zw = zstd::Encoder::new(writer, 0)?;
// After:
let mut zw = zstd::Encoder::new(writer, 0)?;
zw.multithread(num_cpus::get() as u32)?;

Expected win: 2-4x faster cache archive creation for large outputs (Next.js .next/ dirs with 500+ files)

Verify:

cargo nextest run -p turborepo-cache
# Benchmark cache put on a real large output:
turbo build --profile=zstd-before.trace --force  # force cache miss
# After change, same command, compare cache_archive::create spans

Risk: Increased memory usage (zstd allocates per-thread buffers). Test with turbo --heap=heap.pb build using pprof feature.

6. Optimize lockfile transitive closures (crates/turborepo-lockfiles/src/lib.rs:130)

Pre-size the DashMap with DashMap::with_capacity(estimated_dep_count) instead of default
Memoize shared dependency subgraph walks — if packages A and B both depend on react, the DFS from react downward is walked twice
Expected win: Reduces startup time by 10-30% for monorepos with overlapping dependency trees

Verify:

cargo nextest run -p turborepo-lockfiles
turbo build --profile=lockfile-before.trace --dry=json
# Compare all_transitive_closures span duration

7. Build a path trie for file-to-package detection (crates/turborepo-repository/src/change_mapper/package.rs:63)

Replace O(N×M) linear scan with a radix_trie::Trie (already a dependency — used in hash_watcher.rs)
For each changed file, trie lookup is O(path_length) instead of O(num_packages × path_length)
Expected win: Significant for --affected and watch mode with hundreds of packages

Verify:

cargo nextest run -p turborepo-repository
# Integration tests: turborepo-tests/integration/tests/run/

Tier 3: Targeted Optimizations

8. Reduce per-task EnvironmentVariableMap clones (crates/turborepo-lib/src/task_graph/visitor/mod.rs:389)

The dispatch loop clones (String, EnvironmentVariableMap) — a HashMap<String, String> — for every task
Wrap in Arc at precomputation time, share reference into execution
Expected win: Eliminates O(env_vars × tasks) string clones in the hot dispatch loop
Verify: cargo nextest run -p turborepo-lib

9. Reduce SCM hash allocations (crates/turborepo-scm/src/hash_object.rs)

The hex-encoded hash .to_string() at line 69 allocates a 40-byte String per file
Use a fixed-size [u8; 40] newtype that implements Hash + Eq to avoid heap allocation
Also: manual.rs does a double directory walk (once for listing, once for hashing) — merge into single walk
Expected win: Proportional to total file count across all packages
Verify: cargo nextest run -p turborepo-scm

10. Parallelize engine builder config loading (crates/turborepo-engine/src/builder.rs)

The BFS in build() is serial — turbo_json_chain_cache helps but loading itself is sequential
Pre-load all turbo.json files in parallel before BFS traversal using rayon::par_iter
Expected win: Reduces engine construction time from O(N × disk_latency) to O(disk_latency)
Verify: cargo nextest run -p turborepo-engine

11. Optimize GlobSet hash/eq allocations (crates/turborepo-filewatch/src/globwatcher.rs:56-65)

PartialEq::eq allocates HashSet on every comparison; Hash::hash allocates BTreeSet on every hash
Pre-sort and store the include keys as a Vec<String> (sorted) alongside the HashMap
Verify: cargo nextest run -p turborepo-filewatch

Ripple Effects

Change	Crates Affected	Risk of Breakage
Discovery parallel stats	`turborepo-repository`, `turborepo-lib`	Low — pure I/O concurrency change
Env regex pre-compilation	`turborepo-env`, `turborepo-task-hash`	Low — extends existing pattern
TaskHashTracker lock split	`turborepo-task-hash`, `turborepo-lib`	Medium — concurrent semantics change
Glob pattern caching	`turborepo-globwalk`, `turborepo-run-cache`, `turborepo-scm`	Low — transparent memoization
Multi-threaded zstd	`turborepo-cache`	Medium — memory usage increase
Lockfile closure optimization	`turborepo-lockfiles`	Medium — affects all package managers
Path trie for change mapper	`turborepo-repository`	Low — drop-in replacement
EnvironmentVariableMap Arc	`turborepo-lib`	Low — ownership change only
SCM hash allocations	`turborepo-scm`, `turborepo-task-hash`	Low — internal type change
Engine parallel config	`turborepo-engine`	Medium — loading order changes
GlobSet hash/eq	`turborepo-filewatch`	Low — daemon watch mode only

Cross-cutting concern: Changes 3, 5, and 6 all affect concurrent data access patterns. They should be landed separately with thorough integration test runs between each to isolate any concurrency bugs.

What This Unlocks

Immediate measurable wins:

Cold start (no cache): Changes 1, 2, 6, 10 target the serial startup path. Combined, they could reduce turbo run initialization from seconds to sub-second in large monorepos.
Cache-miss builds: Change 5 (multi-threaded zstd) directly reduces wall time for every task that writes to cache. For a Next.js app producing 500+ output chunks, this could cut cache write time by 2-4x.
Watch mode / daemon: Changes 7 and 11 specifically improve the hot paths in turbo watch and the daemon, making file-change-to-rebuild latency proportional to changed files rather than total packages.

Foundation for future work:

Adding criterion benchmarks (currently absent) alongside these changes would create a regression-detection framework
The --profile Chrome trace infrastructure is already excellent — these changes would show up as measurable span reductions in the existing tracing

Verification workflow for all changes:

# 1. Build release binary
cargo build --profile release-turborepo

# 2. Run full test suite
cargo nextest run --workspace

# 3. Profile before (on your target monorepo)
./target/release-turborepo/turbo build --profile=before.trace --force
./target/release-turborepo/turbo build --profile=before-cached.trace

# 4. Apply change, rebuild, re-profile
cargo build --profile release-turborepo
./target/release-turborepo/turbo build --profile=after.trace --force
./target/release-turborepo/turbo build --profile=after-cached.trace

# 5. Compare trace summaries
diff before.trace.md after.trace.md

# 6. Statistical benchmarking
hyperfine --warmup 3 \
  './before-turbo build --force 2>/dev/null' \
  './after-turbo build --force 2>/dev/null'

# 7. Integration tests
cargo build  # debug for integration tests
turbo run test --filter=turborepo-tests-integration -- "tests/run"

# 8. Memory profiling (for zstd multi-thread change)
cargo build --features pprof
./target/debug/turbo --heap=before.pb build --force
# After change:
./target/debug/turbo --heap=after.pb build --force

Effort & Risk

#	Change	Effort	Risk	Impact	Priority
1	Parallel discovery stats	~1 hour	Very Low	Medium	P0
2	Pre-compile env wildcards	~2 hours	Low	Medium	P0
3	Split TaskHashTracker locks	~4 hours	Medium	High	P1
4	Cache compiled globs	~3 hours	Low	Medium-High	P1
5	Multi-threaded zstd	~1 hour	Medium	High (cache-miss)	P1
6	Optimize lockfile closures	~4 hours	Medium	Medium-High	P1
7	Path trie for change mapper	~3 hours	Low	Medium (watch mode)	P2
8	Arc EnvironmentVariableMap	~1 hour	Very Low	Low-Medium	P2
9	SCM hash alloc reduction	~3 hours	Low	Medium	P2
10	Parallel engine config	~4 hours	Medium	Medium	P2
11	GlobSet hash/eq optimization	~1 hour	Very Low	Low (daemon only)	P3

Recommended implementation order: 1 → 2 → 8 → 4 → 5 → 3 → 6 → 9 → 7 → 10 → 11 (ascending risk, descending ease-of-verification)

Key assumptions:

Impact estimates assume a monorepo with 50-200+ packages (typical Vercel customer scale)
"Hours" assumes familiarity with the codebase; first-time contributors should double estimates
The absence of criterion benchmarks means all verification relies on --profile traces and hyperfine — adding a benchmark suite should be a prerequisite for the medium-risk changes (3, 5, 6, 10)
The existing CachingPackageDiscovery (change 1) means the parallel stat improvement only matters on cold starts / non-daemon runs, but that's the most common developer-facing path

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.
task-hash-tracker-rwlock-contention Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs Maturity: stable Relevance: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. Key findings: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.
task-graph-visitor-allocations Files: crates/turborepo-lib/src/task_graph/visitor/mod.rs, crates/turborepo-lib/src/task_graph/visitor/exec.rs Maturity: stable Relevance: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run. Key findings: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run.
change-mapper-linear-package-detection Files: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs Maturity: stable Relevance: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages Key findings: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages
discovery-sequential-turbo-json-stat Files: crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: Local package discovery serializes per-package turbo.json existence checks via .then(), adding O(n) filesystem latency on the critical path of every non-daemon run. Key findings: I confirmed the key finding by reading both files. The strongest candidate is in discovery.rs: At line 157, LocalPackageDiscovery::discover_packages uses tokio_stream::iter(...).then(...) which processes each tokio::fs::try_exists() call sequentially — .then() awaits each future before starting the next. For independent filesystem stat calls across hundreds of packages, this serializes I/O that should be concurrent. The file even has the tokio async machinery in place — it just needs .map() + .buffer_unordered(N) instead of .then().
globwatcher-globset-hash-eq-allocs Files: crates/turborepo-filewatch/src/globwatcher.rs, crates/turborepo-filewatch/src/hash_watcher.rs Maturity: stable Relevance: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos. Key findings: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos.

Exploration

AREA: globwatcher-globset-hash-eq-allocs FILES: crates/turborepo-filewatch/src/globwatcher.rs, crates/turborepo-filewatch/src/hash_watcher.rs MATURITY: stable RELEVANCE: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos. NOTES: (unspecified)

Raw Exploration Notes: AREA: globwatcher-globset-hash-eq-allocs FILES: crates/turborepo-filewatch/src/globwatcher.rs, crates/turborepo-filewatch/src/hash_watcher.rs MATURITY: stable RELEVANCE: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos.

Selected Code Context

===== FILE: crates/turborepo-filewatch/src/globwatcher.rs ===== use std::{ collections::{BTreeSet, HashMap, HashSet}, fmt::Display, future::IntoFuture, str::FromStr, time::Duration, };

use notify::Event; use thiserror::Error; use tokio::sync::{broadcast, mpsc, oneshot}; use tracing::{debug, warn}; use turbopath::{AbsoluteSystemPathBuf, RelativeUnixPath}; use wax::{Any, Glob, Program};

use crate::{ NotifyError, OptionalWatch, cookies::{CookieError, CookieWatcher, CookieWriter, CookiedRequest}, };

type Hash = String;

#[derive(Clone)] pub struct GlobSet { include: HashMap<String, wax::Glob<'static>>, exclude: Any<'static>, // Note that these globs do not include the leading '!' character exclude_raw: BTreeSet, }

impl GlobSet { pub fn as_inputs(&self) -> Vec { let mut inputs: Vec = self.include.keys().cloned().collect(); inputs.extend(self.exclude_raw.iter().map(|s| format!("!{s}"))); inputs }

pub fn matches(&self, input: &RelativeUnixPath) -> bool {
    self.include.values().any(|glob| glob.is_match(input)) && !self.exclude.is_match(input)
}

}

impl std::fmt::Debug for GlobSet { fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result { f.debug_struct("GlobSet") .field("include", &self.include.keys()) .field("exclude", &self.exclude_raw) .finish() } }

impl PartialEq for GlobSet { fn eq(&self, other: &Self) -> bool { self.include.keys().collect::<HashSet<>>() == other.include.keys().collect::<HashSet<>>() && self.exclude_raw == other.exclude_raw } }

impl Eq for GlobSet {}

impl std::hash::Hash for GlobSet { fn hash<H: std::hash::Hasher>(&self, state: &mut H) { self.include.keys().collect::<BTreeSet<_>>().hash(state); self.exclude_raw.hash(state); } }

#[derive(Debug, Error)] pub struct GlobError { // Boxed to minimize error size underlying: Boxwax::BuildError, raw_glob: String, }

impl Display for GlobError { fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result { write!(f, "{}: {}", self.underlying, self.raw_glob) } }

fn compile_glob(raw: &str) -> Result<Glob<'static>, GlobError> { Glob::from_str(raw) .map(|g| g.to_owned()) .map_err(|e| GlobError { underlying: Box::new(e), raw_glob: raw.to_owned(), }) }

impl GlobSet { pub fn from_raw( raw_includes: Vec, raw_excludes: Vec, ) -> Result<Self, GlobError> { let include = raw_includes .iter() .cloned() .map(|raw_glob| { let glob = compile_glob(&raw_glob)?; Ok((raw_glob, glob)) }) .collect::<Result<HashMap<_, >, GlobError>>()?; let excludes = raw_excludes .clone() .iter() .map(|raw_glob| { let glob = compile_glob(raw_glob)?; Ok(glob) }) .collect::<Result<Vec<>, GlobError>>()?; let exclude = wax::any(excludes) .map_err(|e| GlobError { underlying: Box::new(e), raw_glob: format!("{{{}}}", raw_excludes.join(",")), })? .to_owned(); Ok(Self { include, exclude, exclude_raw: BTreeSet::from_iter(raw_excludes), }) }

// delegates to from_raw, but filters the globs into inclusions and exclusions
// first
pub fn from_raw_unfiltered(raw: Vec<String>) -> Result<Self, GlobError> {
    let (includes, excludes): (Vec<_>, Vec<_>) = {
        let mut includes = vec![];
        let mut excludes = vec![];
        for pattern in raw {
            if let Some(exclude) = pattern.strip_prefix('!') {
                excludes.push(exclude.to_string());
            } else {
                includes.push(pattern);
            }
        }
        (includes, excludes)
    };
    Self::from_raw(includes, excludes)
}

pub fn is_package_local(&self) -> bool {
    self.include
        .keys()
        .all(|raw_glob| !raw_glob.starts_with("../"))
        && self
            .exclude_raw
            .iter()
            .all(|raw_glob| !raw_glob.starts_with("../"))
}

}

#[derive(Debug, Error)] pub enum Error { #[error(transparent)] CookieError(#[from] CookieError), #[error("Failed to send query to glob watcher: {0}")] SendError(#[from] mpsc::error::SendError<CookiedRequest>), #[error("Glob watcher has closed.")] Closed, #[error("Glob watcher request timed out.")] Timeout(#[from] tokio::time::error::Elapsed), #[error("Glob watching is unavailable.")] Unavailable, }

impl From<mpsc::error::SendError> for Error { fn from(_: mpsc::error::SendError) -> Self { Error::Closed } }

impl Fromoneshot::error::RecvError for Error { fn from(_: oneshot::error::RecvError) -> Self { Error::Closed } }

pub struct GlobWatcher { cookie_writer: CookieWriter, // _exit_ch exists to trigger a close on the receiver when an instance // of this struct is dropped. The task that is receiving events will exit, // dropping the other sender for the broadcast channel, causing all receivers // to be notified of a close. _exit_ch: oneshot::Sender<()>, query_ch_lazy: OptionalWatch<mpsc::Sender<CookiedRequest>>, }

#[derive(Debug)] pub enum Query { WatchGlobs { hash: Hash, glob_set: GlobSet, resp: oneshot::Sender<Result<(), Error>>, }, GetChangedGlobs { hash: Hash, candidates: HashSet, resp: oneshot::Sender<Result<HashSet, Error>>, }, }

struct GlobTracker { root: AbsoluteSystemPathBuf,

/// maintains the list of <GlobSet> to watch for a given hash
hash_globs: HashMap<Hash, GlobSet>,

/// maps a string glob to the compiled glob and the hashes for which this
/// glob hasn't changed
glob_statuses: HashMap<String, (Glob<'static>, HashSet<Hash>)>,

exit_signal: oneshot::Receiver<()>,

recv: broadcast::Receiver<Result<Event, NotifyError>>,

query_recv: mpsc::Receiver<CookiedRequest<Query>>,

cookie_watcher: CookieWatcher<Query>,

}

impl GlobWatcher { pub fn new( root: AbsoluteSystemPathBuf, cookie_writer: CookieWriter, mut recv: OptionalWatch<broadcast::Receiver<Result<Event, NotifyError>>>, ) -> Self { let (exit_ch, exit_signal) = tokio::sync::oneshot::channel(); let (query_ch_tx, query_ch_lazy) = OptionalWatch::new(); let cookie_root = cookie_writer.root().to_owned(); tokio::task::spawn(async move { let Ok(recv) = recv.get().await.map(|r| r.resubscribe()) else { // if this fails, it means that the filewatcher is not available // so starting the glob tracker is pointless return; };

        // if the receiver is closed, it means the glob watcher is closed and we
        // probably don't want to start the glob tracker
        let (query_ch, query_recv) = mpsc::channel(128);
        if query_ch_tx.send(Some(query_ch)).is_err() {
            tracing::debug!("no queryers for glob watcher, exiting");
            return;
        }

        GlobTracker::new(root, cookie_root, exit_signal, recv, query_recv)
            .watch()
            .await
    });
    Self {
        cookie_writer,
        _exit_ch: exit_ch,
        query_ch_lazy,
    }
}

/// Watch a set of globs for a given hash.
///
/// This function will return `Error::Unavailable` if the globwatcher is not
/// yet available.
pub async fn watch_globs(
    &self,
    hash: Hash,
    globs: GlobSet,
    timeout: Duration,
) -> Result<(), Error> {
    let (tx, rx) = oneshot::channel();
    let req = Query::WatchGlobs {
        hash,
        glob_set: globs,
        resp: tx,
    };
    self.send_request(req).await?;
    tokio::time::timeout(timeout, rx).await??
}

/// Get the globs that have changed for a given hash.
///
/// This function will return `Error::Unavailable` if the globwatcher is not
/// yet available.
pub async fn get_changed_globs(
    &self,
    hash: Hash,
    candidates: HashSet<String>,
    timeout: Duration,
) -> Result<HashSet<String>, Error> {
    let (tx, rx) = oneshot::channel();
    let req = Query::GetChangedGlobs {
        hash,
        candidates,
        resp: tx,
    };

    self.send_request(req).await?;
    tokio::time::timeout(timeout, rx).await??
}

async fn send_request(&self, req: Query) -> Result<(), Error> {
    let cookied_request = self.cookie_writer.cookie_request(req).await?;
    let mut query_ch = self.query_ch_lazy.clone();
    let query_ch = query_ch
        .get_immediate()
        .ok_or(Error::Unavailable)?
        .map(|ch| ch.clone())
        .map_err(|_| Error::Unavailable)?;

    query_ch.send(cookied_request).await?;
    Ok(())
}

}

#[derive(Debug, Error)] enum WatchError { #[error(transparent)] Recv(#[from] broadcast::error::RecvError), #[error(transparent)] Notify(#[from] NotifyError), }

impl GlobTracker { fn new( root: AbsoluteSystemPathBuf, cookie_root: AbsoluteSystemPathBuf, exit_signal: oneshot::Receiver<()>, recv: broadcast::Receiver<Result<Event, NotifyError>>, query_recv: mpsc::Receiver<CookiedRequest>, ) -> Self { Self { root, hash_globs: HashMap::new(), glob_statuses: HashMap::new(), exit_signal, recv, query_recv, cookie_watcher: CookieWatcher::new(cookie_root), } }

fn handle_cookied_query(&mut self, cookied_query: CookiedRequest<Query>) {
    if let Some(request) = self.cookie_watcher.check_request(cookied_query) {
        self.handle_query(request);
    }
}

fn handle_query(&mut self, query: Query) {
    match query {
        Query::WatchGlobs {
            hash,
            glob_set,
            resp,
        } => {
            debug!("watching globs {:?} for hash {}", glob_set, hash);
            // Assume cookie handling has happened external to this component.
            // Other tasks _could_ write to the
            // same output directories, however we are relying on task
            // execution dependencies to prevent that.
            for (glob_str, glob) in glob_set.include.iter() {
                let glob_str = glob_str.to_owned();
                let (_, hashes) = self
                    .glob_statuses
                    .entry(glob_str)
                    .or_insert_with(|| (glob.clone(), HashSet::new()));
                hashes.insert(hash.clone());
            }
            self.hash_globs.insert(hash.clone(), glob_set);
            let _ = resp.send(Ok(()));
        }
        Query::GetChangedGlobs {
            hash,
            mut candidates,
            resp,
        } => {
            // Assume cookie handling has happened external to this component.
            // Build a set of candidate globs that *may* have changed.
            // An empty set translates to all globs have not changed.
            if let Some(unchanged_globs) = self.hash_globs.get(&hash) {
                candidates.retain(|glob_str| {
                    // We are keeping the globs from candidates that
                    // we don't have a record of as unchanged.
                    // If we do have a record, drop it from candidates.
                    !unchanged_globs.include.contains_key(glob_str)
                });
            }
            // If the client has gone away, we don't care about the error
            let _ = resp.send(Ok(candidates));
        }
    }
}

fn handle_file_event(
    &mut self,
    file_event: Result<Result<Event, NotifyError>, broadcast::error::RecvError>,
) {
    match file_event {
        Err(broadcast::error::RecvError::Closed) => (),
        Err(e @ broadcast::error::RecvError::Lagged(_)) => self.on_error(e.into()),
        Ok(Err(error)) => self.on_error(error.into()),
        Ok(Ok(file_event)) => {
            for path in file_event.paths {
                let path = AbsoluteSystemPathBuf::try_from(path)
                    .expect("filewatching should produce absolute paths");
                if let Some(queries) = self
                    .cookie_watcher
                    .pop_ready_requests(file_event.kind, &path)
                {
                    for query in queries {
                        self.handle_query(query);
                    }
                    return;
                }
                let Ok(to_match) = self.root.anchor(path) else {
                    // irrelevant filesystem update
                    return;
                };
                self.handle_path_change(&to_match.to_unix());
            }
        }
    }
}

async fn watch(mut self) {
    loop {
        tokio::select! {
            _ = &mut self.exit_signal => return,
            Some(query) = self.query_recv.recv().into_future() => self.handle_cookied_query(query),
            file_event = self.recv.recv().into_future() => self.handle_file_event(file_event)
        }
    }
}

/// on_error takes the conservative approach of considering everything
/// changed in the event of any error related to filewatching
fn on_error(&mut self, err: WatchError) {
    warn!(
        "encountered filewatching error, flushing all globs: {}",
        err
    );
    self.hash_globs.clear();
    self.glob_statuses.clear();
}

fn handle_path_change(&mut self, path: &RelativeUnixPath) {
    self.glob_statuses
        .retain(|glob_str, (glob, hashes_for_glob)| {
            // If this is not a match, we aren't modifying this glob, bail early and mark
            // for retention.
            if !glob.is_match(path) {
                return true;
            }
            // We have a match. Check which hashes need invalidation.
            hashes_for_glob.retain(|hash| {
                let Some(glob_set) = self.hash_globs.get_mut(hash) else {
                    // This shouldn't ever happen, but if we aren't tracking this hash at
                    // all, we don't need to keep it in the set of hashes that are relevant
                    // for this glob.
                    debug_assert!(
                        false,
                        "A glob is referencing a hash that we are not tracking. This is most \
                         likely an internal bookkeeping error in globwatcher.rs"
                    );
                    return false;
                };
                // If we match an exclusion, don't invalidate this hash
                if glob_set.exclude.is_match(path) {
                    return true;
                }
                // We didn't match an exclusion, we can remove this glob
                debug!("file change at {} invalidated glob {}", path, glob_str);
                glob_set.include.remove(glob_str);

                // We removed the last include, we can stop tracking this hash
                if glob_set.include.is_empty() {
                    self.hash_globs.remove(hash);
                }

                false
            });
            !hashes_for_glob.is_empty()
        });
}

}

#[cfg(test)] mod test { use std::{ collections::{HashMap, HashSet}, str::FromStr, time::Duration, };

use turbopath::{AbsoluteSystemPath, AbsoluteSystemPathBuf};
use wax::{Glob, any};

use crate::{
    FileSystemWatcher,
    cookies::CookieWriter,
    globwatcher::{GlobSet, GlobWatcher},
};

fn temp_dir() -> (AbsoluteSystemPathBuf, tempfile::TempDir) {
    let tmp = tempfile::tempdir().unwrap();
    let path = AbsoluteSystemPathBuf::try_from(tmp.path())
        .unwrap()
        .to_realpath()
        .unwrap();
    (path, tmp)
}

fn setup(repo_root: &AbsoluteSystemPath) {
    // Directory layout:
    // <repo_root>/
    //   .git/
    //   my-pkg/
    //     irrelevant
    //     dist/
    //       dist-file
    //       distChild/
    //         child-file
    //     .next/
    //       next-file
    //       cache/
    repo_root.join_component(".git").create_dir_all().unwrap();
    let pkg_path = repo_root.join_component("my-pkg");
    pkg_path.create_dir_all().unwrap();
    pkg_path
        .join_component("irrelevant")
        .create_with_contents("")
        .unwrap();
    let dist_path = pkg_path.join_component("dist");
    dist_path.create_dir_all().unwrap();
    let dist_child_path = dist_path.join_component("distChild");
    dist_child_path.create_dir_all().unwrap();
    dist_child_path
        .join_component("child-file")
        .create_with_contents("")
        .unwrap();
    dist_path
        .join_component("dist-file")
        .create_with_contents("")
        .unwrap();
    let next_path = pkg_path.join_component(".next");
    next_path.create_dir_all().unwrap();
    next_path
        .join_component("next-file")
        .create_with_contents("")
        .unwrap();
    next_path.join_component("cache").create_dir_all().unwrap();
}

fn make_includes(raw: &[&str]) -> HashMap<String, Glob<'static>> {
    raw.iter()
        .map(|raw_glob| {
            (
                raw_glob.to_string(),
                Glob::from_str(raw_glob).unwrap().to_owned(),
            )
        })
        .collect()
}

#[tokio::test]
async fn test_track_outputs() {
    let timeout = Duration::from_secs(2);
    let (repo_root, _tmp_dir) = temp_dir();
    setup(&repo_root);
    let cookie_dir = repo_root.join_component(".git");

    let watcher = FileSystemWatcher::new_with_default_cookie_dir(&repo_root).unwrap();
    let recv = watcher.watch();
    let cookie_writer = CookieWriter::new(&cookie_dir, Duration::from_secs(2), recv.clone());
    let glob_watcher = GlobWatcher::new(repo_root.clone(), cookie_writer, recv);

    let raw_includes = &["my-pkg/dist/**", "my-pkg/.next/**"];
    let raw_excludes = ["my-pkg/.next/cache/**"];
    let exclude = wax::any(raw_excludes).unwrap().to_owned();
    let globs = GlobSet {
        include: make_includes(raw_includes),
        exclude,
        exclude_raw: raw_excludes.iter().map(|s| s.to_string()).collect(),
    };

    let hash = "the-hash".to_string();

    glob_watcher
        .watch_globs(hash.clone(), globs, timeout)
        .await
        .unwrap();

    let candidates = HashSet::from_iter(raw_includes.iter().map(|s| s.to_string()));
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    // Make an irrelevant change
    repo_root
        .join_components(&["my-pkg", "irrelevant"])
        .create_with_contents("some bytes")
        .unwrap();
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    // Make an excluded change
    repo_root
        .join_components(&["my-pkg", ".next", "cache", "foo"])
        .create_with_contents("some bytes")
        .unwrap();
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    // Make a relevant change
    repo_root
        .join_components(&["my-pkg", "dist", "foo"])
        .create_with_contents("some bytes")
        .unwrap();
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    let expected = HashSet::from_iter(["my-pkg/dist/**".to_string()]);
    assert_eq!(results, expected);

    // Change a file matching the other glob
    repo_root
        .join_components(&["my-pkg", ".next", "foo"])
        .create_with_contents("some bytes")
        .unwrap();
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    let expected =
        HashSet::from_iter(["my-pkg/dist/**".to_string(), "my-pkg/.next/**".to_string()]);
    assert_eq!(results, expected);
}

#[tokio::test]
async fn test_track_multiple_hashes() {
    let timeout = Duration::from_secs(2);
    let (repo_root, _tmp_dir) = temp_dir();
    setup(&repo_root);
    let cookie_dir = repo_root.join_component(".git");

    let watcher = FileSystemWatcher::new_with_default_cookie_dir(&repo_root).unwrap();
    let recv = watcher.watch();
    let cookie_writer = CookieWriter::new(&cookie_dir, Duration::from_secs(2), recv.clone());

    let glob_watcher = GlobWatcher::new(repo_root.clone(), cookie_writer, recv);

    let raw_includes = &["my-pkg/dist/**", "my-pkg/.next/**"];
    let raw_excludes: [&str; 0] = [];
    let globs = GlobSet {
        include: make_includes(raw_includes),
        exclude: any(raw_excludes).unwrap(),
        exclude_raw: raw_excludes.iter().map(|s| s.to_string()).collect(),
    };

    let hash = "the-hash".to_string();

    glob_watcher
        .watch_globs(hash.clone(), globs, timeout)
        .await
        .unwrap();

    let candidates = HashSet::from_iter(raw_includes.iter().map(|s| s.to_string()));
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    let second_raw_includes = &["my-pkg/.next/**"];
    let second_raw_excludes = ["my-pkg/.next/cache/**"];
    let second_globs = GlobSet {
        include: make_includes(second_raw_includes),
        exclude: any(second_raw_excludes).unwrap(),
        exclude_raw: second_raw_excludes.iter().map(|s| s.to_string()).collect(),
    };
    let second_hash = "the-second-hash".to_string();
    glob_watcher
        .watch_globs(second_hash.clone(), second_globs, timeout)
        .await
        .unwrap();

    let second_candidates =
        HashSet::from_iter(second_raw_includes.iter().map(|s| s.to_string()));
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    let results = glob_watcher
        .get_changed_globs(second_hash.clone(), second_candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    // Make a change that is excluded in one of the hashes but not in the other
    repo_root
        .join_components(&["my-pkg", ".next", "cache", "foo"])
        .create_with_contents("hello")
        .unwrap();
    // expect one changed glob for the first hash
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    let expected = HashSet::from_iter(["my-pkg/.next/**".to_string()]);
    assert_eq!(results, expected);

    // The second hash which excludes the change should still not have any changed
    // globs
    let results = glob_watcher
        .get_changed_globs(second_hash.clone(), second_candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    // Make a change for second_hash
    repo_root
        .join_components(&["my-pkg", ".next", "bar"])
        .create_with_contents("hello")
        .unwrap();
    let results = glob_watcher
        .get_changed_globs(second_hash.clone(), second_candidates.clone(), timeout)
        .await
        .unwrap();
    assert_eq!(results, second_candidates);
}

#[tokio::test]
async fn test_watch_single_file() {
    let timeout = Duration::from_secs(2);
    let (repo_root, _tmp_dir) = temp_dir();
    setup(&repo_root);
    let cookie_dir = repo_root.join_component(".git");

    let watcher = FileSystemWatcher::new_with_default_cookie_dir(&repo_root).unwrap();
    let recv = watcher.watch();
    let cookie_writer = CookieWriter::new(&cookie_dir, Duration::from_secs(2), recv.clone());

    let glob_watcher = GlobWatcher::new(repo_root.clone(), cookie_writer, recv);

    // On windows, we expect different sanitization before the
    // globs are passed in, due to alternative data streams in files.
    #[cfg(windows)]
    let raw_includes = &["my-pkg/.next/next-file"];
    #[cfg(not(windows))]
    let raw_includes = &["my-pkg/.next/next-file\\:build"];
    let raw_excludes: [&str; 0] = [];
    let globs = GlobSet {
        include: make_includes(raw_includes),
        exclude: any(raw_excludes).unwrap(),
        exclude_raw: raw_excludes.iter().map(|s| s.to_string()).collect(),
    };

    let hash = "the-hash".to_string();

    glob_watcher
        .watch_globs(hash.clone(), globs, timeout)
        .await
        .unwrap();

    // A change to an irrelevant file
    repo_root
        .join_components(&["my-pkg", ".next", "foo"])
        .create_with_contents("hello")
        .unwrap();

    let candidates = HashSet::from_iter(raw_includes.iter().map(|s| s.to_string()));
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    assert!(results.is_empty());

    // Change the watched file
    let watched_file = repo_root.join_components(&["my-pkg", ".next", "next-file:build"]);
    watched_file.create_with_contents("hello").unwrap();
    let results = glob_watcher
        .get_changed_globs(hash.clone(), candidates.clone(), timeout)
        .await
        .unwrap();
    assert_eq!(results, candidates);
}

} ===== END FILE: crates/turborepo-filewatch/src/globwatcher.rs =====

===== FILE: crates/turborepo-filewatch/src/hash_watcher.rs ===== use std::{ collections::{HashMap, HashSet}, sync::{ Arc, atomic::{AtomicUsize, Ordering}, }, time::Duration, };

use notify::Event; use radix_trie::{Trie, TrieCommon}; use thiserror::Error; use tokio::{ select, sync::{broadcast, mpsc, oneshot, watch}, }; use tracing::{debug, trace}; use turbopath::{AbsoluteSystemPathBuf, AnchoredSystemPath, AnchoredSystemPathBuf}; use turborepo_repository::discovery::DiscoveryResponse; use turborepo_scm::{Error as SCMError, GitHashes, SCM};

use crate::{ NotifyError, OptionalWatch, debouncer::Debouncer, globwatcher::{GlobError, GlobSet}, package_watcher::DiscoveryData, scm_resource::SCMResource, };

pub struct HashWatcher { _exit_tx: oneshot::Sender<()>, _handle: tokio::task::JoinHandle<()>, query_tx: mpsc::Sender, }

#[derive(Debug, Clone, PartialEq, Eq, Hash)] pub enum InputGlobs { Default, DefaultWithExtras(GlobSet), Specific(GlobSet), }

impl InputGlobs { pub fn from_raw(raw: Vec, include_default: bool) -> Result<Self, GlobError> { if raw.is_empty() { return Ok(Self::Default); } let glob_set = GlobSet::from_raw_unfiltered(raw)?; if include_default { Ok(Self::DefaultWithExtras(glob_set)) } else { Ok(Self::Specific(glob_set)) } }

fn is_package_local(&self) -> bool {
    match self {
        InputGlobs::Default => true,
        InputGlobs::DefaultWithExtras(glob_set) => glob_set.is_package_local(),
        InputGlobs::Specific(glob_set) => glob_set.is_package_local(),
    }
}

fn as_inputs(&self) -> Vec<String> {
    match self {
        InputGlobs::Default => Vec::new(),
        InputGlobs::DefaultWithExtras(glob_set) | InputGlobs::Specific(glob_set) => {
            glob_set.as_inputs()
        }
    }
}

pub fn include_default_files(&self) -> bool {
    match self {
        InputGlobs::Default | InputGlobs::DefaultWithExtras(..) => true,
        InputGlobs::Specific(..) => false,
    }
}

}

#[derive(Clone, Debug, PartialEq, Eq, Hash)] pub struct HashSpec { pub package_path: AnchoredSystemPathBuf, pub inputs: InputGlobs, }

impl HashSpec { fn is_package_local(&self) -> bool { self.inputs.is_package_local() } }

#[derive(Error, Debug)] pub enum Error { #[error("Package hashing encountered an error: {0}")] HashingError(String), #[error("File hashing is not available: {0}")] Unavailable(String), #[error("Package not found: {} {:?}", .0.package_path, .0.inputs)] UnknownPackage(HashSpec), #[error("Unsupported: glob traverses out of the package")] UnsupportedGlob, }

// Communication errors that all funnel to Unavailable

impl Fromwatch::error::RecvError for Error { fn from(e: watch::error::RecvError) -> Self { Self::Unavailable(e.to_string()) } }

impl Fromoneshot::error::RecvError for Error { fn from(e: oneshot::error::RecvError) -> Self { Self::Unavailable(e.to_string()) } }

impl From<mpsc::error::SendError> for Error { fn from(e: mpsc::error::SendError) -> Self { Self::Unavailable(e.to_string()) } }

impl HashWatcher { pub fn new( repo_root: AbsoluteSystemPathBuf, package_discovery: watch::Receiver<Option>, file_events: OptionalWatch<broadcast::Receiver<Result<Event, NotifyError>>>, scm: SCM, ) -> Self { let (exit_tx, exit_rx) = oneshot::channel(); let (query_tx, query_rx) = mpsc::channel(16); let subscriber = Subscriber::new(repo_root, package_discovery, scm, query_rx); let handle = tokio::spawn(subscriber.watch(exit_rx, file_events)); Self { _exit_tx: exit_tx, _handle: handle, query_tx, } }

// Note that this does not wait for any sort of ready signal. The watching
// process won't respond until filewatching is ready, but there is no
// guarantee that package data or file hashing will be done before
// responding. Both package discovery and file hashing can fail depending on the
// state of the filesystem, so clients will need to be robust to receiving
// errors.
pub async fn get_file_hashes(&self, hash_spec: HashSpec) -> Result<GitHashes, Error> {
    let (tx, rx) = oneshot::channel();
    self.query_tx.send(Query::GetHash(hash_spec, tx)).await?;
    rx.await?
}

}

struct Subscriber { repo_root: AbsoluteSystemPathBuf, package_discovery: watch::Receiver<Option>, query_rx: mpsc::Receiver, scm: SCMResource, next_version: AtomicUsize, }

#[derive(Debug)] enum Query { GetHash(HashSpec, oneshot::Sender<Result<GitHashes, Error>>), }

// Version is a type that exists to stamp an asynchronous hash computation // with a version so that we can ignore completion of outdated hash // computations. #[derive(Clone, Copy, Debug, PartialEq, Eq, Hash)] struct Version(usize);

enum HashState { Hashes(GitHashes), Pending( Version, Arc, Vec<oneshot::Sender<Result<GitHashes, Error>>>, ), Unavailable(String), } // We use a radix_trie to store hashes so that we can quickly match a file path // to a package without having to iterate over the set of all packages. We // expect file changes to be the highest volume of events that this service // handles, so we want to ensure we're efficient in deciding if a given change // is relevant or not. // // Our Trie keys off of a String because of the orphan rule. Keys are required // to be TrieKey, but this crate doesn't own TrieKey or AnchoredSystemPathBuf. // We could implement TrieKey in AnchoredSystemPathBuf and avoid the String // conversion, if we decide we want to add the radix_trie dependency to // turbopath. struct FileHashes(Trie<String, HashMap<InputGlobs, HashState>>);

impl FileHashes { fn new() -> Self { Self(Trie::new()) }

fn drop_matching<F>(&mut self, mut f: F, reason: &str)
where
    F: FnMut(&AnchoredSystemPath) -> bool,
{
    let mut previous = std::mem::take(&mut self.0);

    // radix_trie doesn't have an into_iter() implementation, so we have a slightly
    // inefficient method for removing matching values. Fortunately, we only
    // need to do this when the package layout changes. It's O(n) in the
    // number of packages, on top of the trie internals.
    let keys = previous.keys().map(|k| k.to_owned()).collect::<Vec<_>>();
    for key in keys {
        let previous_value = previous
            .remove(&key)
            .expect("this key was pulled from previous");
        let path_key =
            AnchoredSystemPath::new(&key).expect("keys are valid AnchoredSystemPaths");
        if !f(path_key) {
            // keep it, we didn't match the key.
            self.0.insert(key, previous_value);
        } else {
            for state in previous_value.into_values() {
                if let HashState::Pending(_, _, txs) = state {
                    for tx in txs {
                        let _ = tx.send(Err(Error::Unavailable(reason.to_string())));
                    }
                }
            }
        }
    }
}

fn get_changed_specs(&self, file_path: &AnchoredSystemPath) -> HashSet<HashSpec> {
    self.0
        .get_ancestor(file_path.as_str())
        // verify we have a key
        .and_then(|subtrie| subtrie.key().map(|key| (key, subtrie)))
        // convert key to AnchoredSystemPath, and verify we have a value
        .and_then(|(package_path, subtrie)| {
            let package_path = AnchoredSystemPath::new(package_path)
                .expect("keys are valid AnchoredSystemPaths");
            // handle scenarios where even though we've found an ancestor, it might be a
            // sibling file or directory that starts with the same prefix,
            // e,g an update to apps/foo_decoy when the package path is
            // apps/foo.
            if let Some(package_path_to_file) = file_path.strip_prefix(package_path) {
                // Pass along the path to the package, the path _within_ the package to this
                // change, in unix format, and the set of input specs that
                // we're tracking.
                subtrie
                    .value()
                    .map(|specs| (package_path, package_path_to_file.to_unix(), specs))
            } else {
                None
            }
        })
        // now that we have a path and a set of specs, filter the specs to the relevant ones
        .map(|(package_path, change_in_package, input_globs)| {
            input_globs
                .keys()
                .filter_map(|input_globs| match input_globs {
                    InputGlobs::Default => Some(HashSpec {
                        package_path: package_path.to_owned(),
                        inputs: InputGlobs::Default,
                    }),
                    inputs @ InputGlobs::DefaultWithExtras(_) => Some(HashSpec {
                        package_path: package_path.to_owned(),
                        inputs: inputs.clone(),
                    }),
                    inputs @ InputGlobs::Specific(glob_set)
                        if glob_set.matches(&change_in_package) =>
                    {
                        Some(HashSpec {
                            package_path: package_path.to_owned(),
                            inputs: inputs.clone(),
                        })
                    }
                    _ => None,
                })
                .collect::<HashSet<_>>()
        })
        .unwrap_or_default()
}

fn drain(&mut self, reason: &str) {
    // funnel through drop_matching even though we could just swap with a new trie.
    // We want to ensure we respond to any pending queries.
    self.drop_matching(|_| true, reason);
}

fn contains_key(&self, key: &HashSpec) -> bool {
    self.0
        .get(key.package_path.as_str())
        .and_then(|states| states.get(&key.inputs))
        .is_some()
}

fn insert(&mut self, key: HashSpec, value: HashState) {
    if let Some(states) = self.0.get_mut(key.package_path.as_str()) {
        states.insert(key.inputs, value);
    } else {
        let mut states = HashMap::new();
        states.insert(key.inputs, value);
        self.0.insert(key.package_path.as_str().to_owned(), states);
    }
}

fn get_mut(&mut self, key: &HashSpec) -> Option<&mut HashState> {
    self.0
        .get_mut(key.package_path.as_str())
        .and_then(|states| states.get_mut(&key.inputs))
}

}

struct HashUpdate { spec: HashSpec, version: Version, result: Result<GitHashes, SCMError>, }

impl Subscriber { fn new( repo_root: AbsoluteSystemPathBuf, package_discovery: watch::Receiver<Option>, scm: SCM, query_rx: mpsc::Receiver, ) -> Self { Self { repo_root, package_discovery, scm: SCMResource::new(scm), query_rx, next_version: AtomicUsize::new(0), } }

async fn watch(
    mut self,
    mut exit_rx: oneshot::Receiver<()>,
    mut file_events: OptionalWatch<broadcast::Receiver<Result<Event, NotifyError>>>,
) {
    debug!("starting file hash watcher");
    let mut file_events_recv = match file_events.get().await {
        Ok(r) => r.resubscribe(),
        Err(e) => {
            debug!("file hash watcher exited: {:?}", e);
            return;
        }
    };
    let (hash_update_tx, mut hash_update_rx) = mpsc::channel::<HashUpdate>(16);
    let mut hashes = FileHashes::new();

    let mut package_data = self.package_discovery.borrow().to_owned();
    self.handle_package_data_update(&package_data, &mut hashes, &hash_update_tx);
    // We've gotten the ready signal from filewatching, and *some* state from
    // package discovery, but there is no guarantee that package discovery
    // is ready. This means that initial queries may be returned with errors
    // until we've completed package discovery and then hashing.
    //
    // This is the main event loop for the hash watcher. It receives file events,
    // updates to the package discovery state, and queries for hashes. It does
    // not use filesystem cookies, as it is expected that the client will
    // synchronize itself first before issuing a series of queries, one per
    // task that in the task graph for a run, and we don't want to block on
    // the filesystem for each query. This is analogous to running without
    // the daemon, where we assume a static filesystem for the duration of
    // generating task hashes.
    loop {
        select! {
            biased;
            _ = &mut exit_rx => {
                debug!("file hash watcher exited");
                return;
            },
            _ = self.package_discovery.changed() => {
                self.package_discovery.borrow().clone_into(&mut package_data);
                self.handle_package_data_update(&package_data, &mut hashes, &hash_update_tx);
            },
            file_event = file_events_recv.recv() => {
                match file_event {
                    Ok(Ok(event)) => {
                        self.handle_file_event(event, &mut hashes, &hash_update_tx);
                    },
                    Ok(Err(e)) => {
                        debug!("file watcher error: {:?}", e);
                        self.flush_and_rehash(&mut hashes, &hash_update_tx, &package_data, &format!("file watcher error: {e}"));
                    },
                    Err(broadcast::error::RecvError::Closed) => {
                        debug!("file watcher closed");
                        hashes.drain("file watcher closed");
                        return;
                    },
                    Err(broadcast::error::RecvError::Lagged(_)) => {
                        debug!("file watcher lagged");
                        self.flush_and_rehash(&mut hashes, &hash_update_tx, &package_data, "file watcher lagged");
                    },
                }
            },
            hash_update = hash_update_rx.recv() => {
                if let Some(hash_update) = hash_update {
                    self.handle_hash_update(hash_update, &mut hashes);
                } else {
                    // note that we only ever lend out hash_update_tx, so this should be impossible
                    unreachable!("hash update channel closed, but we have a live reference to it");
                }
            },
            Some(query) = self.query_rx.recv() => {
                self.handle_query(query, &mut hashes, &hash_update_tx);
            }
        }
    }
}

fn flush_and_rehash(
    &self,
    hashes: &mut FileHashes,
    hash_update_tx: &mpsc::Sender<HashUpdate>,
    package_data: &Option<Result<DiscoveryResponse, String>>,
    reason: &str,
) {
    // We need to send errors to any RPCs that are pending, and having an empty set
    // of hashes will cause handle_package_data_update to consider all
    // packages as new and rehash them.
    hashes.drain(reason);
    self.handle_package_data_update(package_data, hashes, hash_update_tx);
}

// We currently only support a single query, getting hashes for a given
// HashSpec.
fn handle_query(
    &self,
    query: Query,
    hashes: &mut FileHashes,
    hash_update_tx: &mpsc::Sender<HashUpdate>,
) {
    //trace!("handling query {query:?}");
    match query {
        Query::GetHash(spec, tx) => {
            // We don't currently support inputs that are not package-local. Adding this
            // support would require tracking arbitrary file paths and
            // mapping them back to packages. It is doable if we want to
            // attempt it in the future.
            if !spec.is_package_local() {
                let _ = tx.send(Err(Error::UnsupportedGlob));
                trace!("unsupported glob in query {:?}", spec);
                return;
            }
            if let Some(state) = hashes.get_mut(&spec) {
                match state {
                    HashState::Hashes(hashes) => {
                        tx.send(Ok(hashes.clone())).unwrap();
                    }
                    HashState::Pending(_, _, txs) => {
                        txs.push(tx);
                    }
                    HashState::Unavailable(e) => {
                        let _ = tx.send(Err(Error::HashingError(e.clone())));
                    }
                }
            } else if !matches!(spec.inputs, InputGlobs::Default)
                && hashes.contains_key(&HashSpec {
                    package_path: spec.package_path.clone(),
                    inputs: InputGlobs::Default,
                })
            {
                // in this scenario, we know the package exists, but we aren't tracking these
                // particular inputs. Queue a hash request for them.
                let (version, debouncer) = self.queue_package_hash(&spec, hash_update_tx, true);
                // this request will likely time out. However, if the client has asked for
                // this spec once, they might ask again, and we can start tracking it.
                hashes.insert(spec, HashState::Pending(version, debouncer, vec![tx]));
            } else {
                // We don't know anything about this package.
                let _ = tx.send(Err(Error::UnknownPackage(spec)));
            }
        }
    }
}

fn handle_hash_update(&self, update: HashUpdate, hashes: &mut FileHashes) {
    let HashUpdate {
        spec,
        version,
        result,
    } = update;
    // If we have a pending hash computation, update the state. If we don't, ignore
    // this update
    if let Some(state) = hashes.get_mut(&spec) {
        // We need mutable access to 'state' to update it, as well as being able to
        // extract the pending state, so we need two separate if statements
        // to pull the value apart.
        if let HashState::Pending(existing_version, _, pending_queries) = state
            && *existing_version == version
        {
            match result {
                Ok(hashes) => {
                    for pending_query in pending_queries.drain(..) {
                        // We don't care if the client has gone away
                        let _ = pending_query.send(Ok(hashes.clone()));
                    }
                    *state = HashState::Hashes(hashes);
                }
                Err(e) => {
                    let error = e.to_string();
                    for pending_query in pending_queries.drain(..) {
                        // We don't care if the client has gone away
                        let _ = pending_query.send(Err(Error::HashingError(error.clone())));
                    }
                    *state = HashState::Unavailable(error);
                }
            }
        }
    }
}

fn queue_package_hash(
    &self,
    spec: &HashSpec,
    hash_update_tx: &mpsc::Sender<HashUpdate>,
    immediate: bool,
) -> (Version, Arc<Debouncer>) {
    let version = Version(self.next_version.fetch_add(1, Ordering::SeqCst));
    let tx = hash_update_tx.clone();
    let spec = spec.clone();
    let repo_root = self.repo_root.clone();
    let scm = self.scm.clone();
    let debouncer = if immediate {
        Debouncer::new(Duration::from_millis(0))
    } else {
        Debouncer::default()
    };
    let debouncer = Arc::new(debouncer);
    let debouncer_copy = debouncer.clone();
    tokio::task::spawn(async move {
        debouncer_copy.debounce().await;
        let scm_permit = scm.acquire_scm().await;
        // We awkwardly copy the actual SCM instance since we're sending it to a
        // different thread which requires it be 'static.
        let scm_instance = scm_permit.clone();
        // Package hashing involves blocking IO calls, so run on a blocking thread.
        let blocking_handle = tokio::task::spawn_blocking(move || {
            let telemetry = None;
            let inputs = spec.inputs.as_inputs();
            let result = scm_instance.get_package_file_hashes(
                &repo_root,
                &spec.package_path,
                &inputs,
                spec.inputs.i

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

lockfile-and-scm-hashing.md

Now I have all the data needed. Here's the comprehensive report:

Current State

Turborepo's build pipeline has received a burst of performance work in recent commits (Feb 2026). The engine builder BFS was optimized from 283ms to 74ms, task hash computation was parallelized across topological waves, shallow wildcard glob patterns got a fast path (54ms → 6ms), and lockfile reading now runs concurrently with workspace discovery. The codebase is already well-optimized at the macro level — remaining wins are in the cache I/O layer, SCM hashing hot paths, and fine-grained parallelism gaps.

Pipeline stages and their current state:

Stage	Current Speed	Parallelized?	Bottleneck Type
Workspace discovery	Good (shallow wildcard fast path)	Partial	I/O (readdir + stat)
Package.json parsing	Good	Yes (rayon)	I/O (JSON parse)
Lockfile reading	Good	Yes (concurrent with discovery)	I/O
Engine construction	Good (283→74ms)	No (serial BFS, cached)	CPU
File hash computation	Good	Yes (rayon, deduplicated)	I/O + CPU
Task hash precomputation	Good	Partial (wave-by-wave)	CPU
Task execution	Good	Yes (semaphore-controlled)	External process
Cache archive creation	Slow	No	I/O + CPU (single-threaded zstd)
Cache archive restore	Slow	No	I/O (8KB buffer, no BufWriter)

Required Changes

1. Enable Multi-Threaded zstd Compression (HIGH impact, LOW effort)

Files: crates/turborepo-cache/src/cache_archive/create.rs:115,151

Problem: zstd Encoder::new(writer, 0) uses single-threaded compression at default level. The zstd crate (v0.13.3) supports multi-threaded encoding via multithread().

Change:

// Before (create.rs:151)
let zw = zstd::Encoder::new(file_buffer, 0)?.auto_finish();

// After
let mut zw = zstd::Encoder::new(file_buffer, 0)?;
zw.multithread(num_cpus::get() as u32)?;  // or rayon::current_num_threads()
let zw = zw.auto_finish();

Expected gain: 2-4x faster cache writes for large output directories (Next.js .next/ with hundreds of chunks). Compression is per-task so this hits every cache-miss execution.

Verification:

# Build turborepo
cargo build --release -p turbo

# Benchmark cache write time on a real monorepo with tracing:
TURBO_LOG_VERBOSITY=debug turbo run build --force 2>&1 | grep -E 'cache (save|restore|archive)'

# Compare archive sizes before/after (ensure ratio stays reasonable):
ls -la node_modules/.cache/turbo/*.zst | awk '{sum+=$5} END {print sum}'

2. Add BufWriter to Cache Restore + Increase SHA Buffer (HIGH impact, LOW effort)

Files: crates/turborepo-cache/src/cache_archive/restore.rs:39,49

Problem: Restore writes files without buffering (raw io::copy to File), and SHA computation uses an 8KB buffer. The create path uses a 1MB BufWriter — this asymmetry means restores are I/O-bound on syscall overhead.

Change for SHA buffer (restore.rs:49):

// Before
let mut buffer = [0; 8192];
// After
let mut buffer = [0; 131072];  // 128KB

Change for restore writes (restore_regular.rs:39):

// Before
io::copy(entry, &mut file)?;
// After
let mut buffered = BufWriter::with_capacity(1 << 20, file);  // 1MB
io::copy(entry, &mut buffered)?;

Expected gain: 5-15% faster cache restores, especially for tasks with many small output files.

Verification:

# Run with warm cache to measure restore path:
turbo run build  # first run (populates cache)
turbo run build  # second run (all cache hits — times the restore path)

# Use hyperfine for statistical comparison:
hyperfine --warmup 1 'turbo run build --force' --prepare 'turbo run build'

3. Eliminate `encode_hex::<String>()` Allocation in Manual Hashing (MEDIUM impact, LOW effort)

File: crates/turborepo-scm/src/manual.rs:38

Problem: git_like_hash_file() calls result.encode_hex::<String>() which heap-allocates a 40-byte String on every file. The git2 path (hash_object.rs:67-69) already uses a stack buffer — manual mode should match.

Change:

// Before (manual.rs:38)
Ok(result.encode_hex::<String>())

// After
let mut hex_buf = [0u8; 40];
hex::encode_to_slice(result.as_slice(), &mut hex_buf).unwrap();
// SAFETY: hex output is always valid ASCII
Ok(unsafe { std::str::from_utf8_unchecked(&hex_buf) }.to_string())

Expected gain: Eliminates one heap allocation per file in the manual hashing path. In a 10,000-file monorepo, that's 10,000 fewer allocations per build. The final .to_string() is still needed (returning String), but this avoids the intermediate hex crate allocation.

Verification:

# Run turborepo-scm tests:
cargo test -p turborepo-scm --features git2

# Benchmark with DHAT or a custom benchmark:
cargo bench -p turborepo-scm  # if benches exist

4. Eliminate Double Directory Walk in Manual Mode (MEDIUM impact, MEDIUM effort)

File: crates/turborepo-scm/src/manual.rs:108-165

Problem: get_package_file_hashes_without_git() walks the directory tree twice when include_default_files is true — once for user-specified inputs (without gitignore), once for defaults (with gitignore). Both walks do full metadata checks and hashing.

Change: Merge into a single walk that tracks which files matched user inputs vs. defaults, then applies gitignore filtering only to the default set post-walk.

Expected gain: 30-50% faster for packages with include_default_files: true in non-git environments (CI containers without git, Docker builds).

Verification:

cargo test -p turborepo-scm --features git2 -- manual
# Integration test on a repo without git:
cd /tmp && mkdir -p test-pkg && echo '{}' > test-pkg/package.json
# Run turborepo hash computation and compare times

5. Cache Compiled Glob Patterns Across Packages (MEDIUM impact, MEDIUM effort)

File: crates/turborepo-globwalk/src/lib.rs

Problem: Every call to globwalk() recompiles glob patterns from scratch. In a 100-package monorepo where most packages share the same output patterns (e.g., dist/**, .next/**), the same regex compilation happens 100+ times across save_outputs and file-hashing paths.

Change: Add a thread_local! or DashMap<String, CompiledGlob> cache keyed by pattern string:

use std::sync::OnceLock;
use dashmap::DashMap;

static GLOB_CACHE: OnceLock<DashMap<String, Glob<'static>>> = OnceLock::new();

fn cached_glob(pattern: &str) -> Result<Glob<'static>, WalkError> {
    let cache = GLOB_CACHE.get_or_init(DashMap::new);
    if let Some(g) = cache.get(pattern) {
        return Ok(g.clone());
    }
    let g = Glob::new(pattern)?.into_owned();
    cache.insert(pattern.to_string(), g.clone());
    Ok(g)
}

Expected gain: 5-15ms saved in repos with 100+ packages sharing common output patterns. The wax::Glob regex compilation is non-trivial (~50-100μs per pattern).

Verification:

cargo test -p turborepo-globwalk
# Profile with tracing:
TURBO_LOG_VERBOSITY=debug turbo run build 2>&1 | grep globwalk

6. Parallel Config Loading in Engine Builder BFS (LOW impact, MEDIUM effort)

File: crates/turborepo-engine/src/builder.rs:261-425

Problem: The BFS loop loads turbo.json configs serially per workspace. While preload_all() warms the cache before the BFS, any cache misses during BFS (e.g., dynamically-discovered workspaces) hit the filesystem synchronously.

Current mitigation: TurboJsonLoader::preload_all() already parallelizes warmup via rayon. The BFS itself is fast (74ms after recent optimizations). This is low-priority unless preload misses are common.

Assumption: Most repos won't see misses after preload_all(). Only relevant for repos with dynamic workspace discovery or conditional turbo.json files.

Verification: Add tracing to detect cache misses in the BFS loop:

if chain_cache.get(&package_name).is_none() {
    tracing::warn!("turbo.json cache miss during BFS for {}", package_name);
}

7. Wave-to-Queue Task Hash Precomputation (LOW impact, HIGH effort)

File: crates/turborepo-lib/src/task_graph/visitor/mod.rs:191-303

Problem: Task hashes are precomputed wave-by-wave (all tasks at topological depth N must finish before depth N+1 starts). For deep, narrow dependency graphs, this serializes hashing unnecessarily.

Current state: Already good for wide graphs (common in monorepos). For a graph of depth 20 with 1 task per level, this means 20 sequential rounds instead of pipelining.

Change: Replace wave-based iteration with a dependency-aware queue (similar to the execution engine's approach) where tasks are hashed as soon as all their dependencies are hashed.

Expected gain: 20-40% faster hashing for deep dependency graphs. Minimal improvement for wide graphs (already parallelized within waves).

Verification:

# Use a repo with deep dependency chains:
turbo run build --dry=json | jq '.tasks | length'
# Compare precompute_task_hashes timing with tracing spans

Ripple Effects

Change	Risk	What Could Break
Multi-threaded zstd	Low	Increased memory usage during compression (~1MB per thread). Archives remain compatible (decompression is format-level, not thread-aware).
BufWriter on restore	Very Low	None — purely additive buffering. Drop flushes automatically.
Hex encoding stack buffer	Very Low	None — identical output, just fewer allocations.
Single-walk manual mode	Medium	Gitignore behavior must exactly match the current two-walk approach. Edge cases with `.gitignore` in nested directories need careful testing.
Glob pattern cache	Low	Cache invalidation is not needed (patterns are pure functions of input strings). Memory growth is bounded by unique pattern count.
Wave-to-queue hashing	Medium	Must preserve the invariant that a task's dependencies are all hashed before the task itself. Race conditions possible if the queue isn't properly synchronized.

Cross-cutting concern: All changes in the cache layer affect both local and remote cache paths. The async cache layer (async_cache.rs) wraps these operations, so changes propagate automatically. Remote cache uploads/downloads use the same zstd format, so compression changes are transparent.

What This Unlocks

Faster CI cold starts: Multi-threaded zstd + buffered restores directly reduce the time spent on cache operations, which dominates CI wall-clock time when most tasks are cache misses.
Better scaling for large monorepos: Glob pattern caching and single-walk manual mode reduce overhead that scales linearly with package count. A 500-package monorepo would save 50-75ms on glob compilation alone.
Profiling infrastructure: Adding tracing spans to cache create/restore operations (which the codebase already does in some paths) enables data-driven optimization. The recent commits show this pattern working well — build_engine went from 283ms to 74ms by profiling first.
Foundation for streaming cache: Buffered I/O on both create and restore sides is a prerequisite for future streaming cache support (writing to remote cache while compressing, rather than write-local-then-upload).

Effort & Risk

#	Change	Effort	Risk	Expected Gain	Verification
1	Multi-threaded zstd	1-2 hours	Low	2-4x cache write speed	`cargo test -p turborepo-cache` + hyperfine benchmark
2	BufWriter + SHA buffer	30 min	Very Low	5-15% restore speed	`cargo test -p turborepo-cache` + before/after timing
3	Stack hex encoding	30 min	Very Low	~10K fewer allocs/build	`cargo test -p turborepo-scm --features git2`
4	Single-walk manual mode	4-8 hours	Medium	30-50% manual hash speed	`cargo test -p turborepo-scm --features git2 -- manual`
5	Glob pattern cache	2-4 hours	Low	5-15ms per run	`cargo test -p turborepo-globwalk`
6	Engine BFS cache miss logging	1 hour	None	Diagnostic only	`cargo test -p turborepo-engine`
7	Wave-to-queue hashing	8-16 hours	Medium	20-40% for deep graphs	Full integration test suite

Recommended order: 1 → 2 → 3 → 5 → 4 → 6 → 7 (highest ROI first, lowest risk first)

Holistic verification for all changes:

# Full test suite (must pass before and after):
cargo test --workspace

# Integration tests:
cargo test -p turborepo-tests-integration

# End-to-end benchmark (use a real monorepo):
hyperfine --warmup 2 \
  'turbo run build --force' \        # cold cache (measures hashing + execution + cache write)
  'turbo run build'                   # warm cache (measures cache restore)

# Tracing-based profiling:
TURBO_LOG_VERBOSITY=debug turbo run build --force 2>&1 | \
  grep -E '(build_engine|hash_scope|cache|globwalk)' | \
  sort -t= -k2 -n

Assumptions:

The zstd 0.13.3 crate's multithread() API is available (it is — this was added in zstd 0.9+)
Manual hashing mode (get_package_file_hashes_without_git) is used in CI environments without git — if this path is rare, change #4 has lower priority
The wax::Glob compilation cost (~50-100μs per pattern) is based on typical regex complexity; extremely simple patterns may compile faster, reducing the glob cache benefit

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:

Exploration

AREA: scm-package-hash-alloc-reduction FILES: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs MATURITY: stable RELEVANCE: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. NOTES: (unspecified)

Raw Exploration Notes: Based on the research across all three explorations, here's the highest-leverage untapped area:

Selected Code Context

===== FILE: crates/turborepo-scm/src/hash_object.rs ===== #![cfg(feature = "git2")] use rayon::prelude::*; use tracing::debug; use turbopath::{AbsoluteSystemPath, AnchoredSystemPathBuf, RelativeUnixPath, RelativeUnixPathBuf};

use crate::{Error, GitHashes};

const MAX_RETRIES: u32 = 10; const BASE_DELAY_MS: u64 = 10; const MAX_DELAY_MS: u64 = 1000;

fn hash_file_with_retry(path: &AbsoluteSystemPath) -> Result<git2::Oid, git2::Error> { for attempt in 0..MAX_RETRIES { match git2::Oid::hash_file(git2::ObjectType::Blob, path) { Ok(oid) => return Ok(oid), Err(e) if is_too_many_open_files(&e) => { let delay = std::cmp::min(BASE_DELAY_MS * 2u64.pow(attempt), MAX_DELAY_MS); debug!( attempt = attempt + 1, delay_ms = delay, "too many open files, retrying hash_file" ); std::thread::sleep(std::time::Duration::from_millis(delay)); } Err(e) => return Err(e), } } git2::Oid::hash_file(git2::ObjectType::Blob, path) }

fn is_too_many_open_files(e: &git2::Error) -> bool { if e.class() != git2::ErrorClass::Os { return false; } let msg = e.message(); msg.contains("Too many open files") || msg.contains("EMFILE") }

#[tracing::instrument(skip(git_root, hashes, to_hash))] pub(crate) fn hash_objects( git_root: &AbsoluteSystemPath, pkg_path: &AbsoluteSystemPath, to_hash: Vec, hashes: &mut GitHashes, ) -> Result<(), Error> { let pkg_prefix = git_root.anchor(pkg_path).ok().map(|a| a.to_unix());

hashes.reserve(to_hash.len());
let results: Vec<Result<Option<(RelativeUnixPathBuf, String)>, Error>> = to_hash
    .into_par_iter()
    .map(|filename| {
        let full_file_path = git_root.join_unix_path(&filename);
        match hash_file_with_retry(&full_file_path) {
            Ok(hash) => {
                let package_relative_path = pkg_prefix
                    .as_ref()
                    .and_then(|prefix| {
                        RelativeUnixPath::strip_prefix(&filename, prefix)
                            .ok()
                            .map(|stripped| stripped.to_owned())
                    })
                    .unwrap_or_else(|| {
                        AnchoredSystemPathBuf::relative_path_between(pkg_path, &full_file_path)
                            .to_unix()
                    });
                let mut hex_buf = [0u8; 40];
                hex::encode_to_slice(hash.as_bytes(), &mut hex_buf).unwrap();
                // SAFETY: hex output is always valid ASCII
                let hash_str = unsafe { std::str::from_utf8_unchecked(&hex_buf) }.to_string();
                Ok(Some((package_relative_path, hash_str)))
            }
            Err(e) => {
                if e.class() == git2::ErrorClass::Os
                    && full_file_path
                        .symlink_metadata()
                        .map(|md| md.is_symlink())
                        .unwrap_or(false)
                {
                    Ok(None)
                } else {
                    Err(Error::git2_error_context(e, full_file_path.to_string()))
                }
            }
        }
    })
    .collect();

for result in results {
    if let Some((path, hash)) = result? {
        hashes.insert(path, hash);
    }
}
Ok(())

}

#[cfg(test)] mod test { use turbopath::{AbsoluteSystemPathBuf, RelativeUnixPathBuf, RelativeUnixPathBufTestExt};

use super::hash_objects;
use crate::{GitHashes, find_git_root};

#[test]
fn test_read_object_hashes() {
    // Note that cwd can be different based on where the test suite is running from
    // or if the test is launched in debug mode from VSCode
    let cwd = std::env::current_dir().unwrap();
    let cwd = AbsoluteSystemPathBuf::try_from(cwd).unwrap();
    let git_root = find_git_root(&cwd).unwrap();
    let fixture_path = git_root.join_components(&[
        "crates",
        "turborepo-scm",
        "fixtures",
        "01-git-hash-object",
    ]);

    let fixture_child_path = fixture_path.join_component("child");
    let git_root = find_git_root(&fixture_path).unwrap();

    // paths for files here are relative to the package path.
    let tests: Vec<(Vec<(&str, &str)>, &AbsoluteSystemPathBuf)> = vec![
        (vec![], &fixture_path),
        (
            vec![
                ("../root.json", "e69de29bb2d1d6434b8b29ae775ad8c2e48c5391"),
                ("child.json", "e69de29bb2d1d6434b8b29ae775ad8c2e48c5391"),
                (
                    "grandchild/grandchild.json",
                    "e69de29bb2d1d6434b8b29ae775ad8c2e48c5391",
                ),
            ],
            &fixture_child_path,
        ),
    ];

    for (to_hash, pkg_path) in tests {
        let file_hashes: Vec<(RelativeUnixPathBuf, String)> = to_hash
            .into_iter()
            .map(|(raw, hash)| (RelativeUnixPathBuf::new(raw).unwrap(), String::from(hash)))
            .collect();

        let git_to_pkg_path = git_root.anchor(pkg_path).unwrap();
        let pkg_prefix = git_to_pkg_path.to_unix();

        let expected_hashes = GitHashes::from_iter(file_hashes);
        let mut hashes = GitHashes::new();
        let to_hash = expected_hashes.keys().map(|k| pkg_prefix.join(k)).collect();
        hash_objects(&git_root, pkg_path, to_hash, &mut hashes).unwrap();
        assert_eq!(hashes, expected_hashes);
    }

    // paths for files here are relative to the package path.
    let error_tests: Vec<(Vec<&str>, &AbsoluteSystemPathBuf)> = vec![
        // skipping test for outside of git repo, we now error earlier in the process
        (vec!["nonexistent.json"], &fixture_path),
    ];

    for (to_hash, pkg_path) in error_tests {
        let git_to_pkg_path = git_root.anchor(pkg_path).unwrap();
        let pkg_prefix = git_to_pkg_path.to_unix();

        let to_hash = to_hash
            .into_iter()
            .map(|k| pkg_prefix.join(&RelativeUnixPathBuf::new(k).unwrap()))
            .collect();

        let mut hashes = GitHashes::new();
        let result = hash_objects(&git_root, pkg_path, to_hash, &mut hashes);
        assert!(result.is_err());
    }
}

} ===== END FILE: crates/turborepo-scm/src/hash_object.rs =====

===== FILE: crates/turborepo-scm/src/repo_index.rs ===== #![cfg(feature = "git2")]

use tracing::{debug, trace}; use turbopath::RelativeUnixPathBuf;

use crate::{Error, GitHashes, GitRepo, ls_tree::SortedGitHashes, status::RepoStatusEntry};

/// Pre-computed repo-wide git index that caches file hashes and working-tree /// status so they can be filtered per-package without spawning additional /// subprocesses. /// /// Both collections are sorted by path so that per-package lookups can use /// partition_point (binary search) for range queries. This gives O(log n) /// lookup cost with good cache locality on contiguous memory. pub struct RepoGitIndex { ls_tree_hashes: SortedGitHashes, /// Sorted by path so per-package filtering can use binary-search range /// queries instead of linear scans. status_entries: Vec, }

impl RepoGitIndex { /// Build the index using the gix-index path when available, falling back to /// the libgit2 ls-tree + status path otherwise. #[tracing::instrument(skip(git))] pub fn new(git: &GitRepo) -> Result<Self, Error> { #[cfg(feature = "gix")] { match Self::new_from_gix_index(git) { Ok(index) => return Ok(index), Err(e) => { debug!("gix-index path failed: {}. Falling back to libgit2.", e); } } }

    Self::new_from_libgit2(git)
}

/// Build the index by running `git ls-tree` and `git status` via libgit2
/// on separate threads.
fn new_from_libgit2(git: &GitRepo) -> Result<Self, Error> {
    let (ls_tree_hashes, status_entries) = std::thread::scope(|s| {
        let ls_tree = s.spawn(|| git.git_ls_tree_repo_root_sorted());
        let status = s.spawn(|| git.git_status_repo_root());
        (
            ls_tree.join().expect("ls-tree thread panicked"),
            status.join().expect("status thread panicked"),
        )
    });
    let ls_tree_hashes = ls_tree_hashes?;
    let mut status_entries = status_entries?;

    status_entries.sort_by(|a, b| a.path.cmp(&b.path));

    debug!(
        "built repo git index (libgit2): ls_tree_count={}, status_count={}",
        ls_tree_hashes.len(),
        status_entries.len(),
    );
    Ok(Self {
        ls_tree_hashes,
        status_entries,
    })
}

/// Build the index by reading `.git/index` directly via gix-index.
///
/// This replaces both `git ls-tree` and `git status` with a single
/// operation: reading the index file gives us committed blob OIDs, and
/// stat-comparing each entry against the filesystem tells us which files
/// are modified or deleted. Untracked files are detected by a parallel
/// walk of the working tree respecting .gitignore.
///
/// Racy-git entries (where mtime >= index timestamp, so we can't trust
/// the stat comparison) are deferred to per-package hashing rather than
/// content-hashed inline. This avoids reading every file from disk on
/// freshly cloned/checked-out repos.
#[cfg(feature = "gix")]
#[tracing::instrument(skip(git))]
fn new_from_gix_index(git: &GitRepo) -> Result<Self, Error> {
    use rayon::prelude::*;

    let git_dir = git.root.join_component(".git");
    let index_path = git_dir.join_component("index");

    if !index_path.exists() {
        return Err(Error::git_error("no .git/index file found"));
    }

    let index = gix_index::File::at(
        index_path.as_std_path(),
        gix_index::hash::Kind::Sha1,
        true, // skip_hash: don't verify the index checksum (2x faster)
        gix_index::decode::Options::default(),
    )
    .map_err(|e| Error::git_error(format!("failed to read git index: {}", e)))?;

    let stat_opts = gix_index::entry::stat::Options {
        trust_ctime: true,
        check_stat: true,
        // Nanosecond precision reduces false racy-git entries on modern
        // filesystems (macOS APFS, Linux ext4/btrfs).
        use_nsec: true,
        use_stdev: false,
    };

    let index_timestamp = index.timestamp();

    // The index is sorted by path. rayon's indexed collect preserves
    // order, and our sequential collection loop preserves order, so
    // ls_tree_hashes will be sorted without an explicit sort.
    let num_entries = index
        .entries()
        .iter()
        .filter(|e| !e.mode.is_submodule())
        .count();

    // Classify entries in parallel: stat each file, compare with index,
    // and carry the raw ObjectId (20 bytes, Copy) instead of a heap-allocated
    // hex String. Hex conversion uses a thread-local stack buffer to avoid
    // allocator contention across rayon threads.
    let classified: Vec<Result<EntryClassification, Error>> = index
        .entries()
        .par_iter()
        .filter(|e| !e.mode.is_submodule())
        .map(|e| {
            let path_bytes = e.path(&index);
            let path_str = std::str::from_utf8(path_bytes).map_err(|err| {
                Error::git_error(format!("invalid utf8 in index path: {}", err))
            })?;
            let rel_path = RelativeUnixPathBuf::new(path_str)?;
            let abs_path = git.root.join_unix_path(&rel_path);

            match gix_index::fs::Metadata::from_path_no_follow(abs_path.as_std_path()) {
                Ok(fs_meta) => {
                    let fs_stat = gix_index::entry::Stat::from_fs(&fs_meta).map_err(|err| {
                        Error::git_error(format!(
                            "failed to convert stat for {}: {}",
                            path_str, err
                        ))
                    })?;

                    let stat_matches = e.stat.matches(&fs_stat, stat_opts);

                    if !stat_matches {
                        return Ok(EntryClassification::Modified { path: rel_path });
                    }

                    let is_racy = e.stat.is_racy(index_timestamp, stat_opts);
                    if is_racy {
                        return Ok(EntryClassification::Modified { path: rel_path });
                    }

                    // Clean: hex-encode the OID using a stack buffer to
                    // avoid the intermediate HexDisplay allocation from
                    // to_hex().to_string().
                    let mut hex_buf = [0u8; 40];
                    hex::encode_to_slice(e.id.as_bytes(), &mut hex_buf).unwrap();
                    // SAFETY: hex output is always valid ASCII/UTF-8.
                    let oid_str =
                        unsafe { std::str::from_utf8_unchecked(&hex_buf) }.to_string();
                    Ok(EntryClassification::Clean {
                        path: rel_path,
                        oid: oid_str,
                    })
                }
                Err(_) => Ok(EntryClassification::Deleted { path: rel_path }),
            }
        })
        .collect();

    let mut ls_tree_hashes = SortedGitHashes::with_capacity(num_entries);
    let mut status_entries = Vec::new();

    for result in classified {
        match result? {
            EntryClassification::Clean { path, oid } => {
                ls_tree_hashes.push((path, oid));
            }
            EntryClassification::Modified { path } => {
                status_entries.push(RepoStatusEntry {
                    path,
                    is_delete: false,
                });
            }
            EntryClassification::Deleted { path } => {
                status_entries.push(RepoStatusEntry {
                    path,
                    is_delete: true,
                });
            }
        }
    }

    // ls_tree_hashes is already sorted (git index is sorted, rayon
    // preserves order for indexed iterators, sequential loop preserves
    // order). status_entries from Modified/Deleted are also in index order
    // (sorted). Sort once now so find_untracked_files can binary search
    // directly on &[RepoStatusEntry] without cloning paths into Strings.
    status_entries.sort_by(|a, b| a.path.cmp(&b.path));

    let untracked = find_untracked_files(git, &ls_tree_hashes, &status_entries)?;
    for path in untracked {
        status_entries.push(RepoStatusEntry {
            path,
            is_delete: false,
        });
    }

    status_entries.sort_by(|a, b| a.path.cmp(&b.path));

    debug!(
        "built repo git index (gix-index): clean_count={}, status_count={}",
        ls_tree_hashes.len(),
        status_entries.len(),
    );

    Ok(Self {
        ls_tree_hashes,
        status_entries,
    })
}

/// Extract hashes for a single package from the cached repo-wide data.
///
/// Returns `(hashes, to_hash)` where:
/// - `hashes` contains committed file hashes keyed by package-relative
///   paths
/// - `to_hash` contains git-root-relative paths for files that need hashing
///   (modified/untracked files within the package)
pub fn get_package_hashes(
    &self,
    pkg_prefix: &RelativeUnixPathBuf,
) -> Result<(GitHashes, Vec<RelativeUnixPathBuf>), Error> {
    let prefix_str = pkg_prefix.as_str();
    let prefix_is_empty = prefix_str.is_empty();

    let mut hashes = if prefix_is_empty {
        let mut h = GitHashes::with_capacity(self.ls_tree_hashes.len());
        for (path, hash) in &self.ls_tree_hashes {
            h.insert(path.clone(), hash.clone());
        }
        h
    } else {
        let range_start = RelativeUnixPathBuf::new(format!("{}/", prefix_str)).unwrap();
        let range_end = RelativeUnixPathBuf::new(format!("{}0", prefix_str)).unwrap();
        let lo = self
            .ls_tree_hashes
            .partition_point(|(k, _)| *k < range_start);
        let hi = self.ls_tree_hashes.partition_point(|(k, _)| *k < range_end);
        let mut h = GitHashes::with_capacity(hi - lo);
        for (path, hash) in &self.ls_tree_hashes[lo..hi] {
            if let Ok(stripped) = path.strip_prefix(pkg_prefix) {
                h.insert(stripped, hash.clone());
            }
        }
        h
    };

    let mut to_hash = Vec::new();
    let status_entries = if prefix_is_empty {
        &self.status_entries[..]
    } else {
        let range_start = RelativeUnixPathBuf::new(format!("{}/", prefix_str)).unwrap();
        let range_end = RelativeUnixPathBuf::new(format!("{}0", prefix_str)).unwrap();
        let lo = self
            .status_entries
            .partition_point(|e| e.path < range_start);
        let hi = self.status_entries.partition_point(|e| e.path < range_end);
        &self.status_entries[lo..hi]
    };
    for entry in status_entries {
        if entry.is_delete {
            if let Ok(stripped) = entry.path.strip_prefix(pkg_prefix) {
                hashes.remove(&stripped);
            }
        } else {
            to_hash.push(entry.path.clone());
        }
    }

    trace!(
        "filtered repo index for package: pkg_prefix={:?}, ls_tree_matched={}, \
         to_hash_count={}",
        prefix_str,
        hashes.len(),
        to_hash.len(),
    );

    Ok((hashes, to_hash))
}

}

/// Walk the working tree to find untracked files (files on disk that are /// not in the git index). Uses the ignore crate's parallel walker to /// respect .gitignore rules. Binary searches directly on the sorted /// ls_tree_hashes and status_entries slices — no intermediate /// allocations needed. /// /// IMPORTANT: status_entries must be sorted by path before calling. /// /// Each walker thread accumulates results in a thread-local Vec and /// batch-sends them through a channel, avoiding per-file mutex contention. #[cfg(feature = "gix")] #[tracing::instrument(skip(git, ls_tree_hashes, status_entries))] fn find_untracked_files( git: &GitRepo, ls_tree_hashes: &SortedGitHashes, status_entries: &[RepoStatusEntry], ) -> Result<Vec, Error> { use std::sync::mpsc;

use ignore::WalkBuilder;

let (tx, rx) = mpsc::channel::<Vec<RelativeUnixPathBuf>>();
let root = git.root.as_std_path();

let walker = WalkBuilder::new(root)
    .follow_links(false)
    .git_ignore(true)
    .require_git(true)
    .hidden(false)
    .filter_entry(|entry| {
        // Never descend into .git/ — the ignore crate may walk it when
        // hidden(false) is set because .git is a hidden directory.
        !(entry.file_type().is_some_and(|ft| ft.is_dir()) && entry.file_name() == ".git")
    })
    .threads(rayon::current_num_threads().min(8))
    .build_parallel();

struct FlushOnDrop {
    buf: Vec<RelativeUnixPathBuf>,
    tx: mpsc::Sender<Vec<RelativeUnixPathBuf>>,
}

impl Drop for FlushOnDrop {
    fn drop(&mut self) {
        if !self.buf.is_empty() {
            let batch = std::mem::take(&mut self.buf);
            let _ = self.tx.send(batch);
        }
    }
}

walker.run(|| {
    let mut guard = FlushOnDrop {
        buf: Vec::new(),
        tx: tx.clone(),
    };

    Box::new(move |entry| {
        let entry = match entry {
            Ok(e) => e,
            Err(_) => return ignore::WalkState::Continue,
        };

        if entry.file_type().is_some_and(|ft| ft.is_dir()) {
            return ignore::WalkState::Continue;
        }
        if entry.file_type().is_some_and(|ft| ft.is_symlink()) {
            return ignore::WalkState::Continue;
        }

        let abs_path = entry.into_path();
        let rel_path = match abs_path.strip_prefix(root) {
            Ok(rel) => rel,
            Err(_) => return ignore::WalkState::Continue,
        };

        let unix_str = match rel_path.to_str() {
            Some(s) => s,
            None => return ignore::WalkState::Continue,
        };

        #[cfg(windows)]
        let unix_str_owned = unix_str.replace('\\', "/");
        #[cfg(windows)]
        let unix_str: &str = &unix_str_owned;

        // Binary search directly on the borrowed slices — no cloned Strings.
        let in_ls_tree = ls_tree_hashes
            .binary_search_by(|(p, _)| p.as_str().cmp(unix_str))
            .is_ok();
        let in_status = status_entries
            .binary_search_by(|e| e.path.as_str().cmp(unix_str))
            .is_ok();

        if !in_ls_tree
            && !in_status
            && let Ok(path) = RelativeUnixPathBuf::new(unix_str)
        {
            guard.buf.push(path);
        }

        ignore::WalkState::Continue
    })
});
drop(tx);

let mut untracked = Vec::new();
for batch in rx.iter() {
    untracked.extend(batch);
}

Ok(untracked)

}

#[cfg(feature = "gix")] enum EntryClassification { Clean { path: RelativeUnixPathBuf, oid: String, }, Modified { path: RelativeUnixPathBuf, }, Deleted { path: RelativeUnixPathBuf, }, }

#[cfg(test)] mod tests { use std::collections::BTreeMap;

use turbopath::RelativeUnixPathBuf;

use super::*;

fn path(s: &str) -> RelativeUnixPathBuf {
    RelativeUnixPathBuf::new(s).unwrap()
}

fn make_index(ls_tree: Vec<(&str, &str)>, status: Vec<(&str, bool)>) -> RepoGitIndex {
    let mut ls_tree_hashes: SortedGitHashes = ls_tree
        .into_iter()
        .map(|(p, h)| (path(p), h.to_string()))
        .collect::<Vec<_>>();
    ls_tree_hashes.sort_by(|(a, _), (b, _)| a.cmp(b));
    let mut status_entries: Vec<RepoStatusEntry> = status
        .into_iter()
        .map(|(p, is_delete)| RepoStatusEntry {
            path: path(p),
            is_delete,
        })
        .collect();
    status_entries.sort_by(|a, b| a.path.cmp(&b.path));
    RepoGitIndex {
        ls_tree_hashes,
        status_entries,
    }
}

#[test]
fn test_empty_prefix_returns_all_files() {
    let index = make_index(
        vec![
            ("apps/web/src/index.ts", "aaa"),
            ("packages/ui/button.tsx", "bbb"),
            ("root-file.json", "ccc"),
        ],
        vec![],
    );
    let (hashes, to_hash) = index.get_package_hashes(&path("")).unwrap();
    assert_eq!(hashes.len(), 3);
    assert!(to_hash.is_empty());
}

#[test]
fn test_prefix_filters_to_package_and_strips_prefix() {
    let index = make_index(
        vec![
            ("apps/web/src/index.ts", "aaa"),
            ("apps/web/package.json", "bbb"),
            ("apps/docs/README.md", "ccc"),
            ("packages/ui/button.tsx", "ddd"),
        ],
        vec![],
    );
    let (hashes, to_hash) = index.get_package_hashes(&path("apps/web")).unwrap();
    assert_eq!(hashes.len(), 2);
    assert_eq!(hashes.get(&path("src/index.ts")).unwrap(), "aaa");
    assert_eq!(hashes.get(&path("package.json")).unwrap(), "bbb");
    assert!(to_hash.is_empty());
}

#[test]
fn test_prefix_does_not_match_sibling_with_shared_prefix() {
    let index = make_index(
        vec![
            ("apps/web/index.ts", "aaa"),
            ("apps/web-admin/index.ts", "bbb"),
        ],
        vec![],
    );
    let (hashes, _) = index.get_package_hashes(&path("apps/web")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert!(hashes.contains_key(&path("index.ts")));
}

#[test]
fn test_status_modified_file_added_to_to_hash() {
    let index = make_index(
        vec![("my-pkg/file.ts", "aaa")],
        vec![("my-pkg/new-file.ts", false)],
    );
    let (hashes, to_hash) = index.get_package_hashes(&path("my-pkg")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert_eq!(to_hash, vec![path("my-pkg/new-file.ts")]);
}

#[test]
fn test_status_deleted_file_removed_from_hashes() {
    let index = make_index(
        vec![("my-pkg/keep.ts", "aaa"), ("my-pkg/deleted.ts", "bbb")],
        vec![("my-pkg/deleted.ts", true)],
    );
    let (hashes, to_hash) = index.get_package_hashes(&path("my-pkg")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert!(hashes.contains_key(&path("keep.ts")));
    assert!(to_hash.is_empty());
}

#[test]
fn test_status_entries_for_other_packages_ignored() {
    let index = make_index(
        vec![("pkg-a/file.ts", "aaa")],
        vec![("pkg-b/new.ts", false), ("pkg-b/gone.ts", true)],
    );
    let (hashes, to_hash) = index.get_package_hashes(&path("pkg-a")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert!(to_hash.is_empty());
}

#[test]
fn test_empty_prefix_with_status() {
    let index = make_index(
        vec![("file.ts", "aaa")],
        vec![("new.ts", false), ("file.ts", true)],
    );
    let (hashes, to_hash) = index.get_package_hashes(&path("")).unwrap();
    assert!(hashes.is_empty());
    assert_eq!(to_hash, vec![path("new.ts")]);
}

#[test]
fn test_sorted_status_binary_search_matches_linear_scan() {
    let status = vec![
        ("apps/docs/new.ts", false),
        ("apps/web/changed.ts", false),
        ("apps/web-admin/added.ts", false),
        ("apps/web/deleted.ts", true),
        ("packages/ui/modified.ts", false),
        ("root-new.ts", false),
    ];
    let index = make_index(
        vec![
            ("apps/docs/index.ts", "aaa"),
            ("apps/web/index.ts", "bbb"),
            ("apps/web/deleted.ts", "ccc"),
            ("apps/web-admin/index.ts", "ddd"),
            ("packages/ui/button.tsx", "eee"),
        ],
        status,
    );

    let (hashes, to_hash) = index.get_package_hashes(&path("apps/web")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert!(hashes.contains_key(&path("index.ts")));
    assert_eq!(to_hash, vec![path("apps/web/changed.ts")]);

    let (_, to_hash) = index.get_package_hashes(&path("apps/web-admin")).unwrap();
    assert_eq!(to_hash, vec![path("apps/web-admin/added.ts")]);

    let (_, to_hash) = index.get_package_hashes(&path("")).unwrap();
    assert_eq!(to_hash.len(), 5);
}

#[test]
fn test_range_query_equivalence_with_binary_search() {
    let ls_tree_data = vec![
        ("apps/docs/README.md", "aaa"),
        ("apps/docs/package.json", "bbb"),
        ("apps/web-admin/index.ts", "ccc"),
        ("apps/web/package.json", "ddd"),
        ("apps/web/src/index.ts", "eee"),
        ("apps/web/src/utils.ts", "fff"),
        ("packages/ui/button.tsx", "ggg"),
        ("packages/ui/package.json", "hhh"),
        ("root.json", "iii"),
    ];
    let index = make_index(ls_tree_data.clone(), vec![]);

    let (hashes, _) = index.get_package_hashes(&path("apps/web")).unwrap();
    assert_eq!(hashes.len(), 3);

    let (hashes, _) = index.get_package_hashes(&path("apps/docs")).unwrap();
    assert_eq!(hashes.len(), 2);

    let (hashes, _) = index.get_package_hashes(&path("packages/ui")).unwrap();
    assert_eq!(hashes.len(), 2);

    let (hashes, _) = index.get_package_hashes(&path("nonexistent")).unwrap();
    assert_eq!(hashes.len(), 0);

    let sorted_vec: Vec<(RelativeUnixPathBuf, String)> = ls_tree_data
        .iter()
        .map(|(p, h)| (path(p), h.to_string()))
        .collect();
    assert!(sorted_vec.windows(2).all(|w| w[0].0 < w[1].0));

    let prefix = "apps/web";
    let range_start = path(&format!("{prefix}/"));
    let range_end = path(&format!("{prefix}0"));
    let lo = sorted_vec.partition_point(|(k, _)| *k < range_start);
    let hi = sorted_vec.partition_point(|(k, _)| *k < range_end);
    let vec_results: Vec<_> = sorted_vec[lo..hi]
        .iter()
        .map(|(p, h)| (p.clone(), h.clone()))
        .collect();

    let btree: BTreeMap<RelativeUnixPathBuf, String> = ls_tree_data
        .iter()
        .map(|(p, h)| (path(p), h.to_string()))
        .collect();
    let btree_results: Vec<_> = btree
        .range(range_start..range_end)
        .map(|(p, h)| (p.clone(), h.clone()))
        .collect();
    assert_eq!(vec_results, btree_results);
}

#[test]
fn test_full_copy_preserves_all_entries() {
    let ls_tree_data = vec![("a.ts", "111"), ("b/c.ts", "222"), ("d/e/f.ts", "333")];
    let index = make_index(ls_tree_data, vec![]);
    let (hashes, to_hash) = index.get_package_hashes(&path("")).unwrap();
    assert_eq!(hashes.len(), 3);
    assert_eq!(hashes.get(&path("a.ts")).unwrap(), "111");
    assert!(to_hash.is_empty());
}

#[test]
fn test_status_binary_search_matches_linear_scan() {
    let index = make_index(
        vec![
            ("apps/docs/README.md", "aaa"),
            ("apps/web-admin/index.ts", "bbb"),
            ("apps/web/index.ts", "ccc"),
            ("apps/web/lib.ts", "ddd"),
            ("packages/ui/button.tsx", "eee"),
        ],
        vec![
            ("apps/docs/new-doc.md", false),
            ("apps/web-admin/deleted.ts", true),
            ("apps/web/dirty.ts", false),
            ("apps/web/index.ts", true),
            ("packages/ui/new-component.tsx", false),
            ("root-level-file.ts", false),
        ],
    );

    let (hashes, to_hash) = index.get_package_hashes(&path("apps/web")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert_eq!(to_hash, vec![path("apps/web/dirty.ts")]);

    let (hashes, to_hash) = index.get_package_hashes(&path("apps/web-admin")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert!(to_hash.is_empty());

    let (hashes, to_hash) = index.get_package_hashes(&path("apps/docs")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert_eq!(to_hash, vec![path("apps/docs/new-doc.md")]);

    let (hashes, to_hash) = index.get_package_hashes(&path("packages/ui")).unwrap();
    assert_eq!(hashes.len(), 1);
    assert_eq!(to_hash, vec![path("packages/ui/new-component.tsx")]);

    let (hashes, to_hash) = index.get_package_hashes(&path("")).unwrap();
    assert_eq!(hashes.len(), 4);
    assert_eq!(to_hash.len(), 4);
}

#[test]
fn test_status_substring_prefix_not_matched() {
    let index = make_index(
        vec![("pkg/file.ts", "aaa"), ("pkg-extra/file.ts", "bbb")],
        vec![("pkg-extra/dirty.ts", false), ("pkg/dirty.ts", false)],
    );

    let (_, to_hash) = index.get_package_hashes(&path("pkg")).unwrap();
    assert_eq!(to_hash, vec![path("pkg/dirty.ts")]);

    let (_, to_hash) = index.get_package_hashes(&path("pkg-extra")).unwrap();
    assert_eq!(to_hash, vec![path("pkg-extra/dirty.ts")]);
}

#[test]
fn test_status_binary_search_empty_status() {
    let index = make_index(vec![("pkg/a.ts", "aaa"), ("pkg/b.ts", "bbb")], vec![]);
    let (hashes, to_hash) = index.get_package_hashes(&path("pkg")).unwrap();
    assert_eq!(hashes.len(), 2);
    assert!(to_hash.is_empty());
}

#[test]
fn test_status_all_entries_same_package() {
    let index = make_index(
        vec![("pkg/a.ts", "aaa")],
        vec![("pkg/b.ts", false), ("pkg/c.ts", false), ("pkg/a.ts", true)],
    );
    let (hashes, to_hash) = index.get_package_hashes(&path("pkg")).unwrap();
    assert!(hashes.is_empty(), "a.ts was deleted");
    assert_eq!(to_hash, vec![path("pkg/b.ts"), path("pkg/c.ts")]);
}

} ===== END FILE: crates/turborepo-scm/src/repo_index.rs =====

===== FILE: crates/turborepo-scm/src/manual.rs ===== // This module doesn't require git2, but it is only used by modules that require // git2. #![cfg(feature = "git2")] use std::{ collections::HashSet, io::{ErrorKind, Read}, };

use globwalk::fix_glob_pattern; use hex::ToHex; use ignore::WalkBuilder; use sha1::{Digest, Sha1}; use turbopath::{AbsoluteSystemPath, AnchoredSystemPath, IntoUnix}; use wax::{Glob, Program, any};

use crate::{Error, GitHashes};

fn git_like_hash_file(path: &AbsoluteSystemPath) -> Result<String, Error> { let mut hasher = Sha1::new(); let mut f = path.open()?; // Pre-allocate the buffer based on file metadata to avoid repeated // reallocations during read_to_end. The +1 accounts for read_to_end's // probe read that confirms EOF. let estimated_size = f.metadata().map(|m| m.len() as usize + 1).unwrap_or(0); let mut buffer = Vec::with_capacity(estimated_size); // Note that read_to_end reads the target if f is a symlink. Currently, this can // happen when we are hashing a specific set of files, which in turn only // happens for handling dotEnv files. It is likely that in the future we // will want to ensure that the target is better accounted for in the set of // inputs to the task. Manual hashing, as well as global deps and other // places that support globs all ignore symlinks. let size = f.read_to_end(&mut buffer)?; hasher.update("blob ".as_bytes()); hasher.update(size.to_string().as_bytes()); hasher.update([b'\0']); hasher.update(buffer.as_slice()); let result = hasher.finalize(); Ok(result.encode_hex::()) }

fn to_glob(input: &str) -> Result<Glob<'_>, Error> { let glob = fix_glob_pattern(input).into_unix(); let g = Glob::new(glob.as_str()).map(|g| g.into_owned())?;

Ok(g)

}

pub(crate) fn hash_files( root_path: &AbsoluteSystemPath, files: impl Iterator<Item = impl AsRef>, allow_missing: bool, ) -> Result<GitHashes, Error> { let mut hashes = GitHashes::new(); for file in files.into_iter() { let path = root_path.resolve(file.as_ref()); match git_like_hash_file(&path) { Ok(hash) => hashes.insert(file.as_ref().to_unix(), hash), Err(Error::Io(ref io_error, _)) if allow_missing && io_error.kind() == ErrorKind::NotFound => { continue; } Err(e) => return Err(e), }; } Ok(hashes) }

pub(crate) fn get_package_file_hashes_without_git<S: AsRef>( turbo_root: &AbsoluteSystemPath, package_path: &AnchoredSystemPath, inputs: &[S], include_default_files: bool, ) -> Result<GitHashes, Error> { let full_package_path = turbo_root.resolve(package_path); let mut hashes = GitHashes::new(); let mut default_file_hashes = GitHashes::new(); let mut excluded_file_paths = HashSet::new();

let mut walker_builder = WalkBuilder::new(&full_package_path);
let mut includes = Vec::new();
let mut excludes = Vec::new();
for pattern in inputs {
    let pattern = pattern.as_ref();
    if let Some(exclusion) = pattern.strip_prefix('!') {
        let g = to_glob(exclusion)?;
        excludes.push(g);
    } else {
        let g = to_glob(pattern)?;
        includes.push(g);
    }
}
let include_pattern = if includes.is_empty() {
    None
} else {
    // Add in package.json and turbo.json to input patterns. Both file paths are
    // relative to pkgPath
    //
    // - package.json is an input because if the `scripts` in the package.json
    //   change (i.e. the tasks that turbo executes), we want a cache miss, since
    //   any existing cache could be invalid.
    // - turbo.json because it's the definition of the tasks themselves. The root
    //   turbo.json is similarly included in the global hash. This file may not
    //   exist in the workspace, but that is ok, because it will get ignored
    //   downstream.
    let turbo_g = to_glob("package.json")?;
    let package_g = to_glob("turbo.json")?;
    includes.push(turbo_g);
    includes.push(package_g);

    Some(any(includes)?)
};
let exclude_pattern = if excludes.is_empty() {
    None
} else {
    Some(any(excludes)?)
};

let walker = walker_builder
    .follow_links(false)
    // if inputs have been provided manually, we shouldn't skip ignored files to mimic the
    // regular behavior
    .git_ignore(inputs.is_empty())
    .require_git(false)
    .hidden(false) // this results in yielding hidden files (e.g. .gitignore)
    .build();

for dirent in walker {
    let dirent = dirent?;
    let metadata = dirent.metadata()?;
    // We need to do this here, rather than as a filter, because the root
    // directory is always yielded and not subject to the supplied filter.
    if metadata.is_dir() {
        continue;
    }

    let path = AbsoluteSystemPath::from_std_path(dirent.path())?;
    let relative_path = full_package_path.anchor(path)?;
    let relative_path = relative_path.to_unix();

    // if we have includes, and this path doesn't match any of them, skip it
    if let Some(include_pattern) = include_pattern.as_ref()
        && !include_pattern.is_match(relative_path.as_str())
    {
        continue;
    }

    // if we have excludes, and this path matches one of them, skip it
    if let Some(exclude_pattern) = exclude_pattern.as_ref()
        && exclude_pattern.is_match(relative_path.as_str())
    {
        continue;
    }

    // FIXME: we don't hash symlinks...
    if metadata.is_symlink() {
        continue;
    }
    let hash = git_like_hash_file(path)?;
    hashes.insert(relative_path, hash);
}

// If we're including default files, we need to walk again, but this time with
// git_ignore enabled
if include_default_files {
    let walker = walker_builder
        .follow_links(false)
        .git_ignore(true)
        .require_git(false)
        .hidden(false) // this results in yielding hidden files (e.g. .gitignore)
        .build();

    for dirent in walker {
        let dirent = dirent?;
        let metadata = dirent.metadata()?;
        // We need to do this here, rather than as a filter, because the root
        // directory is always yielded and not subject to the supplied filter.
        if metadata.is_dir() || metadata.is_symlink() {
            continue;
        }

        let path = AbsoluteSystemPath::from_std_path(dirent.path())?;
        let relative_path = full_package_path.anchor(path)?;
        let relative_path = relative_path.to_unix();

        if let Some(exclude_pattern) = exclude_pattern.as_ref()
            && exclude_pattern.is_match(relative_path.as_str())
        {
            // Track excluded paths — no need to hash since we only use the
            // path for filtering.
            excluded_file_paths.insert(relative_path);
            continue;
        }

        // Skip files already hashed in the first walk to avoid redundant I/O.
        if hashes.contains_key(&relative_path) {
            continue;
        }

        let hash = git_like_hash_file(path)?;
        default_file_hashes.insert(relative_path, hash);
    }
}

// merge default with all hashes
hashes.extend(default_file_hashes);
// remove excluded files
if !excluded_file_paths.is_empty() {
    hashes.retain(|key, _| !excluded_file_paths.contains(key));
}

Ok(hashes)

}

#[cfg(test)] mod tests { use std::assert_matches::assert_matches;

use test_case::test_case;
use turbopath::{
    AbsoluteSystemPathBuf, AnchoredSystemPathBuf, RelativeUnixPath, RelativeUnixPathBuf,
};

use super::*;

fn tmp_dir() -> (tempfile::TempDir, AbsoluteSystemPathBuf) {
    let tmp_dir = tempfile::tempdir().unwrap();
    let dir = AbsoluteSystemPathBuf::try_from(tmp_dir.path().to_path_buf())
        .unwrap()
        .to_realpath()
        .unwrap();
    (tmp_dir, dir)
}

#[test_case(&["non-existent-file.txt"], true, false ; "allow_missing, all missing")]
#[test_case(&["non-existent-file.txt", "existing-file.txt"], true, false ; "allow_missing, some missing, some not")]
#[test_case(&["existing-file.txt"], true, false ; "allow_missing, none missing")]
#[test_case(&["non-existent-file.txt"], false, true ; "don't allow_missing, all missing")]
#[test_case(&["non-existent-file.txt", "existing-file.txt"], false, true ; "don't allow_missing, some missing, some not")]
#[test_case(&["existing-file.txt"], false, false ; "don't allow_missing, none missing")]
fn test_hash_files(files: &[&str], allow_missing: bool, want_err: bool) {
    let (_tmp, turbo_root) = tmp_dir();
    let test_file = turbo_root.join_component("existing-file.txt");
    test_file.create_with_contents("").unwrap();

    let expected = {
        let mut expected = GitHashes::new();
        if files.contains(&"existing-file.txt") {
            expected.insert(
                RelativeUnixPathBuf::new("existing-file.txt").unwrap(),
                "e69de29bb2d1d6434b8b29ae775ad8c2e48c5391".to_string(),
            );
        }
        expected
    };

    let files = files
        .iter()
        .map(|s| AnchoredSystemPathBuf::from_raw(s).unwrap());
    match hash_files(&turbo_root, files, allow_missing) {
        Err(e) => assert!(want_err, "unexpected error {e}"),
        Ok(hashes) => assert_eq!(hashes, expected),
    }
}

#[test]
fn test_hash_symlink() {
    let (_tmp, turbo_root) = tmp_dir();
    let from_to_file = turbo_root.join_component("symlink-from-to-file");
    let from_to_dir = turbo_root.join_component("symlink-from-to-dir");
    let broken = turbo_root.join_component("symlink-broken");

    let to_file = turbo_root.join_component("the-file-target");
    to_file.create_with_contents("contents").unwrap();

    let to_dir = turbo_root.join_component("the-dir-target");
    to_dir.create_dir_all().unwrap();

    from_to_file.symlink_to_file(to_file.to_string()).unwrap();
    from_to_dir.symlink_to_dir(to_dir.to_string()).unwrap();
    broken.symlink_to_file("does-not-exist").unwrap();

    // Symlink to file.
    let out = hash_files(
        &turbo_root,
        [AnchoredSystemPathBuf::from_raw("symlink-from-to-file").unwrap()].iter(),
        true,
    )
    .unwrap();
    let from_to_file_hash = out
        .get(&RelativeUnixPathBuf::new("symlink-from-to-file").unwrap())
        .unwrap();
    assert_eq!(
        from_to_file_hash,
        "0839b2e9412b314cb8bb9a20f587aa13752ae310"
    );

    // Symlink to dir, allow_missing = true.
    #[cfg(not(windows))]
    {
        let out = hash_files(
            &turbo_root,
            [AnchoredSystemPathBuf::from_raw("symlink-from-to-dir").unwrap()].iter(),
            true,
        );
        match out.err().unwrap() {
            Error::Io(io_error, _) => assert_eq!(io_error.kind(), ErrorKind::IsADirectory),
            _ => panic!("wrong error"),
        };
    }

    // Symlink to dir, allow_missing = false.
    let out = hash_files(
        &turbo_root,
        [AnchoredSystemPathBuf::from_raw("symlink-from-to-dir").unwrap()].iter(),
        false,
    );
    #[cfg(windows)]
    let expected_err_kind = ErrorKind::PermissionDenied;
    #[cfg(not(windows))]
    let expected_err_kind = ErrorKind::IsADirectory;
    assert_matches!(out.unwrap_err(), Error::Io(io_error, _) if io_error.kind() == expected_err_kind);

    // Broken symlink with allow_missing = true.
    let out = hash_files(
        &turbo_root,
        [AnchoredSystemPathBuf::from_raw("symlink-broken").unwrap()].iter(),
        true,
    )
    .unwrap();
    let broken_hash = out.get(&RelativeUnixPathBuf::new("symlink-broken").unwrap());
    assert_eq!(broken_hash, None);

    // Broken symlink with allow_missing = false.
    let out = hash_files(
        &turbo_root,
        [AnchoredSystemPathBuf::from_raw("symlink-broken").unwrap()].iter(),
        false,
    );
    match out.err().unwrap() {
        Error::Io(io_error, _) => assert_eq!(io_error.kind(), ErrorKind::NotFound),
        _ => panic!("wrong error"),
    };
}

#[test]
fn test_get_package_file_hashes_from_processing_gitignore() {
    let root_ignore_contents = ["ignoreme", "ignorethisdir/"].join("\n");
    let pkg_ignore_contents = ["pkgignoreme", "pkgignorethisdir/"].join("\n");

    let (_tmp, turbo_root) = tmp_dir();

    let pkg_path = AnchoredSystemPathBuf::from_raw("child-dir/libA").unwrap();
    let unix_pkg_path = pkg_path.to_unix();
    let mut file_hash: Vec<(&str, &str, Option<&str>)> = vec![
        ("turbo.json", "turbo.json-file-contents", None),
        ("package.json", "root-package.json-file-contents", None),
        ("top-level-file", "top-level-file-contents", None),
        ("other-dir/other-dir-file", "other-dir-file-contents", None),
        ("ignoreme", "anything", None),
        (
            "child-dir/libA/turbo.json",
            "lib-turbo.json-content",
            Some("ca4dbb95c0829676756c6decae728252d4aa4911"),
        ),
        (
            "child-dir/libA/package.json",
            "lib-package.json-content",
            Some("55d57df9acc1b37d0cfc2c1c70379dab48f3f7e1"),
        ),
        (
            "child-dir/libA/some-file",
            "some-file-contents",
            Some("7e59c6a6ea9098c6d3beb00e753e2c54ea502311"),
        ),
        (
            "child-dir/libA/some-dir/other-file",
            "some-file-contents",
            Some("7e59c6a6ea9098c6d3beb00e753e2c54ea502311"),
        ),
        (
            "child-dir/libA/some-dir/another-one",
            "some-file-contents",
            Some("7e59c6a6ea9098c6d3beb00e753e2c54ea502311"),
        ),
        (
            "child-dir/libA/some-dir/excluded-file",
            "some-file-contents",
            Some("7e59c6a6ea9098c6d3beb00e753e2c54ea502311"),
        ),
        ("child-dir/libA/ignoreme", "anything", None),
        ("child-dir/libA/ignorethisdir/anything", "anything", None),
        ("child-dir/libA/pkgignoreme", "anything", None),
        ("child-dir/libA/pkgignorethisdir/file", "anything", None),
    ];

    let root_ignore_file = turbo_root.join_component(".gitignore");
    root_ignore_file
        .create_with_contents(root_ignore_contents)
        .unwrap();
    let pkg_ignore_file = turbo_root.resolve(&pkg_path).join_component(".gitignore");
    pkg_ignore_file.ensure_dir().unwrap();
    pkg_ignore_file
        .create_with_contents(pkg_ignore_contents)
        .unwrap();

    let mut expected = GitHashes::new();
    for (raw_unix_path, contents, expected_hash) in file_hash.iter() {
        let unix_path = RelativeUnixPath::new(raw_unix_path).unwrap();
        let file_path = turbo_root.join_unix_path(unix_path);
        file_path.ensure_dir().unwrap();

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

p0-land-first-with-benchmarks.md

Now I have a complete picture. Here's the comprehensive performance analysis:

Current State

Turborepo's Rust codebase has seen active performance work recently (parallel task hashing, shallow wildcard fast paths, tracing spans), but 12 distinct optimization opportunities remain across the critical turbo run hot path. The codebase has excellent profiling infrastructure (--profile emits Chrome Trace Format, turborepo-profile-md generates summaries) but no formal benchmark suite (no criterion/bencher). Testing runs via cargo nextest in CI with coverage tracking.

The hot path for every turbo run is roughly:

Package discovery (sequential turbo.json stats)
Lockfile transitive closures (redundant DFS walks)
Engine/graph construction (serial config loading)
Task hashing (RwLock contention, regex recompilation, env cloning)
Task execution (EnvironmentVariableMap double-clones)
Cache archiving (single-threaded zstd)

Required Changes

Tier 1 — High Impact, Low Risk

1. Concurrent turbo.json discovery (discovery.rs:157) Replace .then() (sequential) with .map().buffer_unordered(64) for tokio::fs::try_exists calls. Every non-daemon turbo run blocks on this.

// Before (sequential):
iter(package_paths).then(|path| async move { ... }).collect()

// After (concurrent):
iter(package_paths)
    .map(|path| async move { ... })
    .buffer_unordered(64)
    .collect()

Verify: turbo run build --profile=before.json on a 100+ package monorepo, compare discover_packages span duration before/after. Run cargo nextest run -p turborepo-repository to confirm no regressions.

2. Wrap env_at_execution_start in Arc to eliminate loose-env clones (task-hash/src/lib.rs:522)

EnvMode::Loose (the default) deep-clones the entire process env HashMap per task:

EnvMode::Loose => Ok(self.env_at_execution_start.clone()),  // line 522

Change env_at_execution_start from &'a EnvironmentVariableMap to Arc<EnvironmentVariableMap> in TaskHasher, and return Arc::clone() instead. Affects visitor/mod.rs:130 and run/mod.rs where it's threaded through.

Verify: Profile a 200-task run — the env span's allocation count should drop to near-zero. Run cargo nextest run -p turborepo-task-hash -p turborepo-lib.

3. Pre-compile task env regex patterns (turborepo-env/src/lib.rs:354-383)

hashable_task_env calls wildcard_map_from_wildcards which recompiles regex on every task. The CompiledWildcards type already exists for pass-through env — extend it to cover the task env path:

// In TaskHasher::new(), pre-compile common task env patterns:
let compiled_task_envs: HashMap<Vec<String>, CompiledWildcards> = ...;

Verify: cargo nextest run -p turborepo-env. Profile with --profile and check hashable_task_env span drops proportionally.

4. Split TaskHashTracker RwLock into per-field locks (task-hash/src/lib.rs:177-192)

Single RwLock over 6 HashMaps serializes all writes across parallel rayon task-hash computation:

// Before:
struct TaskHashTracker { state: Arc<RwLock<TaskHashTrackerState>> }

// After: use DashMap or individual RwLock per map
struct TaskHashTrackerState {
    package_task_hashes: DashMap<TaskId<'static>, Arc<str>>,
    package_task_env_vars: DashMap<TaskId<'static>, DetailedMap>,
    // ... etc
}

Verify: The existing tests at test_hash_tracker_concurrent_read_write validate correctness. Profile parallel task hash computation — contention should disappear in the trace.

Tier 2 — Medium Impact, Medium Risk

5. Enable multi-threaded zstd compression (cache_archive/create.rs:115,151)

Currently single-threaded at level 0:

let zw = zstd::Encoder::new(writer, 0)?.auto_finish();

Add .multithread(num_cpus::get() as u32)? or at minimum .multithread(4)?. This directly reduces wall-clock time for every cache-miss task writing large output archives.

Verify: Time cache-miss builds with large outputs (Next.js .next/ directory). Compare archive creation span duration. Run cargo nextest run -p turborepo-cache.

6. Cache compiled glob patterns (turborepo-globwalk/src/lib.rs:539-566)

compile_globs() recompiles the same dist/** pattern 100+ times in a 100-package monorepo. Add a HashMap<Vec<String>, Arc<CompiledGlobs>> cache keyed by the pattern list.

Verify: Profile calculate_file_hashes span on a large monorepo — glob compilation time should become negligible. Run cargo nextest run -p turborepo-globwalk.

7. Reduce EnvironmentVariableMap double-clone in visitor (visitor/mod.rs:287,388)

The task visitor clones (String, EnvironmentVariableMap) once into the precomputed map (line 287), then clones again on retrieval (line 388). Use Arc<EnvironmentVariableMap> or .remove() instead of .get().clone().

Verify: cargo nextest run -p turborepo-lib. Profile allocation counts in the execute_task spans.

8. Use trie/prefix-tree for file-to-package detection (change_mapper/package.rs:63-81)

detect_package does O(N) linear scan over all packages per file. Build a path-prefix trie at startup for O(depth) lookups:

// At startup:
let pkg_trie = PathTrie::from(packages.iter().map(|(name, info)| (info.path(), name)));
// At query time:
let package = pkg_trie.longest_prefix_match(file);  // O(path_depth)

Verify: Measure --affected and watch-mode response times on 500+ package repos. Run cargo nextest run -p turborepo-repository.

9. Optimize all_transitive_closures DashMap usage (lockfiles/src/lib.rs:130-149)

Pre-size the DashMap with estimated capacity
Use entry() API instead of get() + insert() to avoid double-lookup
Wrap Package in Arc to eliminate clones during cache hits

Verify: cargo nextest run -p turborepo-lockfiles. Profile the all_transitive_closures span on repos with 1000+ transitive deps.

Tier 3 — Lower Impact, Targeted Wins

10. Parallelize engine builder config loading (engine/src/builder.rs:605-615)

The BFS loads turbo.json configs serially per workspace. Pre-load all configs in parallel before the BFS walk, or use Arc<Vec<&TurboJson>> in the chain cache to eliminate redundant clones.

Verify: cargo nextest run -p turborepo-engine. Profile EngineBuilder::build span.

11. Fix hex encoding allocation in SCM (scm/src/hash_object.rs:69)

.to_string() on stack buffer creates unnecessary heap allocation per file. Use a fixed-size ArrayString<40> or similar inline string to avoid the allocation entirely. This is called once per file per task.

Verify: cargo nextest run -p turborepo-scm. Measure with dhat or similar allocator profiling.

12. Pre-sort GlobSet include keys for hash/eq (filewatch/src/globwatcher.rs:61-66,52-57)

Store a pre-sorted Vec<String> alongside the include HashMap. Derive Hash/Eq from it instead of allocating BTreeSet/HashSet on every call (thousands/minute in watch mode).

Verify: cargo nextest run -p turborepo-filewatch. Measure allocation rates in watch mode with DHAT.

Ripple Effects

Arc<EnvironmentVariableMap> (changes 2, 7): Touches TaskHasher, Visitor, Run — the signature change propagates through 3-4 crates. Must verify serialization for run summaries still works (the DetailedMap in HashTrackerInfo trait).
TaskHashTracker split (change 4): Every consumer of TaskHashTracker (run-summary, task-executor, visitor) accesses state — switching to DashMap changes the API from state.read().package_task_hashes.get() to self.package_task_hashes.get(). The existing concurrent tests (test_hash_tracker_concurrent_read_write) must continue passing.
Multi-threaded zstd (change 5): May increase peak memory usage. Ensure CI test runners and user machines have sufficient RAM. The zstd crate's zstdmt feature must be enabled.
Trie for change_mapper (change 8): Adds a new data structure dependency. The trie must be rebuilt when the package graph changes (watch mode rediscovery).

What This Unlocks

Implementing all 12 changes targets every phase of turbo run:

Phase	Changes	Expected Improvement
Startup / Discovery	1, 10	2-5x faster package discovery on 100+ workspace repos
Lockfile Processing	9	20-40% reduction in `all_transitive_closures` time
Task Hashing	2, 3, 4, 7, 11	Eliminates per-task regex recompilation, env cloning, lock contention
Cache I/O	5, 6	2-4x faster cache-miss archive creation (large outputs)
Watch Mode	8, 12	O(1) vs O(N) file-to-package detection, fewer allocations per event

The combined effect on a cold turbo run build in a 200-package monorepo should be measurable via:

# Before/after comparison:
hyperfine --warmup 1 --runs 5 \
  'turbo run build --force --profile=before.json' \
  'turbo run build --force --profile=after.json'

# Detailed span comparison:
turbo run build --force --profile=trace.json
# Open trace.json in chrome://tracing or Perfetto UI

Effort & Risk

#	Change	Effort	Risk	Files Touched
1	Concurrent discovery	S (1 line)	Low — drop-in replacement	1
2	Arc env_at_execution_start	M (signature threading)	Low — ref-counted sharing	3-4
3	Pre-compile task env regex	M	Low — additive cache	2
4	Split TaskHashTracker locks	M	Medium — API change, concurrent correctness	3-4
5	Multi-threaded zstd	S (1 line + feature flag)	Low — well-tested upstream	1-2
6	Glob pattern cache	M	Low — additive cache	2
7	Visitor env clone reduction	M	Low — swap clone for Arc	2
8	Path-prefix trie	L	Medium — new data structure	2-3
9	DashMap optimization	S	Low — API-level change	1
10	Parallel config loading	M	Medium — ordering sensitivity	1-2
11	Hex encoding inline string	S	Low — localized change	1
12	Pre-sorted GlobSet keys	S	Low — localized change	1

Recommended attack order: 1 → 2 → 5 → 3 → 4 → 9 → 6 → 7 → 11 → 12 → 8 → 10 (sorted by impact/effort ratio, lowest risk first).

Assumptions:

"100+ package monorepo" is the target scale — smaller repos will see proportionally smaller gains for items 1, 8, 9
EnvMode::Loose is the default mode most users run (confirmed by code inspection)
The zstd crate's multi-threading support is available on all target platforms (it is, via zstd-sys)
No formal benchmarks exist, so --profile traces and hyperfine are the primary verification tools

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.
task-hash-tracker-rwlock-contention Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/task_graph/visitor/mod.rs Maturity: stable Relevance: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave. Key findings: Single RwLock guards five independent HashMaps in TaskHashTracker, causing write-lock contention that serializes parallel rayon task-hash computation across every topological wave.
task-graph-visitor-allocations Files: crates/turborepo-lib/src/task_graph/visitor/mod.rs, crates/turborepo-lib/src/task_graph/visitor/exec.rs Maturity: stable Relevance: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run. Key findings: The task visitor's main loop double-clones EnvironmentVariableMap per task and redundantly clones TaskId/info 5+ times per task execution, all in the hottest path of every turbo run.
change-mapper-linear-package-detection Files: crates/turborepo-repository/src/change_mapper/package.rs, crates/turborepo-repository/src/change_mapper/mod.rs Maturity: stable Relevance: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages Key findings: O(n×m) file-to-package detection in detect_package() blocks --affected and watch-mode scaling in large monorepos with hundreds of packages
discovery-sequential-turbo-json-stat Files: crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: Local package discovery serializes per-package turbo.json existence checks via .then(), adding O(n) filesystem latency on the critical path of every non-daemon run. Key findings: I confirmed the key finding by reading both files. The strongest candidate is in discovery.rs: At line 157, LocalPackageDiscovery::discover_packages uses tokio_stream::iter(...).then(...) which processes each tokio::fs::try_exists() call sequentially — .then() awaits each future before starting the next. For independent filesystem stat calls across hundreds of packages, this serializes I/O that should be concurrent. The file even has the tokio async machinery in place — it just needs .map() + .buffer_unordered(N) instead of .then().
globwatcher-globset-hash-eq-allocs Files: crates/turborepo-filewatch/src/globwatcher.rs, crates/turborepo-filewatch/src/hash_watcher.rs Maturity: stable Relevance: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos. Key findings: GlobSet::hash() and eq() allocate fresh BTreeSet/HashSet collections on every file event and hash query in watch mode, causing thousands of throwaway allocations per minute in active monorepos.
task-hash-loose-env-clone Files: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/run/watch.rs, crates/turborepo-lib/src/run/mod.rs Maturity: stable Relevance: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate. Key findings: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate.

Exploration

AREA: task-hash-loose-env-clone FILES: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/run/watch.rs, crates/turborepo-lib/src/run/mod.rs MATURITY: stable RELEVANCE: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate. NOTES: (unspecified)

Raw Exploration Notes: AREA: task-hash-loose-env-clone FILES: crates/turborepo-task-hash/src/lib.rs, crates/turborepo-lib/src/run/watch.rs, crates/turborepo-lib/src/run/mod.rs MATURITY: stable RELEVANCE: Every task in loose env mode (the default) deep-clones the entire process environment HashMap, causing thousands of redundant string allocations per run that an Arc wrapper would eliminate.

Selected Code Context

pub mod global_hash;

use std::{ collections::{HashMap, HashSet}, sync::{Arc, RwLock}, };

#[derive(Debug, Default)] pub struct PackageInputsHashes { hashes: HashMap<TaskId<'static>, String>, expanded_hashes: HashMap<TaskId<'static>, Arc>, }

    // Use the pre-built index if provided, otherwise build one on the spot.
    let owned_index;
    let repo_index = match pre_built_index {
        Some(idx) => Some(idx),
        None => {
            owned_index = scm.build_repo_index(workspaces.len());
            owned_index.as_ref()
        }
    };

    // Phase 1: Collect task metadata and group by (package_path, inputs) for dedup.
    // Multiple tasks in the same package with identical inputs produce the same
    // file hashes — no need to globwalk and hash the same files repeatedly.
    struct TaskInfo<'b> {
        task_id: TaskId<'static>,
        package_path: &'b AnchoredSystemPath,
        inputs: &'b TaskInputs,
    }

    let mut task_infos = Vec::new();
    for task in all_tasks {
        let TaskNode::Task(task_id) = task else {
            continue;
        };
        let task_definition = task_definitions
            .get(task_id)
            .ok_or_else(|| Error::MissingPipelineEntry(task_id.clone()))?;
        let workspace_name = task_id.to_workspace_name();
        let pkg = workspaces
            .get(&workspace_name)
            .ok_or_else(|| Error::MissingPackageJson(workspace_name.to_string()))?;
        let package_path = pkg
            .package_json_path
            .parent()
            .unwrap_or_else(|| AnchoredSystemPath::new("").unwrap());
        let inputs = task_definition.inputs();
        task_infos.push(TaskInfo {
            task_id: task_id.clone(),
            package_path,
            inputs,
        });
    }

    // Build dedup key: (package_path_str, globs, default)
    type HashKey = (AnchoredSystemPathBuf, Vec<String>, bool);
    let mut unique_keys: Vec<HashKey> = Vec::new();
    let mut key_indices: HashMap<HashKey, usize> = HashMap::new();
    let mut task_key_map: Vec<usize> = Vec::with_capacity(task_infos.len());

    for info in &task_infos {
        let key: HashKey = (
            info.package_path.to_owned(),
            info.inputs.globs.clone(),
            info.inputs.default,
        );
        let idx = match key_indices.get(&key) {
            Some(&idx) => idx,
            None => {
                let idx = unique_keys.len();
                key_indices.insert(key.clone(), idx);
                unique_keys.push(key);
                idx
            }
        };
        task_key_map.push(idx);
    }

    debug!(
        total_tasks = task_infos.len(),
        unique_hash_keys = unique_keys.len(),
        "file hash deduplication"
    );

    // Phase 2: Compute file hashes in parallel across unique keys.
    // EMFILE (too many open files) errors are handled via retry-with-backoff
    // in the globwalk and hash_objects layers, so we can safely parallelize
    // all keys on rayon without worrying about fd exhaustion.
    let file_hash_results: Vec<Result<Arc<FileHashes>, Error>> = unique_keys
        .into_par_iter()
        .map(|(package_path, globs, default)| {
            scm.get_package_file_hashes(
                repo_root,
                &package_path,
                &globs,
                default,
                None,
                repo_index,
            )
            .map(|h| {
                let mut v: Vec<_> = h.into_iter().collect();
                v.sort_unstable_by(|(a, _), (b, _)| a.cmp(b));
                Arc::new(FileHashes(v))
            })
            .map_err(Error::from)
        })
        .collect();

    let file_hash_results: Vec<Arc<FileHashes>> = file_hash_results
        .into_iter()
        .collect::<Result<Vec<_>, _>>()?;

    // Phase 3: Distribute shared results to individual tasks.
    let mut hashes = HashMap::with_capacity(task_infos.len());
    let mut expanded_hashes = HashMap::with_capacity(task_infos.len());

    for (i, info) in task_infos.into_iter().enumerate() {
        let key_idx = task_key_map[i];
        let file_hashes = &file_hash_results[key_idx];

        let hash = file_hashes.as_ref().hash();

        hashes.insert(info.task_id.clone(), hash);
        expanded_hashes.insert(info.task_id, Arc::clone(file_hashes));
    }

    Ok(PackageInputsHashes {
        hashes,
        expanded_hashes,
    })
}

}

#[derive(Default, Debug, Clone)] pub struct TaskHashTracker { state: Arc<RwLock>, }

    let compiled_builtins = CompiledWildcards::compile(BUILTIN_PASS_THROUGH_ENV)
        .unwrap_or_else(|_| {
            let empty: &[&str] = &[];
            CompiledWildcards::compile(empty).unwrap()
        });

    Self {
        hashes,
        run_opts,
        env_at_execution_start,
        global_hash,
        global_env,
        global_env_patterns,
        task_hash_tracker: TaskHashTracker::new(expanded_hashes),
        compiled_builtins,
        external_deps_hash_cache: HashMap::new(),
    }
}

/// Pre-compute and cache external dependency hashes for all packages.
/// Many tasks share the same package, so this avoids re-sorting
/// transitive dependencies for every task.
pub fn precompute_external_deps_hashes<'b>(
    &mut self,
    workspaces: impl Iterator<Item = (&'b PackageName, &'b PackageInfo)>,
) {
    if self.run_opts.single_package() {
        return;
    }
    let ws: Vec<_> = workspaces.collect();
    self.external_deps_hash_cache = ws
        .par_iter()
        .map(|(name, info)| {
            let hash = get_external_deps_hash(&info.transitive_dependencies);
            (name.as_str().to_owned(), hash)
        })
        .collect();
}

#[tracing::instrument(skip(self, task_definition, task_env_mode, workspace, dependency_set))]
pub fn calculate_task_hash<T: TaskDefinitionHashInfo>(
    &self,
    task_id: &TaskId<'static>,
    task_definition: &T,
    task_env_mode: EnvMode,
    workspace: &PackageInfo,
    dependency_set: &[&TaskNode],
    telemetry: PackageTaskEventBuilder,
) -> Result<String, Error> {
    let do_framework_inference = self.run_opts.framework_inference();
    let is_monorepo = !self.run_opts.single_package();

    let hash_of_files = self
        .hashes
        .get(task_id)
        .ok_or_else(|| Error::MissingPackageFileHash(task_id.to_string()))?;
    // See if we can infer a framework
    let framework = do_framework_inference
        .then(|| infer_framework(workspace, is_monorepo))
        .flatten()
        .inspect(|framework| {
            debug!("auto detected framework for {}", task_id.package());
            debug!(
                "framework: {}, env_prefix: {:?}",
                framework.slug(),
                framework.env(self.env_at_execution_start)
            );
            telemetry.track_framework(framework.slug().to_string());
        });
    let framework_slug = framework.map(|f| f.slug());

    let env_vars = if let Some(framework) = framework {
        let mut computed_wildcards = framework.env(self.env_at_execution_start);

        if let Some(exclude_prefix) = self
            .env_at_execution_start
            .get("TURBO_CI_VENDOR_ENV_KEY")
            .filter(|prefix| !prefix.is_empty())
        {
            let computed_exclude = format!("!{exclude_prefix}*");
            debug!(
                "excluding environment variables matching wildcard {}",
                computed_exclude
            );
            computed_wildcards.push(computed_exclude);
        }

        // Combine task-specific env patterns with global env exclusions
        // Global exclusions (patterns starting with !) should apply to framework
        // inference
        let combined_env_patterns: Vec<String> = task_definition
            .env()
            .iter()
            .chain(
                self.global_env_patterns
                    .iter()
                    .filter(|p| p.starts_with('!')),
            )
            .cloned()
            .collect();

        self.env_at_execution_start
            .hashable_task_env(&computed_wildcards, &combined_env_patterns)
            .map_err(|err| Error::EnvPattern {
                task_id: task_id.clone().into_owned(),
                err,
            })?
    } else {
        let all_env_var_map = self
            .env_at_execution_start
            .from_wildcards(task_definition.env())?;

        DetailedMap {
            by_source: BySource {
                explicit: all_env_var_map.clone(),
                matching: EnvironmentVariableMap::default(),
            },
            all: all_env_var_map,
        }
    };

    let hashable_env_pairs = env_vars.all.to_hashable();
    let outputs = task_definition.hashable_outputs(task_id);
    let task_dependency_hashes = self.calculate_dependency_hashes(dependency_set)?;
    let ext_hash_fallback;
    let external_deps_hash: Option<&str> = if !is_monorepo {
        None
    } else if let Some(cached) = self.external_deps_hash_cache.get(task_id.package()) {
        Some(cached.as_str())
    } else {
        ext_hash_fallback = get_external_deps_hash(&workspace.transitive_dependencies);
        Some(ext_hash_fallback.as_str())
    };

    if !hashable_env_pairs.is_empty() {
        debug!(
            "task hash env vars for {}:{}\n vars: {:?}",
            task_id.package(),
            task_id.task(),
            hashable_env_pairs
        );
    }

    let package_dir = workspace.package_path().to_unix();
    let is_root_package = package_dir.is_empty();
    // We wrap in an Option to mimic Go's serialization of nullable values
    let optional_package_dir = (!is_root_package).then_some(package_dir);

    let task_hashable = TaskHashable {
        global_hash: self.global_hash,
        task_dependency_hashes,
        package_dir: optional_package_dir,
        hash_of_files,
        external_deps_hash,
        task: task_id.task(),
        outputs,

        pass_through_args: self.run_opts.pass_through_args(),
        env: task_definition.env(),
        resolved_env_vars: hashable_env_pairs,
        pass_through_env: task_definition.pass_through_env().unwrap_or_default(),
        env_mode: task_env_mode,
    };

    let task_hash = task_hashable.calculate_task_hash();

    let task_hash_arc: Arc<str> = Arc::from(task_hash.as_str());
    self.task_hash_tracker.insert_hash(
        task_id.clone(),
        env_vars,
        task_hash_arc,
        framework_slug,
    );

    Ok(task_hash)
}

/// Gets the hashes of a task's dependencies. Because the visitor
/// receives the nodes in topological order, we know that all of
/// the dependencies have been processed before the current task.
///
/// # Arguments
///
/// * `dependency_set`: The dependencies of the current task
///
/// returns: Result<Vec<String, Global>, Error>
fn calculate_dependency_hashes(
    &self,
    dependency_set: &[&TaskNode],
) -> Result<Vec<Arc<str>>, Error> {
    let state = self
        .task_hash_tracker
        .state
        .read()
        .expect("hash tracker rwlock poisoned");

    let mut dependency_hash_list: Vec<Arc<str>> = Vec::with_capacity(dependency_set.len());
    for dependency_task in dependency_set {
        let TaskNode::Task(dependency_task_id) = dependency_task else {
            continue;
        };

        let dependency_hash = state
            .package_task_hashes
            .get(dependency_task_id)
            .ok_or_else(|| Error::MissingDependencyTaskHash(dependency_task.to_string()))?;
        dependency_hash_list.push(Arc::clone(dependency_hash));
    }
    drop(state);

    dependency_hash_list.sort_unstable();
    dependency_hash_list.dedup();

    Ok(dependency_hash_list)
}

pub fn into_task_hash_tracker_state(self) -> TaskHashTrackerState {
    let rwlock = Arc::into_inner(self.task_hash_tracker.state)
        .expect("multiple references to tracker state still exist");
    rwlock.into_inner().unwrap()
}

pub fn task_hash_tracker(&self) -> TaskHashTracker {
    self.task_hash_tracker.clone()
}

pub fn env<T: TaskDefinitionHashInfo>(
    &self,
    task_id: &TaskId,
    task_env_mode: EnvMode,
    task_definition: &T,
) -> Result<EnvironmentVariableMap, Error> {
    match task_env_mode {
        EnvMode::Strict => {
            let pass_through_env_vars = self.env_at_execution_start.pass_through_env_compiled(
                &self.compiled_builtins,
                &self.global_env,
                task_definition.pass_through_env().unwrap_or_default(),
            )?;

            let tracker_env = self
                .task_hash_tracker
                .env_vars(task_id)
                .ok_or_else(|| Error::MissingEnvVars(task_id.clone().into_owned()))?;

            let mut full_task_env = EnvironmentVariableMap::default();
            full_task_env.union(&pass_through_env_vars);
            full_task_env.union(&tracker_env.all);

            Ok(full_task_env)
        }
        EnvMode::Loose => Ok(self.env_at_execution_start.clone()),
    }
}

}

// Collect references instead of cloning each Package (which has two Strings).
let mut transitive_deps: Vec<&turborepo_lockfiles::Package> =
    transitive_dependencies.iter().collect();

transitive_deps.sort_unstable_by(|a, b| match a.key.cmp(&b.key) {
    std::cmp::Ordering::Equal => a.version.cmp(&b.version),
    other => other,
});

LockFilePackagesRef(transitive_deps).hash()

}

let owned_index;
let repo_index = match pre_built_index {
    Some(idx) => Some(idx),
    None => {
        owned_index = scm.build_repo_index(package_dirs.len());
        owned_index.as_ref()
    }
};

let merged = package_dirs
    .into_par_iter()
    .map(|package_dir| {
        scm.get_package_file_hashes::<&str>(root, package_dir, &[], false, None, repo_index)
    })
    .reduce(
        || Ok(HashMap::new()),
        |acc, hashes| {
            let mut acc = acc?;
            let hashes = hashes?;
            acc.extend(hashes.into_iter());
            Ok(acc)
        },
    )?;

let mut file_hashes: Vec<_> = merged.into_iter().collect();
file_hashes.sort_unstable_by(|(a, _), (b, _)| a.cmp(b));
Ok(FileHashes(file_hashes).hash())

}

pub fn hash(&self, task_id: &TaskId) -> Option<Arc<str>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_hashes.get(task_id).cloned()
}

fn insert_hash(
    &self,
    task_id: TaskId<'static>,
    env_vars: DetailedMap,
    hash: Arc<str>,
    framework_slug: Option<FrameworkSlug>,
) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state
        .package_task_env_vars
        .insert(task_id.clone(), env_vars);
    if let Some(framework) = framework_slug {
        // Only pay for one extra clone when framework inference is active.
        state
            .package_task_framework
            .insert(task_id.clone(), framework);
    }
    state.package_task_hashes.insert(task_id, hash);
}

pub fn env_vars(&self, task_id: &TaskId) -> Option<DetailedMap> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_env_vars.get(task_id).cloned()
}

pub fn framework(&self, task_id: &TaskId) -> Option<FrameworkSlug> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_framework.get(task_id).cloned()
}

pub fn expanded_outputs(&self, task_id: &TaskId) -> Option<Vec<AnchoredSystemPathBuf>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_outputs.get(task_id).cloned()
}

pub fn insert_expanded_outputs(
    &self,
    task_id: TaskId<'static>,
    outputs: Vec<AnchoredSystemPathBuf>,
) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state.package_task_outputs.insert(task_id, outputs);
}

pub fn cache_status(&self, task_id: &TaskId) -> Option<CacheHitMetadata> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_cache.get(task_id).copied()
}

pub fn insert_cache_status(&self, task_id: TaskId<'static>, cache_status: CacheHitMetadata) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state.package_task_cache.insert(task_id, cache_status);
}

pub fn get_expanded_inputs(&self, task_id: &TaskId) -> Option<Arc<FileHashes>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state
        .package_task_inputs_expanded_hashes
        .get(task_id)
        .cloned()
}

}

fn env_vars(&self, task_id: &TaskId) -> Option<HashTrackerDetailedMap> {
    TaskHashTracker::env_vars(self, task_id).map(|detailed| HashTrackerDetailedMap {
        explicit: detailed.by_source.explicit.to_secret_hashable(),
        matching: detailed.by_source.matching.to_secret_hashable(),
    })
}

fn cache_status(&self, task_id: &TaskId) -> Option<HashTrackerCacheHitMetadata> {
    TaskHashTracker::cache_status(self, task_id).map(|status| {
        let (local, remote) = match status.source {
            turborepo_cache::CacheSource::Local => (true, false),
            turborepo_cache::CacheSource::Remote => (false, true),
        };
        HashTrackerCacheHitMetadata {
            local,
            remote,
            time_saved: status.time_saved,
        }
    })
}

fn expanded_outputs(&self, task_id: &TaskId) -> Option<Vec<AnchoredSystemPathBuf>> {
    TaskHashTracker::expanded_outputs(self, task_id)
}

fn framework(&self, task_id: &TaskId) -> Option<String> {
    TaskHashTracker::framework(self, task_id).map(|f| f.to_string())
}

fn expanded_inputs(&self, task_id: &TaskId) -> Option<Vec<(RelativeUnixPathBuf, String)>> {
    TaskHashTracker::get_expanded_inputs(self, task_id).map(|file_hashes| file_hashes.0.clone())
}

}

fn insert_expanded_outputs(
    &self,
    task_id: TaskId<'static>,
    outputs: Vec<AnchoredSystemPathBuf>,
) {
    TaskHashTracker::insert_expanded_outputs(self, task_id, outputs)
}

}

#[cfg(test)] mod test { use super::*;

#[test]
fn test_hash_tracker_is_send_and_sync() {
    // We need the tracker to implement these traits as multiple tasks will query
    // and write to it
    fn assert_send<T: Send>() {}
    fn assert_sync<T: Sync>() {}
    assert_send::<TaskHashTracker>();
    assert_sync::<TaskHashTracker>();
}

#[test]
fn test_hash_tracker_concurrent_reads() {
    let tracker = TaskHashTracker::new(HashMap::new());
    let task_id: TaskId<'static> = TaskId::new("pkg", "build");
    tracker.insert_hash(
        task_id.clone(),
        DetailedMap::default(),
        Arc::from("abc123"),
        None,
    );

    // Multiple concurrent reads should not deadlock or panic with RwLock
    std::thread::scope(|s| {
        for _ in 0..8 {
            let tracker = &tracker;
            let task_id = &task_id;
            s.spawn(move || {
                for _ in 0..100 {
                    let h = tracker.hash(task_id);
                    assert_eq!(h.as_deref(), Some("abc123"));
                }
            });
        }
    });
}

#[test]
fn test_hash_tracker_concurrent_read_write() {
    let tracker = TaskHashTracker::new(HashMap::new());

    // Pre-create owned task IDs to avoid lifetime issues with TaskId borrows
    let task_ids: Vec<TaskId<'static>> = (0..50)
        .map(|i| TaskId::new("pkg", &format!("task-{i}")).into_owned())
        .collect();

    // One writer, many readers — verifies RwLock allows concurrent reads
    // while writes are exclusive, without deadlock.
    std::thread::scope(|s| {
        let tracker = &tracker;
        let task_ids = &task_ids;

        s.spawn(move || {
            for (i, task_id) in task_ids.iter().enumerate() {
                tracker.insert_hash(
                    task_id.clone(),
                    DetailedMap::default(),
                    Arc::from(format!("hash-{i}").as_str()),
                    None,
                );
            }
        });

        for _ in 0..4 {
            s.spawn(move || {
                for task_id in task_ids {
                    // May or may not find the hash depending on timing — that's fine,
                    // we're testing for absence of panics/deadlocks.
                    let _ = tracker.hash(task_id);
                    let _ = tracker.env_vars(task_id);
                    let _ = tracker.cache_status(task_id);
                }
            });
        }
    });
}

#[test]
fn test_expanded_inputs_returns_cloned_data() {
    use turborepo_types::HashTrackerInfo;

    let task_id: TaskId<'static> = TaskId::new("pkg", "build");
    // Sorted by key (the invariant FileHashes requires)
    let file_hashes = FileHashes(vec![
        (
            RelativeUnixPathBuf::new("package.json").unwrap(),
            "def456".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("src/index.ts").unwrap(),
            "abc123".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("src/utils/helper.ts").unwrap(),
            "ghi789".to_string(),
        ),
    ]);

    let mut input_hashes = HashMap::new();
    input_hashes.insert(task_id.clone(), Arc::new(file_hashes));
    let tracker = TaskHashTracker::new(input_hashes);

    // Via concrete method
    let arc_result = tracker.get_expanded_inputs(&task_id);
    assert!(arc_result.is_some());
    let arc_hashes = arc_result.unwrap();
    assert_eq!(arc_hashes.0.len(), 3);
    assert_eq!(arc_hashes.0[1].0.as_str(), "src/index.ts");
    assert_eq!(arc_hashes.0[1].1, "abc123");

    // Via trait method — returns sorted Vec
    let trait_result: Option<Vec<(RelativeUnixPathBuf, String)>> =
        HashTrackerInfo::expanded_inputs(&tracker, &task_id);
    assert!(trait_result.is_some());
    let trait_hashes = trait_result.unwrap();
    assert_eq!(trait_hashes.len(), 3);
    assert_eq!(trait_hashes[0].0.as_str(), "package.json");
    assert_eq!(trait_hashes[0].1, "def456");
    // Must be sorted by key
    assert!(
        trait_hashes.windows(2).all(|w| w[0].0 < w[1].0),
        "expanded_inputs should return sorted keys"
    );

    // Missing task returns None
    let missing = TaskId::new("other", "test");
    assert!(tracker.get_expanded_inputs(&missing).is_none());
    assert!(HashTrackerInfo::expanded_inputs(&tracker, &missing).is_none());
}

// Regression: expanded_inputs data must contain all entries and be sorted
// by key. This captures the invariant that must hold when switching the
// return type from BTreeMap to sorted Vec.
#[test]
fn test_expanded_inputs_sorted_and_complete() {
    use turborepo_types::HashTrackerInfo;

    let task_id: TaskId<'static> = TaskId::new("pkg", "build");
    // Sorted by key (FileHashes invariant)
    let file_hashes = FileHashes(vec![
        (
            RelativeUnixPathBuf::new("a/first.ts").unwrap(),
            "aaa".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("a/second.ts").unwrap(),
            "bbb".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("m/middle.ts").unwrap(),
            "mmm".to_string(),
        ),
        (
            RelativeUnixPathBuf::new("z/last.ts").unwrap(),
            "zzz".to_string(),
        ),
    ]);

    let mut input_hashes = HashMap::new();
    input_hashes.insert(task_id.clone(), Arc::new(file_hashes));
    let tracker = TaskHashTracker::new(input_hashes);

    let result = HashTrackerInfo::expanded_inputs(&tracker, &task_id).unwrap();
    assert_eq!(result.len(), 4, "all entries must be present");

    // Entries must be sorted by key
    assert!(
        result.windows(2).all(|w| w[0].0 < w[1].0),
        "expanded_inputs must return keys in sorted order"
    );

    // Verify specific values
    assert_eq!(result[0].0.as_str(), "a/first.ts");
    assert_eq!(result[0].1, "aaa");
    assert_eq!(result[3].0.as_str(), "z/last.ts");
    assert_eq!(result[3].1, "zzz");
}

#[test]
fn test_external_deps_hash_deterministic() {
    use turborepo_lockfiles::Package;

    let deps: HashSet<Package> = vec![
        Package {
            key: "react".to_string(),
            version: "18.0.0".to_string(),
        },
        Package {
            key: "lodash".to_string(),
            version: "4.17.21".to_string(),
        },
        Package {
            key: "typescript".to_string(),
            version: "5.0.0".to_string(),
        },
    ]
    .into_iter()
    .collect();

    let hash1 = get_external_deps_hash(&Some(deps.clone()));
    let hash2 = get_external_deps_hash(&Some(deps));
    assert_eq!(hash1, hash2, "same deps should produce same hash");
    assert!(!hash1.is_empty(), "hash should be non-empty");
}

#[test]
fn test_external_deps_hash_empty() {
    let hash_none = get_external_deps_hash(&None);
    assert_eq!(hash_none, "", "None deps should produce empty hash");

    let hash_empty = get_external_deps_hash(&Some(HashSet::new()));
    assert!(
        !hash_empty.is_empty(),
        "empty set should produce non-empty hash"
    );
}

#[test]
fn test_external_deps_hash_order_independent() {
    use turborepo_lockfiles::Package;

    let deps1: HashSet<Package> = vec![
        Package {
            key: "a".to_string(),
            version: "1.0".to_string(),
        },
        Package {
            key: "b".to_string(),
            version: "2.0".to_string(),
        },
    ]
    .into_iter()
    .collect();

    let deps2: HashSet<Package> = vec![
        Package {
            key: "b".to_string(),
            version: "2.0".to_string(),
        },
        Package {
            key: "a".to_string(),
            version: "1.0".to_string(),
        },
    ]
    .into_iter()
    .collect();

    let hash1 = get_external_deps_hash(&Some(deps1));
    let hash2 = get_external_deps_hash(&Some(deps2));
    assert_eq!(
        hash1, hash2,
        "hash should be order-independent since we sort"
    );
}

#[test]
fn test_tracker_pre_sized_hashmaps() {
    let mut input_hashes = HashMap::new();
    for i in 0..100 {
        let task_id = TaskId::new("pkg", &format!("task-{i}")).into_owned();
        input_hashes.insert(task_id, Arc::new(FileHashes(Vec::new())));
    }
    let tracker = TaskHashTracker::new(input_hashes);

    // Insert hashes and verify pre-sizing didn't break anything
    for i in 0..100 {
        let task_id = TaskId::new("pkg", &format!("task-{i}")).into_owned();
        tracker.insert_hash(
            task_id.clone(),
            DetailedMap::default(),
            Arc::from(format!("hash-{i}").as_str()),
            None,
        );
        assert_eq!(
            tracker.hash(&task_id).as_deref(),
            Some(format!("hash-{i}").as_str())
        );
    }
}

// Validates that sort+dedup produces the same result as the previous
// HashSet→Vec→sort approach for dependency hash deduplication.
#[test]
fn test_sort_dedup_matches_hashset_behavior() {
    let inputs: Vec<Vec<&str>> = vec![
        vec!["abc", "def", "abc", "ghi", "def"],
        vec!["zzz", "aaa", "mmm"],
        vec!["same", "same", "same"],
        vec![],
        vec!["only-one"],
    ];

    for input in inputs {
        // New approach: sort + dedup
        let mut sort_dedup: Vec<String> = input.iter().map(|s| s.to_string()).collect();
        sort_dedup.sort_unstable();
        sort_dedup.dedup();

        // Old approach: HashSet → Vec → sort
        let hash_set: HashSet<String> = input.iter().map(|s| s.to_string()).collect();
        let mut hashset_sorted: Vec<String> = hash_set.into_iter().collect();
        hashset_sorted.sort();

        assert_eq!(
            sort_dedup, hashset_sorted,
            "sort+dedup and hashset+sort should produce identical results for: {input:?}"
        );
    }
}

} ===== END FILE: crates/turborepo-task-hash/src/lib.rs =====

===== FILE: crates/turborepo-lib/src/run/watch.rs ===== use std::{ collections::HashSet, ops::DerefMut as _, sync::{Arc, Mutex}, };

use futures::{future::join_all, StreamExt}; use miette::{Diagnostic, SourceSpan}; use thiserror::Error; use tokio::{select, sync::Notify, task::JoinHandle}; use tracing::{instrument, trace}; use turborepo_daemon::{proto, DaemonConnector, DaemonConnectorError, DaemonError, Paths}; use turborepo_repository::package_graph::PackageName; use turborepo_signals::{listeners::get_signal, SignalHandler}; use turborepo_telemetry::events::command::CommandEventBuilder; use turborepo_ui::sender::UISender;

use crate::{ commands::CommandBase, config::resolve_turbo_config_path, engine::{EngineExt, TaskNode}, get_version, opts, run::{self, builder::RunBuilder, scope::target_selector::InvalidSelectorError, Run}, };

#[derive(Debug)] enum ChangedPackages { All, Some(HashSet), }

impl Default for ChangedPackages { fn default() -> Self { ChangedPackages::Some(HashSet::new()) } }

impl ChangedPackages { pub fn is_empty(&self) -> bool { match self { ChangedPackages::All => false, ChangedPackages::Some(pkgs) => pkgs.is_empty(), } } }

pub struct WatchClient { run: Arc, watched_packages: HashSet, persistent_tasks_handle: Option, active_runs: Vec, connector: DaemonConnector, base: CommandBase, telemetry: CommandEventBuilder, handler: SignalHandler, ui_sender: Option, ui_handle: Option<JoinHandle<Result<(), turborepo_ui::Error>>>, experimental_write_cache: bool, }

struct RunHandle { stopper: run::RunStopper, run_task: JoinHandle<Result<i32, run::Error>>, }

#[derive(Debug, Error, Diagnostic)] pub enum Error { #[error("Failed to connect to daemon.")] #[diagnostic(transparent)] Daemon(#[from] DaemonError), #[error("Failed to connect to daemon.")] DaemonConnector(#[from] DaemonConnectorError), #[error("Failed to decode message from daemon.")] Decode(#[from] prost::DecodeError), #[error("Could not get current executable.")] CurrentExe(std::io::Error), #[error("Could not start turbo.")] Start(std::io::Error), #[error(transparent)] #[diagnostic(transparent)] Run(#[from] run::Error), #[error("--since is not supported in Watch Mode.")] SinceNotSupported, #[error(transparent)] Opts(#[from] opts::Error), #[error("Invalid filter pattern")] InvalidSelector(#[from] InvalidSelectorError), #[error("Filter cannot contain a git range in Watch Mode.")] GitRangeInFilter { #[source_code] filter: String, #[label] span: SourceSpan, }, #[error("Daemon connection closed.")] ConnectionClosed, #[error("Failed to subscribe to signal handler. Shutting down.")] NoSignalHandler, #[error("Watch interrupted due to signal.")] SignalInterrupt, #[error("Package change error.")] PackageChange(#[from] tonic::Status), #[error(transparent)] UI(#[from] turborepo_ui::Error), #[error("Could not connect to UI thread: {0}")] UISend(String), #[error("Invalid config: {0}")] Config(#[from] crate::config::Error), #[error(transparent)] SignalListener(#[from] turborepo_signals::listeners::Error), }

impl WatchClient { pub async fn new( base: CommandBase, experimental_write_cache: bool, telemetry: CommandEventBuilder, ) -> Result<Self, Error> { let signal = get_signal()?; let handler = SignalHandler::new(signal);

    let standard_config_path = resolve_turbo_config_path(&base.repo_root)?;

    let new_base = base.clone();
    let (run, _analytics) = RunBuilder::new(new_base, None)?
        .build(&handler, telemetry.clone())
        .await?;
    let run = Arc::new(run);

    let watched_packages = run.get_relevant_packages();

    let (ui_sender, ui_handle) = run.start_ui()?.unzip();

    // Determine if we're using a custom turbo.json path
    let custom_turbo_json_path =
        if base.opts.repo_opts.root_turbo_json_path != standard_config_path {
            tracing::info!(
                "Using custom turbo.json path: {} (standard: {})",
                base.opts.repo_opts.root_turbo_json_path,
                standard_config_path
            );
            Some(base.opts.repo_opts.root_turbo_json_path.clone())
        } else {
            None
        };

    let connector = DaemonConnector {
        can_start_server: true,
        can_kill_server: true,
        paths: Paths::from_repo_root(&base.repo_root),
        custom_turbo_json_path,
    };

    Ok(Self {
        base,
        run,
        watched_packages,
        connector,
        handler,
        telemetry,
        experimental_write_cache,
        persistent_tasks_handle: None,
        active_runs: Vec::new(),
        ui_sender,
        ui_handle,
    })
}

pub async fn start(&mut self) -> Result<(), Error> {
    let connector = self.connector.clone();
    let mut client = connector.connect().await?;

    let mut events = client.package_changes().await?;

    let signal_subscriber = self.handler.subscribe().ok_or(Error::NoSignalHandler)?;

    // We explicitly use a tokio::sync::Mutex here to avoid deadlocks.
    // If we used a std::sync::Mutex, we could deadlock by spinning the lock
    // and not yielding back to the tokio runtime.
    let changed_packages = Mutex::new(ChangedPackages::default());
    let notify_run = Arc::new(Notify::new());
    let notify_event = notify_run.clone();

    let event_fut = async {
        while let Some(event) = events.next().await {
            let event = event?;
            Self::handle_change_event(&changed_packages, event.event.unwrap())?;
            notify_event.notify_one();
        }

        Err(Error::ConnectionClosed)
    };

    let run_fut = async {
        loop {
            notify_run.notified().await;
            let some_changed_packages = {
                let mut changed_packages_guard =
                    changed_packages.lock().expect("poisoned lock");
                (!changed_packages_guard.is_empty())
                    .then(|| std::mem::take(changed_packages_guard.deref_mut()))
            };

            if let Some(mut changed_packages) = some_changed_packages {
                // Clean up currently running tasks
                self.active_runs.retain(|h| !h.run_task.is_finished());

                match changed_packages {
                    ChangedPackages::Some(pkgs) => {
                        let impacted = self.stop_impacted_tasks(&pkgs).await;
                        changed_packages = ChangedPackages::Some(impacted);
                    }
                    ChangedPackages::All => {
                        for handle in self.active_runs.drain(..) {
                            handle.stopper.stop().await;
                            let _ = handle.run_task.await;
                        }
                    }
                }
                let new_run = self.execute_run(changed_packages).await?;
                self.active_runs.push(new_run);
            }
        }
    };

    select! {
        biased;
        _ = signal_subscriber.listen() => {
            tracing::info!("shutting down");
            Err(Error::SignalInterrupt)
        }
        result = event_fut => {
            result
        }
        run_result = run_fut => {
            run_result
        }
    }
}

#[instrument(skip(changed_packages))]
fn handle_change_event(
    changed_packages: &Mutex<ChangedPackages>,
    event: proto::package_change_event::Event,
) -> Result<(), Error> {
    // Should we recover here?
    match event {
        proto::package_change_event::Event::PackageChanged(proto::PackageChanged {
            package_name,
        }) => {
            let package_name = PackageName::from(package_name);

            match changed_packages.lock().expect("poisoned lock").deref_mut() {
                ChangedPackages::All => {
                    // If we've already changed all packages, ignore
                }
                ChangedPackages::Some(ref mut pkgs) => {
                    pkgs.insert(package_name);
                }
            }
        }
        proto::package_change_event::Event::RediscoverPackages(_) => {
            *changed_packages.lock().expect("poisoned lock") = ChangedPackages::All;
        }
        proto::package_change_event::Event::Error(proto::PackageChangeError { message }) => {
            return Err(DaemonError::Unavailable(message).into());
        }
    }

    Ok(())
}

async fn stop_impacted_tasks(&self, pkgs: &HashSet<PackageName>) -> HashSet<PackageName> {
    let engine = self.run.engine();

    let impacted_nodes = engine.tasks_impacted_by_packages(pkgs);

    // Extract task IDs from task nodes (filtering out Root nodes)
    let task_ids: Vec<_> = impacted_nodes
        .iter()
        .filter_map(|node| match node {
            TaskNode::Task(task_id) => Some(task_id.clone()),
            TaskNode::Root => None,
        })
        .collect();

    // Collect unique impacted packages
    let impacted_packages: HashSet<_> = task_ids
        .iter()
        .map(|task_id| PackageName::from(task_id.package()))
        .collect();

    join_all(
        self.active_runs
            .iter()
            .map(|handle| handle.stopper.stop_tasks(&task_ids)),
    )
    .await;

    impacted_packages
}

/// Shut down any resources that run as part of watch.
pub async fn shutdown(&mut self) {
    if let Some(sender) = &self.ui_sender {
        sender.stop().await;
    }
    for handle in self.active_runs.drain(..) {
        handle.stopper.stop().await;
        let _ = handle.run_task.await;
    }
    if let Some(RunHandle { stopper, run_task }) = self.persistent_tasks_handle.take() {
        // Shut down the tasks for the run
        stopper.stop().await;
        // Run should exit shortly after we stop all child tasks, wait for it to finish
        // to ensure all messages are flushed.
        let _ = run_task.await;
    }
}

/// Executes a run with the given changed packages. Splits the run into two
/// parts:
/// 1. The persistent tasks that are not allowed to be interrupted
/// 2. The non-persistent tasks and the persistent tasks that are allowed to
///    be interrupted
///
/// Returns a handle to the task running (2)
async fn execute_run(&mut self, changed_packages: ChangedPackages) -> Result<RunHandle, Error> {
    // Should we recover here?
    trace!("handling run with changed packages: {changed_packages:?}");
    match changed_packages {
        ChangedPackages::Some(packages) => {
            let packages = packages
                .into_iter()
                .filter(|pkg| {
                    // If not in the watched packages set, ignore
                    self.watched_packages.contains(pkg)
                })
                .collect();

            let mut opts = self.base.opts().clone();
            if !self.experimental_write_cache {
                opts.cache_opts.cache.remote.write = false;
                opts.cache_opts.cache.remote.read = false;
            }

            let new_base = CommandBase::from_opts(
                opts,
                self.base.repo_root.clone(),
                get_

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

p0-validated-land-first.md

Current State

Turborepo already does the "big rocks" correctly — task hashing is parallelized by topological waves via rayon (#11969 shipped), and file-hash work is deduplicated by (package_path, inputs) groups. However, several critical-path hotspots cap scalability:

Lock contention in TaskHashTracker (single RwLock over 5 HashMaps) serializes parallel hash computation
Regex recompilation for env filtering happens per task
Unnecessary cloning of large EnvironmentVariableMaps in the visitor loop
Sequential filesystem I/O for turbo.json existence checks during discovery
Avoidable I/O and allocations in cache/SCM code paths

Required Changes

P0 — Land First (High Impact, Low-Medium Risk)

#	Area	Fix	Impact (100pkg/500task)	Risk
7	TaskHashTracker RwLock	Replace single `RwLock<TaskHashTrackerState>` with per-field `DashMap`s	30-70% reduction in hash precompute time (hundreds of ms to seconds)	M — test dependency ordering across waves
2	Cache zstd compression	Add `encoder.multithread(min(8, available_parallelism))`	20-60% faster cache-miss archiving (hundreds of ms)	L — different archive bytes but restore works fine
10	Discovery sequential stat	Replace `.then()` with `.map().buffer_unordered(64)`	50-300ms cold start savings	L — bound concurrency, preserve ordering
8	Visitor clone→remove	`precomputed.remove(&info)` instead of `.get().clone()`, drop `Arc<Mutex>` wrapper	Tens to low hundreds of ms on 500 tasks	L — just ownership change
5	Lockfile transitive closure	Pre-size DashMap + add `DepCache` for `all_dependencies()` results	Hundreds of ms to seconds on startup	M — cache correctness depends on `all_dependencies` being pure

P1 — Remove Hot-Path Overhead

#	Area	Fix	Impact	Risk
13	Cache write-through	Tier 1: `spawn_blocking` for `fs.put()`. Tier 2: pass archive bytes directly	Tens of ms (T1) to seconds (T2) per remote hit	M — format compatibility for T2
6	Env regex caching	Add `hashable_task_env_compiled()` + `wildcard_map_from_compiled()` using existing `CompiledWildcards`	50-300ms across 500 tasks	M — preserve Windows case-insensitive behavior
9	Change mapper lookup	Build `HashMap<package_root, PackageName>`, walk ancestors instead of O(n×m) scan	10-200ms → 1-10ms for --affected/watch	L-M — handle nested packages
4	SCM hash allocations	Store `GitHash([u8; 20])` instead of hex String; stream file content; single directory walk	Tens to hundreds of ms + peak memory reduction	M — hash stability risk
1	Engine config loading	Parallelize turbo.json IO with `buffer_unordered(64)` + optional on-disk mtime cache	100-500ms cold start (possibly seconds with cache)	M — cache invalidation bugs

P2 — Cleanup & Deferred

#	Area	Fix	Impact	Risk
12	Loose env clone	Overlay pattern: `ExecutionEnv { base: &Map, overrides: SmallVec }` instead of full clone	50-300ms on 500 tasks	M-H — invasive API change
3	Glob pattern cache	`CompiledGlobWalker` type or bounded `moka::Cache<u64, Arc<Program>>`	Tens to hundreds of ms	M — cache key must include all options
11	GlobSet hash/eq	Store `include_raw: BTreeSet<String>` at construction, hash that	Microseconds to low ms (watch mode cleanup)	L
14	Package graph allocs	Pre-size HashMaps, intern PackageName as `Arc<str>`	10-50ms	L-M
15	Parallel task hash	Already shipped	N/A	N/A

Ripple Effects

#7 is prerequisite for #15's full benefit — parallel hashing is shipped but serialized by tracker lock contention
#6 and #12 hit the same phase — fixing only one makes the other the new bottleneck
#13 interacts with #2 — if you switch to "store archive bytes directly," both remote and local must use the same tar+zstd format
#4 and #5 can affect hash stability — any representation change (raw bytes, Arc) must preserve exact ordering/normalization
Windows/macOS differences — env filtering is case-insensitive on Windows; caching keys must include platform flag

What This Unlocks

Aggregate for a 100-package / 500-task / 10K-file repo:

Cold start: 0.3–2.0s faster (#10 + #5 + #1 + #14)
Task hash precompute: 0.3–1.5s faster (#7 + #6 + #8 + #12)
Cache miss archiving: Hundreds of ms to seconds (#2)
Remote cache hit: Up to seconds faster with less disk churn (#13)

Operationally: better CI scaling, watch-mode/--affected stops growing linearly with package count, more predictable latency.

Effort & Risk

Dependency Ordering (land in this sequence)

Quick wins (reduce noise for measuring rest): #8, #10, #2, #11
Unlock parallel hashing: #7 → re-measure #15 benefits
Hot-path per-task overhead: #6, #12
Startup/affected/watch: #5, #9, then #1/#14
Cache I/O: #13 Tier 1 → Tier 2

Benchmark Suite Recipe

A) Criterion microbenchmarks:

turborepo-env/benches/env_wildcards.rs — 200 vars, 20 patterns, compiled vs uncompiled
turborepo-lockfiles/benches/transitive_closure.rs — real lockfile fixture, 100 workspaces
turborepo-cache/benches/archive_create.rs — 10K files, single vs multi-thread zstd
turborepo-repository/benches/discovery.rs — 100 packages, sequential vs buffer_unordered
turborepo-repository/benches/change_mapper.rs — 100 packages, 10K file paths
turborepo-task-hash/benches/task_hash.rs — 500 tasks, RwLock vs DashMap tracker

B) End-to-end (hyperfine + generated fixture monorepo):

# Cold start
rm -rf .turbo && hyperfine -w 3 './target/release/turbo run build --dry-run'

# Hash scaling
RAYON_NUM_THREADS=1 vs =N comparison

# Cache miss
turbo run build --force (clear cache, measure archive creation)

# Remote hit write-through
# Local stub remote cache server, measure latency before/after

C) Tracing spans to standardize:

engine.config_load.*, repository.discovery.*, lockfiles.transitive_closure.*
task_hash.wave.*, task_hash.calculate.*
cache.archive.create.*, cache.fetch.remote.*, cache.write_through.*
affected.detect_package.*

Oracle Session (inlined)

Plan Markdown Repair

The previous draft failed validation against the required output format. Rewrite it to satisfy the exact contract below while preserving useful details.

Mode: research

Output Format Contract (REQUIRED)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Use the exact section headings above and include substantive content in each section.

Validation Errors

Missing required section heading: ## Required Changes

Invalid Draft

Here's the comprehensive GPT-5.2 Pro analysis of every performance optimization opportunity in turborepo, synthesized from 45 minutes of extended thinking:

---

## Current State

Turborepo already does the "big rocks" correctly — task hashing is parallelized by topological waves via rayon (#11969 shipped), and file-hash work is deduplicated by `(package_path, inputs)` groups. However, several critical-path hotspots cap scalability:

- **Lock contention** in TaskHashTracker (single `RwLock` over 5 HashMaps) serializes parallel hash computation
- **Regex recompilation** for env filtering happens per task
- **Unnecessary cloning** of large EnvironmentVariableMaps in the visitor loop
- **Sequential filesystem I/O** for turbo.json existence checks during discovery
- **Avoidable I/O and allocations** in cache/SCM code paths

---

## Required Changes (15 Areas)

### P0 — Land First (High Impact, Low-Medium Risk)

| # | Area | Fix | Impact (100pkg/500task) | Risk |
|---|------|-----|------------------------|------|
| **7** | TaskHashTracker RwLock | Replace single `RwLock<TaskHashTrackerState>` with per-field `DashMap`s | **30-70% reduction** in hash precompute time (hundreds of ms to seconds) | M — test dependency ordering across waves |
| **2** | Cache zstd compression | Add `encoder.multithread(min(8, available_parallelism))` | **20-60% faster** cache-miss archiving (hundreds of ms) | L — different archive bytes but restore works fine |
| **10** | Discovery sequential stat | Replace `.then()` with `.map().buffer_unordered(64)` | **50-300ms** cold start savings | L — bound concurrency, preserve ordering |
| **8** | Visitor clone→remove | `precomputed.remove(&info)` instead of `.get().clone()`, drop `Arc<Mutex>` wrapper | **Tens to low hundreds of ms** on 500 tasks | L — just ownership change |
| **5** | Lockfile transitive closure | Pre-size DashMap + add `DepCache` for `all_dependencies()` results | **Hundreds of ms to seconds** on startup | M — cache correctness depends on `all_dependencies` being pure |

### P1 — Remove Hot-Path Overhead

| # | Area | Fix | Impact | Risk |
|---|------|-----|--------|------|
| **13** | Cache write-through | Tier 1: `spawn_blocking` for `fs.put()`. Tier 2: pass archive bytes directly | **Tens of ms (T1) to seconds (T2)** per remote hit | M — format compatibility for T2 |
| **6** | Env regex caching | Add `hashable_task_env_compiled()` + `wildcard_map_from_compiled()` using existing `CompiledWildcards` | **50-300ms** across 500 tasks | M — preserve Windows case-insensitive behavior |
| **9** | Change mapper lookup | Build `HashMap<package_root, PackageName>`, walk ancestors instead of O(n×m) scan | **10-200ms → 1-10ms** for --affected/watch | L-M — handle nested packages |
| **4** | SCM hash allocations | Store `GitHash([u8; 20])` instead of hex String; stream file content; single directory walk | **Tens to hundreds of ms** + peak memory reduction | M — hash stability risk |
| **1** | Engine config loading | Parallelize turbo.json IO with `buffer_unordered(64)` + optional on-disk mtime cache | **100-500ms** cold start (possibly seconds with cache) | M — cache invalidation bugs |

### P2 — Cleanup & Deferred

| # | Area | Fix | Impact | Risk |
|---|------|-----|--------|------|
| **12** | Loose env clone | Overlay pattern: `ExecutionEnv { base: &Map, overrides: SmallVec }` instead of full clone | **50-300ms** on 500 tasks | M-H — invasive API change |
| **3** | Glob pattern cache | `CompiledGlobWalker` type or bounded `moka::Cache<u64, Arc<Program>>` | **Tens to hundreds of ms** | M — cache key must include all options |
| **11** | GlobSet hash/eq | Store `include_raw: BTreeSet<String>` at construction, hash that | **Microseconds to low ms** (watch mode cleanup) | L |
| **14** | Package graph allocs | Pre-size HashMaps, intern PackageName as `Arc<str>` | **10-50ms** | L-M |
| **15** | Parallel task hash | Already shipped | N/A | N/A |

---

## Ripple Effects

- **#7 is prerequisite for #15's full benefit** — parallel hashing is shipped but serialized by tracker lock contention
- **#6 and #12 hit the same phase** — fixing only one makes the other the new bottleneck
- **#13 interacts with #2** — if you switch to "store archive bytes directly," both remote and local must use the same tar+zstd format
- **#4 and #5 can affect hash stability** — any representation change (raw bytes, Arc<str>) must preserve exact ordering/normalization
- **Windows/macOS differences** — env filtering is case-insensitive on Windows; caching keys must include platform flag

---

## What This Unlocks

Aggregate for a 100-package / 500-task / 10K-file repo:

- **Cold start**: **0.3–2.0s faster** (#10 + #5 + #1 + #14)
- **Task hash precompute**: **0.3–1.5s faster** (#7 + #6 + #8 + #12)
- **Cache miss archiving**: **Hundreds of ms to seconds** (#2)
- **Remote cache hit**: **Up to seconds faster** with less disk churn (#13)

Operationally: better CI scaling, watch-mode/--affected stops growing linearly with package count, more predictable latency.

---

## Effort & Risk

### Dependency Ordering (land in this sequence)

1. **Quick wins** (reduce noise for measuring rest): #8, #10, #2, #11
2. **Unlock parallel hashing**: #7 → re-measure #15 benefits
3. **Hot-path per-task overhead**: #6, #12
4. **Startup/affected/watch**: #5, #9, then #1/#14
5. **Cache I/O**: #13 Tier 1 → Tier 2

### Benchmark Suite Recipe

**A) Criterion microbenchmarks:**
- `turborepo-env/benches/env_wildcards.rs` — 200 vars, 20 patterns, compiled vs uncompiled
- `turborepo-lockfiles/benches/transitive_closure.rs` — real lockfile fixture, 100 workspaces
- `turborepo-cache/benches/archive_create.rs` — 10K files, single vs multi-thread zstd
- `turborepo-repository/benches/discovery.rs` — 100 packages, sequential vs buffer_unordered
- `turborepo-repository/benches/change_mapper.rs` — 100 packages, 10K file paths
- `turborepo-task-hash/benches/task_hash.rs` — 500 tasks, RwLock vs DashMap tracker

**B) End-to-end (hyperfine + generated fixture monorepo):**
```bash
# Cold start
rm -rf .turbo && hyperfine -w 3 './target/release/turbo run build --dry-run'

# Hash scaling
RAYON_NUM_THREADS=1 vs =N comparison

# Cache miss
turbo run build --force (clear cache, measure archive creation)

# Remote hit write-through
Local stub remote cache server, measure latency before/after

C) Tracing spans to standardize:

engine.config_load.*, repository.discovery.*, lockfiles.transitive_closure.*
task_hash.wave.*, task_hash.calculate.*
cache.archive.create.*, cache.fetch.remote.*, cache.write_through.*
affected.detect_package.*


Return only the repaired markdown.

Raw

perf-overview-20-wins.md

The investigation is complete. Here's a summary of the findings:

Key Findings: 20+ Performance Optimizations

Tier 1 — High Impact (4 wins)

TaskHashTracker coarse RwLock — Single lock around 6 independent HashMaps serializes all concurrent writes. Replace with DashMap or per-field locks. (~1 day)
Engine::execute spawns per-task — Thousands of Tokio tasks instead of a fixed worker pool. ~80% reduction in scheduling overhead. (~2 days)
RepoGitIndex full clone for root package — Clones every (path, hash) pair when prefix is empty. Use Arc<str> or borrowed views. (~2 days)
Wax glob compilation per call — Exclude globs recompiled every invocation instead of cached per task definition. (~1 day)

Tier 2 — Medium Impact (8 wins)

5-12 cover: per-task env Vec allocation, CI vendor exclude string, expensive dedup key cloning, std::Mutex in async cancellation path, HTTP cache double-buffering artifact bodies, deep-clone of expanded_inputs, String vs Arc for hashes, sequential cache archive creation.

Tier 3 — Easy Quick Wins (8 wins)

13-20 cover: fixed-size bytes for dependency hash sort, status_entries double sort, OID string interning, Vec capacity hints, error collector mutex, SHA buffer size.

Architectural Wins

Streaming cache upload (no full in-memory buffer)
Parallel cache prefetch + restore
Content-addressable OID store

Full report with file locations, code examples, effort estimates, and risk assessments written to investigation.md.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

task-hash-tracker-lock-contention

Exploration

AREA: task-hash-tracker-lock-contention FILES: crates/turborepo-task-hash/src/lib.rs MATURITY: stable RELEVANCE: The coarse RwLock wrapping five independent HashMaps serializes all concurrent task hash insertions under heavy parallelism, making it the highest-contention bottleneck in the per-task hot path.

Raw Exploration Notes: AREA: task-hash-tracker-lock-contention FILES: crates/turborepo-task-hash/src/lib.rs MATURITY: stable RELEVANCE: The coarse RwLock wrapping five independent HashMaps serializes all concurrent task hash insertions under heavy parallelism, making it the highest-contention bottleneck in the per-task hot path.

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

Raw

prioritized-p0-p1-p2.md

Here is the repaired markdown with all required sections:

Current State

Turborepo's turbo run hot path has 9 distinct performance bottlenecks spanning config loading, dependency resolution, hashing, task scheduling, and caching. Recent PRs (#11969, #11970, #11972) landed parallel task hash computation and shallow wildcard fast paths, but significant gains remain.

Already optimized: Package graph construction (rayon), package.json parsing (rayon), workspace discovery (async), file hashing (rayon par_iter), shallow wildcard expansion (readdir + parallel stat), task hash precomputation (topological waves), globwalk regex patterns (OnceLock).

Required Changes

#	Bottleneck	Location	Impact	Effort
1	Serial turbo.json config loading in engine BFS	`engine/builder.rs:156-166`	2-3x on config phase	Medium
2	Regex recompilation per task in env hashing	`turborepo-env/lib.rs:273-316`	100-200ms/run saved	Low
3	Single RwLock on 6 independent HashMaps	`task-hash/lib.rs:225-242`	20-30% less contention	Low
4	Redundant DFS in lockfile transitive closure	`lockfiles/lib.rs:260-286`	30-50% fewer lookups	Medium
5	Single-threaded zstd compression at default level	`cache_archive/create.rs:115,151`	2-4x faster compression	Low
6	Redundant String clones in SCM filtering	`scm/repo_index.rs:240-259`	Scales with repo size	Low
7	Double directory walk in manual mode	`scm/manual.rs:69-212`	2x I/O reduction	Low
8	Unsized DashMap in lockfile cache	`lockfiles/lib.rs:135`	5-10% on startup	Trivial
9	Per-task EnvironmentVariableMap clone	`visitor/mod.rs` precomputed `.clone()`	O(tasks) alloc savings	Trivial

Biggest quick win: Zstd tuning (#5) — adjust compression level or add encoder.multithread(num_cpus) for 2-4x faster cache archive creation with a one-line change.
Biggest structural win: Parallel config loading (#1) — 2-3x on engine build phase for 50+ workspace repos by parallelizing TaskInheritanceResolver across workspaces with rayon.
Lowest risk: Pre-size DashMap (#8) and wrap env map in Arc (#9) — trivial changes with zero/low risk.

Ripple Effects

#	Change	Propagation Scope	Affected Crates / APIs
1	Parallel config loading	`TurboJsonLoader` trait must become `Send + Sync`; `tasks_set` needs synchronization (e.g. `DashSet` or post-merge collect). Test helper `TestTurboJsonLoader` in `builder.rs:1319+` needs update.	`turborepo-engine`
2	Pre-compiled regex for env hashing	`hashable_task_env()` signature changes to accept `&CompiledWildcards`; callers in `TaskHasher::calculate_task_hash()` must pre-compile patterns once. `CompiledWildcards` already exists — this extends its usage.	`turborepo-env`, `turborepo-task-hash`
3	Per-field DashMap instead of single RwLock	`TaskHashTrackerState` struct dissolves into individual `DashMap` fields; `into_task_hash_tracker_state()` (line 487) changes from `Arc::into_inner()` to per-map extraction. `#[derive(Serialize)]` on state struct needs rework (fields are already `#[serde(skip)]`). `HashTrackerProvider` trait in `turborepo-task-executor` unaffected.	`turborepo-task-hash`
4	Cached sub-tree DFS in lockfile closure	Requires a `DashMap<String, Arc<HashSet<Package>>>` shared across rayon threads. Concurrent traversal of the same package by different threads needs careful ordering to avoid deadlock with `DashMap` entry API. Each lockfile implementation (npm, pnpm, yarn1, berry, bun) tested indirectly.	`turborepo-lockfiles`
5	Zstd level / multithread	`CacheWriter::create()` and `from_writer()` gain a compression parameter or separate constructor. Caller in `fs.rs` passes config. Existing archives remain readable — zstd decompression is level-agnostic.	`turborepo-cache`
6	`Arc<str>` for git hashes	`GitHashes = HashMap<RelativeUnixPathBuf, String>` becomes `HashMap<RelativeUnixPathBuf, Arc<str>>`. Propagates through `scm/package_deps.rs`, `scm/repo_index.rs`, and into `turborepo-task-hash` consumers. The gix-index path (line 162) still allocates owned hex strings for OID encoding.	`turborepo-scm`, `turborepo-task-hash`
7	Merge double directory walk	The `ignore` crate's `WalkBuilder` doesn't expose per-file gitignore status separately from its filter. Merging requires either a custom override set or a single walk with post-hoc gitignore classification. Existing tests in `manual.rs:499-540` guard `include_default_files=true` behavior.	`turborepo-scm`
8	Pre-size DashMap	One-line change: `DashMap::with_capacity(workspaces.len())`. No API changes.	`turborepo-lockfiles`
9	`Arc<EnvironmentVariableMap>`	`exec_context()` factory and `turborepo-task-executor` must accept `Arc<EnvironmentVariableMap>` instead of owned. In `EnvMode::Loose` (common case), all tasks share one `Arc`; in `EnvMode::Strict`, each task gets its own. The `.clone()` at line 388 becomes a cheap ref-count bump.	`turborepo-lib`, `turborepo-task-executor`

What This Unlocks

Sub-second config loading for large monorepos (50+ workspaces): Parallel TaskInheritanceResolver (#1) removes the last serial bottleneck in the engine build phase, enabling turbo run startup to scale with available cores instead of workspace count.
Flat per-task overhead regardless of env complexity: Pre-compiled regex (#2) + Arc env maps (#9) make per-task cost O(1) instead of O(env_vars × regex_patterns). This is critical for CI where 100+ tasks run in a single invocation.
Fully concurrent task hash tracking: Per-field DashMap (#3) eliminates cross-map contention, allowing rayon's topological wave scheduler to saturate all cores during the hash computation phase — the current bottleneck after #11970's precomputation landed.
Faster cache writes without sacrificing correctness: Zstd tuning (#5) directly reduces wall-clock time for cache misses — the most expensive path in turbo run. Combined with #8 (pre-sized DashMap), lockfile resolution also becomes tighter.
Foundation for incremental SCM hashing: Reducing String clones (#6) and eliminating double walks (#7) in the SCM layer are prerequisites for future incremental file hashing — where only changed files are re-hashed between runs. Without these, the allocation overhead of the current approach would dominate any incremental savings.
Better scaling curve: Collectively, these changes shift turbo run's performance profile from "O(workspaces × tasks)" to closer to "O(max(workspaces, tasks) / cores)" — a meaningful difference for repos with 100+ packages.

Effort & Risk

#	Change	Effort	Risk	Mitigation
1	Parallel config loading	Medium (2-3 days)	Medium — `TurboJsonLoader` must be thread-safe; cycle detection in `ResolutionState` needs per-thread isolation	Verify concrete `TurboJsonLoader` impls are `Send+Sync`; existing tests in `builder.rs:1180+` cover resolver logic
2	Pre-compiled regex	Low (half day)	Low — `CompiledWildcards` already exists with identical logic; extending it to `hashable_task_env` is a known pattern	Existing tests for `CompiledWildcards` in `lib.rs:640+` validate correctness
3	Per-field DashMap	Low (1 day)	Medium — `into_task_hash_tracker_state()` extraction and serde derive need rework	Concurrent access tests at `lib.rs:762+` serve as regression guards
4	Cached DFS sub-trees	Medium (2-3 days)	Medium-High — concurrent sub-tree traversal by rayon threads risks deadlock with `DashMap` entry API	Start with a simpler `Mutex<HashMap>` cache; each lockfile type has its own test suite
5	Zstd level/multithread	Low (hours)	Low — zstd decompression is format-agnostic; existing archives remain readable	Round-trip tests in `test_cases.rs` and `fs.rs` exercise cache read/write transparently
6	`Arc<str>` for git hashes	Low (1 day)	Medium — `GitHashes` type change propagates through SCM and task-hash crates	Package-level SCM tests in `package_deps.rs:390+` validate the interface
7	Merge double walk	Low-Medium (1-2 days)	Medium — `ignore` crate doesn't expose per-file gitignore status; merging semantics are subtle	Strong test suite in `manual.rs:215-540+` with `test_case` attributes covers `include_default_files` behavior
8	Pre-size DashMap	Trivial (minutes)	None — purely additive, no behavioral change	N/A
9	`Arc<EnvironmentVariableMap>`	Trivial (hours)	Low — localized to visitor and exec_context factory; `Arc::clone()` is a ref-count bump	End-to-end integration tests cover the run pipeline

Recommended execution order: #8 → #9 → #5 → #2 → #3 → #6 → #1 → #7 → #4 (ascending risk, maximizing early wins)

Verification strategy: Chrome tracing (turbo run build --profile), full test suite (cargo nextest run --workspace), integration tests (turbo run test --filter=turborepo-tests-integration), and cache correctness checks (zstd -t on archives).

Oracle Session (inlined)

Plan Markdown Repair

The previous draft failed validation against the required output format. Rewrite it to satisfy the exact contract below while preserving useful details.

Mode: research

Output Format Contract (REQUIRED)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Use the exact section headings above and include substantive content in each section.

Validation Errors

Missing required section heading: ## Ripple Effects
Missing required section heading: ## What This Unlocks
Missing required section heading: ## Effort & Risk

Invalid Draft

## Current State

Turborepo's `turbo run` hot path has **9 distinct performance bottlenecks** spanning config loading, dependency resolution, hashing, task scheduling, and caching. Recent PRs (#11969, #11970, #11972) landed parallel task hash computation and shallow wildcard fast paths, but significant gains remain.

**Already optimized:** Package graph construction (rayon), package.json parsing (rayon), workspace discovery (async), file hashing (rayon par_iter), shallow wildcard expansion (readdir + parallel stat), task hash precomputation (topological waves), globwalk regex patterns (OnceLock).

## Required Changes

| # | Bottleneck | Location | Impact | Effort |
|---|-----------|----------|--------|--------|
| 1 | Serial turbo.json config loading in engine BFS | `engine/builder.rs:156-166` | 2-3x on config phase | Medium |
| 2 | Regex recompilation per task in env hashing | `turborepo-env/lib.rs:354-383` | 100-200ms/run saved | Low |
| 3 | Single RwLock on 6 independent HashMaps | `task-hash/lib.rs:225-242` | 20-30% less contention | Low |
| 4 | Redundant DFS in lockfile transitive closure | `lockfiles/lib.rs:260-286` | 30-50% fewer lookups | Medium |
| 5 | Single-threaded zstd compression | `cache_archive/create.rs:115,151` | 2-4x faster compression | Low |
| 6 | Redundant String clones in SCM filtering | `scm/repo_index.rs:240-259` | Scales with repo size | Low |
| 7 | Double directory walk in manual mode | `scm/manual.rs:69-212` | 2x I/O reduction | Low |
| 8 | Unsized DashMap in lockfile cache | `lockfiles/lib.rs:135` | 5-10% on startup | Trivial |
| 9 | Per-task EnvironmentVariableMap clone | `visitor/mod.rs` precomputed `.clone()` | O(tasks) alloc savings | Trivial |

The full analysis with exact code changes, verification commands, ripple effects, and risk assessment is in the output file. Key highlights:

- **Biggest quick win:** Multi-threaded zstd (#5) — essentially a one-liner (`encoder.multithread(4)`) for 2-4x faster cache archive creation
- **Biggest structural win:** Parallel config loading (#1) — 2-3x on engine build phase for 50+ workspace repos
- **Lowest risk:** Pre-size DashMap (#8) and remove env map clone (#9) — trivial changes, zero/low risk

**Verification strategy:** Chrome tracing (`turbo run build --profile`), full test suite (`cargo nextest run --workspace`), integration tests (`turbo run test --filter=turborepo-tests-integration`), and cache correctness checks (`zstd -t` on archives).

Return only the repaired markdown.

Raw

top-5-remaining-bottlenecks.md

The Oracle session failed due to a browser issue. No matter — I have comprehensive research from all exploration agents. Let me compile the final analysis.

Current State

The turborepo codebase has seen aggressive recent optimization (7+ perf commits in the last 2 weeks). The critical turbo run path follows this pipeline:

CLI parse → Package graph build → Engine build → Parallel hash scope
  → Topological task hash precompute → DAG walker → Task dispatch → Execution

Already optimized (recent commits):

Task hash computation parallelized across topological waves: 270ms → 53ms on 1700 tasks (b3c0f46da8)
Engine builder: turbo_json_chain_cache + FixedMap caching: 283ms → 74ms (e145bc6b74)
Engine callback sent before tracker bookkeeping: -41ms (4b5410b397)
File hash deduplication by (package_path, globs, default): 1.48x speedup (57cf69c2a4)
Shallow wildcard fast paths in globwalk: 54ms → 6ms per walk (db01cb4490)
Literal input paths resolved via stat instead of glob walk (b0048b034f)
TLS initialization deferred to background thread (f1d487fff3)
SCM allocation reduction: hex buffer reuse, BufReader 8KB→64KB, binary-search status lookups (d16f081679, a0e90cb1c5)

Remaining bottlenecks ranked by expected wall-clock impact on ~1000 packages / 1700 tasks:

Required Changes

1. Task Env Regex Recompilation (HIGH — ~20-50ms savings)

Problem: hashable_task_env at crates/turborepo-env/src/lib.rs:354 calls from_wildcards (line 362) and wildcard_map_from_wildcards_unresolved (line 364), each invoking wildcard_map_from_wildcards (line 273) which recompiles regex via RegexBuilder::new().build() on every single task. With 1700 tasks, that's 3400+ regex compilations per build.

CompiledWildcards already exists (line 173) and is used for the pass-through path (pass_through_env_compiled at line 426), but NOT for the hash path.

Fix:

Add a hashable_task_env_compiled method to EnvironmentVariableMap accepting &CompiledWildcards for the computed_wildcards parameter
Add a wildcard_map_from_compiled_wildcards_unresolved method for the task_env parameter
In TaskHasher::new() (crates/turborepo-task-hash/src/lib.rs:258), pre-compile framework wildcards per unique pattern set
Cache compiled task-env patterns at the TaskHasher level keyed by pattern set

Verification:

# Existing tests cover correctness parity:
cargo test -p turborepo-env -- test_compiled_wildcards
cargo test -p turborepo-env -- test_hashable_env
cargo test -p turborepo-task-hash

# Perf verification — profile with tracing on large monorepo:
turbo run build --profile 2>&1 | grep -E "hashable_task_env|calculate_task_hash"
# Compare median of 5 warm runs before/after

2. Lockfile Transitive Closure Pre-sizing & Subgraph Memoization (MEDIUM — ~10-30ms savings)

Problem: all_transitive_closures at crates/turborepo-lockfiles/src/lib.rs:130 creates DashMap::new() (line 135) with default capacity. With 1000+ packages each having 50-200 transitive deps, the DashMap undergoes repeated resizing. Additionally, when workspaces A and B both depend on package X, X's entire subgraph is walked twice independently — only individual resolution lookups are cached, not subtree closures.

Fix:

Pre-size the DashMap: DashMap::with_capacity(lockfile_entry_count) at line 135
Add a memoization cache for complete transitive closures: DashMap<Package, HashSet<Package>> — when walk() (line 260) completes a package's closure, store it so subsequent walks skip recomputation
For npm lockfile (crates/turborepo-lockfiles/src/npm.rs:84): build a dependency index during parse (like pnpm already does at pnpm/data.rs:230) instead of recomputing all_dependencies() dynamically

Verification:

cargo test -p turborepo-lockfiles
# Correctness: transitive closures must be identical before/after
# Perf: trace `all_transitive_closures` span in --profile output

3. Multi-threaded zstd Cache Compression (MEDIUM — ~5-20ms per cache-miss task)

Problem: crates/turborepo-cache/src/cache_archive/create.rs:115 creates zstd::Encoder::new(writer, 0) — single-threaded compression. For large Next.js builds (hundreds of output chunks, 50-200MB archives), compression is the bottleneck on cache-miss tasks.

Fix:

Enable multi-threaded zstd: encoder.multithread(num_cpus::get() as u32)? after creating the encoder
Consider raising compression level to 1-3 for better ratio with minimal speed cost (level 0 = default = ~3, so this may already be fine)
For decompression (crates/turborepo-cache/src/cache_archive/restore.rs:26): multi-threaded decode isn't supported by zstd streaming API, but the decompressor could be windowed

Verification:

cargo test -p turborepo-cache
# Perf: benchmark archive creation on a large output set:
# Create test archive, measure wall time before/after
# Verify archive is bit-for-bit decompressible (content hash matches)

Caveat: This only helps cache-miss tasks. On full-cache-hit runs, compression doesn't execute. The win scales with output size per task.

4. SCM Hash String Allocation Reduction (LOW-MEDIUM — ~5-15ms savings)

Problem: crates/turborepo-scm/src/hash_object.rs:69, ls_tree.rs:74, and repo_index.rs:158-162 all allocate heap Strings from stack-based [u8; 40] hex buffers for every file. In a 1000-package monorepo with ~100K tracked files, that's ~100K String allocations.

Additionally, repo_index.rs:240-244 clones the entire ls_tree_hashes HashMap (all paths and hashes) when the prefix is empty.

Fix:

Use a compact 20-byte binary hash representation ([u8; 20]) instead of 40-char hex strings internally, converting to hex only at serialization boundaries
In get_package_hashes (repo_index.rs:240), when prefix is empty, return a borrowed view or Cow instead of cloning the entire map
Use Arc<str> for hash strings to enable cheap cloning across shared substructures

Verification:

cargo test -p turborepo-scm
# Perf: measure with large repo, compare allocation counts via DHAT or similar

5. Global Hash Env Var Regex (LOW — ~1-3ms savings)

Problem: get_global_hashable_env_vars at crates/turborepo-env/src/lib.rs:493 calls from_wildcards(DEFAULT_ENV_VARS) — recompiling the BUILTIN_PASS_THROUGH_ENV regex list (60+ patterns) once per run. Minor since it's only called once, but the compiled form could be a lazy_static.

Fix: Use CompiledWildcards::compile(DEFAULT_ENV_VARS) as a once_cell::sync::Lazy static.

Verification:

cargo test -p turborepo-env -- test_global_env

Ripple Effects

Change	Affected Crates	Risk
Env regex caching	`turborepo-env`, `turborepo-task-hash`	Low — additive API, existing tests cover parity
Lockfile pre-sizing	`turborepo-lockfiles`	Low — pure optimization, no behavioral change
Lockfile subtree memoization	`turborepo-lockfiles`	Medium — must ensure memoized closures are identical to non-memoized; concurrent mutation via DashMap needs care
Multi-threaded zstd	`turborepo-cache`	Medium — must verify archives remain compatible; multi-threaded mode changes internal framing but output is still valid zstd
SCM alloc reduction	`turborepo-scm`, `turborepo-task-hash`	Medium — changing hash representation from String to binary is a cross-cutting concern; needs careful type migration
npm lockfile indexing	`turborepo-lockfiles`	Low — pnpm already uses this pattern successfully

Cross-cutting concerns:

Multi-threaded zstd increases peak memory usage (each thread needs a window)
Rayon thread pool is already heavily used; adding more parallelism could cause contention on smaller machines
fd limits: already handled via retry-with-backoff in globwalk/hash_objects layers

What This Unlocks

Env regex fix (#1): Removes the last O(N tasks) regex compilation from the critical path, making precompute_task_hashes even faster (currently ~53ms on 1700 tasks, could drop to ~30-40ms)
Lockfile memoization (#2): Makes startup time sublinear in shared dependency count — large enterprise monorepos with deep shared dependency trees benefit most
Multi-threaded zstd (#3): Directly reduces the "first build" and "cache-miss" experience, which is the most frustrating user-facing latency
Combined: On a 1000-package monorepo with 1700 tasks, estimated 30-80ms total reduction on warm cached runs, 100-500ms on cache-miss runs (from zstd parallelism)

Effort & Risk

#	Change	Effort	Risk	Impact
1	Task env regex caching	Small (1-2 days) — pattern already exists in `pass_through_env_compiled`	Low — parity tests exist	High — hot path, O(N) → O(unique_patterns)
2	Lockfile DashMap pre-sizing	Tiny (hours) — single line change	Minimal	Medium — eliminates rehashing
2b	Lockfile subtree memoization	Medium (2-3 days) — needs careful concurrent cache design	Medium — correctness-critical	Medium — scales with shared deps
2c	npm dependency indexing	Small (1-2 days) — follow pnpm's existing pattern	Low	Medium (npm users only)
3	Multi-threaded zstd	Tiny (hours) — one method call	Low-Medium — verify archive compat	High on cache-miss, Zero on cache-hit
4	SCM binary hash repr	Large (3-5 days) — cross-crate type change	Medium — wide blast radius	Low-Medium — marginal per-file savings
5	Global env lazy static	Tiny (hours)	Minimal	Low — one-time cost

Recommended priority order: #1 → #3 → #2 → #2c → #5 → #4

Assumption: These estimates are for a ~1000-package monorepo. Smaller repos (5-50 packages) will see proportionally smaller gains from #1 and #2 but similar gains from #3. The existing optimizations already address the largest bottlenecks — remaining wins are incremental but compound meaningfully at scale.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.
globwalk-pattern-caching Files: crates/turborepo-globwalk/src/lib.rs, crates/turborepo-run-cache/src/lib.rs, crates/turborepo-scm/src/package_deps.rs, crates/turborepo-types/src/lib.rs Maturity: stable Relevance: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win. Key findings: Glob patterns are recompiled from scratch on every globwalk() call with zero caching—in a 100-package monorepo the same dist/** pattern gets regex-compiled 100+ times per build across save_outputs and file-hashing paths, and adding a compiled-pattern cache is a low-risk, high-leverage win.
scm-package-hash-alloc-reduction Files: crates/turborepo-scm/src/hash_object.rs, crates/turborepo-scm/src/repo_index.rs, crates/turborepo-scm/src/manual.rs, crates/turborepo-scm/src/ls_tree.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: Per-file String allocations from hex encoding, redundant hash clones across packages, and a double directory walk in manual mode create measurable overhead on every build — these are hot-path wins that scale with repo size. Key findings: Based on the research across all three explorations, here's the highest-leverage untapped area:
lockfile-transitive-closure Files: crates/turborepo-lockfiles/src/lib.rs, crates/turborepo-lockfiles/src/berry/.rs, crates/turborepo-lockfiles/src/pnpm/.rs, crates/turborepo-lockfiles/src/npm.rs, crates/turborepo-lockfiles/src/bun.rs Maturity: stable Relevance: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos. Key findings: all_transitive_closures sits on the critical startup path — every turbo run blocks on it, and an unsized DashMap plus redundant per-workspace DFS walks over shared dependency subgraphs leave significant time on the table for large monorepos.
task-env-regex-recompilation Files: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs Maturity: stable Relevance: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. Key findings: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.

Exploration

AREA: task-env-regex-recompilation FILES: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs MATURITY: stable RELEVANCE: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop. NOTES: (unspecified)

Raw Exploration Notes: AREA: task-env-regex-recompilation FILES: crates/turborepo-env/src/lib.rs, crates/turborepo-task-hash/src/lib.rs MATURITY: stable RELEVANCE: hashable_task_env recompiles regex via wildcard_map_from_wildcards on every task despite CompiledWildcards already existing for the pass-through path — extending it here eliminates O(tasks) regex compilations in the hot loop.

Selected Code Context

===== FILE: crates/turborepo-env/src/lib.rs ===== //! Environment variable filtering for tasks and hashing for cache keys.

#![deny(clippy::all)]

use std::{ collections::HashMap, env, ops::{Deref, DerefMut}, };

use regex::{Regex, RegexBuilder}; use serde::Serialize; use sha2::{Digest, Sha256}; use thiserror::Error;

pub mod platform;

const DEFAULT_ENV_VARS: &[&str] = ["VERCEL_ANALYTICS_ID", "VERCEL_TARGET_ENV"].as_slice();

pub const BUILTIN_PASS_THROUGH_ENV: &[&str] = &[ "HOME", "USER", "TZ", "LANG", "SHELL", "PWD", "XDG_RUNTIME_DIR", "XAUTHORITY", "DBUS_SESSION_BUS_ADDRESS", "CI", "NODE_OPTIONS", "COREPACK_HOME", "LD_LIBRARY_PATH", "DYLD_FALLBACK_LIBRARY_PATH", "LIBPATH", "LD_PRELOAD", "DYLD_INSERT_LIBRARIES", "COLORTERM", "TERM", "TERM_PROGRAM", "DISPLAY", "TMP", "TEMP", // Windows "WINDIR", "ProgramFiles", "ProgramFiles(x86)", // VSCode IDE "VSCODE_", "ELECTRON_RUN_AS_NODE", // Docker "DOCKER_", "BUILDKIT_", // Docker compose "COMPOSE_", // Jetbrains IDE "JB_IDE_", "JB_INTERPRETER", "JETBRAINS_TEST_RUNNER_RUN_SCOPE_TYPE", // Vercel specific "VERCEL", "VERCEL", "NEXT_", "USE_OUTPUT_FOR_EDGE_FUNCTIONS", "NOW_BUILDER", "VC_MICROFRONTENDS_CONFIG_FILE_NAME", // GitHub Actions "GITHUB_", "RUNNER_*", // Command Prompt casing of env variables "APPDATA", "PATH", "PROGRAMDATA", "SYSTEMROOT", "SYSTEMDRIVE", "USERPROFILE", "HOMEDRIVE", "HOMEPATH", "PNPM_HOME", "NPM_CONFIG_STORE_DIR", ];

#[derive(Clone, Debug, Error)] pub enum Error { #[error("Failed to parse regex: {0}")] Regex(#[from] regex::Error), }

// TODO: Consider using immutable data structures here #[derive(Clone, Debug, Default, Serialize, PartialEq)] #[serde(transparent)] pub struct EnvironmentVariableMap(HashMap<String, String>);

impl EnvironmentVariableMap { // Returns a deterministically sorted set of EnvironmentVariablePairs // from an EnvironmentVariableMap. // This is the value that is used upstream as a task hash input, // so we need it to be deterministic pub fn to_hashable(&self) -> EnvironmentVariablePairs { let mut list: Vec<_> = self.iter().map(|(k, v)| format!("{k}={v}")).collect(); list.sort();

    list
}

pub fn names(&self) -> Vec<String> {
    let mut names: Vec<_> = self.keys().cloned().collect();
    names.sort();

    names
}

// Returns a deterministically sorted set of EnvironmentVariablePairs
// from an EnvironmentVariableMap
// This is the value used to print out the task hash input,
// so the values are cryptographically hashed
pub fn to_secret_hashable(&self) -> EnvironmentVariablePairs {
    let mut pairs: Vec<String> = self
        .iter()
        .map(|(k, v)| {
            if !v.is_empty() {
                let mut hasher = Sha256::new();
                hasher.update(v.as_bytes());
                let hash = hasher.finalize();
                let hexed_hash = hex::encode(hash);
                format!("{k}={hexed_hash}")
            } else {
                format!("{k}=")
            }
        })
        .collect();
    // Make it deterministic to facilitate comparisons
    pairs.sort();
    pairs
}

}

// BySource contains a map of environment variables broken down by the source #[derive(Debug, Default, Serialize, Clone)] pub struct BySource { pub explicit: EnvironmentVariableMap, pub matching: EnvironmentVariableMap, }

// DetailedMap contains the composite and the detailed maps of environment // variables All is used as a taskhash input (taskhash.CalculateTaskHash) // BySource is used by dry runs and run summaries #[derive(Debug, Default, Serialize, Clone)] pub struct DetailedMap { pub all: EnvironmentVariableMap, pub by_source: BySource, }

// A list of "k=v" strings for env variables and their values pub type EnvironmentVariablePairs = Vec;

// WildcardMaps is a pair of EnvironmentVariableMaps. #[derive(Debug)] pub struct WildcardMaps { pub inclusions: EnvironmentVariableMap, pub exclusions: EnvironmentVariableMap, }

impl WildcardMaps { // Resolve collapses a WildcardSet into a single EnvironmentVariableMap. fn resolve(self) -> EnvironmentVariableMap { let mut output = self.inclusions; output.difference(&self.exclusions); output } }

/// Pre-compiled include/exclude wildcard regexes. Compile once and reuse /// across tasks that share the same wildcard patterns. pub struct CompiledWildcards { include_regex: Option, exclude_regex: Option, }

impl CompiledWildcards { pub fn compile(wildcard_patterns: &[impl AsRef]) -> Result<Self, Error> { let mut include_patterns = Vec::new(); let mut exclude_patterns = Vec::new();

    for wildcard_pattern in wildcard_patterns {
        let wildcard_pattern = wildcard_pattern.as_ref();
        if let Some(rest) = wildcard_pattern.strip_prefix('!') {
            exclude_patterns.push(wildcard_to_regex_pattern(rest));
        } else if wildcard_pattern.starts_with("\\!") {
            include_patterns.push(wildcard_to_regex_pattern(&wildcard_pattern[1..]));
        } else {
            include_patterns.push(wildcard_to_regex_pattern(wildcard_pattern));
        }
    }

    let case_insensitive = cfg!(windows);

    let include_regex = if include_patterns.is_empty() {
        None
    } else {
        let pattern = format!("^({})$", include_patterns.join("|"));
        Some(
            RegexBuilder::new(&pattern)
                .case_insensitive(case_insensitive)
                .build()?,
        )
    };

    let exclude_regex = if exclude_patterns.is_empty() {
        None
    } else {
        let pattern = format!("^({})$", exclude_patterns.join("|"));
        Some(
            RegexBuilder::new(&pattern)
                .case_insensitive(case_insensitive)
                .build()?,
        )
    };

    Ok(CompiledWildcards {
        include_regex,
        exclude_regex,
    })
}

}

impl From<HashMap<String, String>> for EnvironmentVariableMap { fn from(map: HashMap<String, String>) -> Self { EnvironmentVariableMap(map) } }

impl Deref for EnvironmentVariableMap { type Target = HashMap<String, String>;

fn deref(&self) -> &Self::Target {
    &self.0
}

}

impl DerefMut for EnvironmentVariableMap { fn deref_mut(&mut self) -> &mut Self::Target { &mut self.0 } }

impl EnvironmentVariableMap { pub fn infer() -> Self { EnvironmentVariableMap(env::vars().collect()) }

pub fn into_inner(self) -> HashMap<String, String> {
    self.0
}

// Takes another EnvironmentVariableMap and adds it into `self`
// Overwrites values if they already exist.
pub fn union(&mut self, another: &EnvironmentVariableMap) {
    for (key, value) in &another.0 {
        self.0.insert(key.clone(), value.clone());
    }
}

// Takes another EnvironmentVariableMap and removes matching keys
// from `self`
pub fn difference(&mut self, another: &EnvironmentVariableMap) {
    for key in another.0.keys() {
        self.0.remove(key);
    }
}

// returns a WildcardMaps after processing wildcards against it.
fn wildcard_map_from_wildcards(
    &self,
    wildcard_patterns: &[impl AsRef<str>],
) -> Result<WildcardMaps, Error> {
    let mut output = WildcardMaps {
        inclusions: EnvironmentVariableMap::default(),
        exclusions: EnvironmentVariableMap::default(),
    };

    let mut include_patterns = Vec::new();
    let mut exclude_patterns = Vec::new();

    for wildcard_pattern in wildcard_patterns {
        let wildcard_pattern = wildcard_pattern.as_ref();
        if let Some(rest) = wildcard_pattern.strip_prefix('!') {
            let exclude_pattern = wildcard_to_regex_pattern(rest);
            exclude_patterns.push(exclude_pattern);
        } else if wildcard_pattern.starts_with("\\!") {
            let include_pattern = wildcard_to_regex_pattern(&wildcard_pattern[1..]);
            include_patterns.push(include_pattern);
        } else {
            let include_pattern = wildcard_to_regex_pattern(wildcard_pattern);
            include_patterns.push(include_pattern);
        }
    }

    let include_regex_string = format!("^({})$", include_patterns.join("|"));
    let exclude_regex_string = format!("^({})$", exclude_patterns.join("|"));

    let case_insensitive = cfg!(windows);
    let include_regex = RegexBuilder::new(&include_regex_string)
        .case_insensitive(case_insensitive)
        .build()?;
    let exclude_regex = RegexBuilder::new(&exclude_regex_string)
        .case_insensitive(case_insensitive)
        .build()?;
    for (env_var, env_value) in &self.0 {
        if !include_patterns.is_empty() && include_regex.is_match(env_var) {
            output.inclusions.insert(env_var.clone(), env_value.clone());
        }
        if !exclude_patterns.is_empty() && exclude_regex.is_match(env_var) {
            output.exclusions.insert(env_var.clone(), env_value.clone());
        }
    }

    Ok(output)
}

// Returns an EnvironmentVariableMap containing the variables
// in the environment which match an array of wildcard patterns.
pub fn from_wildcards(
    &self,
    wildcard_patterns: &[impl AsRef<str>],
) -> Result<EnvironmentVariableMap, Error> {
    if wildcard_patterns.is_empty() {
        return Ok(EnvironmentVariableMap::default());
    }

    let resolved_set = self.wildcard_map_from_wildcards(wildcard_patterns)?;
    Ok(resolved_set.resolve())
}

// FromWildcardsUnresolved returns a wildcardSet specifying the inclusions and
// exclusions discovered from a set of wildcard patterns. This is used to ensure
// that user exclusions have primacy over inferred inclusions.
pub fn wildcard_map_from_wildcards_unresolved(
    &self,
    wildcard_patterns: &[impl AsRef<str>],
) -> Result<WildcardMaps, Error> {
    if wildcard_patterns.is_empty() {
        return Ok(WildcardMaps {
            inclusions: EnvironmentVariableMap::default(),
            exclusions: EnvironmentVariableMap::default(),
        });
    }

    self.wildcard_map_from_wildcards(wildcard_patterns)
}

/// Return a detailed map for which environment variables are factored into
/// the task's hash
pub fn hashable_task_env(
    &self,
    computed_wildcards: &[String],
    task_env: &[String],
) -> Result<DetailedMap, Error> {
    let mut explicit_env_var_map = EnvironmentVariableMap::default();
    let mut all_env_var_map = EnvironmentVariableMap::default();
    let mut matching_env_var_map = EnvironmentVariableMap::default();
    let inference_env_var_map = self.from_wildcards(computed_wildcards)?;

    let user_env_var_set = self.wildcard_map_from_wildcards_unresolved(task_env)?;

    all_env_var_map.union(&user_env_var_set.inclusions);
    all_env_var_map.union(&inference_env_var_map);
    all_env_var_map.difference(&user_env_var_set.exclusions);

    explicit_env_var_map.union(&user_env_var_set.inclusions);
    explicit_env_var_map.difference(&user_env_var_set.exclusions);

    matching_env_var_map.union(&inference_env_var_map);
    matching_env_var_map.difference(&user_env_var_set.exclusions);

    Ok(DetailedMap {
        all: all_env_var_map,
        by_source: BySource {
            explicit: explicit_env_var_map,
            matching: matching_env_var_map,
        },
    })
}

/// Constructs an environment map that contains pass through environment
/// variables
pub fn pass_through_env(
    &self,
    builtins: &[&str],
    global_env: &Self,
    task_pass_through: &[impl AsRef<str>],
) -> Result<Self, Error> {
    let mut pass_through_env = EnvironmentVariableMap::default();
    let default_env_var_pass_through_map = self.from_wildcards(builtins)?;
    let task_pass_through_env =
        self.wildcard_map_from_wildcards_unresolved(task_pass_through)?;

    pass_through_env.union(&default_env_var_pass_through_map);
    pass_through_env.union(global_env);
    pass_through_env.union(&task_pass_through_env.inclusions);
    pass_through_env.difference(&task_pass_through_env.exclusions);

    Ok(pass_through_env)
}

/// Like `from_wildcards` but uses pre-compiled regexes.
pub fn from_compiled_wildcards(&self, compiled: &CompiledWildcards) -> EnvironmentVariableMap {
    let mut output = EnvironmentVariableMap::default();
    for (env_var, env_value) in &self.0 {
        let included = compiled
            .include_regex
            .as_ref()
            .is_some_and(|re| re.is_match(env_var));
        let excluded = compiled
            .exclude_regex
            .as_ref()
            .is_some_and(|re| re.is_match(env_var));
        if included && !excluded {
            output.insert(env_var.clone(), env_value.clone());
        }
    }
    output
}

/// Like `pass_through_env` but uses pre-compiled builtin wildcards.
pub fn pass_through_env_compiled(
    &self,
    compiled_builtins: &CompiledWildcards,
    global_env: &Self,
    task_pass_through: &[impl AsRef<str>],
) -> Result<Self, Error> {
    let default_env_var_pass_through_map = self.from_compiled_wildcards(compiled_builtins);
    let task_pass_through_env =
        self.wildcard_map_from_wildcards_unresolved(task_pass_through)?;

    let mut pass_through_env = EnvironmentVariableMap::default();
    pass_through_env.union(&default_env_var_pass_through_map);
    pass_through_env.union(global_env);
    pass_through_env.union(&task_pass_through_env.inclusions);
    pass_through_env.difference(&task_pass_through_env.exclusions);

    Ok(pass_through_env)
}

}

const WILDCARD: char = ''; const WILDCARD_ESCAPE: char = '\'; const REGEX_WILDCARD_SEGMENT: &str = ".";

fn wildcard_to_regex_pattern(pattern: &str) -> String { let mut regex_string = Vec::new(); let mut previous_index = 0; let mut previous_char: Option = None;

for (i, char) in pattern.char_indices() {
    if char == WILDCARD {
        if previous_char == Some(WILDCARD_ESCAPE) {
            // Found a literal *
            // Replace the trailing "\*" with just "*" before adding the segment.
            regex_string.push(regex::escape(&format!(
                "{}*",
                &pattern[previous_index..(i - 1)]
            )));
        } else {
            // Found a wildcard
            // Add in the static segment since the last wildcard. Can be zero length.
            regex_string.push(regex::escape(&pattern[previous_index..i]));

            // Add a dynamic segment if it isn't adjacent to another dynamic segment.
            if let Some(last_segment) = regex_string.last()
                && last_segment != REGEX_WILDCARD_SEGMENT
            {
                regex_string.push(REGEX_WILDCARD_SEGMENT.to_string());
            }
        }

        // Advance the pointer.
        previous_index = i + 1;
    }
    previous_char = Some(char);
}

// Add the last static segment. Can be zero length.
regex_string.push(regex::escape(&pattern[previous_index..]));

regex_string.join("")

}

pub fn get_global_hashable_env_vars( env_at_execution_start: &EnvironmentVariableMap, global_env: &[String], ) -> Result<DetailedMap, Error> { let default_env_var_map = env_at_execution_start.from_wildcards(DEFAULT_ENV_VARS)?;

let user_env_var_set =
    env_at_execution_start.wildcard_map_from_wildcards_unresolved(global_env)?;

let mut all_env_var_map = EnvironmentVariableMap::default();
all_env_var_map.union(&user_env_var_set.inclusions);
all_env_var_map.union(&default_env_var_map);
all_env_var_map.difference(&user_env_var_set.exclusions);

let mut explicit_env_var_map = EnvironmentVariableMap::default();
explicit_env_var_map.union(&user_env_var_set.inclusions);
explicit_env_var_map.difference(&user_env_var_set.exclusions);

let mut matching_env_var_map = EnvironmentVariableMap::default();
matching_env_var_map.union(&default_env_var_map);
matching_env_var_map.difference(&explicit_env_var_map);

Ok(DetailedMap {
    all: all_env_var_map,
    by_source: BySource {
        explicit: explicit_env_var_map,
        matching: matching_env_var_map,
    },
})

}

#[cfg(test)] mod tests { use test_case::test_case;

use super::*;

#[test_case("LITERAL_\\*", "LITERAL_\\*" ; "literal star")]
#[test_case("\\*LEADING", "\\*LEADING" ; "leading literal star")]
#[test_case("\\!LEADING", "\\\\!LEADING" ; "leading literal bang")]
#[test_case("!LEADING", "!LEADING" ; "leading bang")]
#[test_case("*LEADING", ".*LEADING" ; "leading star")]
fn test_wildcard_to_regex_pattern(pattern: &str, expected: &str) {
    let actual = super::wildcard_to_regex_pattern(pattern);
    assert_eq!(actual, expected);
}

#[test]
fn test_case_sensitivity() {
    let start = EnvironmentVariableMap(
        vec![("Turbo".to_string(), "true".to_string())]
            .into_iter()
            .collect(),
    );
    let actual = start.from_wildcards(&["TURBO"]).unwrap();
    if cfg!(windows) {
        assert_eq!(actual.get("Turbo").map(|s| s.as_str()), Some("true"));
    } else {
        assert_eq!(actual.get("Turbo"), None);
    }
}

#[test_case(&[], &["VERCEL_ANALYTICS_ID", "VERCEL_TARGET_ENV"] ; "defaults")]
#[test_case(&["!VERCEL*"], &[] ; "removing defaults")]
#[test_case(&["FOO*", "!FOOD"], &["FOO", "FOOBAR", "VERCEL_ANALYTICS_ID", "VERCEL_TARGET_ENV"] ; "intersecting globs")]
fn test_global_env(inputs: &[&str], expected: &[&str]) {
    let env_at_start = EnvironmentVariableMap(
        vec![
            ("VERCEL_TARGET_ENV", "prod"),
            ("VERCEL_ANALYTICS_ID", "1"),
            ("FOO", "bar"),
            ("FOOBAR", "baz"),
            ("FOOD", "cheese"),
        ]
        .into_iter()
        .map(|(k, v)| (k.to_owned(), v.to_owned()))
        .collect(),
    );
    let inputs = inputs.iter().map(|s| s.to_string()).collect::<Vec<_>>();
    let actual = get_global_hashable_env_vars(&env_at_start, &inputs).unwrap();
    let mut actual = actual.all.keys().map(|s| s.as_str()).collect::<Vec<_>>();
    actual.sort();
    assert_eq!(actual, expected);
}

#[test_case(&["FOO*"], &["BAR"], &["BAR", "FOO", "FOOBAR", "FOOD"] ; "wildcard")]
#[test_case(&["FOO*", "!FOOBAR"], &["BAR"], &["BAR", "FOO", "FOOD"] ; "omit wild")]
#[test_case(&["FOO*"], &["!FOOBAR"], &["FOO", "FOOD"] ; "omit task")]
#[test_case(&["FOO*"], &["!FOO*"], &[] ; "exclude all framework vars")]
fn test_hashable_env(wildcards: &[&str], task: &[&str], expected: &[&str]) {
    let env_at_start = EnvironmentVariableMap(
        vec![
            ("FOO", "bar"),
            ("FOOBAR", "baz"),
            ("FOOD", "cheese"),
            ("BAR", "nuts"),
        ]
        .into_iter()
        .map(|(k, v)| (k.to_owned(), v.to_owned()))
        .collect(),
    );
    let wildcards: Vec<_> = wildcards.iter().map(|s| s.to_string()).collect();
    let task: Vec<_> = task.iter().map(|s| s.to_string()).collect();
    let output = env_at_start.hashable_task_env(&wildcards, &task).unwrap();
    let mut actual: Vec<_> = output.all.keys().map(|s| s.as_str()).collect();
    actual.sort();
    assert_eq!(actual, expected);
}

#[test_case(&["FOO*"], &["FOO", "FOOBAR", "FOOD", "PATH"] ; "folds 3 sources")]
#[test_case(&["!FOO"], &["PATH"] ; "remove global")]
#[test_case(&["!PATH"], &["FOO"] ; "remove builtin")]
#[test_case(&["FOO*", "!FOOD"], &["FOO", "FOOBAR", "PATH"] ; "mixing negations")]
fn test_pass_through_env(task: &[&str], expected: &[&str]) {
    let env_at_start = EnvironmentVariableMap(
        vec![
            ("PATH", "of"),
            ("FOO", "bar"),
            ("FOOBAR", "baz"),
            ("FOOD", "cheese"),
            ("BAR", "nuts"),
        ]
        .into_iter()
        .map(|(k, v)| (k.to_owned(), v.to_owned()))
        .collect(),
    );
    let global_env = EnvironmentVariableMap(
        vec![("FOO", "bar")]
            .into_iter()
            .map(|(k, v)| (k.to_owned(), v.to_owned()))
            .collect(),
    );
    let output = env_at_start
        .pass_through_env(&["PATH"], &global_env, task)
        .unwrap();
    let mut actual: Vec<_> = output.keys().map(|s| s.as_str()).collect();
    actual.sort();
    assert_eq!(actual, expected);
}

#[test_case(&["FOO*"], &["FOO", "FOOBAR", "FOOD", "PATH"] ; "folds 3 sources")]
#[test_case(&["!FOO"], &["PATH"] ; "remove global")]
#[test_case(&["!PATH"], &["FOO"] ; "remove builtin")]
#[test_case(&["FOO*", "!FOOD"], &["FOO", "FOOBAR", "PATH"] ; "mixing negations")]
fn test_pass_through_env_compiled_matches_original(task: &[&str], expected: &[&str]) {
    let env_at_start = EnvironmentVariableMap(
        vec![
            ("PATH", "of"),
            ("FOO", "bar"),
            ("FOOBAR", "baz"),
            ("FOOD", "cheese"),
            ("BAR", "nuts"),
        ]
        .into_iter()
        .map(|(k, v)| (k.to_owned(), v.to_owned()))
        .collect(),
    );
    let global_env = EnvironmentVariableMap(
        vec![("FOO", "bar")]
            .into_iter()
            .map(|(k, v)| (k.to_owned(), v.to_owned()))
            .collect(),
    );
    let builtins: &[&str] = &["PATH"];
    let compiled = CompiledWildcards::compile(builtins).unwrap();
    let output = env_at_start
        .pass_through_env_compiled(&compiled, &global_env, task)
        .unwrap();
    let mut actual: Vec<_> = output.keys().map(|s| s.as_str()).collect();
    actual.sort();
    assert_eq!(actual, expected);
}

#[test]
fn test_compiled_wildcards_matches_from_wildcards() {
    let env = EnvironmentVariableMap(
        vec![
            ("HOME", "/home/user"),
            ("PATH", "/usr/bin"),
            ("VSCODE_PID", "12345"),
            ("DOCKER_HOST", "tcp://localhost"),
            ("GITHUB_TOKEN", "ghp_xxx"),
            ("NEXT_PUBLIC_API", "https://api"),
            ("RANDOM_VAR", "value"),
            ("CI", "true"),
            ("VERCEL", "1"),
            ("VERCEL_URL", "example.vercel.app"),
        ]
        .into_iter()
        .map(|(k, v)| (k.to_owned(), v.to_owned()))
        .collect(),
    );

    let original = env.from_wildcards(BUILTIN_PASS_THROUGH_ENV).unwrap();
    let compiled = CompiledWildcards::compile(BUILTIN_PASS_THROUGH_ENV).unwrap();
    let from_compiled = env.from_compiled_wildcards(&compiled);

    let mut orig_keys: Vec<_> = original.keys().cloned().collect();
    let mut comp_keys: Vec<_> = from_compiled.keys().cloned().collect();
    orig_keys.sort();
    comp_keys.sort();

    assert_eq!(
        orig_keys, comp_keys,
        "compiled and original wildcard matching must produce identical keys"
    );

    for key in &orig_keys {
        assert_eq!(
            original.get(key),
            from_compiled.get(key),
            "values differ for key {key}"
        );
    }
}

#[test]
fn test_builtin_pass_through_env_compiles() {
    CompiledWildcards::compile(BUILTIN_PASS_THROUGH_ENV)
        .expect("BUILTIN_PASS_THROUGH_ENV should compile without error");
}

#[test]
fn test_compiled_wildcards_with_excludes() {
    let env = EnvironmentVariableMap(
        vec![("FOO", "1"), ("FOOBAR", "2"), ("FOOD", "3"), ("BAR", "4")]
            .into_iter()
            .map(|(k, v)| (k.to_owned(), v.to_owned()))
            .collect(),
    );

    let patterns: &[&str] = &["FOO*", "!FOOD"];
    let original = env.from_wildcards(patterns).unwrap();
    let compiled = CompiledWildcards::compile(patterns).unwrap();
    let from_compiled = env.from_compiled_wildcards(&compiled);

    let mut orig_keys: Vec<_> = original.keys().cloned().collect();
    let mut comp_keys: Vec<_> = from_compiled.keys().cloned().collect();
    orig_keys.sort();
    comp_keys.sort();

    assert_eq!(orig_keys, comp_keys);
    assert_eq!(orig_keys, vec!["FOO", "FOOBAR"]);
}

#[test]
fn test_compiled_wildcards_empty_patterns() {
    let env = EnvironmentVariableMap(
        vec![("FOO", "bar")]
            .into_iter()
            .map(|(k, v)| (k.to_owned(), v.to_owned()))
            .collect(),
    );

    let empty: &[&str] = &[];
    let compiled = CompiledWildcards::compile(empty).unwrap();
    let result = env.from_compiled_wildcards(&compiled);
    assert!(result.is_empty(), "empty patterns should match nothing");
}

} ===== END FILE: crates/turborepo-env/src/lib.rs =====

pub mod global_hash;

use std::{ collections::{HashMap, HashSet}, sync::{Arc, RwLock}, };

#[derive(Debug, Default)] pub struct PackageInputsHashes { hashes: HashMap<TaskId<'static>, String>, expanded_hashes: HashMap<TaskId<'static>, Arc>, }

    // Use the pre-built index if provided, otherwise build one on the spot.
    let owned_index;
    let repo_index = match pre_built_index {
        Some(idx) => Some(idx),
        None => {
            owned_index = scm.build_repo_index(workspaces.len());
            owned_index.as_ref()
        }
    };

    // Phase 1: Collect task metadata and group by (package_path, inputs) for dedup.
    // Multiple tasks in the same package with identical inputs produce the same
    // file hashes — no need to globwalk and hash the same files repeatedly.
    struct TaskInfo<'b> {
        task_id: TaskId<'static>,
        package_path: &'b AnchoredSystemPath,
        inputs: &'b TaskInputs,
    }

    let mut task_infos = Vec::new();
    for task in all_tasks {
        let TaskNode::Task(task_id) = task else {
            continue;
        };
        let task_definition = task_definitions
            .get(task_id)
            .ok_or_else(|| Error::MissingPipelineEntry(task_id.clone()))?;
        let workspace_name = task_id.to_workspace_name();
        let pkg = workspaces
            .get(&workspace_name)
            .ok_or_else(|| Error::MissingPackageJson(workspace_name.to_string()))?;
        let package_path = pkg
            .package_json_path
            .parent()
            .unwrap_or_else(|| AnchoredSystemPath::new("").unwrap());
        let inputs = task_definition.inputs();
        task_infos.push(TaskInfo {
            task_id: task_id.clone(),
            package_path,
            inputs,
        });
    }

    // Build dedup key: (package_path_str, globs, default)
    type HashKey = (AnchoredSystemPathBuf, Vec<String>, bool);
    let mut unique_keys: Vec<HashKey> = Vec::new();
    let mut key_indices: HashMap<HashKey, usize> = HashMap::new();
    let mut task_key_map: Vec<usize> = Vec::with_capacity(task_infos.len());

    for info in &task_infos {
        let key: HashKey = (
            info.package_path.to_owned(),
            info.inputs.globs.clone(),
            info.inputs.default,
        );
        let idx = match key_indices.get(&key) {
            Some(&idx) => idx,
            None => {
                let idx = unique_keys.len();
                key_indices.insert(key.clone(), idx);
                unique_keys.push(key);
                idx
            }
        };
        task_key_map.push(idx);
    }

    debug!(
        total_tasks = task_infos.len(),
        unique_hash_keys = unique_keys.len(),
        "file hash deduplication"
    );

    // Phase 2: Compute file hashes in parallel across unique keys.
    // EMFILE (too many open files) errors are handled via retry-with-backoff
    // in the globwalk and hash_objects layers, so we can safely parallelize
    // all keys on rayon without worrying about fd exhaustion.
    let file_hash_results: Vec<Result<Arc<FileHashes>, Error>> = unique_keys
        .into_par_iter()
        .map(|(package_path, globs, default)| {
            scm.get_package_file_hashes(
                repo_root,
                &package_path,
                &globs,
                default,
                None,
                repo_index,
            )
            .map(|h| {
                let mut v: Vec<_> = h.into_iter().collect();
                v.sort_unstable_by(|(a, _), (b, _)| a.cmp(b));
                Arc::new(FileHashes(v))
            })
            .map_err(Error::from)
        })
        .collect();

    let file_hash_results: Vec<Arc<FileHashes>> = file_hash_results
        .into_iter()
        .collect::<Result<Vec<_>, _>>()?;

    // Phase 3: Distribute shared results to individual tasks.
    let mut hashes = HashMap::with_capacity(task_infos.len());
    let mut expanded_hashes = HashMap::with_capacity(task_infos.len());

    for (i, info) in task_infos.into_iter().enumerate() {
        let key_idx = task_key_map[i];
        let file_hashes = &file_hash_results[key_idx];

        let hash = file_hashes.as_ref().hash();

        hashes.insert(info.task_id.clone(), hash);
        expanded_hashes.insert(info.task_id, Arc::clone(file_hashes));
    }

    Ok(PackageInputsHashes {
        hashes,
        expanded_hashes,
    })
}

}

#[derive(Default, Debug, Clone)] pub struct TaskHashTracker { state: Arc<RwLock>, }

    let compiled_builtins = CompiledWildcards::compile(BUILTIN_PASS_THROUGH_ENV)
        .unwrap_or_else(|_| {
            let empty: &[&str] = &[];
            CompiledWildcards::compile(empty).unwrap()
        });

    Self {
        hashes,
        run_opts,
        env_at_execution_start,
        global_hash,
        global_env,
        global_env_patterns,
        task_hash_tracker: TaskHashTracker::new(expanded_hashes),
        compiled_builtins,
        external_deps_hash_cache: HashMap::new(),
    }
}

/// Pre-compute and cache external dependency hashes for all packages.
/// Many tasks share the same package, so this avoids re-sorting
/// transitive dependencies for every task.
pub fn precompute_external_deps_hashes<'b>(
    &mut self,
    workspaces: impl Iterator<Item = (&'b PackageName, &'b PackageInfo)>,
) {
    if self.run_opts.single_package() {
        return;
    }
    let ws: Vec<_> = workspaces.collect();
    self.external_deps_hash_cache = ws
        .par_iter()
        .map(|(name, info)| {
            let hash = get_external_deps_hash(&info.transitive_dependencies);
            (name.as_str().to_owned(), hash)
        })
        .collect();
}

#[tracing::instrument(skip(self, task_definition, task_env_mode, workspace, dependency_set))]
pub fn calculate_task_hash<T: TaskDefinitionHashInfo>(
    &self,
    task_id: &TaskId<'static>,
    task_definition: &T,
    task_env_mode: EnvMode,
    workspace: &PackageInfo,
    dependency_set: &[&TaskNode],
    telemetry: PackageTaskEventBuilder,
) -> Result<String, Error> {
    let do_framework_inference = self.run_opts.framework_inference();
    let is_monorepo = !self.run_opts.single_package();

    let hash_of_files = self
        .hashes
        .get(task_id)
        .ok_or_else(|| Error::MissingPackageFileHash(task_id.to_string()))?;
    // See if we can infer a framework
    let framework = do_framework_inference
        .then(|| infer_framework(workspace, is_monorepo))
        .flatten()
        .inspect(|framework| {
            debug!("auto detected framework for {}", task_id.package());
            debug!(
                "framework: {}, env_prefix: {:?}",
                framework.slug(),
                framework.env(self.env_at_execution_start)
            );
            telemetry.track_framework(framework.slug().to_string());
        });
    let framework_slug = framework.map(|f| f.slug());

    let env_vars = if let Some(framework) = framework {
        let mut computed_wildcards = framework.env(self.env_at_execution_start);

        if let Some(exclude_prefix) = self
            .env_at_execution_start
            .get("TURBO_CI_VENDOR_ENV_KEY")
            .filter(|prefix| !prefix.is_empty())
        {
            let computed_exclude = format!("!{exclude_prefix}*");
            debug!(
                "excluding environment variables matching wildcard {}",
                computed_exclude
            );
            computed_wildcards.push(computed_exclude);
        }

        // Combine task-specific env patterns with global env exclusions
        // Global exclusions (patterns starting with !) should apply to framework
        // inference
        let combined_env_patterns: Vec<String> = task_definition
            .env()
            .iter()
            .chain(
                self.global_env_patterns
                    .iter()
                    .filter(|p| p.starts_with('!')),
            )
            .cloned()
            .collect();

        self.env_at_execution_start
            .hashable_task_env(&computed_wildcards, &combined_env_patterns)
            .map_err(|err| Error::EnvPattern {
                task_id: task_id.clone().into_owned(),
                err,
            })?
    } else {
        let all_env_var_map = self
            .env_at_execution_start
            .from_wildcards(task_definition.env())?;

        DetailedMap {
            by_source: BySource {
                explicit: all_env_var_map.clone(),
                matching: EnvironmentVariableMap::default(),
            },
            all: all_env_var_map,
        }
    };

    let hashable_env_pairs = env_vars.all.to_hashable();
    let outputs = task_definition.hashable_outputs(task_id);
    let task_dependency_hashes = self.calculate_dependency_hashes(dependency_set)?;
    let ext_hash_fallback;
    let external_deps_hash: Option<&str> = if !is_monorepo {
        None
    } else if let Some(cached) = self.external_deps_hash_cache.get(task_id.package()) {
        Some(cached.as_str())
    } else {
        ext_hash_fallback = get_external_deps_hash(&workspace.transitive_dependencies);
        Some(ext_hash_fallback.as_str())
    };

    if !hashable_env_pairs.is_empty() {
        debug!(
            "task hash env vars for {}:{}\n vars: {:?}",
            task_id.package(),
            task_id.task(),
            hashable_env_pairs
        );
    }

    let package_dir = workspace.package_path().to_unix();
    let is_root_package = package_dir.is_empty();
    // We wrap in an Option to mimic Go's serialization of nullable values
    let optional_package_dir = (!is_root_package).then_some(package_dir);

    let task_hashable = TaskHashable {
        global_hash: self.global_hash,
        task_dependency_hashes,
        package_dir: optional_package_dir,
        hash_of_files,
        external_deps_hash,
        task: task_id.task(),
        outputs,

        pass_through_args: self.run_opts.pass_through_args(),
        env: task_definition.env(),
        resolved_env_vars: hashable_env_pairs,
        pass_through_env: task_definition.pass_through_env().unwrap_or_default(),
        env_mode: task_env_mode,
    };

    let task_hash = task_hashable.calculate_task_hash();

    let task_hash_arc: Arc<str> = Arc::from(task_hash.as_str());
    self.task_hash_tracker.insert_hash(
        task_id.clone(),
        env_vars,
        task_hash_arc,
        framework_slug,
    );

    Ok(task_hash)
}

/// Gets the hashes of a task's dependencies. Because the visitor
/// receives the nodes in topological order, we know that all of
/// the dependencies have been processed before the current task.
///
/// # Arguments
///
/// * `dependency_set`: The dependencies of the current task
///
/// returns: Result<Vec<String, Global>, Error>
fn calculate_dependency_hashes(
    &self,
    dependency_set: &[&TaskNode],
) -> Result<Vec<Arc<str>>, Error> {
    let state = self
        .task_hash_tracker
        .state
        .read()
        .expect("hash tracker rwlock poisoned");

    let mut dependency_hash_list: Vec<Arc<str>> = Vec::with_capacity(dependency_set.len());
    for dependency_task in dependency_set {
        let TaskNode::Task(dependency_task_id) = dependency_task else {
            continue;
        };

        let dependency_hash = state
            .package_task_hashes
            .get(dependency_task_id)
            .ok_or_else(|| Error::MissingDependencyTaskHash(dependency_task.to_string()))?;
        dependency_hash_list.push(Arc::clone(dependency_hash));
    }
    drop(state);

    dependency_hash_list.sort_unstable();
    dependency_hash_list.dedup();

    Ok(dependency_hash_list)
}

pub fn into_task_hash_tracker_state(self) -> TaskHashTrackerState {
    let rwlock = Arc::into_inner(self.task_hash_tracker.state)
        .expect("multiple references to tracker state still exist");
    rwlock.into_inner().unwrap()
}

pub fn task_hash_tracker(&self) -> TaskHashTracker {
    self.task_hash_tracker.clone()
}

pub fn env<T: TaskDefinitionHashInfo>(
    &self,
    task_id: &TaskId,
    task_env_mode: EnvMode,
    task_definition: &T,
) -> Result<EnvironmentVariableMap, Error> {
    match task_env_mode {
        EnvMode::Strict => {
            let pass_through_env_vars = self.env_at_execution_start.pass_through_env_compiled(
                &self.compiled_builtins,
                &self.global_env,
                task_definition.pass_through_env().unwrap_or_default(),
            )?;

            let tracker_env = self
                .task_hash_tracker
                .env_vars(task_id)
                .ok_or_else(|| Error::MissingEnvVars(task_id.clone().into_owned()))?;

            let mut full_task_env = EnvironmentVariableMap::default();
            full_task_env.union(&pass_through_env_vars);
            full_task_env.union(&tracker_env.all);

            Ok(full_task_env)
        }
        EnvMode::Loose => Ok(self.env_at_execution_start.clone()),
    }
}

}

// Collect references instead of cloning each Package (which has two Strings).
let mut transitive_deps: Vec<&turborepo_lockfiles::Package> =
    transitive_dependencies.iter().collect();

transitive_deps.sort_unstable_by(|a, b| match a.key.cmp(&b.key) {
    std::cmp::Ordering::Equal => a.version.cmp(&b.version),
    other => other,
});

LockFilePackagesRef(transitive_deps).hash()

}

let owned_index;
let repo_index = match pre_built_index {
    Some(idx) => Some(idx),
    None => {
        owned_index = scm.build_repo_index(package_dirs.len());
        owned_index.as_ref()
    }
};

let merged = package_dirs
    .into_par_iter()
    .map(|package_dir| {
        scm.get_package_file_hashes::<&str>(root, package_dir, &[], false, None, repo_index)
    })
    .reduce(
        || Ok(HashMap::new()),
        |acc, hashes| {
            let mut acc = acc?;
            let hashes = hashes?;
            acc.extend(hashes.into_iter());
            Ok(acc)
        },
    )?;

let mut file_hashes: Vec<_> = merged.into_iter().collect();
file_hashes.sort_unstable_by(|(a, _), (b, _)| a.cmp(b));
Ok(FileHashes(file_hashes).hash())

}

pub fn hash(&self, task_id: &TaskId) -> Option<Arc<str>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_hashes.get(task_id).cloned()
}

fn insert_hash(
    &self,
    task_id: TaskId<'static>,
    env_vars: DetailedMap,
    hash: Arc<str>,
    framework_slug: Option<FrameworkSlug>,
) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state
        .package_task_env_vars
        .insert(task_id.clone(), env_vars);
    if let Some(framework) = framework_slug {
        // Only pay for one extra clone when framework inference is active.
        state
            .package_task_framework
            .insert(task_id.clone(), framework);
    }
    state.package_task_hashes.insert(task_id, hash);
}

pub fn env_vars(&self, task_id: &TaskId) -> Option<DetailedMap> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_env_vars.get(task_id).cloned()
}

pub fn framework(&self, task_id: &TaskId) -> Option<FrameworkSlug> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_framework.get(task_id).cloned()
}

pub fn expanded_outputs(&self, task_id: &TaskId) -> Option<Vec<AnchoredSystemPathBuf>> {
    let state = self.state.read().expect("hash tracker rwlock poisoned");
    state.package_task_outputs.get(task_id).cloned()
}

pub fn insert_expanded_outputs(
    &self,
    task_id: TaskId<'static>,
    outputs: Vec<AnchoredSystemPathBuf>,
) {
    let mut state = self.state.write().expect("hash tracker rwlock poisoned");
    state.package_task_outputs.insert(task_id, outputs);
}

pub f

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

IMPORTANT: Structure your response using the exact section headings above.

Raw

zstd-mt-and-walker.md

Current State

Turborepo's hot path has four sequential phases: Startup/Discovery (100-600ms), Engine Construction (10-100ms), Hash Computation (50-500ms), and Task Execution (variable). Recent canary commits have already parallelized task hashing across topological waves, added shallow wildcard glob fast paths, and moved engine callbacks before tracker bookkeeping. Package.json parsing, lockfile reading, turbo.json preloading, and dependency splitting are already parallelized with rayon/tokio.

Required Changes

Tier 1 — High Impact

Multi-threaded zstd compression (cache_archive/create.rs:115,151) — Currently single-threaded level 0. Adding zw.multithread(4) and bumping to level 1 gives 2-4x faster writes and 30-60% smaller archives. ~5 LoC change.
Wave-based graph walker (walker.rs:34-125) — Currently spawns one tokio task + broadcast channel per graph node. A 10k-task graph = 10k spawned tasks. In-degree counting with a wave queue eliminates this. Larger rewrite but high payoff at scale.
Batch edge additions in engine builder (builder.rs:367-387) — Cartesian product adds edges one-at-a-time with mutable graph access. Collecting edges first then batching cuts 10-30% of engine construction.
Arc-wrap immutable engine collections (lib.rs:299-301) — Three full HashMap clones on every subgraph creation. Wrapping in Arc makes clone cost O(1). Critical for turbo watch.

Tier 2 — Medium Impact, Low Effort

Process spawn outside mutex (process/lib.rs:105-135) — Lock held during kernel spawn call serializes concurrent task starts.
Raise rayon thread cap from 8 to 16 (repo_index.rs:329) — Underutilizes 16+ core machines for stat-heavy I/O.
Lock-free hash aggregation (visitor/mod.rs:233-297) — Replace Arc<Mutex<HashMap>> with thread-local collect + merge.
Overlap env inference with turbo.json preload (builder.rs:400-407) — Independent ops run sequentially; rayon::join saves the shorter one.

Tier 3 — Quick Wins

Parallel file metadata collection for tar archives (fs.rs:175-177)
Increase async cache channel buffer from max_workers to max_workers * 4 (async_cache.rs:55)
Build reverse task definition index to avoid O(p) scan (builder.rs:439-447)

Ripple Effects

Multi-threaded zstd increases CPU during writes but outputs compatible archives — gate thread count on available cores
Walker rewrite changes within-wave scheduling order — test output ordering carefully
Process spawn outside lock introduces a race window with kill_all — register-after-spawn with a check
Raised thread cap could hit fd limits on constrained systems — test with low ulimit

What This Unlocks

Smaller zstd archives make remote cache uploads faster (network-bound improvement)
Wave-based walker enables adaptive per-wave concurrency tuning
Arc engine collections enable zero-copy engine sharing in watch mode
Adding criterion microbenchmarks (currently absent) would make all future perf work measurable

Effort & Risk

Change	Effort	Risk	Verify with
Multi-threaded zstd	XS	Low	`cargo test -p turborepo-cache` + compare archive sizes
Wave-based walker	L	Medium	`cargo test -p turborepo-graph-utils` + `--profile` comparison
Batch edges	S	Low	`cargo test -p turborepo-engine`
Arc collections	M	Low	`cargo test -p turborepo-engine` + watch smoke test
Spawn outside lock	S	Medium	`cargo test -p turborepo-process`
Thread cap raise	XS	Low	`--profile` on 16-core machine
Lock-free hashes	S	Low	`cargo test -p turborepo-lib`
Overlap env+preload	XS	Low	`--profile` startup spans

Verification for all changes:

# Baseline
turbo run build --force --profile=baseline.json
# After change
turbo run build --force --profile=after.json
# Compare spans in .md auto-summary or chrome://tracing
# Key spans: "pkg_dep_graph_build", "hash_scope", "engine::execute", "cache worker: cache PUT"
# Full regression check
cargo test --workspace

Recommended order: Start with XS/S changes (1, 6, 8, 10) for immediate wins, then tackle structural changes (3, 4), and save the walker rewrite (2) for last.

Oracle Session (inlined)

Research Investigation

Answer the investigation question using the provided context. Keep recommendations specific to this repository and call out assumptions.

Question: Find every possible perf win with the goal of making turborepo builds as fast as possible and show how to verify the perf improvements and that they're not breaking anything

Project Context

Recent Commits

Memory Context

Mode: research Previously explored areas:

engine-builder-parallel-config-loading Files: crates/turborepo-engine/src/builder.rs, crates/turborepo-repository/src/package_graph/mod.rs, crates/turborepo-repository/src/discovery.rs Maturity: stable Relevance: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count. Key findings: The EngineBuilder BFS loads turbo.json configs serially per workspace during every turbo run cold start — parallelizing this and caching resolved configs would cut graph construction time proportionally to workspace count.
cache-archive-parallel-compression Files: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs Maturity: stable Relevance: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. Key findings: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.

Exploration

AREA: cache-archive-parallel-compression FILES: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs MATURITY: stable RELEVANCE: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks. NOTES: (unspecified)

Raw Exploration Notes: AREA: cache-archive-parallel-compression FILES: crates/turborepo-cache/src/fs.rs, crates/turborepo-cache/src/cache_archive/create.rs, crates/turborepo-cache/src/cache_archive/mod.rs MATURITY: stable RELEVANCE: Cache archive creation uses single-threaded zstd at level 0 with sequential file reads — enabling multi-threaded zstd compression and parallel I/O would directly cut wall-clock time on every cache-miss task, especially for large Next.js builds with hundreds of output chunks.

Selected Code Context

===== FILE: crates/turborepo-cache/src/fs.rs ===== use std::backtrace::Backtrace;

use crate::{ CacheError, CacheHitMetadata, CacheSource, cache_archive::{CacheReader, CacheWriter}, };

pub struct FSCache { cache_directory: AbsoluteSystemPathBuf, analytics_recorder: Option, }

#[derive(Debug, Deserialize, Serialize)] struct CacheMetadata { hash: String, duration: u64, }

impl FSCache { fn resolve_cache_dir( repo_root: &AbsoluteSystemPath, cache_dir: &Utf8Path, ) -> AbsoluteSystemPathBuf { AbsoluteSystemPathBuf::from_unknown(repo_root, cache_dir) }

#[tracing::instrument(skip_all)]
pub fn new(
    cache_dir: &Utf8Path,
    repo_root: &AbsoluteSystemPath,
    analytics_recorder: Option<AnalyticsSender>,
) -> Result<Self, CacheError> {
    debug!(
        "FSCache::new called with cache_dir={}, repo_root={}",
        cache_dir, repo_root
    );
    let cache_directory = Self::resolve_cache_dir(repo_root, cache_dir);
    debug!("FSCache resolved cache_directory={}", cache_directory);
    cache_directory.create_dir_all()?;

    Ok(FSCache {
        cache_directory,
        analytics_recorder,
    })
}

fn log_fetch(&self, event: analytics::CacheEvent, hash: &str, duration: u64) {
    // If analytics fails to record, it's not worth failing the cache
    if let Some(analytics_recorder) = &self.analytics_recorder {
        let analytics_event = AnalyticsEvent {
            session_id: None,
            source: analytics::CacheSource::Local,
            event,
            hash: hash.to_string(),
            duration,
        };

        let _ = analytics_recorder.send(analytics_event);
    }
}

#[tracing::instrument(skip_all)]
pub fn fetch(
    &self,
    anchor: &AbsoluteSystemPath,
    hash: &str,
) -> Result<Option<(CacheHitMetadata, Vec<AnchoredSystemPathBuf>)>, CacheError> {
    let uncompressed_cache_path = self.cache_directory.join_component(&format!("{hash}.tar"));
    let compressed_cache_path = self
        .cache_directory
        .join_component(&format!("{hash}.tar.zst"));

    debug!(
        "FSCache::fetch looking for cache artifacts at {} or {}",
        uncompressed_cache_path, compressed_cache_path
    );

    let cache_path = if uncompressed_cache_path.exists() {
        uncompressed_cache_path
    } else if compressed_cache_path.exists() {
        compressed_cache_path
    } else {
        debug!(
            "FSCache::fetch cache miss for hash {} in {}",
            hash, self.cache_directory
        );
        self.log_fetch(analytics::CacheEvent::Miss, hash, 0);
        return Ok(None);
    };

    let mut cache_reader = CacheReader::open(&cache_path)?;

    let restored_files = cache_reader.restore(anchor)?;

    let meta = CacheMetadata::read(
        &self
            .cache_directory
            .join_component(&format!("{hash}-meta.json")),
    )?;

    self.log_fetch(analytics::CacheEvent::Hit, hash, meta.duration);

    Ok(Some((
        CacheHitMetadata {
            time_saved: meta.duration,
            source: CacheSource::Local,
        },
        restored_files,
    )))
}

#[tracing::instrument(skip_all)]
pub(crate) fn exists(&self, hash: &str) -> Result<Option<CacheHitMetadata>, CacheError> {
    let cache_dir = self.cache_directory.as_str();
    let mut buf = String::with_capacity(cache_dir.len() + 1 + hash.len() + "-meta.json".len());
    buf.push_str(cache_dir);
    buf.push(std::path::MAIN_SEPARATOR);
    buf.push_str(hash);
    let prefix_len = buf.len();

    buf.push_str(".tar");
    let uncompressed_exists = std::path::Path::new(&buf).exists();

    buf.push_str(".zst");
    let compressed_exists = std::path::Path::new(&buf).exists();

    if !uncompressed_exists && !compressed_exists {
        return Ok(None);
    }

    buf.truncate(prefix_len);
    buf.push_str("-meta.json");

    let duration = CacheMetadata::read(
        &AbsoluteSystemPathBuf::try_from(buf.as_str())
            .map_err(|_| CacheError::ConfigCacheInvalidBase)?,
    )
    .map(|meta| meta.duration)
    .unwrap_or(0);

    Ok(Some(CacheHitMetadata {
        time_saved: duration,
        source: CacheSource::Local,
    }))
}

#[tracing::instrument(skip_all)]
pub fn put(
    &self,
    anchor: &AbsoluteSystemPath,
    hash: &str,
    files: &[AnchoredSystemPathBuf],
    duration: u64,
) -> Result<(), CacheError> {
    let cache_path = self
        .cache_directory
        .join_component(&format!("{hash}.tar.zst"));

    let mut cache_item = CacheWriter::create(&cache_path)?;

    for file in files {
        cache_item.add_file(anchor, file)?;
    }

    // Finish the archive (performs atomic rename from temp to final path)
    cache_item.finish()?;

    // Write metadata file atomically using write-to-temp-then-rename pattern
    let metadata_path = self
        .cache_directory
        .join_component(&format!("{hash}-meta.json"));

    let meta = CacheMetadata {
        hash: hash.to_string(),
        duration,
    };

    let meta_json = serde_json::to_string(&meta)
        .map_err(|e| CacheError::InvalidMetadata(e, Backtrace::capture()))?;

    // Write to temporary file then atomically rename
    let temp_metadata_path = self
        .cache_directory
        .join_component(&format!(".{hash}-meta.json.{}.tmp", std::process::id()));

    temp_metadata_path.create_with_contents(&meta_json)?;
    temp_metadata_path.rename(&metadata_path)?;

    Ok(())
}

}

#[cfg(test)] mod test { use std::time::Duration;

use anyhow::Result;
use futures::future::try_join_all;
use tempfile::tempdir;
use turbopath::AnchoredSystemPath;
use turborepo_analytics::start_analytics;
use turborepo_api_client::{APIAuth, APIClient, SecretString};
use turborepo_vercel_api_mock::start_test_server;

use super::*;
use crate::test_cases::{TestCase, get_test_cases, validate_analytics};

#[tokio::test]
async fn test_fs_cache() -> Result<()> {
    let port = port_scanner::request_open_port().unwrap();
    let (ready_tx, ready_rx) = tokio::sync::oneshot::channel();
    tokio::spawn(start_test_server(port, Some(ready_tx)));

    // Wait for server to be ready
    tokio::time::timeout(Duration::from_secs(5), ready_rx)
        .await
        .map_err(|_| anyhow::anyhow!("Test server failed to start"))??;

    let test_cases = get_test_cases();

    try_join_all(
        test_cases
            .iter()
            .map(|test_case| round_trip_test(test_case, port)),
    )
    .await?;

    validate_analytics(&test_cases, analytics::CacheSource::Local, port).await?;
    Ok(())
}

async fn round_trip_test(test_case: &TestCase, port: u16) -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;
    test_case.initialize(repo_root_path)?;

    let api_client = APIClient::new(
        format!("http://localhost:{port}"),
        Some(Duration::from_secs(200)),
        None,
        "2.0.0",
        true,
    )?;
    let api_auth = APIAuth {
        team_id: Some("my-team".to_string()),
        token: SecretString::new("my-token".to_string()),
        team_slug: None,
    };
    let (analytics_sender, analytics_handle) =
        start_analytics(api_auth.clone(), api_client.clone());

    let cache = FSCache::new(
        Utf8Path::new(""),
        repo_root_path,
        Some(analytics_sender.clone()),
    )?;

    let expected_miss = cache.fetch(repo_root_path, test_case.hash)?;
    assert!(expected_miss.is_none());

    let files: Vec<_> = test_case
        .files
        .iter()
        .map(|f| f.path().to_owned())
        .collect();
    cache.put(repo_root_path, test_case.hash, &files, test_case.duration)?;

    let (status, files) = cache.fetch(repo_root_path, test_case.hash)?.unwrap();

    assert_eq!(
        status,
        CacheHitMetadata {
            time_saved: test_case.duration,
            source: CacheSource::Local
        }
    );

    assert_eq!(files.len(), test_case.files.len());
    for (expected, actual) in test_case.files.iter().zip(files.iter()) {
        let actual: &AnchoredSystemPath = actual;
        assert_eq!(expected.path(), actual);
        let actual_file = repo_root_path.resolve(actual);
        if let Some(contents) = expected.contents() {
            assert_eq!(contents, actual_file.read_to_string()?);
        } else {
            assert!(actual_file.exists());
        }
    }

    analytics_handle.close_with_timeout().await;
    Ok(())
}

/// Test that multiple concurrent writes to the same hash don't corrupt the
/// cache. This tests the atomic write pattern
/// (write-to-temp-then-rename).
#[tokio::test]
async fn test_concurrent_writes_same_hash() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("test content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "concurrent_write_test";
    let duration = 100;

    // Create multiple caches pointing to the same directory
    let cache1 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache2 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache3 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;

    // Perform concurrent writes
    let handle1 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache1.put(&repo_root, hash, &files, duration) })
    };
    let handle2 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache2.put(&repo_root, hash, &files, duration) })
    };
    let handle3 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache3.put(&repo_root, hash, &files, duration) })
    };

    // All writes should succeed (or at least not corrupt the cache)
    let _ = handle1.await?;
    let _ = handle2.await?;
    let _ = handle3.await?;

    // The cache should be readable
    let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let result = cache.fetch(repo_root_path, hash)?;
    assert!(
        result.is_some(),
        "Cache should be readable after concurrent writes"
    );

    Ok(())
}

/// Test that reads during writes don't fail.
/// A read should either return the old content, new content, or a miss -
/// never corrupted data.
#[tokio::test]
async fn test_read_during_write() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("original content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "read_during_write_test";
    let duration = 100;

    // First write to establish the cache
    let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    cache.put(repo_root_path, hash, &files, duration)?;

    // Update the source file
    test_file.create_with_contents("updated content")?;

    // Perform concurrent read and write
    let cache_write = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache_read = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;

    let write_handle = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache_write.put(&repo_root, hash, &files, duration + 1) })
    };

    // Perform multiple reads while write is happening
    for _ in 0..10 {
        let result = cache_read.fetch(repo_root_path, hash);
        // Should either succeed with valid data or fail cleanly - no corruption
        if let Ok(Some((metadata, _))) = result {
            // Duration should be either old or new value
            assert!(
                metadata.time_saved == duration || metadata.time_saved == duration + 1,
                "Unexpected duration: {}",
                metadata.time_saved
            );
        }
    }

    write_handle.await??;

    Ok(())
}

/// Test that multiple concurrent reads don't interfere with each other.
#[tokio::test]
async fn test_concurrent_reads() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("test content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "concurrent_read_test";
    let duration = 100;

    // Write to cache first
    let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    cache.put(repo_root_path, hash, &files, duration)?;

    // Perform concurrent reads
    let mut handles = Vec::new();
    for _ in 0..10 {
        let cache = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
        let repo_root = repo_root_path.to_owned();
        handles.push(tokio::spawn(async move { cache.fetch(&repo_root, hash) }));
    }

    // All reads should succeed
    for handle in handles {
        let result = handle.await??;
        assert!(result.is_some(), "Concurrent read should succeed");
        let (metadata, _) = result.unwrap();
        assert_eq!(metadata.time_saved, duration);
    }

    Ok(())
}

/// Test that temp files are cleaned up after concurrent writes.
#[tokio::test]
async fn test_concurrent_writes_cleanup_temp_files() -> Result<()> {
    let repo_root = tempdir()?;
    let repo_root_path = AbsoluteSystemPath::from_std_path(repo_root.path())?;

    // Create test files
    let test_file = repo_root_path.join_component("test.txt");
    test_file.create_with_contents("test content")?;

    let files = vec![AnchoredSystemPathBuf::from_raw("test.txt")?];
    let hash = "temp_cleanup_test";
    let duration = 100;

    // Perform concurrent writes
    let cache1 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache2 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;
    let cache3 = FSCache::new(Utf8Path::new("cache"), repo_root_path, None)?;

    let handle1 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache1.put(&repo_root, hash, &files, duration) })
    };
    let handle2 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache2.put(&repo_root, hash, &files, duration) })
    };
    let handle3 = {
        let files = files.clone();
        let repo_root = repo_root_path.to_owned();
        tokio::spawn(async move { cache3.put(&repo_root, hash, &files, duration) })
    };

    // Wait for all writes to complete
    let _ = handle1.await?;
    let _ = handle2.await?;
    let _ = handle3.await?;

    // Verify no orphaned temp files remain in cache directory
    let cache_dir = repo_root_path.join_component("cache");
    let temp_files: Vec<_> = std::fs::read_dir(cache_dir.as_std_path())?
        .filter_map(|e| e.ok())
        .filter(|e| e.file_name().to_string_lossy().ends_with(".tmp"))
        .collect();
    assert!(
        temp_files.is_empty(),
        "Orphaned temp files found after concurrent writes: {:?}",
        temp_files
    );

    // Verify exactly one archive file exists for the hash
    let archive_files: Vec<_> = std::fs::read_dir(cache_dir.as_std_path())?
        .filter_map(|e| e.ok())
        .filter(|e| {
            let name = e.file_name().to_string_lossy().to_string();
            name.contains(hash) && name.ends_with(".tar.zst")
        })
        .collect();
    assert_eq!(
        archive_files.len(),
        1,
        "Expected exactly one archive file, found: {:?}",
        archive_files
    );

    Ok(())
}

} ===== END FILE: crates/turborepo-cache/src/fs.rs =====

===== FILE: crates/turborepo-cache/src/cache_archive/create.rs ===== use std::{ backtrace::Backtrace, fs, fs::OpenOptions, io::{BufWriter, Read, Write}, path::Path, sync::atomic::{AtomicU64, Ordering}, };

use tar::{EntryType, Header}; use turbopath::{AbsoluteSystemPath, AbsoluteSystemPathBuf, AnchoredSystemPath, IntoUnix};

use crate::CacheError;

/// Atomic counter to ensure unique temp filenames within a single process. /// Combined with PID, this guarantees uniqueness across concurrent tasks. static TEMP_FILE_COUNTER: AtomicU64 = AtomicU64::new(0);

/// Generate a unique temporary filename in the same directory as the target. /// /// Uses process ID and an atomic counter to ensure uniqueness both across /// processes and within concurrent tasks in the same process. fn generate_temp_path(final_path: &AbsoluteSystemPath) -> AbsoluteSystemPathBuf { let file_name = final_path.file_name().unwrap_or("cache").to_string(); let unique_id = TEMP_FILE_COUNTER.fetch_add(1, Ordering::Relaxed); let temp_name = format!(".{}.{}.{}.tmp", file_name, std::process::id(), unique_id); final_path .parent() .expect("cache path must have parent") .join_component(&temp_name) }

/// A writer for creating cache archives with atomic writes. /// /// Uses write-to-temp-then-rename pattern for concurrent safety. When created /// via [CacheWriter::create], writes go to a temporary file which is /// atomically renamed to the final path on [CacheWriter::finish]. /// /// # Resource Management /// /// Implements [Drop] to clean up temporary files if finish() is not called. /// This ensures no orphaned temp files remain on disk after errors or panics. pub struct CacheWriter<'a> { builder: tar::Builder<Box<dyn Write + 'a>>, /// The temporary path where the archive is being written. /// On finish(), this will be atomically renamed to final_path. temp_path: Option, /// The final destination path for the archive. final_path: Option, }

impl Drop for CacheWriter<'_> { fn drop(&mut self) { // Clean up temp file if finish() was not called (e.g., due to error or panic). // We take() the path to avoid double-cleanup if Drop is called multiple times. if let Some(temp_path) = self.temp_path.take() { // Best-effort cleanup - ignore errors since we may be in a panic // or the file may have already been cleaned up/moved. let _ = temp_path.remove_file(); } } }

impl<'a> CacheWriter<'a> { // Appends data to tar builder. fn append_data( &mut self, header: &mut Header, path: impl AsRef, body: impl Read, ) -> Result<(), CacheError> { Ok(self.builder.append_data(header, path, body)?) }

fn append_link(
    &mut self,
    header: &mut Header,
    path: impl AsRef<Path>,
    target: impl AsRef<Path>,
) -> Result<(), CacheError> {
    Ok(self.builder.append_link(header, path, target)?)
}

/// Finish writing the archive.
///
/// If the archive was created with `create()`, this will atomically rename
/// the temporary file to the final destination path. This ensures that
/// concurrent readers will either see the complete old file or the complete
/// new file, never a partially written file.
///
/// After calling this method, the `Drop` implementation will not attempt
/// to clean up the temp file since it has been successfully renamed.
pub fn finish(mut self) -> Result<(), CacheError> {
    // Finish the tar archive - this writes the tar footer and flushes data.
    // The underlying zstd encoder (if used) has `auto_finish()` which ensures
    // compression is finalized when the encoder is dropped.
    self.builder.finish()?;

    // Take the paths before the rename. If rename succeeds, Drop won't try
    // to clean up. If rename fails, we return the error and Drop will clean up.
    if let (Some(temp_path), Some(final_path)) = (self.temp_path.take(), self.final_path.take())
    {
        // Atomically rename temp file to final destination.
        // The builder (and its file handle) will be dropped when `self` goes
        // out of scope at the end of this function.
        temp_path.rename(&final_path)?;
    }

    // Drop runs here - temp_path is None so no cleanup attempted
    Ok(())
}

pub fn from_writer(writer: impl Write + 'a, use_compression: bool) -> Result<Self, CacheError> {
    if use_compression {
        let zw = zstd::Encoder::new(writer, 0)?.auto_finish();
        Ok(CacheWriter {
            builder: tar::Builder::new(Box::new(zw)),
            temp_path: None,
            final_path: None,
        })
    } else {
        Ok(CacheWriter {
            builder: tar::Builder::new(Box::new(writer)),
            temp_path: None,
            final_path: None,
        })
    }
}

// Makes a new CacheArchive at the specified path
// Wires up the chain of writers:
// tar::Builder -> zstd::Encoder (optional) -> BufWriter -> File
//
// Uses atomic write pattern: writes to a temporary file, then renames
// to the final path on `finish()`. This ensures concurrent safety when
// multiple processes may be writing to the same cache location.
pub fn create(path: &AbsoluteSystemPath) -> Result<Self, CacheError> {
    let temp_path = generate_temp_path(path);

    let mut options = OpenOptions::new();
    options.write(true).create(true).truncate(true);

    let file = temp_path.open_with_options(options)?;

    // Flush to disk in 1mb chunks.
    let file_buffer = BufWriter::with_capacity(2usize.pow(20), file);

    let is_compressed = path.extension() == Some("zst");

    if is_compressed {
        let zw = zstd::Encoder::new(file_buffer, 0)?.auto_finish();

        Ok(CacheWriter {
            builder: tar::Builder::new(Box::new(zw)),
            temp_path: Some(temp_path),
            final_path: Some(path.to_owned()),
        })
    } else {
        Ok(CacheWriter {
            builder: tar::Builder::new(Box::new(file_buffer)),
            temp_path: Some(temp_path),
            final_path: Some(path.to_owned()),
        })
    }
}

// Adds a user-cached item to the tar
pub(crate) fn add_file(
    &mut self,
    anchor: &AbsoluteSystemPath,
    file_path: &AnchoredSystemPath,
) -> Result<(), CacheError> {
    // Resolve the fully-qualified path to the file to read it.
    let source_path = anchor.resolve(file_path);

    // Grab the file info to construct the header.
    let file_info = source_path.symlink_metadata()?;

    // Normalize the path within the cache
    let mut file_path = file_path.to_unix();
    file_path.make_canonical_for_tar(file_info.is_dir());

    let mut header = Self::create_header(&file_info)?;

    if matches!(header.entry_type(), EntryType::Regular) && file_info.len() > 0 {
        let file = source_path.open()?;
        self.append_data(&mut header, file_path.as_str(), file)?;
    } else if matches!(header.entry_type(), EntryType::Symlink) {
        // We convert to a Unix path because all paths in tar should be
        // Unix-style. This will get restored to a system path.
        let target = source_path.read_link()?.into_unix();
        self.append_link(&mut header, file_path.as_str(), &target)?;
    } else {
        self.append_data(&mut header, file_path.as_str(), &mut std::io::empty())?;
    }

    Ok(())
}

fn create_header(file_info: &fs::Metadata) -> Result<Header, CacheError> {
    let mut header = Header::new_gnu();

    let mode: u32;
    #[cfg(unix)]
    {
        use std::os::unix::fs::MetadataExt;
        mode = file_info.mode();
    }
    #[cfg(windows)]
    {
        // Windows makes up 0o666 for files, which in the Go code
        // we do: (0o666 & 0o755) | 0o111 which produces 0o755
        mode = 0o755
    }
    header.set_mode(mode);

    if file_info.is_symlink() {
        // We do *not* set the linkname here because it could be too long
        // Instead we set it when we add the file to the archive
        header.set_entry_type(EntryType::Symlink);
        header.set_size(0);
    } else if file_info.is_dir() {
        header.set_size(0);
        header.set_entry_type(EntryType::Directory);
    } else if file_info.is_file() {
        header.set_entry_type(EntryType::Regular);
        header.set_size(file_info.len());
    } else {
        // Throw an error if trying to create a cache that contains a type we don't
        // support.
        return Err(CacheError::CreateUnsupportedFileType(Backtrace::capture()));
    }

    // Consistent creation
    header.set_uid(0);
    header.set_gid(0);
    header.as_gnu_mut().unwrap().set_atime(0);
    header.set_mtime(0);
    header.as_gnu_mut().unwrap().set_ctime(0);

    Ok(header)
}

}

#[cfg(test)] mod tests { use std::path::PathBuf;

use anyhow::Result;
use tempfile::tempdir;
use test_case::test_case;
use turbopath::{AbsoluteSystemPathBuf, AnchoredSystemPathBuf};

use super::*;
use crate::cache_archive::restore::CacheReader;

#[derive(Debug)]
enum FileType {
    Dir,
    Symlink { linkname: String },
    Fifo,
    File,
}

#[derive(Debug)]
struct CreateFileDefinition {
    path: AnchoredSystemPathBuf,
    mode: u32,
    file_type: FileType,
}

fn create_entry(anchor: &AbsoluteSystemPath, file: &CreateFileDefinition) -> Result<()> {
    match &file.file_type {
        FileType::Dir => create_dir(anchor, file),
        FileType::Symlink { linkname } => create_symlink(anchor, file, linkname),
        FileType::Fifo => create_fifo(anchor, file),
        FileType::File => create_file(anchor, file),
    }
}

fn create_dir(anchor: &AbsoluteSystemPath, file: &CreateFileDefinition) -> Result<()> {
    let path = anchor.resolve(&file.path);
    path.create_dir_all()?;

    #[cfg(unix)]
    {
        path.set_mode(file.mode & 0o777)?;
    }

    Ok(())
}

fn create_symlink(
    anchor: &AbsoluteSystemPath,
    file: &CreateFileDefinition,
    linkname: &str,
) -> Result<()> {
    let path = anchor.resolve(&file.path);
    path.symlink_to_file(linkname)?;

    Ok(())
}

#[cfg(unix)]
fn create_fifo(anchor: &AbsoluteSystemPath, file: &CreateFileDefinition) -> Result<()> {
    use std::ffi::CString;

    let path = anchor.resolve(&file.path);
    let path_cstr = CString::new(path.as_str())?;

    unsafe {
        libc::mkfifo(path_cstr.as_ptr(), 0o644);
    }

    Ok(())
}

#[cfg(windows)]
fn create_fifo(_: &AbsoluteSystemPath, _: &CreateFileDefinition) -> Result<()> {
    Err(CacheError::CreateUnsupportedFileType(Backtrace::capture()).into())
}

fn create_file(anchor: &AbsoluteSystemPath, file: &CreateFileDefinition) -> Result<()> {
    let path = anchor.resolve(&file.path);
    fs::write(&path, b"file contents")?;
    #[cfg(unix)]
    {
        path.set_mode(file.mode & 0o777)?;
    }

    Ok(())
}

#[test_case(
  vec![
     CreateFileDefinition {
       path: AnchoredSystemPathBuf::from_raw("hello world.txt").unwrap(),
       mode: 0o644,
       file_type: FileType::File,
     }
  ],
  None
  ; "create regular file"
)]
#[test_case(
    vec![
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("one").unwrap(),
            mode: 0o777,
            file_type: FileType::Symlink { linkname: "two".to_string() },
        },
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("two").unwrap(),
            mode: 0o777,
            file_type: FileType::Symlink { linkname: "three".to_string() },
        },
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("three").unwrap(),
            mode: 0o777,
            file_type: FileType::Symlink { linkname: "real".to_string() },
        },
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("real").unwrap(),
            mode: 0o777,
            file_type: FileType::File,
        }
    ],
    None
    ; "create symlinks"
)]
#[test_case(
    vec![
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("parent").unwrap(),
            mode: 0o777,
            file_type: FileType::Dir,
        },
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("parent/child").unwrap(),
            mode: 0o644,
            file_type: FileType::File,
        },
    ],
    None
    ; "create directory"
)]
#[test_case(
    vec![
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("one").unwrap(),
            mode: 0o644,
            file_type: FileType::Symlink { linkname: "two".to_string() },
        },
    ],
    None
    ; "create broken symlink"
)]
#[test_case(
    vec![
        CreateFileDefinition {
            path: AnchoredSystemPathBuf::from_raw("one").unwrap(),
            mode: 0o644,
            file_type: FileType::Fifo,
        }
    ],
    Some("attempted to create unsupported file type")
    ; "create unsupported"
)]
fn test_create(
    files: Vec<CreateFileDefinition>,
    #[allow(unused_variables)] expected_err: Option<&str>,
) -> Result<()> {
    'outer: for compressed in [false, true] {
        let input_dir = tempdir()?;
        let archive_dir = tempdir()?;
        let input_dir_path = AbsoluteSystemPathBuf::try_from(input_dir.path())?;
        let archive_path = if compressed {
            AbsoluteSystemPathBuf::try_from(archive_dir.path().join("out.tar.zst"))?
        } else {
            AbsoluteSystemPathBuf::try_from(archive_dir.path().join("out.tar"))?
        };

        let mut cache_archive = CacheWriter::create(&archive_path)?;

        for file in files.iter() {
            let result = create_entry(&input_dir_path, file);
            if let Err(err) = result {
                assert!(expected_err.is_some());
                assert_eq!(err.to_string(), expected_err.unwrap());
                continue 'outer;
            }

            let result = cache_archive.add_file(&input_dir_path, &file.path);
            if let Err(err) = result {
                assert!(expected_err.is_some());
                assert_eq!(err.to_string(), expected_err.unwrap());
                continue 'outer;
            }
        }

        cache_archive.finish()?;
    }

    Ok(())
}

#[test]
#[cfg(unix)]
fn test_add_trailing_slash_unix() {
    let mut path = PathBuf::from("foo/bar");
    assert_eq!(path.to_string_lossy(), "foo/bar");
    path.push("");
    assert_eq!(path.to_string_lossy(), "foo/bar/");

    // Confirm that this is idempotent
    path.push("");
    assert_eq!(path.to_string_lossy(), "foo/bar/");
}

#[test]
#[cfg(windows)]
fn test_add_trailing_slash_windows() {
    let mut path = PathBuf::from("foo\\bar");
    assert_eq!(path.to_string_lossy(), "foo\\bar");
    path.push("");
    assert_eq!(path.to_string_lossy(), "foo\\bar\\");

    // Confirm that this is idempotent
    path.push("");
    assert_eq!(path.to_string_lossy(), "foo\\bar\\");
}

#[test]
fn create_tar_with_really_long_name() -> Result<()> {
    let archive_dir = tempdir()?;
    let archive_dir_path = AbsoluteSystemPath::new(archive_dir.path().to_str().unwrap())?;

    let tar_dir = tempdir()?;
    let tar_dir_path = AbsoluteSystemPath::new(tar_dir.path().to_str().unwrap())?;

    let tar_path = tar_dir_path.join_component("test.tar");
    let mut archive = CacheWriter::create(&tar_path)?;
    let base = "this-is-a-really-really-really-long-path-like-so-very-long-that-i-can-list-all-of-my-favorite-directors-like-edward-yang-claire-denis-lucrecia-martel-wong-kar-wai-even-kurosawa";
    let file_name = format!("{base}.txt");
    let dir_symlink_name = format!("{base}-dir");
    let really_long_file = AnchoredSystemPath::new(&file_name).unwrap();
    let really_long_dir = AnchoredSystemPath::new(base).unwrap();
    let really_long_symlink = AnchoredSystemPath::new("this-is-a-really-really-really-long-symlink-like-so-very-long-that-i-can-list-all-of-my-other-favorite-directors-like-jim-jarmusch-michelangelo-antonioni-and-terrence-malick-symlink").unwrap();
    let really_long_dir_symlink = AnchoredSystemPath::new(&dir_symlink_name).unwrap();

    let really_long_path = archive_dir_path.resolve(really_long_file);
    really_long_path.create_with_contents("The End!")?;

    let really_long_symlink_path = archive_dir_path.resolve(really_long_symlink);
    really_long_symlink_path.symlink_to_file(really_long_file.as_str())?;

    let really_long_dir_path = archive_dir_path.resolve(really_long_dir);
    really_long_dir_path.create_dir_all()?;

    let really_long_dir_symlink_path = archive_dir_path.resolve(really_long_dir_symlink);
    really_long_dir_symlink_path.symlink_to_dir(really_long_dir.as_str())?;

    archive.add_file(archive_dir_path, really_long_file)?;
    archive.add_file(archive_dir_path, really_long_dir_symlink)?;
    archive.add_file(archive_dir_path, really_long_dir)?;
    archive.add_file(archive_dir_path, really_long_symlink)?;

    archive.finish()?;

    let restore_dir = tempdir()?;
    let restore_dir_path = AbsoluteSystemPath::new(restore_dir.path().to_str().unwrap())?;

    let mut restore = CacheReader::open(&tar_path)?;
    let files = restore.restore(restore_dir_path)?;
    assert_eq!(files.len(), 4);
    assert_eq!(files[0].as_str(), really_long_file.as_str());
    assert_eq!(files[1].as_str(), really_long_dir.as_str());
    assert_eq!(files[2].as_str(), really_long_symlink.as_str());
    assert_eq!(files[3].as_str(), really_long_dir_symlink.as_str());
    Ok(())
}

#[test]
fn test_compression() -> Result<()> {
    let mut buffer = Vec::new();
    let mut encoder = zstd::Encoder::new(&mut buffer, 0)?.auto_finish();
    encoder.write_all(b"hello world")?;
    // Should finish encoding on drop
    drop(encoder);

    let mut decoder = zstd::Decoder::new(&buffer[..])?;
    let mut out = String::new();
    decoder.read_to_string(&mut out)?;

    assert_eq!(out, "hello world");

    Ok(())
}

/// Test that temp files are cleaned up when CacheWriter is dropped without
/// calling finish().
#[test]
fn test_cachewriter_cleanup_on_drop() -> Result<()> {
    let archive_dir = tempdir()?;
    let archive_path =
        AbsoluteSystemPathBuf::try_from(archive_dir.path().join("test.tar.zst"))?;

    {
        // Create a CacheWriter but don't call finish()
        let _writer = CacheWriter::create(&archive_path)?;
        // Writer is dropped here without finish()
    }

    // Verify no orphaned temp files remain
    let entries: Vec<_> = std::fs::read_dir(archive_dir.path())?
        .filter_map(|e| e.ok())
        .filter(|e| e.file_name().to_string_lossy().ends_with(".tmp"))
        .collect();
    assert!(
        entries.is_empty(),
        "Temp files were not cleaned up: {:?}",
        entries
    );

    Ok(())
}

/// Test that temp files are cleaned up when CacheWriter errors during
/// writing.
#[test]
fn test_cachewriter_cleanup_on_error() -> Result<()> {
    let archive_dir = tempdir()?;
    let archive_path =
        AbsoluteSystemPathBuf::try_from(archive_dir.path().join("test.tar.zst"))?;

    let input_dir = tempdir()?;
    let input_dir_path = AbsoluteSystemPathBuf::try_from(input_dir.path())?;

    {
        let mut writer = CacheWriter::create(&archive_path)?;

        // Try to add a file that doesn't exist - this will error
        let nonexistent_file = AnchoredSystemPathBuf::from_raw("nonexistent.txt")?;
        let result = writer.add_file(&input_dir_path, &nonexistent_file);
        assert!(result.is_err());

        // Writer is dropped here without finish()
    }

    // Verify no orphaned temp files remain
    let entries: Vec<_> = std::fs::read_dir(archive_dir.path())?
        .filter_map(|e| e.ok())
        .filter(|e| e.file_name().to_string_lossy().ends_with(".tmp"))
        .collect();
    assert!(
        entries.is_empty(),
        "Temp files were not cleaned up after error: {:?}",
        entries
    );

    Ok(())
}

/// Test that temp file paths are unique even when generated concurrently.
#[test]
fn test_generate_temp_path_uniqueness() -> Result<()> {
    let archive_dir = tempdir()?;
    let base_path = AbsoluteSystemPathBuf::try_from(archive_dir.path().join("hash.tar.zst"))?;

    // Generate many temp paths and verify uniqueness
    let paths: Vec<_> = (0..100).map(|_| generate_temp_path(&base_path)).collect();

    let unique_count = paths.iter().collect::<std::collections::HashSet<_>>().len();
    assert_eq!(
        unique_count,
        paths.len(),
        "Temp paths should be unique, but found duplicates"
    );

    Ok(())
}

/// Test that successful finish() properly renames temp file to final path.
#[test]
fn test_cachewriter_finish_renames_file() -> Result<()> {
    let archive_dir = tempdir()?;
    let archive_path =
        AbsoluteSystemPathBuf::try_from(archive_dir.path().join("test.tar.zst"))?;

    let input_dir = tempdir()?;
    let input_dir_path = AbsoluteSystemPathBuf::try_from(input_dir.path())?;

    // Create a test file
    let test_file = input_dir_path.join_component("test.txt");
    test_file.create_with_contents("test content")?;

    let mut writer = CacheWriter::create(&archive_path)?;
    let file_path = AnchoredSystemPathBuf::from_raw("test.txt")?;
    writer.add_file(&input_dir_path, &file_path)?;
    writer.finish()?;

    // Verify final file exists
    assert!(archive_path.exists(), "Final archive should exist");

    // Verify no temp files remain
    let temp_files: Vec<_> = std::fs::read_dir(archive_dir.path())?
        .filter_map(|e| e.ok())
        .filter(|e| e.file_name().to_string_lossy().ends_with(".tmp"))
        .collect();
    assert!(
        temp_files.is_empty(),
        "No temp files should remain after finish(): {:?}",
        temp_files
    );

    Ok(())
}

} ===== END FILE: crates/turborepo-cache/src/cache_archive/create.rs =====

===== FILE: crates/turborepo-cache/src/cache_archive/mod.rs ===== #![allow(dead_code)] mod create; mod restore; mod restore_directory; mod restore_regular; mod restore_symlink;

pub use create::CacheWriter; pub use restore::CacheReader; ===== END FILE: crates/turborepo-cache/src/cache_archive/mod.rs =====

johnlindquist/10-tier-optimizations.md

Current State

Required Changes

P0 — Largest wins, lowest ambiguity

P0.1: Split TaskHashTrackerState lock + remove deep clones

P0.2: Replace wave Mutex<HashMap> with concurrent map

P0.3: Precompile env wildcard regexes for hashable env path

P0.4: Cache archive fast-path improvements

P1 — Big wins, slightly more surface area

P1.1: Cache compiled glob patterns across packages

P1.2: SCM hash allocation removal + double-walk merge

P1.3: Engine builder parallel config loading

P1.4: Change mapper prefix index

P2 — Smaller wins, do after P0/P1

P2.1: Visitor micro-allocations (TaskId cloning)

P2.2: Lockfile DashMap pre-sizing

Ripple Effects

What This Unlocks

Effort & Risk

Oracle Session (inlined)

Research Investigation

Project Context

Recent Commits

Memory Context

Exploration

Selected Code Context

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

Current State

Required Changes

Tier 1 — Quick Wins (low risk, high leverage, <1 day each)

1. Parallelize turbo.json stat calls in package discovery

2. Pre-compile task env regex patterns

3. Multi-threaded zstd compression for cache archives

Tier 2 — Medium Effort (moderate risk, significant impact, 1-3 days each)

4. Split TaskHashTracker RwLock into per-field DashMaps

5. Arc-wrap EnvironmentVariableMap in precomputed hash results

6. Parallelize turbo.json config loading in EngineBuilder

7. Size DashMap and deduplicate DFS in lockfile transitive closures

Tier 3 — Larger Efforts (higher risk, targeted impact, 3-5 days each)

8. Compiled glob pattern caching

9. O(1) file-to-package detection via path prefix index

10. Eliminate double directory walk in manual SCM hashing

Ripple Effects

What This Unlocks

Effort & Risk

Oracle Session (inlined)

Research Investigation

Project Context

Recent Commits

Memory Context

Exploration

Selected Code Context

Output Format (REQUIRED — your response MUST contain these exact sections)

Current State

Required Changes

Ripple Effects

What This Unlocks

Effort & Risk

Current State

Required Changes

Tier 1: High Impact

Tier 2: Medium-High Impact

Tier 3: Medium Impact, Low Risk (Quick Wins)

Tier 4: Medium Impact, Medium Effort

Bonus Wins (Spotted by GPT-5.2 Pro)

Ripple Effects

What This Unlocks

Effort & Risk

Oracle Session (inlined)

Research Investigation

Project Context

Recent Commits

Memory Context

Exploration

Selected Code Context

P0.1: Split `TaskHashTrackerState` lock + remove deep clones

P0.2: Replace wave `Mutex<HashMap>` with concurrent map