davehowell · January 16, 2025 00:52
diff --git a/BigQuery.md b/BigQuery.md
diff --git a/gcloud_cli.md b/gcloud_cli.md
diff --git a/gcp_architect_notes b/gcp_architect_notes

 ## Compute Engine
 MIG - managed instance groups (for autoscaling)
 - lift n shift, where you don't want to refactor

 ## Kubernetes Engine GKE
 - managed, cloud-native K8s
 - supports complex configurations
 - capable, but complex, e.g. complex manifests
 - autopilot helps scale-on-demand
 - Pods, Deployments, ReplicaSets, DaemonSets


 ## Cloud Functions


 ## App Engine
 - GCP first serverless offering (so its kindof legacy)
 - limited language runtime support
 - particular structure and boilerplate required


 ## Cloud Run
 - full managed for simpler container use cases
 - narrow set of use cases, limited
 - good for synchronous event-driven applications and microservices
 - also for async, events from http, gRPC, Pub/Sub, Cloud Scheduler, Cloud Tasks
 - Stateless, i.e. if you need db or backend then use GKE instead


 ## Google Cloud SQL 
 - Managed 
 - MySQL, Postgres, MSSQL

 ## Google Cloud Spanner 
 - Compute nodes + distributed storage using Colossus (distributed, replicated file system)
 - multilang support: C#, C++, Golang, Java, Node.js, PHP, Python, Ruby
 - burst trafficß
 - proprietary, write API is not just SQL
 - Scalable, but expensive
 - minimum 3 node 

 ## BigTable
 - Good for timeseries, IoT



 ## BigQuery
 - Built on... see https://cloud.google.com/blog/products/data-analytics/new-blog-series-bigquery-explained-overview
    - Dremel (compute), sql -> exec trees. Leaves are "slots", branches are "mixers" (aggregations and calculations)
    - Jupiter (petabit network for shuffling)
    - Colossus (global columnar storage optimised with nested type compression), successor to GFS. Also used by Spanner. "Curator" (scalable metadata)
        - internal format is "Capacitor" see https://cloud.google.com/blog/products/bigquery/inside-capacitor-bigquerys-next-generation-columnar-storage-format
    - Borg (orchestrator, a precursor to Kubernetes), allocates hardware resources

 ## Anthos 
 - orchestration and management layer, scalable, managed k8's (GKE) & VMs, 
 - also supports hybrid & multi-cloud (Google Distributed Cloud) - onprem / edge / cloud , includes hardware & software


 ## BeyondCorp
 - zero trust security model
 - not just network-based, includes MFA and managed device policy awareness
 - enables security outside a VPN (really? ... if it's good enough for GOOG then I guess)

 ## Sensitive Data Protection SDP
 - Cloud Data Loss Prevention and the DLP API are now part of SDP
 - auto profiling on bigquery
 - can feed to "Chronicle" and Security Command Center
 - classify & de-identify / masking

 ## Storage
 - blob - Cloud Storage (GCS) - global edge caching, cheap archiving
 - block (persistent disk, or local SSD)
 - filestore (share between apps, managed, scalable, predictable) 
    - this is mountable NFS, used for k8s or where shared proper filesystem required
 - firebase (mobile apps) AKA cloud datastore
 - artifact registry ( containers, OS & library packages)
 - workspace storage ( google workspace essentials, i.e. g-suite stuff)
 - data transfer ( services (DTS) or transfer appliance )

 - Storage tiers standard, nearline (access monthly), coldline (access quaterly), archive (access yearly)

 ## Security foundations blueprint
 - defense in depth, at scale, by default
 - BeyondProd (beyondcorp? is this the same)
 - shared fate relationship (not just shared responsibility)


 ## Networking
 Edge Nodes / Edge point of presence
 Zonal
 Regional (at least 3 zones)
 Multi-Regional
 global

 ## Connectivity
 - Cloud interconnect (3 options):
    - Dedicated interconnect - private IP address space connectivity, you manage the equipment at a colo G PoP 
    - Partner interconnect - private IP address space connectivity, ISP manages equipment at colo 
    - Cross-cloud - between clouds e.g. Azure <-> GCP
 - Peering - for google workspace applications (AKA G-Suite: gmail, drive, docs, sheets, etc)
    - Carrier peering - public IP connectivity, ISP manages the equipment at colo
    - Direct Peering 
 - MPLS
    - CE customer edge
    - PE provider edge
    - AC attachment circuit - physical or vitual circuit attaching a CE to a PE
    - PW psuedowire, a bidirectional virtual connection between two PEs 



 ## Identity, role, privilege
 Members (Principals/Entity)
 |- user account
 |- service account

 Roles
 |- basic
 |- predefined
 |- custom

 Members have Roles which provide privileges for resources within a project
 CLIs - gcloud, gsutil, bq, kubectl,  etc 

diff --git a/practice_test_nodes.md b/practice_test_nodes.md
diff --git a/sheets.md b/sheets.md
diff --git a/test.sql b/test.sql
 ‎‎

	## Compute Engine
	MIG - managed instance groups (for autoscaling)
	- lift n shift, where you don't want to refactor

	## Kubernetes Engine GKE
	- managed, cloud-native K8s
	- supports complex configurations
	- capable, but complex, e.g. complex manifests
	- autopilot helps scale-on-demand
	- Pods, Deployments, ReplicaSets, DaemonSets


	## Cloud Functions


	## App Engine
	- GCP first serverless offering (so its kindof legacy)
	- limited language runtime support
	- particular structure and boilerplate required


	## Cloud Run
	- full managed for simpler container use cases
	- narrow set of use cases, limited
	- good for synchronous event-driven applications and microservices
	- also for async, events from http, gRPC, Pub/Sub, Cloud Scheduler, Cloud Tasks
	- Stateless, i.e. if you need db or backend then use GKE instead


	## Google Cloud SQL
	- Managed
	- MySQL, Postgres, MSSQL

	## Google Cloud Spanner
	- Compute nodes + distributed storage using Colossus (distributed, replicated file system)
	- multilang support: C#, C++, Golang, Java, Node.js, PHP, Python, Ruby
	- burst trafficß
	- proprietary, write API is not just SQL
	- Scalable, but expensive
	- minimum 3 node

	## BigTable
	- Good for timeseries, IoT



	## BigQuery
	- Built on... see https://cloud.google.com/blog/products/data-analytics/new-blog-series-bigquery-explained-overview
	- Dremel (compute), sql -> exec trees. Leaves are "slots", branches are "mixers" (aggregations and calculations)
	- Jupiter (petabit network for shuffling)
	- Colossus (global columnar storage optimised with nested type compression), successor to GFS. Also used by Spanner. "Curator" (scalable metadata)
	- internal format is "Capacitor" see https://cloud.google.com/blog/products/bigquery/inside-capacitor-bigquerys-next-generation-columnar-storage-format
	- Borg (orchestrator, a precursor to Kubernetes), allocates hardware resources

	## Anthos
	- orchestration and management layer, scalable, managed k8's (GKE) & VMs,
	- also supports hybrid & multi-cloud (Google Distributed Cloud) - onprem / edge / cloud , includes hardware & software


	## BeyondCorp
	- zero trust security model
	- not just network-based, includes MFA and managed device policy awareness
	- enables security outside a VPN (really? ... if it's good enough for GOOG then I guess)

	## Sensitive Data Protection SDP
	- Cloud Data Loss Prevention and the DLP API are now part of SDP
	- auto profiling on bigquery
	- can feed to "Chronicle" and Security Command Center
	- classify & de-identify / masking

	## Storage
	- blob - Cloud Storage (GCS) - global edge caching, cheap archiving
	- block (persistent disk, or local SSD)
	- filestore (share between apps, managed, scalable, predictable)
	- this is mountable NFS, used for k8s or where shared proper filesystem required
	- firebase (mobile apps) AKA cloud datastore
	- artifact registry ( containers, OS & library packages)
	- workspace storage ( google workspace essentials, i.e. g-suite stuff)
	- data transfer ( services (DTS) or transfer appliance )

	- Storage tiers standard, nearline (access monthly), coldline (access quaterly), archive (access yearly)

	## Security foundations blueprint
	- defense in depth, at scale, by default
	- BeyondProd (beyondcorp? is this the same)
	- shared fate relationship (not just shared responsibility)


	## Networking
	Edge Nodes / Edge point of presence
	Zonal
	Regional (at least 3 zones)
	Multi-Regional
	global

	## Connectivity
	- Cloud interconnect (3 options):
	- Dedicated interconnect - private IP address space connectivity, you manage the equipment at a colo G PoP
	- Partner interconnect - private IP address space connectivity, ISP manages equipment at colo
	- Cross-cloud - between clouds e.g. Azure <-> GCP
	- Peering - for google workspace applications (AKA G-Suite: gmail, drive, docs, sheets, etc)
	- Carrier peering - public IP connectivity, ISP manages the equipment at colo
	- Direct Peering
	- MPLS
	- CE customer edge
	- PE provider edge
	- AC attachment circuit - physical or vitual circuit attaching a CE to a PE
	- PW psuedowire, a bidirectional virtual connection between two PEs



	## Identity, role, privilege
	Members (Principals/Entity)
	\|- user account
	\|- service account

	Roles
	\|- basic
	\|- predefined
	\|- custom

	Members have Roles which provide privileges for resources within a project
	CLIs - gcloud, gsutil, bq, kubectl, etc
No results found