This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| """ | |
| RunPod A100 Batch Embedding Script — v3 (File-Based Pipeline) | |
| Eliminates bore tunnel bottleneck by decoupling DB I/O from GPU. | |
| STRATEGY: Pull → Embed → Push (in chunks) | |
| 1. PULL: Stream N docs from remote DB to local CSV (one-time bore cost) | |
| 2. EMBED: Read local CSV, encode on GPU at full speed (no network!) | |
| 3. PUSH: Bulk-write embeddings back to remote DB (one-time bore cost) | |
| 4. Repeat for next chunk |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| """ | |
| RunPod A100 Batch Embedding Script — v3 (File-Based Pipeline) | |
| Eliminates bore tunnel bottleneck by decoupling DB I/O from GPU. | |
| STRATEGY: Pull → Embed → Push (in chunks) | |
| 1. PULL: Stream N docs from remote DB to local CSV (one-time bore cost) | |
| 2. EMBED: Read local CSV, encode on GPU at full speed (no network!) | |
| 3. PUSH: Bulk-write embeddings back to remote DB (one-time bore cost) | |
| 4. Repeat for next chunk |