Skip to content

Instantly share code, notes, and snippets.

View harshavardhana's full-sized avatar
🌚
I may be slow to respond.

Harshavardhana harshavardhana

🌚
I may be slow to respond.
View GitHub Profile
@harshavardhana
harshavardhana / INSTALL-RDMA-PACKAGES.md
Last active March 30, 2026 06:54
MinIO AIStor RDMA RPM Installation Guide (EDGE)

MinIO AIStor RDMA RPM Installation Guide (EDGE)

Installing MinIO AIStor RDMA RPM Package

Modes of Operation

MinIO AIStor has two independent communication layers, each with an HTTP and RDMA mode. They are controlled separately and can be mixed in any combination.

 β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”

MinIO AIStor vs SeaweedFS: Comprehensive Comparison

Overview

MinIO AIStor SeaweedFS
Latest Release RELEASE.2026-03-20T23-11-32Z (Mar 20, 2026) v4.17 (Mar 11, 2026)
License Proprietary (enterprise) Apache 2.0 (OSS) / Commercial ($1/TB/mo)
Architecture Single binary, no external dependencies 3-tier: Master + Volume + Filer servers, requires external metadata database
Primary Author MinIO Inc. (experienced engineering team) Primarily Chris Lu (~9,018 of ~11,000 commits)

MinIO AIStor vs Dell AI Data Platform: A Hardware-Level Comparison

In response to Michael Dell's LinkedIn post claiming: "12x faster vector indexing, 3x faster processing with Lightning FS (the fastest parallel file system in the world), feeding GPUs at 150 GB/s per rack."

Let's look at the actual hardware behind both platforms and compare published numbers.


The Hardware Dell Is Talking About

@harshavardhana
harshavardhana / storj-vs-minio-aistor.md
Created March 10, 2026 22:01
Storj S3 vs MinIO AIStor: Technical Comparison

Storj S3 vs MinIO AIStor: Technical Comparison

The Elephant in the Room: Storj's S3 Gateway IS MinIO

storj/gateway-st/miniogw implements MinIO's ObjectLayer interface, routing S3 API calls to the Storj network instead of local disks. Storj's entire S3-compatible surface area is a MinIO fork β€” they are not building an S3 implementation; they are building a storage backend adapter for MinIO.

gateway-mt (the hosted, multi-tenant variant) is the same pattern wrapped in a MultiTenancyLayer. When you connect to Storj's S3 endpoint, you are talking to a MinIO process.

This is not incidental. It means Storj's S3 compatibility ceiling is bounded by whatever version of MinIO they have forked and are maintaining β€” and their own issue tracker (edge#27: "Replace Minio fork with most recent Apache2 version") shows they are perpetually behind upstream.

@harshavardhana
harshavardhana / README.md
Last active March 8, 2026 06:20
PySpark parquet overwrite pattern β€” tests partition prefix visibility after overwrite on S3/MinIO

PySpark Parquet Overwrite β€” Partition Prefix Visibility Test

Tests that after Spark overwrites partitioned Parquet files on S3/MinIO, the date-level partition prefixes remain visible in delimited ListObjectsV2 so that Spark's partition discovery still works correctly.

What the test does

  1. Generates two sample CSV files (batch1.csv, batch2.csv) with the same schema and same date partitions but different values.
@harshavardhana
harshavardhana / minio-aistor-vs-oss-final.md
Last active January 27, 2026 07:22
MinIO AIStor vs MinIO OSS - Complete Technical Comparison (13,061 commits analyzed)

MinIO AIStor vs MinIO OSS - Complete Technical Comparison (13,061 commits analyzed)

MinIO AIStor vs MinIO OSS - Complete Technical Comparison

Analysis based on full commit history review of 13,061 commits


Table of Contents

@harshavardhana
harshavardhana / minio-aistor-vs-oss-comprehensive.md
Created January 24, 2026 08:30
MinIO AIStor vs MinIO OSS - Comprehensive Technical Comparison (2800+ commits analyzed)

MinIO AIStor vs MinIO OSS - Comprehensive Technical Comparison

Analysis based on full commit history review of 2800+ commits


Table of Contents

  1. Executive Summary
  2. Codebase Statistics
@harshavardhana
harshavardhana / replication-dataflow.md
Last active December 23, 2025 19:58
MinIO AIStor: Synchronous vs Asynchronous Replication Dataflow

MinIO: Synchronous vs Asynchronous Replication

Synchronous Replication

Client waits for replication to complete before receiving response.

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”         β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”         β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚        β”‚  PUT    β”‚      SOURCE CLUSTER         β”‚         β”‚   REMOTE   β”‚
β”‚ Client β”œβ”€β”€β”€β”€β”€β”€β”€β”€β–Ίβ”‚                             β”‚         β”‚   TARGET   β”‚
## template:jinja
{#
This file (/etc/cloud/templates/hosts.debian.tmpl) is only utilized
if enabled in cloud-config. Specifically, in order to enable it
you need to add the following to config:
manage_etc_hosts: True
-#}
# Your system has configured 'manage_etc_hosts' as True.
# As a result, if you wish for changes to this file to persist
# then you will need to either
#!/bin/bash
docker system prune -af --filter "until=8h"