Skip to content

Instantly share code, notes, and snippets.

View akki's full-sized avatar
🇮🇳
.

Akshesh Doshi akki

🇮🇳
.
View GitHub Profile
@akki
akki / mongo-hadoop-core-2.0.2.jar
Created June 1, 2018 11:55
JAR files for MongoDB-Hadoop connector
" zip.vim version v27
" Browsing zipfile /home/hadoopuser/hadoop/lib/mongo-hadoop-core-2.0.2.jar
" Select a file with cursor and press ENTER
META-INF/
META-INF/MANIFEST.MF
com/
com/mongodb/
com/mongodb/hadoop/
com/mongodb/hadoop/MongoOutputFormat.class
[Unit]
Requires=zookeeper.service
After=zookeeper.service
[Service]
Type=simple
User=kafka
ExecStart=/bin/sh -c '/home/kafka/kafka/bin/kafka-server-start.sh /home/kafka/kafka/config/server.properties > /home/kafka/kafka/kafka.log 2>&1'
ExecStop=/home/kafka/kafka/bin/kafka-server-stop.sh
Restart=on-abnormal
@akki
akki / stack.yml
Last active March 2, 2020 04:08
Docker-Swarm-Operator-for-Airflow-blog
version: '3'
services:
postgres:
image: postgres:9.6-alpine
environment:
- POSTGRES_USER=test
- POSTGRES_PASSWORD=test
- POSTGRES_DB=test
pod:
@akki
akki / benchmark-spark-magic-committer.py
Last active March 28, 2024 14:22
Spark app with magic committer
# These extra-configs enable the magic committer.
# A _SUCCESS file with data is generated if these extra configs are passed. The file would otherwise be zero-byte in size, indicating that magic commiter did NOT get enabled.
extra_configs = {
"spark.sql.sources.commitProtocolClass": "org.apache.spark.internal.io.cloud.PathOutputCommitProtocol",
"spark.sql.parquet.output.committer.class": "org.apache.hadoop.mapreduce.lib.output.BindingPathOutputCommitter",
"spark.hadoop.mapreduce.outputcommitter.factory.scheme.s3a": "org.apache.hadoop.fs.s3a.commit.S3ACommitterFactory",
"spark.hadoop.fs.s3a.committer.name": "magic",
"spark.hadoop.fs.s3a.committer.magic.enabled": "true",
}