Skip to content

Instantly share code, notes, and snippets.

View devender-yadav's full-sized avatar

Devender Yadav devender-yadav

View GitHub Profile
@devender-yadav
devender-yadav / code_quality.md
Created July 21, 2019 19:35
My python notes

Static code analysis for python application

Pylint

How to install

pip install pylint

How to run

@devender-yadav
devender-yadav / mongo.md
Created May 22, 2019 17:28
Merge mongo data distributed on 2 locations

Let us assume, initally mongo is using /data/db1 dbpath and then /data/db2

Start mongo /data/db1

sudo mongod --dbpath=/data/db1

Export data from mongo /data/db1

mongoexport --db test_db --collection collection1 --out tb_collection1_db1.json

package com.dev.util.aws;
import java.io.File;
import java.io.IOException;
import java.nio.ByteBuffer;
import java.nio.file.Files;
import java.util.ArrayList;
import java.util.List;
import com.amazonaws.auth.AWSCredentials;
@devender-yadav
devender-yadav / pyspark-jdbc-intro.md
Last active June 19, 2021 04:41
Pyspark JDBC samples
@devender-yadav
devender-yadav / spark_tutorial.md
Last active October 10, 2022 10:12
Introduction to Spark
@devender-yadav
devender-yadav / sqoop.md
Last active December 21, 2018 05:33
To make sqoop job efficient

How to choose split by column?

Idea is to use a uniformly distributed numeric column. So, we should prefer primary key and then any numeric column and we should avoid using text column for splitting.

How to determine the number of Mappers?

We should consider:

  • Number of rows
  • Number of tasks that can be run in parallel in Hadoop
@devender-yadav
devender-yadav / gitlab.md
Last active November 27, 2018 09:54
Access postgres database of gitlab
sudo -u gitlab-psql /opt/gitlab/embedded/bin/psql -h /var/opt/gitlab/postgresql/ gitlabhq_production

Sql query to fetch discussions

 SELECT n.id,
 mr.iid AS merge_request_iid,
@devender-yadav
devender-yadav / AmazonTranslateUtil.java
Created July 28, 2018 20:00
Translate non-English text to English using AWS Translate
import com.amazonaws.auth.AWSCredentials;
import com.amazonaws.auth.AWSStaticCredentialsProvider;
import com.amazonaws.auth.BasicAWSCredentials;
import com.amazonaws.regions.Regions;
import com.amazonaws.services.translate.AmazonTranslate;
import com.amazonaws.services.translate.AmazonTranslateClientBuilder;
import com.amazonaws.services.translate.model.TranslateTextRequest;
import com.amazonaws.services.translate.model.TranslateTextResult;
public class AmazonTranslateUtil {