Skip to content

Instantly share code, notes, and snippets.

https://archive.org/download/nycTaxiTripData2013
http://www.kfirlavi.com/blog/2012/11/06/elegant-locking-of-bash-program/
@yashk
yashk / nc
Created May 3, 2017 04:21
nc
while true
while> do
while> nc -l -p 9999 localhost
while> done
cat test.yml| nc -c localhost 9999
@yashk
yashk / tibco_jobs.png
Last active April 26, 2017 03:26
openings at tibco
test
Role Overview:
We have an immediate need for a senior-level Big data Engineer with core Java skills to join our TIBCO Mashery team. You'll have an opportunity to work on the Mashery Datamart that leverages Big data and Cloud technologies such as Kafka, Spark, Hadoop, AWS, etc. Mashery Datamart is a critical component of the Mashery API Management product that provides data to customers that can be used for performance and usage measurements and for billing functions.
Responsibilities:
Creating complex data processing pipelines, using apache spark/hadoop.
Maintain and expand batch aggregations/reports to existing data mart processing using Hive/Oozie.
Maintain and expand real-time real-time aggregations using Spark streaming,Cassandra,Graphite, Elasticsearch.
Writing Unit tests, integration tests and help in defining and executing performance Tests for Datamart components.
Deploying data pipelines in production based on Continuous Delivery practices.
Create Dashboards/Alerts exhibiting critical metrics and
jmap -F -dump:format=b,file=<pid>_hd.hprof <pid>
@yashk
yashk / hiveanalysis.md
Last active January 20, 2017 14:18
Which table HiveMetaStoreClient.getPartition queries and does that table have index on partitions

This [code][1] is exectuted to get parition list

private MPartition getMPartition(String dbName, String tableName, List<String> part_vals)
     .........
     ........
     query =
          pm.newQuery(MPartition.class,
              "table.tableName == t1 && table.database.name == t2 && partitionName == t3");
[merge]
tool = intellij
[mergetool "intellij"]
cmd = idea merge $(cd $(dirname "$LOCAL") && pwd)/$(basename "$LOCAL") $(cd $(dirname "$REMOTE") && pwd)/$(basename "$REMOTE") $(cd $(dirname "$BASE") && pwd)/$(basename "$BASE") $(cd $(dirname "$MERGED") && pwd)/$(basename "$MERGED")
trustExitCode = true
[diff]
tool = intellij
[difftool "intellij"]
cmd = idea diff $(cd $(dirname "$LOCAL") && pwd)/$(basename "$LOCAL") $(cd $(dirname "$REMOTE") && pwd)/$(basename "$REMOTE")
cat ~/Downloads/fsync.log | jq -r '.results[] | [.time, .host , (.message | split("ms")[0] | split(" ")[8])] | @csv'