Skip to content

Instantly share code, notes, and snippets.

@jibing57
jibing57 / aws_elb_access_log_parser.rb
Last active May 26, 2020 18:27
a simple ruby script to parse aws elb access log to csv format file
require 'csv'
require 'optparse'
class AWSELBAccessLogParser
def initialize()
@@elb_access_log_format=%Q(timestamp elb client:port backend:port request_processing_time backend_processing_time response_processing_time elb_status_code backend_status_code received_bytes sent_bytes "request" "user_agent" ssl_cipher ssl_protocol)
# puts "elb_access_log_format is #{elb_access_log_format.split(" ")}"
@@line_regex = /
@NickCrews
NickCrews / coalesce_parquet.py
Last active January 10, 2024 03:48
Coalesce parquet files
"""coalesce_parquets.py
gist of how to coalesce small row groups into larger row groups.
Solves the problem described in https://issues.apache.org/jira/browse/PARQUET-1115
"""
from __future__ import annotations
from pathlib import Path
from typing import Callable, Iterable, TypeVar