Skip to content

Instantly share code, notes, and snippets.

@jaysoncena
jaysoncena / pyspark-apache-logs.py
Created March 10, 2016 13:43
PySpark code to analyze Tomcat logs (can be also used with Apache HTTPd logs)
%pyspark
import re
from pyspark.sql.types import *
from pyspark.sql import Row
from datetime import datetime
access = "ABCD"