Using PySpark and AWS Glue to analyze multi-line log files
Logfile analytics with Spark is tricky. One of the common problems are multi-line logs. In this post I explain to you how you can use PySpark to get your multi-line logs into a structured data frame.