Custom Python App on EMR Serverless

Environment Reference I had to jump through a few hoops to get a PySpark application running on EMR Serverless. Below are the steps I followed, along with final functioning configuration, and at the bottom of this post is a few errors I encountered along the way. Steps 1. Setup Build Environment For a packaged application … Continue reading “Custom Python App on EMR Serverless”

Default Argument Value Does Not Refresh Between Function Calls

Something struck me as unexpected today while working in Python. I had a function to take a datetime object and convert it into epoch milliseconds: import datetime import time this_tz = ‘US/Eastern’ def get_epch_ms(dttm=datetime.datetime.now(pytz.timezone(this_tz))): # Returns milliseconds since epoch for datetime object passed. # If no argument is passed, uses *now* as time basis. # … Continue reading “Default Argument Value Does Not Refresh Between Function Calls”