Engineering Director specializing in SRE, Data Engineering, and MLOps. I build reliable data platforms at scale, lead high-performing teams, and optimize cloud costs. Former Databricks Solutions Architect. Open source contributor.
Pinned Loading
-
spark-bestfit
spark-bestfit PublicEfficiently fit ~90 scipy.stats distributions to your data using Spark's parallel processing with optimized Pandas UDFs and broadcast variables.
-
spark-pipeline-framework
spark-pipeline-framework PublicA configuration-driven framework for building Spark pipelines with HOCON config files and PureConfig.
Scala 2
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.





