Skip to content
View dwsmith1983's full-sized avatar

Highlights

  • Pro

Organizations

@conda-forge @GrowingInTech

Block or report dwsmith1983

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dwsmith1983/README.md

Typing SVG

About Me

Engineering Director specializing in SRE, Data Engineering, and MLOps. I build reliable data platforms at scale, lead high-performing teams, and optimize cloud costs. Former Databricks Solutions Architect. Open source contributor.

  • Currently leading SRE for Data & Analytics at Techcombank
  • Based in Ha Noi, Viet Nam
  • LinkedIn
  • Website

Tech Stack

Languages Python Scala SQL

Data Engineering Apache Spark Databricks Delta Lake Kafka Airflow

Cloud & Infrastructure AWS GCP Docker Kubernetes

SRE & Observability Prometheus Grafana CloudWatch

MLOps MLflow TensorFlow TFX

Certifications

Databricks GCP

GitHub Stats

GitHub Stats Dark GitHub Stats Light Top Languages Dark Top Languages Light

Pinned Loading

  1. spark-bestfit spark-bestfit Public

    Efficiently fit ~90 scipy.stats distributions to your data using Spark's parallel processing with optimized Pandas UDFs and broadcast variables.

    Python 1 2

  2. spark-pipeline-framework spark-pipeline-framework Public

    A configuration-driven framework for building Spark pipelines with HOCON config files and PureConfig.

    Scala 2