- Supports multiple cloud providers (AWS EKS, IBM IKS) managing around 7 terabytes of log ingestion every month across thousands of k8s nodes in 12 datacenters.
- Deploys software through Terraform, Helm and IBM Razee deployment methods.
- Manage reliability and patching around MongoDB, Redis and Elasticsearch clusters as large scale StatefulSets in Kubernetes.
- Sponsored the adoption of LinkerD service mesh (multiple milestones) on Kubernetes to tackle endpoint security and eventually end-to-end observability concerns.
- Wrote email deliverability performance collection tool for email vendor monitoring key performance SLIs for system performance.
- Expanding access tools to internal CLI python client (logdnactl) for better integration into backend systems for the SRE team.
- Regularly contributes to our internal tooling which uses the k8s api and pymongo libraries to manage administrative operations across the product.
- Built a support dashboard for support to manage and integrated Flask/Rebrow Redis blueprints into the app along with Python-eve (REST toolkit) for full-search MongoORM via REST. All behind python-authlib and OpenIDConnect/Okta for RBAC.
- Re-wrote the ansible integration for LogDNA logging library for a variety of new features for our customers.
- Added functionality to support dashboard to look into ElasticSearch field mappings to troubleshoot index limits and indicate growth needs for customers.
- Developed a proxy request tool for webhooks so support can easily debug webhook payloads.