Senior Cloud Platform Engineer with 9+ years of experience specializing in cloud infrastructure, container orchestration, and platform engineering. Proven track record of optimizing infrastructure costs (₹10L+ annual savings) while improving performance (10X compression ratios, 5K+ logs/sec throughput). Expert in Kubernetes (CKA/CKAD certified - 91% score), multi-cloud environments (AWS, GCP, Azure), and building self-service developer platforms. Led critical infrastructure migrations and automation projects reducing manual work by 38%. Published technical thought leadership on Medium.
Manual Work Reduced
MongoDB Automation PlatformAnnual Cost Savings
ClickHouse MigrationDevelopers Enabled
Project Blackbox PlatformCKA Score
Top 9th PercentileMongoDB | Dec 2023 - Present (2 years)
Platform Engineering Projects:
Atlas Infrastructure Operations:
NoBroker.com | Aug 2021 - Dec 2023 (2 years 5 months)
NoBroker.com | Jun 2020 - Jul 2021 (1 year 2 months)
NoBrokerHood - Society Management Platform:
Project Blackbox - Self-Service Staging Platform:
DXC Technology | Apr 2019 - Jun 2020 (1 year 3 months)
Cognizant Technology Solutions | Jun 2017 - Apr 2019 (1 year 11 months)
Cognizant Technology Solutions | Jun 2015 - Jun 2017 (2 years 1 month)
Tech Stack: Golang, RabbitMQ, MongoDB, Jira APIs, Kubernetes, Prometheus, Grafana, Splunk
Impact: 38% manual work reduction replacing 3 years of repetitive processes
Built Golang automation service with node-based execution graph using modular executor pattern. System fetches Jira tickets, compares Atlas cluster goal state vs current state, automatically applies fixes from playbooks or escalates to customers.
Tech Stack: Golang, MongoDB Changestreams, Temporal Framework, TTL
Impact: Eliminated race conditions, reduced alert processing time, simplified architecture
Eliminated RabbitMQ dependency by building single gateway service. Used MongoDB Changestreams for event-driven processing and Temporal framework for idempotent child workflows.
Tech Stack: Temporal, MongoDB, Prometheus, Sliding Window Algorithm
Impact: Proactive detection with confidence scoring, auto-incident creation
Built multi-source correlation engine using sliding window algorithm with 1-hour lookback. Correlates cluster signals with cloud provider health dashboards, produces confidence scores and auto-creates tracking incidents.
Tech Stack: ClickHouse, Fluent Bit, Redash, Grafana, LZ4HC
Impact: ₹10L+ annual savings, 10:1 compression, 5,000+ logs/sec throughput
Led Elasticsearch to ClickHouse migration. Researched ClickBench benchmarks, studied Zerodha/Cloudflare deployments, implemented zero-downtime migration. Published article on Medium reaching 1,000+ engineers.
Tech Stack: Docker Swarm, Jenkins (Groovy), Perl, Traefik, Portainer, Prometheus, Loki
Impact: 100+ developers enabled, <5min Git push to HTTPS deployment
Built complete self-service staging platform. Automated workflow: Git → Jenkins → Nexus → Portainer → Swarm → Traefik with Let's Encrypt SSL and dynamic subdomains. GitHub: vicknesh22/blackbox-swarm
Tech Stack: Kubernetes, Nginx, Automation Scripts
Impact: Scalable platform enabling application growth
Architected initial Kubernetes infrastructure for society management application. Built automation scripts, nginx configurations, and scalable environments for platform expansion.
Score: 91% (Top 9th Percentile)
The Linux Foundation
The Linux Foundation
Google Cloud
Amazon Web Services
Amazon Web Services
Pondicherry Engineering College | 2011 - 2015
Medium | December 2023 | 1,000+ readers
Detailed technical article on migrating from Elasticsearch to ClickHouse, achieving 10:1 compression and ₹10L+ cost savings. Covers architecture decisions, implementation challenges, and performance optimization.
Read Article →Medium (NoBroker Engineering) | September 2020 | 30+ engagement
Architecture guide for building self-service developer environments using Docker Swarm, Jenkins, and modern DevOps tools. Covers automation workflows and scalability patterns.
Read Article →