Over 12 years of operations and site reliability engineering (SRE) experience, including 7 years working with cloud services (AWS/Aliyun) and production Kubernetes environments. Demonstrated expertise in building system observability, managing incident response processes, and optimizing platform stability. Strong background in Kubernetes cluster management.
AWS
Kubernetes
Prometheus/VictoriaMetrics
undefinedHobbies: cycling, robotics, watching football and F1 races