Summary
Overview
Work History
Education
Skills
Certification
Hobbies
Timeline
Generic
WANG YAN

WANG YAN

Staff SRE
Singapore

Summary

Over 12 years of operations and site reliability engineering (SRE) experience, including 7 years working with cloud services (AWS/Aliyun) and production Kubernetes environments. Demonstrated expertise in building system observability, managing incident response processes, and optimizing platform stability. Strong background in Kubernetes cluster management.

Overview

12
12
years of professional experience
4
4
years of post-secondary education
2
2
Certifications

Work History

Staff SRE

OKX
01.2024 - Current
  • Built and maintained a full-stack Kubernetes monitoring and alerting platform, integrated with incident response management (IRM)
  • Led Kubernetes cluster upgrades from 1.22 to 1.30 across ACK and EKS
  • Achieved average node provisioning times under 60 seconds during scaling
  • Designed hybrid node management using MNG and Karpenter for stability and elasticity

Senior SRE

Bluehelix
01.2018 - 01.2023
  • Managed the operations and maintenance of multiple company accounts and environments, including WAF, RDS, ELB, Kubernetes clusters, and the monitoring and logging systems. Collaborated with the app team to develop a CDN route monitoring and automatic switching service.
  • During the company’s transformation into a SaaS exchange, re-architected system ingress, caching databases, and logging/monitoring components with a sharded design to meet new business requirements. Enabled the infrastructure to achieve essentially unlimited horizontal scalability.
  • Implemented a Kubernetes security framework covering base image hardening, API access controls, container network policy enforcement, and secret encryption.

Senior SRE

Yiren Digital Ltd.
01.2018 - 10.2018
  • mainly responsible for the construction and operation of the company's containerized platform. Through the customized development of Kubernetes istio, combined with the company's existing publishing system and CMDB, creates a new generation of operation and maintenance platform.

Operations Engineer

Jingdong Finance
07.2013 - 10.2015
  • In Jingdong Finance (formerly: Online Banking Online Technology Co., Ltd.) engaged in application operation and maintenance engineers, mainly responsible for the operation and maintenance of financial products, in the early stage of the group's business development, to ensure the rapid iteration of the new system, rapid capacity expansion, and system stability.

Education

B.S. - Software Engineering

Harbin University of Science And Technology
01.2009 - 01.2013

Skills

AWS

Kubernetes

Prometheus/VictoriaMetrics

undefined

Certification

AWS Certified Solutions Architect - Professional

Hobbies

Hobbies: cycling, robotics, watching football and F1 races

Timeline

Staff SRE

OKX
01.2024 - Current

Senior SRE

Yiren Digital Ltd.
01.2018 - 10.2018

Senior SRE

Bluehelix
01.2018 - 01.2023

Operations Engineer

Jingdong Finance
07.2013 - 10.2015

B.S. - Software Engineering

Harbin University of Science And Technology
01.2009 - 01.2013
WANG YANStaff SRE