Dynamic and results-driven professional with over 20 years of experience in the software development life cycle (SDLC), specializing in the implementation and management of software applications. Expertise includes 11 years in Big Data platform administration, overseeing environments with more than 1,000 nodes and petabytes of data, along with proficiency in managing Azure cloud environments for HDInsight clusters and data lake flows. Proven track record in data warehouse projects utilizing Datastage, Oracle, Db2, and Teradata, complemented by hands-on experience in maintaining Hadoop ecosystems while implementing robust security measures. Skilled in establishing standards and processes across multiple clusters, ensuring high availability and efficient resource management within Hadoop environments.
Certifications
Expertise in big data platform engineering & SRE
Migrated the datalake platforms from a physicalized infra to complete virtualized environment and segregation of Compute and storage resources for an independent scaling
Proficient in streamlining processes through automations and minimizing the manual tasks
Experienced in cloud environment administration and migrating the platforms from onprem to Cloud
Handled the platforms security remediation programmes for zero-day vulernabilties , CVE's and it's patches
Skilled in managing Cloudera and Hortonworks environments
Hadoop Eco System: HDFS , Atlas, Kerberos, Knox, Ranger,MapReduce2,YARN,Tez, Hive, HBase, Sqoop, Oozie, ZooKeeper, Falcon, Kafka, Spark, Ni-Fi & Solr
Big Data Distributions: Cloudera and Horton works (HDP) Apache distribution
SQL query engine : Presto
Hadoop Replication ; WANdisco, Cloudera Replication Manager, ISILON SynqIQ
Databases: Db2 9x,10x, Oracle 11G , Teradata 1510
Platforms : Windows, Redhat, Suse Linux IBM AIX
ETL Tool : IBM InfoSphere Information Server 85
Replication Tool : IBM Change Data Capture 113x
Reporting Tools : MictroStrategy 103, Tableau 9x
Job Scheduler : BMC ControlM 81, Apache Airflow
Query Tools: AQT v9, Toad, SQL Developer, SQuirreL, Teradata Studio, DB Visualizer