Having 7 years of experience as a Big Data Engineer & Administrator and an ETL expert, I have expertise in ETL processes and tools, including Cloudera Data Science Workbench (CDSW), Neo4j, Unravel, Cloudera Data Platform (CDP), Cloudera Distributed Hadoop (CDH), Hortonworks Data Platform (HDP), Apache Airflow, and cloud services such as AWS and GCP. I possess strong knowledge of Hadoop Administration and a deep understanding of Hadoop ecosystem components such as HDFS, MapReduce, YARN, Pig, Sentry, Sqoop, Hive, HBase, Oozie, Zookeeper, and Ranger. I am a self-motivated, responsible professional with excellent self-starting and teamwork abilities, and possess strong interactive communication skills.
Big Data frameworks (Hadoop, HDFS, MapReduce, YARN, Hive, Pig, HBase, Sqoop, Ranger, Spark, PySpark, Impala, Sentry, Hue, Oozie), data platforms (Cloudera Data Platform (CDP), Cloudera Distributed Hadoop (CDH), Hortonworks Data Platform (HDP)), operating systems (Windows, MacOS, Linux, Ubuntu), scripting and programming (Ansible, Shell Scripting, Python), project and version control tools (MS Office, MS Project, MS Visio, MS Visual Studio, PowerPoint, GIT, GitHub, Bitbucket, GitLab, AWS CodeCommit), CI/CD and containerization tools (Jenkins, Docker), cloud platforms (AWS, GCP), scheduling and monitoring tools (Control-M, Autosys, Geneos, Grafana), ticketing and workflow management tools (Jira, ServiceNow, Symphony SummitAI, AirFlow), and ETL processes