About me

 A technocrat with strong business acumen & technical expertise in Big data/Hadoop
 Possess over 3.5 years of experience in IT industry including 3 year experience in Big data/Hadoop with extensive focus on Hadoop ecosystem: Map Reduce coding, Hive, Impala, Pig, HBase, Sqoop, HDFS, Oozie and Apache Spark
 Adept in writing ETL Operation & Map Reduce using Twitter API Scalding in Scala
 Innate ability to provide complete Data Warehousing Solution such as Initial loading, incremental load, capturing CDC (change data capture)
 Expert in writing Hive query to create table on data ware house layer, query layer in Avro and Parquet format and provided access using Impala to make query faster
 Proven competency in the Installation of Hadoop Cluster
 Hands on experience in working with:
o Cloudera Distribution of Hadoop 5.3
o Spark, Spark SQL, MLlib, Spark Streaming using Kafka (POC)
o Linux environment
o Analytics and Reporting Tool, SAP BO

 Accredited for providing data warehouse solution and Hadoop ETL operation using Scalding (Twitter API)
 Successfully implemented:
o Data warehouse functionality using scalding (code written in Scala) and Hive as well in two different project
o SCD TYPE 2 while loading data from landing zone to query layer in HDFS
 Able to acquire new skills within short time & adapt to rapidly changing work practices
 Won Second Prize in Wipro PMP training

Currently No Recommendations
On the web
Perm, Freelance, Temp-Perm, Part time, Remote
Skill Level
Creativepool member since 22 July 2016