• A Big Data Associate Architect & Developer with 11+ Yrs IT consulting, 3+ Yrs into Hadoop/Big data, Spark-Scala development, Hadoop and 8+ Yrs into SAP & Salesforce.com design and development experience. • Expertise in Hadoop eco-system like HDFS, Hive, Spark, Scala, SparkSQL, Map Reduce, HBase, Cassandra, Sqoop, Kafka, Implala, Oozie, exposure on Flume for Big Data Analytics in various domains includes Finance, Telecom and Manufacturing. • Expert knowledge of Hadoop Architecture including YARN, HDFS, Resource Manager, Node Manager, Name Node, Data Node and MR v1 & v2 concepts. • Experienced in importing and exporting data from sources like Machine data, MySQL, RDBMS into staging data in HDFS and Hive using SQOOP for further analysis. • Extensive experience in building hive tables in different format like parquet with compression and applied several hive optimization techniques in solving performance related issues. • Implemented batch processing solutions for structured/unstructured and large volume of data by using Spark, Scala and Hadoop MapReduce framework. • Hands on experience with complex mappings from varied transformation logics like Map, flatMap, groupBy, AggregateBy, ReduceBy, Distinct, Joins, Unconnected and Connected lookups, Router, Aggregator, Joiner, Update Strategy, Normalizer Transformation and re-usable transformations. • Good Hands on knowledge on developing Hadoop/Spark applications on AWS EC2 machines with cluster scalability and using S3 for storage and analytics. • Excellent understanding of object oriented concepts in Java with good programming knowledge and agile project development.
©