Hi, I have 2+ year of experience in writing map& reduce code and 2+ years of experience in java. I have completed several project. I have also developed some of data mining algorithm like Apriori algorithm. I mostly work on Linux environment. I have developed testing tool which compares two data file in hdfs location and generate a report about data and metadata difference between the two file using shell script, python and pig Script. I am also working on hive and pig.
I have experienced in Map reduce and hive QL. Writing Hive query and scheduling the job through oozie.
Recently, I and my friend has provided support for a client in Melbourne (Australia). We are working in MNC. Currently I am writing Hive query for delta processing (creating snapshot from historical and ongoing table, taking latest record set from changed table and removing the record which are present in delete table) and processing hive table and metadata using Hcatalog.