Souvik Ghosh is a result oriented professional with over 3 years of Freelancing experience and 4 years of post-degree Corporate experience. Started his career in web development and gradually shifted to the Hadoop ecosystem. Co-founded and run a console game sharing startup for 1 year. Currently seeking exciting big data challenges where one can solve problems as well as learn new technologies. Additonally I have extensive experience in data visualization tools like Tableau and I am also looking forward to challenging problems to solve in the field of data visualization.
Souvik was excellent. Great communication. Delivered ahead of schedule. Was creative in his approach to solve my requirements. Was understanding when there were delays on my side. Very professional . I would highly recommend his services.
Thanks to Souvik I have an app (Executor) selling on the app store. Souvik is a very good developer, delivering what I asked in a timely manner.
I am hoping to use him for other projects now.
Highly recommended.
Project# : Loggers Analysis for site Users
Analysis of hourly site usage of Afrihost users from a set of Apache HTTP Server logs.. All the Legal procedures, complains are maintained and recorded in this system. Previously various downstream applications were implemented in Afrihost to fetch and amend the data.
Client African Hosting Giant
Technologies Map Reduce Algorithm, Hive , HDFS,PIG, Sqoop
Database MySQL
Duration May 2014 – till date
Role Tech Lead
Team Size 4
Responsibilities:
• Create different Hive schema
• Extract tables from database using SQOOP and load them into HDFS
• Create HIVE partition as required
• Join different table to generate different reports
• Implementation using Map Reduce
5月, 2014 - 現在
•
10 , 8
Specialist - Technology
8月, 2012 - 2月, 2013
•
6
123greetings.com
8月, 2012 - 2月, 2013
•
6
Project# : Overhaul of Reporting Section and Adhoc user engagement analysis
Overhauling of the reporting section for 123Stores to accommodate exponential growth in database size. Implementation of Hadoop ecosystem as well as user log data analysis for
users of 123greetings.
Technologies Map Reduce Algorithm, Hive , HDFS,PIG, Hbase, Oozie, Sqoop
Database MySQL
Duration Aug 2012 – Feb 2013
Role Senior Programmer
Team Size 5
Responsibilities:
• Developed MapReduce programs to parse the raw data, populate staging tables and store the refined data in partitioned tables in the EDW
• Created Hive queries that helped market analysts spot emerging trends by comparing fresh data with EDW reference tables and historical metrics.
• Enabled speedy reviews and first mover advantages by using Oozie to automate data loading into the Hadoop Distributed File System and PIG to pre-process the data.
• Shared responsibility for administration of Hadoop, Hive and Pig.
8月, 2012 - 2月, 2013
•
6
Software Engineer
10月, 2010 - 8月, 2012
•
1 , 10
Zensar
10月, 2010 - 8月, 2012
•
1 , 10
Project# : Device Fault Prediction
Cisco’s support team on a day-to-day basis deals with huge volumes of issues related to their network products like routers, switches etc. The support teams have been operating on a reactive model i.e. based on the customer tickets/queries being raised. Hence, to improve customer satisfaction, they would like the system to predict network faults based on the logs being generated by various network devices i.e. by loading them into Hadoop cluster and analyzing them using some of the machine learning algorithms implemented in Apache Mahout or custom built.
Client Cisco
Technologies Map Reduce Algorithm, Hive , HDFS
Database MySQL
Duration Jan 2012 – Aug 2012
Role Developer
Team Size 7
Responsibilities:
• Involved infrastructure Verification(File System, OS Compatibility Check, Java Version Verification)
• Analyzing the requirement to setup a cluster
• Created two different users (hduser for performing hdfs operations and mapred user for
performing Map Reduce operations only)
• Setting-up the Hadoop-cluster on 10 VMs
• Ensured NFS is configured for name node
• Setup Hive with MySQL as a Remote Metastore
• Moved all log files generated by various network devices into HDFS location
• Written Map Reduce code that will take input as log files and parse the logs and structure them in
tabular format to facilitate effective querying on the log data.
• Created External Hive Table on top of parsed data
Project# : Internal Ticketing System
An internal ticketing system for Cisco to manage, log, monitor all issues received by its tech support team.
Client Cisco
Technologies PHP, Mysql, Html, Css, Jquery, Javascript
Database MySQL
Duration Jan 2011 – Dec 2011
Role Developer
Team Size 15
Responsibilities:
• Involved in database schema design and architecture
• Ensured that Cisco specific coding standards are met including nomenclature
• Optimized Mysql queries to lower fetching times
• Responsible for Unit Testing and Integration Testing