I have 10+ years of industrial experience with 4.5+ in providing solutions for distributed systems using Big Data & Web technologies including web development & web scraping.
Big Data tech stack:
• Spark Core, Spark Streaming, Kafka, Hadoop, Hive, Tez, MapReduce, Pig, Oozie, Flume, Sqoop, Presto
• Scala, Java, Python, ScalaTest, TestNG, Gradle, Shell scripting, Angular JS, Intellij IDEA, Struts, Django, Selenium, Linux
• Cassandra, ArangoDB, Druid, Elasticsearch, Kibana, Solr, Couchbase
My area of expertise includes developing large scale distributed systems by implementing Big Data scalable architectures that combine the powers of batch and real-time data. Most notable ones include Lambda and Kappa architectures. I have successfully setup/upscaled Hadoop clusters of 3 notable companies. Currently, I am designing the next generation customer segmentation and targeting framework.