Summary
Overview
Work History
Education
Skills
Languages
Certification
Tools & Technologies
Personal Information
Timeline
Generic

Kaushik Prakash Vir Sharma

Porto,Portugal

Summary

Accomplished Professional with 17 years of IT experience and
expertise in Data Architecture & Engineering, ETL & Data Pipeline
Development, and Data Governance & Security. Proficient in Cloud
& Big Data Technologies, Database & Data Modeling, and
Technical Leadership. Demonstrated success in leveraging Big
Data & Cloud Platforms to drive data processing and engineering
initiatives. Skilled in managing Databases & Data Warehouses,
Infrastructure & Automation, and BI & Analytics Tools to deliver
robust Data Solutions.

Overview

17
17
years of professional experience
1
1
Certification

Work History

Principal Data Engineer / Data Architect

Jumia
Porto, Portugal
10.2018 - 03.2025
  • Architected and developed scalable, high-performance data pipelines to handle terabytes of e-commerce data, improving system efficiency by 30%
  • Designed and optimized data models for structured and unstructured data using Iceberg & Parquet, enabling efficient storage and retrieval
  • Implemented AWS-based Data Lake migration from Hadoop, reducing operational costs by 40%
  • Built real-time analytics pipelines using Kafka + Spark Streaming, enhancing seller performance insights
  • Led Data Governance & Security Strategy, ensuring compliance with GDPR & data privacy policies
  • Migrated ML Pipelines from Dataiku to AWS SageMaker, cutting model deployment time by 50%
  • Developed complex data models to support business decision-making.

Senior Big Data Consultant

CenturyLink
India
05.2017 - 08.2018
  • Led a 20-member team managing 8 enterprise data engineering projects in banking, finance, and healthcare
  • Designed scalable ETL workflows using Apache Airflow & AWS Glue, reducing processing time by 25%
  • Integrated data sources from multiple vendors, ensuring seamless ingestion and processing

Big Data Administrator

HCL Technologies
Noida, India
03.2014 - 05.2017
  • Designed & implemented a Hadoop-based data lake for Deutsche Bank, enabling advanced analytics
  • Developed real-time data ingestion pipelines using NiFi and Kafka




Unix & Storage Administrator

Fujitsu Consulting
India
09.2008 - 03.2012
  • Led data infrastructure projects for PriceWaterhouseCoopers and Harvard University, ensuring high availability and security
  • Optimized storage and Unix-based data processing systems, improving data access speeds by 30%


Education

B.E. - Electronics & Communication

Rajiv Gandhi University

Skills

  • Data Architecture & Engineering
  • ETL & Data Pipeline Development
  • Data Governance & Security
  • Cloud & Big Data Technologies
  • Database & Data Modeling
  • Technical Leadership
  • Big Data & Cloud Platforms
  • Data Processing & Engineering
  • Databases & Data Warehouses
  • Infrastructure & Automation
  • BI & Analytics Tools

Languages

English
Proficient (C2)
Hindi
Proficient (C2)
Portuguese
Intermediate

Certification

  • AWS Certified Solutions Architect – Associate
  • Microsoft Certified Azure Data Engineer

Tools & Technologies

  • AWS (S3, Glue, EMR, Lambda, Step Functions, Kinesis, Athena,Redshift)
  • Apache Spark, Apache Kafka, Apache Iceberg, Apache Airflow, NiFi, PostgreSQL, MySQL, Hive, Druid, Terraform, Ansible, Kubernetes, Docker, CI/CD (Jenkins, GitLab), Apache Atlas, Ranger, Power BI, Tableau, Apache Superset
  • Azure (ADLS Gen2,Synapse Analytics, ADF,Azure Databricks )
  • SnowFlake, DBT
  • GDPR, HIPAA compliance

Personal Information

Timeline

Principal Data Engineer / Data Architect

Jumia
10.2018 - 03.2025

Senior Big Data Consultant

CenturyLink
05.2017 - 08.2018

Big Data Administrator

HCL Technologies
03.2014 - 05.2017

Unix & Storage Administrator

Fujitsu Consulting
09.2008 - 03.2012

B.E. - Electronics & Communication

Rajiv Gandhi University
Kaushik Prakash Vir Sharma