Extensive experience in data analytics and ETL migration projects to Google Cloud Platform (GCP) using tools like BigQuery, Cloud DataProc, Cloud Storage, and Composer. Proficient in data modeling concepts (Star and Snowflake schemas), SQL (Presto, Hive) and programming with Python and PySpark. Skilled in building robust airflow data pipelines using bash scripting on Unix/Linux systems and developing python Packages for ETL processes. Hands-on experience with Sqoop for transferring data between RDBMS, HDFS, and Hive, and working with file formats like Avro, ORC, and Parquet. Expertise in Spark-SQL, Pyspark for data transformations and Spark Streaming for real-time processing. Strong skills in data preparation, modeling, and visualization using Power BI and Tableau to create impactful dashboards and reports. Experienced in all phases of the SDLC, including analysis, design, development, testing, and deployment.