Data Engineering

  • Introduction to AWS and Cloud setup
  • Introduction to Hadoop and MapReduce Programming
  • Data Management and Relational Database Modelling
  • NoSQL Databases and Apache HBase
  • Data Warehousing
  • Data Ingestion and Apache Sqoop and Apache Flume
  • Hands on with Apache Hive
  • Amazon Redshift
  • Introduction to Apache Spark
  • Data Processing with PySpark and Flink
  • Real time data streaming with Apache Kafka
  • Real time data processing using Spark Streaming and Apache Flink
  • Building Automated Data Pipelines with Airflow and Apache Spark
  • Analytics using PySpark
Duration

90 hours, 2 hours daily thrice a week (3 months approx)

For More Details