Data Engineering is the practice that enables data-driven decision-making by collecting, transforming and publishing data. Data Engineers design, build, operationalize, secure, and monitor data processing systems. They focus on security and compliance, scalability and efficiency, reliability and fidelity, flexibility and portability. 

This training program has been developed for talented Juniors with basic knowledge of Java or Python eager to build a successful career in the Data Engineering sphere.


During our course, you will learn various characteristics of big data and its sources. We will introduce you to architectural requirements, principles of big data infrastructures, and the intersection of cloud computing with big data. We will also provide an overview of the most popular big data technologies, including core Hadoop, NoSQL databases, Apache Spark, Apache Kafka, etc.


The program consists of two stages: 

  • The first one is the self-paced training course. You will first study theoretical materials and then check your knowledge with the help of assigned tasks. Built-in instructions will help you navigate the program and pass it with ease. 
  • If you complete all training modules with a final score above 70% and your English test result is B2 (and higher), we will invite you for an interview. Based on that, you may continue your studies with a mentor at the second stage – EPAM Data Lab. It usually lasts for three months.


Upon program completion, if all materials have been studied, assigned tasks accomplished and the final assessment successfully passed, you will be invited to a technical interview.

Do you have questions? Contact us
Do you have questions? Contact us