Data Engineering is the practice that enables data-driven decision-making by collecting, transforming and publishing data. Data Engineers design, build, operationalize, secure, and monitor data processing systems. They focus on security and compliance, scalability and efficiency, reliability and fidelity, flexibility and portability. 

This training program has been developed for talented Juniors with knowledge of Java or Python.


During our course, you will learn various characteristics of big data and its sources. We will introduce you to architectural requirements, principles of big data infrastructures, and the intersection of cloud computing with big data. We will also provide an overview of the most popular big data technologies, including core Hadoop, NoSQL databases, Apache Spark, Apache Kafka, etc.


The program consists of a self-paced training course and final testing:

  • During the self-paced part of the program, you will study theoretical materials combined with testing your knowledge with assigned tasks. The program's built-in instructions will assist you in navigating and passing it with ease.
  • Then you will move on to the final testing. The success criteria are score of 70% or higher and B2 or higher level English.

After completing this program, successful candidates will get an opportunity to continue their path within EPAM according to their level, skills, and available opportunities for future juniors. 

Do you have questions? Contact us
Do you have questions? Contact us