Apache Spark scales complex data analysis and machine learning applications across a computing cluster. The software has established itself as a standard tool for analyzing large amounts of data. The Spark Engine can be addressed from Python programs via the PySpark API.
In the two-day online course Big data analysis with PySpark you will receive a thorough introduction to the Spark framework in many practical exercises. You will learn to develop productive, scalable Python applications based on Spark. You will learn about Spark SQL for working with tabular data, the Spark Streaming API, GraphX and Spark ML for machine learning.
The online course will take place from September 14th to 15th via video conference. Participants should have a solid basic knowledge of Python. It is possible at any time during the course to exchange ideas with the speaker and the other participants. If you book by August 21 get a 10 percent early bird discount.
Further information and registration: