Current course offerings

Data engineering: How to build industry-strength data lakes and processing platforms

Location: Online or in Stockholm Price: 22,000 SEK + VAT Duration: Five days

There is no course instance currently planned. If you are interested in taking the course, please fill out this form or send an email to info@scling.com.

Summary

This course teaches practical data engineering, how to build industry-strength data lakes and data processing platforms, and how to use them to build robust, scalable, and high-performing data processing applications.

The course focus is practical engineering, with real-life examples. We will cover architecture and development end-to-end, from data collection through batch and stream data processing, to exporting and serving data artifacts to users. In selected key areas, we will go down and cover implementation in detail. The course includes theoretical lectures as well as practical exercises that teach programming of scalable data processing frameworks. The contents are vendor neutral, but we will present a recommended selection of open source and public cloud components that serve as a starting point for a complete technology stack.

Whereas the first two-course instances were three full days, this instance is five days, but only six hours per day. Feedback from earlier instances indicates shorter days are preferred, and new material has been added to extend to five days.

Past instances of this course have received a net promoter score of 9.0/10 and 9.4/10. See the bottom of this page for more course evaluation metrics.

Target audience

The course targets professionals requiring a hands-on understanding of state-of-the-art data engineering practices, such as data engineers, backend engineers, data scientists, business insights developers, database admins, and managers of those roles. Participants are expected to have at least three years of technical work experience.

Prerequisites

Course participants are supposed to be proficient in either a major object-oriented or functional programming language, e.g. Java, Python, C++, Scala, or to be proficient in data modelling and SQL. For the practical exercises, it is recommended that participants work in pairs, ideally one person with developer experience and one person with data modelling experience.

Practical exercises will be done in Scala and Python. Participants who do not know Scala or Python in advance need it not fear, however. Advanced language features are not needed, so it is sufficient to go through tutorials in advance. Links will be provided.

Participants need to bring their own laptops and download the exercise source code a few days in advance. Links and instructions will be provided. The exercises depend on open source libraries, downloaded as part of the preparations.

Lecturer

Lars Albertsson Lars is founder of Scling, providing data-factory-as-a-service - a partnership solution for creating business value from data. He is a frequent conference speaker on big data technology and privacy protection. Before founding Scling, Lars has worked at Google, Spotify, and as an independent consultant, helping organisations create value with data processing technology. As independent consultant, his clients have ranged from startups to banks. LinkedIn profile at https://www.linkedin.com/in/larsalbertsson. Lars’ conference presentations are listed here.

Contents

The following topics will be covered in the lectures. Practical exercises will be interleaved with the lectures.

Other info

The course is held in English, unless there is unanimous agreement for holding it in Swedish. The course material is in English.

Consider registering in pairs, coupling developer experience and data-modelling experience.

The Maximum number of participants is 24, and the minimum for holding the course is eight.

Lunch and coffee are not included in the online edition, but breaks will be provided. :-)

Registration

Register your interest to attend the next course instance in this form or email info@scling.com.

Cancellation policy

Cancellations more than 14 days before the course are refunded in full. Cancellations 8-14 days in advance are refunded by 50%. Cancellations 7 days or less in advance are not refunded. A registration can be transferred to another participant without charge, however.

If the number of participants is too low, Scling retains the right to cancel any activity up until 14 days prior to its start. In case of emergencies with limited time for Scling to respond and make changes, or unexpected unavailability of the lecturer, e.g. due to illness, Scling retains the right to cancel closer to the event than 14 days. Scling is responsible for informing all participants registered for the activity as soon as possible. Scling is not liable for any costs accrued by the participants or their organisations in regard to cancelled events. Scling is also not liable for any costs when the activity is cancelled due to factors Scling cannot control, such as pandemics, strikes, blockades, lockouts, war, governmental regulation, police or legal court orders and other occurrences that are considered force majeure. Scling will refund the registration for a cancelled event.

Past courses

Data engineering: How to build industry-strength data lakes and processing platforms (online edition)

Dates: May 26-28, 2020
Location: Online

This three-day course teaches practical data engineering, how to build industry-strength data lakes and data processing platforms, and how to use them to build robust, scalable, and high-performing data processing applications.

Course evaluation results

Net promoter score - Would you recommend the course to colleagues?     9.4/10
Was it time well spent?     9.2/10
Did you learn things useful for your work?     9.4/10
Did you have fun?     9.0/10
How well did the video conferencing work?     8.1/10
How well did the other communication channels work?     7.8/10

Evaluation response rate: 8 out of 21 course participants.

Data engineering: How to build industry-strength data lakes and processing platforms

Dates: November 18 - 20, 2019
Location: Stockholm

This three-day course teaches practical data engineering, how to build industry-strength data lakes and data processing platforms, and how to use them to build robust, scalable, and high-performing data processing applications.

Course evaluation results

Net promoter score - Would you recommend the course to colleagues?     9.0/10
Was it time well spent?     8.8/10
Did you learn things useful for your work?     8.5/10
Did you have fun?     9.2/10
How well did the onsite practical arrangements work?     7.8/10

Evaluation response rate: 12 out of 22 course participants.