Courses.

Current course offerings

Data engineering: How to build industry-strength data lakes and processing platforms

Next offering planned in autumn, online or in Stockholm depending on the pandemic. Contact us (see bottom) if you want to participate.

 

Summary
This three day course teaches practical data engineering, how to build industry-strength data lakes and data processing platforms, and how to use them to build robust, scalable, and high-performing data processing applications.

The course focus is practical engineering, with real-life examples. We will cover architecture and development end-to-end, from data collection through batch and stream data processing, to exporting and serving data artifacts to users. In selected key areas, we will go down and cover implementation in detail. The course includes theoretical lectures as well as practical exercises that teach programming of scalable data processing frameworks. The contents are vendor neutral, but we will present a recommended selection of open source and public cloud components that serve as a starting point for a complete technology stack. 

The first instance of this course received a net promoter score of 9.0 out of 10. See the bottom of this page for more course evaluation metrics.

Target audience

The course targets professionals requiring a hands-on understanding of state-of-the-art data engineering practices, such as data engineers, backend engineers, data scientists, business insights developers, database admins, and managers of those roles. Participants are expected to have at least three years of technical work experience.

Prerequisites

Course participants are supposed to be proficient in either a major object-oriented or functional programming language, e.g. Java, Python, C++, Scala, or to be proficient in data modelling and SQL. For the practical exercises, it is recommended that participants work in pairs, ideally one person with developer experience and one person with data modelling experience.

 

Practical exercises will be done in Scala. Participants who do not know Scala in advance need not fear, however. Advanced language features are not needed, so it is sufficient to go through a tutorial in advance. Links will be provided.

 

Participants need to bring own laptops, and download the exercise source code a few days in advance. Links and instructions will be provided. The exercises depend on open source libraries, downloaded as part of the preparations.

 

Lecturer

 

Lars Albertsson

Lars is founder of Scling, providing data-value-as-a-service - a partnership solution for creating business value from data. He is a frequent conference speaker on big data technology and privacy protection. Before founding Scling, Lars has worked at Google, Spotify, and as an independent consultant, helping organisations create value with data processing technology. As independent consultant, his clients have ranged from startups to banks. LinkedIn profile at https://www.linkedin.com/in/larsalbertsson

 

Contents

 

The following topics will be covered in the lectures. Practical exercises will be interleaved with the lectures.

  • Overview and motivation. Why building a data platform, and how to use it.

  • Data collection. Gathering data into a data platform.

  • Batch processing. How to process data with scalable frameworks, such as MapReduce, Spark, Flink, etc.

  • Intro into serving and NoSQL. How to export data from a data platform, and how to serve data-driven applications.

  • Workflow orchestration. Connecting batch processing flows into robust pipelines.

  • Real-time processing. Data processing with scalable stream processing frameworks.

  • Deployment. Deploying batch processing applications in production.

  • DataOps and quality assurance. Testing, continuous deployment, error handling, and engineering data quality.

  • Lifecycle, evolution, schemas. How to evolve data pipelines over time without breaking applications.

  • Privacy by design. Architecting data processing in order to comply with privacy regulations.

 

Other info

 

The course is held in English, unless there is unanimous agreement for holding it in Swedish. The course material is in English.  

 

Consider registering in pairs, coupling developer experience and data-modelling experience.

Maximum number of participants is 24, and the minimum for holding the course is eight. 

Lunch and coffee is not included in the online edition, but breaks will be provided. :-)

Registration

Instructions will be published later.

Cancellation policy

 

Cancellations more than 14 days before the course are refunded in full. Cancellations 8-14 days in advance are refunded by 50%. Cancellations 7 days or less in advance are not refunded. A registration can be transferred to another participant without charge, however.

 

If the number of participants is too low, Scling retains the right to cancel any activity up until 14 days prior to its start. In case of emergencies with limited time for Scling to respond and make changes, or unexpected unavailability of the lecturer e.g. due to illness, Scling retains the right to cancel closer to the event than 14 days. Scling is responsible for informing all participants registered for the activity as soon as possible. Scling is not liable for any costs accrued by the participant or their organisations, with regards to cancelled events. Scling is also not liable for any costs when the activity is cancelled due to factors Scling cannot control, such as strikes, blockades, lockouts, war, governmental regulation, police or legal court orders and other occurrences that are considered force majeure. Scling will refund the registration for a cancelled event.

Past courses

Data engineering: How to build industry-strength data lakes and processing platforms (online edition)

May 26-28, 2020

Location: Stockholm

Price: 11,000 SEK (50% deduction due to first online instance)

This three day course teaches practical data engineering, how to build industry-strength data lakes and data processing platforms, and how to use them to build robust, scalable, and high-performing data processing applications.

Course evaluation results

Net promoter score - Would you recommend the course to colleagues?    9.4/10

Was it time well spent?    9.2/10

Did you learn things useful for your work?    9.2/10

Did you have fun?    9.4/10

How well did the video conferencing work?    8.4/10

How well did the other communication channels work?    8.0/10

Evaluation response rate: 5 out of 21 course participants. 

Data engineering: How to build industry-strength data lakes and processing platforms

November 18 - 20, 2019

Location: Stockholm

Price: 22,000 SEK

This three day course teaches practical data engineering, how to build industry-strength data lakes and data processing platforms, and how to use them to build robust, scalable, and high-performing data processing applications.

Course evaluation results

Net promoter score - Would you recommend the course to colleagues?    9.0/10

Was it time well spent?    8.8/10

Did you learn things useful for your work?    8.5/10

Did you have fun?    9.2/10

How well did the onsite practical arrangements work?    7.8/10

Evaluation response rate: 12 out of 22 course participants. 

 Address. c/o RISE, Isafjordsgatan 22, 164 29 Kista, Sweden

Tel. +46 70 7687109

This site was built with Wix, which sets cookies. We do not show an annoying banner,

but encourage everyone to install Privacy Badger or similar tools to limit third-party cookies.