Chocolate, ice cream and games are perhaps 3 of the most popular universally understood words that can bring joy to anyone between 5-60 years of age!
InnoGames is one of the world's leading developers and providers of online games and at InnoGames we not only have all three of those things but in addition we build up a powerful data infrastructure because it's expensive to run your business blind. And being able to evaluate key performance indicators fast to make good decisions and deliver personalized and relevant content to each and every gamer is essential to be successful and it is how a customer becomes a fan.
Our data infrastructure mainly consists of a data pipeline that covers the streaming part and a data platform to perform batch processing. The latter is based on the Hadoop ecosystem using technologies such as Hive, Spark, Hue, R and more to give our data scientists a high flexibility. There were several evolutions of the data pipeline, starting with Kestrel and custom streaming applications. Later on we switched the base technologies to Apache Kafka and Apache Storm. Last year we recreated our streaming infrastructure based on Apache Flink which is an open-source stream processing framework for distributed, high-performing, always-available, and accurate data streaming applications.
Because having fun is the best way to learn, after a quick introduction to Flink and the Flink ecosystem this talk will focus on real-world use cases and transports the idea of those projects to live examples. This way, the audience will be part of a Flink based experiment to internalize the experience we gained with Flink.