Skip to main content

Move a petabyte Hadoop cluster to K8s, Data event online/Warsaw

Hey!

We recently worked on a massive migration project: migrating a
petabyte-scale Hadoop cluster to Kubernetes, which is fully open-source and
built for scale.

This wasn't just a lift-and-shift. It involved running Spark, Trino,
Airflow, JupyterHub, Superset, and HDFS (yes, still!) across a K8s-based
platform with full automation via ArgoCD. It had to support multiple teams,
secure access, and be easy to maintain.

We did this for Play, Poland's largest telco, and shared the story at last
year's Data & AI Warsaw Tech Summit. The full write-up is now live:
👉
https://getindata.com/blog/play-case-migrating-hadoop-cluster-kubernetes-open-source/

If you're exploring similar challenges or just curious about how we handled
orchestration, resource management, or multi-tenant deployments, you'll
probably find some takeaways.

Also—this year's Data & AI Summit is around the corner. If you want to hear
more like this (or chat with us), we'll be there:
📅 April 8–10, 2025
📍 Warsaw + Online

🎟️ Use promo code Getindata20 for 20% off your ticket at dataiwarsaw.tech!

If I got your atteniont, 2 more topics.

DATA PILL - this is a weekly newsletter with the best articles, tutorials
about data, ML and AI selected by the data community. Only content
recommended by our community members.

Subscribe: https://datapill.tech/

AI MONITOR - a global market study examining how organizations are
approaching data and AI. It takes 5 minutes, and you can contribute to the
data world. It's annonymous and participants will receive the full report
featuring key trends, tooling, and strategic insights. Also there is a
lottery with nice gifts (LEGO etc)

Take the survey: https://forms.office.com/e/JrxLRiRbSE

Hope to see you there,
[image: avatar]
[image: Getindata]
Sylwia Kołpuć
Senior Marketing Specialist
E: sylwia.kolpuc@getindata.com
A: Puławska 39/20, Warsaw

Website <http://www.getindata.com/>Blog <https://getindata.com/blog/>
LinkedIn <https://www.linkedin.com/company/getindata/>

Comments