I would like to share his company's experience running large Hadoop clusters on Docker, shipped to different cloud providers (AWS, Azure, Google Cloud, Digital Ocean). “Cloudbreak” is their cloud agnostic, open source Hadoop as a Service API, supporting autoscaling, blueprints based declarative clusters and service discovery.