churn-prediction/ ├── docker/ │ ├── docker-compose.yml # Full cluster definition │ └── hadoop.env # HDFS configuration ├── data/ │ ├── raw/ # Original Kaggle CSV │ ├── cleaned/ # After Spark cleaning ...