Try to preprocess the TITANIC dataset available on kaggle. Fixed a predictive model (for example SVM, k-NN, etc.) verify if there are significant variations in the prediction accuracy:
https://colab.research.google.com/drive/1_PG2cOMmT9IuhtG4_U2f67UsmM2QOaxO?usp=sharing