Here you can find the preprocessing I've done on the Titanic dataset available on Kaggle. At the end of the notebook I compared the accuracy obtained exploting the dirty and the cleaned dataset.
https://colab.research.google.com/drive/1WJyGFI1StCvXWNc54wjri6qmOWrHFuli?usp=sharing