


Make sure that you have java installed.If you follow the steps, you should be able to install PySpark without any problem. My machine has ubuntu 18.04 and I am using java 8 along with anaconda3. So this is just a small effort of mine to put everything together.

I went through a lot of medium articles and StackOverflow answers but not one particular answer or post did solve my problems. Source: Basic set-up for distributed machine learningĪfter a struggle for a few hours, I finally installed java 8, spark and configured all the environment variables.
