Blog: Connect Jupyter Notebook to AWS Glue Endpoint

Posted on Thu 12 November 2020 in blogs • Tagged with devops

If I am not wrong, then almost everyone in data engineering industry have heard of Apache Spark and if not (highly unlikely) then you are just one google search away for ample number resources. This post will revolve around Spark, AWS Glue, notebook and binding these tools for optimal results …


Continue reading

Blog: AWS GluePyspark Locally

Posted on Sat 16 May 2020 in blogs • Tagged with devops

Download and install maven

  1. Download maven from https://aws-glue-etl-artifacts.s3.amazonaws.com/glue-common/apache-maven-3.6.0-bin.tar.gz
  2. untar the content to respective folder For example, mv apache-maven-3.6.0 {HOME}/Documents/opt/apache-maven
  3. Add mvn to your path

    bash echo 'export PATH=$PATH:/Users/bhavintandel/Documents/opt/apache-maven/bin' …


Continue reading