Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

README.md

Cheat Sheets - Java and Scala

Table of Content (ToC)

Created by gh-md-toc

Overview

This cheat sheet explains how to install and to maintain a few tools pertaining to programming with Java and Scala, in particular for Spark-powered data processing.

References

Data Engineering helpers

SDKMan

  • If Java needs to be installed (e.g., on systems not packaging it natively), it is advised to install and use SDKMan
    • Once SDKMan has been installed, installing in parallel a specific version of Java becomes as easy as sdk install 11.0.21-amzn (here, for the Amazon-supported Corretto OpenJDK 11)
    • On MacOS, Java may simply be installed with HomeBrew: brew install openjdk

JAR packages on Maven Central

Specific JAR packages

Hadoop

Hive Metastore

PostgreSQL JDBC drivers

$ wget https://repo1.maven.org/maven2/org/postgresql/postgresql/42.6.0/postgresql-42.6.0.jar

Spark

Delta

$ wget https://repo1.maven.org/maven2/io/delta/delta-spark_2.12/3.0.0/delta-spark_2.12-3.0.0.jar
$ wget https://repo1.maven.org/maven2/io/delta/delta-standalone_2.12/3.0.0/delta-standalone_2.12-3.0.0.jar