CentOSLinuxTutorials

How To Install Apache Zeppelin on CentOS 7

Install Apache Zeppelin on CentOS 7

In this tutorial, we will show you how to install Apache Zeppelin on CentOS 7. For those of you who didn’t know, Apache Zeppelin is an online open-source laptop and collaborative application for interactive data ingestion, discovery, analytics, and visualization. Zeppelin supports 20+ languages, including Apache Spark, SQL, R, Elasticsearch, and many more. Apache Zeppelin allows you to make beautiful data-driven documents and see the results of your analytics.

This article assumes you have at least basic knowledge of Linux, know how to use the shell, and most importantly, you host your site on your own VPS. The installation is quite simple and assumes you are running in the root account, if not you may need to add ‘sudo‘ to the commands to get root privileges. I will show you the step-by-step install Apache Zeppelin on CentOS 7 server.

Prerequisites

  • A server running one of the following operating systems: CentOS 7.
  • It’s recommended that you use a fresh OS install to prevent any potential issues.
  • SSH access to the server (or just open Terminal if you’re on a desktop).
  • A non-root sudo user or access to the root user. We recommend acting as a non-root sudo user, however, as you can harm your system if you’re not careful when acting as the root.

Install Apache Zeppelin on CentOS 7

Step 1. First, let’s start by ensuring your system is up-to-date.

yum clean all
yum -y update

Step 2. Installing Java.

At the time of writing this tutorial, the latest Java JDK version was JDK 8u45. First, let us download the latest Java SE Development Kit 8 release from its official download page or use the following commands to download from a shell:

cd /opt/
wget --no-cookies --no-check-certificate --header "Cookie: gpw_e24=http%3A%2F%2Fwww.oracle.com%2F; oraclelicense=accept-securebackup-cookie" "http://download.oracle.com/otn-pub/java/jdk/8u45-b14/jdk-8u45-linux-x64.tar.gz"
tar xzf jdk-8u45-linux-x64.tar.gz

After extracting the archive file use the alternatives command to install it. alternatives command is available in chkconfig the package:

cd /opt/jdk1.8.0_45/
alternatives --install /usr/bin/java java /opt/jdk1.8.0_45/bin/java 2
alternatives --config java
There are 3 programs which provide 'java'.

  Selection    Command
-----------------------------------------------
*  1           /opt/jdk1.7.0_71/bin/java
 + 2           /opt/jdk1.8.0_25/bin/java
   3           /opt/jdk1.8.0_45/bin/java

Enter to keep the current selection[+], or type selection number: 3

At this point, JAVA 8 (JDK 8u45) has been successfully installed on your system. We also recommend to setup java and jar commands paths using alternatives:

alternatives --install /usr/bin/jar jar /opt/jdk1.8.0_45/bin/jar 2
alternatives --install /usr/bin/javac javac /opt/jdk1.8.0_45/bin/javac 2
alternatives --set jar /opt/jdk1.8.0_45/bin/jar
alternatives --set javac /opt/jdk1.8.0_45/bin/javac

Checking the Installed java version:

root@idroot.us ~# java -version
java version "1.8.0_45"
Java(TM) SE Runtime Environment (build 1.8.0_45-b14)
Java HotSpot(TM) 64-Bit Server VM (build 25.45-b02, mixed mode)

We can easily set the environment variables using the export command as shown below:

Setup JAVA_HOME Variable:

export JAVA_HOME=/opt/jdk1.8.0_45

Setup JRE_HOME Variable:

export JRE_HOME=/opt/jdk1.8.0_45/jre

Setup PATH Variable:

export PATH=$PATH:/opt/jdk1.8.0_45/bin:/opt/jdk1.8.0_45/jre/bin

Step 3. Installing Zeppelin.

First, download the Zeppelin binary on your system. You can always find the latest version of the application on the Zeppelin download page:

wget http://www-us.apache.org/dist/zeppelin/zeppelin-0.7.3/zeppelin-0.10.1-bin-all.tgz
tar xf zeppelin-*-bin-all.tgz -C /opt

Rename the directory for sake of convenience:

mv /opt/zeppelin-*-bin-all /opt/zeppelin

Step 4. Configure Systemd service for Apache Zeppelin.

We will set up a Systemd unit file for the Zeppelin application:

adduser -d /opt/zeppelin -s /sbin/nologin zeppelin

Provide ownership of the files to the newly created Zeppelin user:

chown -R zeppelin:zeppelin /opt/zeppelin

Next, Create a new Systemd service unit file:

nano /etc/systemd/system/zeppelin.service
[Unit]
Description=Zeppelin service
After=syslog.target network.target

[Service]
Type=forking
ExecStart=/opt/zeppelin/bin/zeppelin-daemon.sh start
ExecStop=/opt/zeppelin/bin/zeppelin-daemon.sh stop
ExecReload=/opt/zeppelin/bin/zeppelin-daemon.sh reload
User=zeppelin
Group=zeppelin
Restart=always

[Install]
WantedBy=multi-user.target

Then, start the application:

systemctl start zeppelin
systemctl enable zeppelin

Step 5. Configure Reverse Proxy Nginx.

By default, the Zeppelin server listens to localhost on port 8080. In this tutorial, we will use Nginx as a reverse proxy so that the application can be accessed via standard HTTP and HTTPS ports:

yum install certbot
yum install nginx

Start Nginx and enable it to automatically start at boot time:

sudo systemctl start nginx
sudo systemctl enable nginx

Next, Generate the SSL certificates:

certbot certonly --webroot -w /usr/share/nginx/html -d zeppelin.idroot.us

The generated certificates are likely to be stored in /etc/letsencrypt/live/zeppelin.idroot.us/. The SSL certificate will be stored as fullchain.pem and private key will be stored as privkey.pem.

Set up auto-renewal of the certificates Let’s Encrypt using cron jobs:

sudo crontab -e
30 5 * * * /usr/bin/certbot renew --quiet

The next steps create a new server block file for the Zeppelin site:

nano /etc/nginx/conf.d/zeppelin.idroot.us.conf
upstream zeppelin {
server 127.0.0.1:8080;
}
server {
 listen 80;
 server_name zeppelin.idroot.us;
 return 301 https://$host$request_uri;
}

server {
 listen 443;
 server_name zeppelin.idroot.us;

ssl_certificate /etc/letsencrypt/live/zeppelin.idroot.us/fullchain.pem;
 ssl_certificate_key /etc/letsencrypt/live/zeppelin.idroot.us/privkey.pem;

ssl on;
 ssl_session_cache builtin:1000 shared:SSL:10m;
 ssl_protocols TLSv1 TLSv1.1 TLSv1.2;
 ssl_ciphers HIGH:!aNULL:!eNULL:!EXPORT:!CAMELLIA:!DES:!MD5:!PSK:!RC4;
 ssl_prefer_server_ciphers on;

access_log /var/log/nginx/zeppelin.access.log;

location / {
 proxy_pass http://zeppelin;
 proxy_set_header X-Real-IP $remote_addr;
 proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
 proxy_set_header Host $http_host;
 proxy_set_header X-NginX-Proxy true;
 proxy_redirect off;
 }
location /ws {
 proxy_pass http://zeppelin/ws;
 proxy_http_version 1.1;
 proxy_set_header Upgrade websocket;
 proxy_set_header Connection upgrade;
 proxy_read_timeout 86400;
 }
 }

Restart Nginx so that the changes can take effect:

systemctl restart nginx

Step 6. Accessing Apache Zeppelin.

Apache Zeppelin will be available on HTTP port 80 by default. Open your favorite browser and navigate to https://zeppelin.idroot.us and complete the required steps to finish the installation. If you are using a firewall, please open port 80 to enable access to the control panel.

Congratulations! You have successfully installed Apache Zeppelin. Thanks for using this tutorial for installing Apache Zeppelin on CentOS 7 systems. For additional help or useful information, we recommend you check the official Apache Zeppelin website.

VPS Manage Service Offer
If you don’t have time to do all of this stuff, or if this is not your area of expertise, we offer a service to do “VPS Manage Service Offer”, starting from $10 (Paypal payment). Please contact us to get the best deal!

r00t

r00t is a seasoned Linux system administrator with a wealth of experience in the field. Known for his contributions to idroot.us, r00t has authored numerous tutorials and guides, helping users navigate the complexities of Linux systems. His expertise spans across various Linux distributions, including Ubuntu, CentOS, and Debian. r00t's work is characterized by his ability to simplify complex concepts, making Linux more accessible to users of all skill levels. His dedication to the Linux community and his commitment to sharing knowledge makes him a respected figure in the field.
Back to top button