Welcome!
So "What are Zeppelin Notebooks?"
In this video we'll be looking at Zeppelin
Notebooks, one of the open-source, web-based
tools in Data Scientist Workbench.
Zeppelin Notebooks are multi-purpose notebooks
that can handle all your analytics needs:
From Data ingestion, data discovery, data
analytics, to data Visualization and collaboration.
The Zeppelin interpreter concept allows any
language or data-processing-backend to be
plugged into Zeppelin.
Currently Apache Zeppelin supports many interpreters
such as Apache Spark, Python, JDBC, Markdown
and Shell.
Apache Zeppelin, in particular, provides built-in "Apache
Spark"integration.
You don't need to build a separate module,
plugin or library for it.
Apache Zeppelin with Spark integration provides
a number of great features, including.
Automatic SparkContext and SQLContext injection
Runtime jar dependency loading from local
filesystem or maven repository, as well as
Canceling job and progress display
For further information about Apache Spark
in Apache Zeppelin, take a look at the Apache
Spark in Zeppelin Notebooks video.
For data visualization, here are some basic
charts already included in Apache Zeppelin.
Visualizations are not limited to SparkSQL
query in fact, any output from any language
backend can be recognized and visualized.
For Pivot charts, Apache Zeppelin aggregates
values and displays them in a pivot chart
with simple drag and drop.
You can easily create charts with multiple
aggregated values including sum, count, average,
minimum, and maximum.
For Dynamic forms, Apache Zeppelin can dynamically
create some input forms in your notebook.
Apache Zeppelin is Apache2 Licensed software.
Zeppelin Notebooks are 100% open source.
So please check out the source repository and how
to contribute.
In fact, Apache Zeppelin has a very active
development community.
Please feel free to join our Mailing list and
report issues, if you'd like, on Jira
Issue tracker.
This brings us to the end of this video.
Thanks for watching!
1
