In this blog, we will cover 3 different open source BI tools that you can use to do SQL and reporting on Apache Cassandra. This blog is part 4 and wraps up our series on “Doing SQL and Reporting on Apache Cassandra with Open Source Tools”. Parts 1-3 will be linked at the end of the blog if you want to check them out as well. Also, a webinar will be embedded below if you want to watch a video version of this blog; as well as, a live demo using one of the tools we discuss!
In this blog, we will introduce 3 different open source BI tools that we can use to do SQL and report on Apache Cassandra. The tools we will be discussing are:
Metabase is an open source tool designed to be the simplest and fastest way to get business intelligence and analytics to everyone in your company.
Metabase also allows users to hit their Query API directly from Javascript to integrate the simple analytics they provide with your own application or third-party services to do things like:
Metabase does provide a paid option where you can host on their cloud, but the self-hosted option is free and open source. Pricing can be seen here. In the video embedded below, there is a live demo where we quickly spin up Metabase on Docker and connect to Cassandra via Presto. If you are not familiar with Presto, you can check out Part 1 of the “Doing SQL and Reporting on Apache Cassandra with Open Source Tools” series here (also linked below in a list with parts 1-3).
Redash is an open source tool designed to enable anyone, regardless of the level of technical sophistication, to harness the power of data big and small. SQL users leverage Redash to explore, query, visualize, and share data from any data sources. Their work in turn enables anybody in their organization to use the data.
Like Metabase, Redash also offers a paid version where you can host using Redash itself; however, like Metabase, it is free to host on your own. If you want to host on your own, you can check out this page, where you quickly set up an instance of Redash using AWS EC2 AMI, DigitalOcean, Google Compute Engine Image, and Docker.
Apache Superset is an open source modern data exploration and visualization platform.
Unlike Metabase and Redash, there is no paid service / hosting for Apache Superset, so you will need to figure out where to host it. Superset can be set-up using Docker, and more on getting started with Superset can be found here. A word of notice: Superset is not officially supported on Windows, unfortunately. The best option for Windows users to try out Superset locally is to install an Ubuntu Desktop VM via VirtualBox and proceed with the Docker on Linux instructions inside of that VM.
And that wraps up our series on “Doing SQL and Reporting on Apache Cassandra with Open Source Tools” with open source BI tools and reporting on Apache Cassandra. As mentioned above, a video form of this blog is embedded below, which includes a live demo of using Metabase with Presto to connect to Cassandra. Also, parts 1-3 of this series are also linked below!
Cassandra.Link is a knowledge base that we created for all things Apache Cassandra. Our goal with Cassandra.Link was to not only fill the gap of Planet Cassandra, but to bring the Cassandra community together. Feel free to reach out if you wish to collaborate with us on this project in any capacity.
We are a technology company that specializes in building business platforms. If you have any questions about the tools discussed in this post or about any of our services, feel free to send us an email!