Business Platform Team

Anant Corporation Blog: Our research, knowledge, thoughts, and recommendations about building and managing online business platforms.

Monthly Archives: December 2020


Running a databricks notebook against datastax astra

Running A Databricks Notebook Against DataStax Astra

In this blog, we will cover running a Databricks notebook against DataStax Astra. We will use Databricks community edition and use DataStax Astra’s free tier to show you how you can run a Databricks notebook against DataStax Astra without a credit card. Additionally, a YouTube video is embedded below if you want to watch a live demo of this process, so be sure to check it out.

Continue reading
Connect databricks and datastax astra

Connect Databricks and DataStax Astra

In this blog, we will cover how to connect Databricks and DataStax Astra. We will use Databricks community edition and use DataStax Astra’s free tier to show you how you can connect Databricks to DataStax Astra without a credit card. Additionally, a YouTube video is embedded below if you want to watch a live demo of this process, so be sure to check it out.

Continue reading
Data Engineer's Lunch #6: Common Data Formats Used in Data Engineering

Data Engineer’s Lunch #6: Common Data Formats Used in Data Engineering

In Data Engineer’s Lunch #6: Common Data Formats Used in Data Engineering, we discuss common data storage formats used in data engineering. The live recording of the Data Engineer’s Lunch, which includes a more in-depth discussion, is also embedded below in case you were not able to attend live. If you would like to attend Data Engineer’s Lunch in person, it is hosted every Monday at 12 PM EST. Register here now!

Continue reading

Apache Spark Companion Technologies: Distributed Machine Learning Frameworks

One of Apache Spark’s main core features is Spark MLLib, a library for doing machine learning in Spark. Most data science education relies on specific machine learning libraries, like Sci-Kit Learn. Having data scientists retrain to use Spark MLLib can be an extra cost on top of the data engineering work that needs to be done in the first place, just to use Spark. Databricks offers distributed versions of some of these Machine Learning frameworks as part of the Databricks platform.

Continue reading

Apache Cassandra Lunch #32: Cassandra Data Operations – Common Ways to Move Data in Cassandra

In case you missed it, this blog post is a recap of Cassandra Lunch #30, covering the basics of Cassandra Data Operations. We discuss the various ways of moving data into and out of Cassandra clusters. The live recording of Cassandra Lunch, which includes a more in-depth discussion, is also embedded below in case you were not able to attend live. If you would like to attend Apache Cassandra Lunch live, it is hosted every Wednesday at 12 PM EST. Register here now!

Continue reading