Business Platform Team

Anant Corporation Blog: Our research, knowledge, thoughts, and recommendations about building and managing online business platforms.

Monthly Archives: March 2021


Cover slide for the NoSQL part 3: data store types webinar

Data Engineer’s Lunch #17: NoSQL Part 3: Data Store Types

In Data Engineer’s Lunch #17: NoSQL Part 3: Data Store Types, we discussed the four different types of data stores that underlie NoSQL databases. The live recording of the Data Engineer’s Lunch, which includes a more in-depth discussion, is also embedded below in case you were not able to attend live. If you would like to attend a Data Engineer’s Lunch live, it is hosted every Monday at noon EST. Register here now!

Continue reading

Alpakka Cassandra and Twitter: Using Alpakka to read and write tweets from/to Cassandra

Alpakka is an open-source project designed to implement stream-aware and reactive integration pipelines for Java/Scala which is built on top of Akka Streams. This blog talks specifically about using Alpakka Cassandra and Akka Streams together with Twitter4S (Twitter client written in Scala) to pull new Tweets from Twitter for a given hashtag (or set of hashtags) using Twitter API v1.1 and write them into a local Cassandra database.

Continue reading

Apache Cassandra Lunch #42: SSTable Files with SSTableloader

In case you missed it, this blog post is a recap of Cassandra Lunch #42, covering SSTable files. It also covers their relation to SSTableLoader. We also walk through an example using SSTableloader to load data taken from a cluster to a new, empty cluster. The live recording of Cassandra Lunch, which includes a more in-depth discussion, is also embedded below in case you were not able to attend live. If you would like to attend Apache Cassandra Lunch live, it is hosted every Wednesday at 12 PM EST. Register here now!

Continue reading
Introduction to awk for data engineering

Data Engineer’s Lunch #16: Introduction to awk for Data Engineering

In Data Engineer’s Lunch #16: Introduction to awk for Data Engineering, we introduce awk, a domain-specific language and text processor, and how we can use this tool for data engineering. The live recording of the Data Engineer’s Lunch, which includes a more in-depth discussion and live walkthrough, is also embedded below in case you were not able to attend live. If you would like to attend Data Engineer’s Lunch in person, it is hosted every Monday at 12 PM EST. Register here now!

Continue reading

Using TableAnalyzer for Data Model Review

This post will cover the installation of TableAnalyzer and the use of the tool for data model review. TableAnalyzer is useful for troubleshooting problems with Cassandra tables. We can then tie those problems to particular issues with the data model and use it to drive changes. Check out Using TableAnalyzer – Anant’s Tool for Analysis of Cassandra Tables as well!

Continue reading