Business Platform Team

Anant Corporation Blog: Our research, knowledge, thoughts, and recommendations about building and managing online business platforms.

Author Archives: Obioma Anomnachi


Cover image for Petl for Data Engineering presentation

Data Engineer’s Lunch #28: Petl for Data Engineering

In Data Engineer’s Lunch #28: Petl for Data Engineering, we discussed Petl as part of our ongoing series on python ETL tools. The live recording of the Data Engineer’s Lunch, which includes a more in-depth discussion, is also embedded below in case you were not able to attend live. If you would like to attend a Data Engineer’s Lunch live, it is hosted every Monday at noon EST. Register here now!

Continue reading
Cover image with the title Spark Script Dependency Management

Spark Script Dependency Management

In this blog post, we will discuss a number of ways of doing dependency management when running spark scripts. This particular post is not a part of any of our ongoing series. We often discuss using spark during our Data Engineer’s Lunch events every Monday. If you would like to attend a Data Engineer’s Lunch live, it is hosted every Monday at noon EST. Register here now! We last discussed Spark at a recent Cassandra Lunch. The topic was ETL in Cassandra with Airflow and Spark, Our most recent discussion of Spark can be found here.

Continue reading
Cover slide for the Airflow and Cassandra for Cluster Management webinar

Apache Cassandra Lunch #52: Airflow and Cassandra for Cluster Management

In Apache Cassandra Lunch #52: Airflow and Cassandra for Cluster Management, we discussed using Airflow to schedule tasks on a Cassandra cluster beyond what could be accomplished with the Cassandra provider package. The live recording of Cassandra Lunch, which includes a more in-depth discussion and a demo, is embedded below in case you were not able to attend live. If you would like to attend Apache Cassandra Lunch live, it is hosted every Wednesday at 12 PM EST. Register here now!

Continue reading
Cover slide for the Pandas for Data Engineering webinar

Data Engineer’s Lunch #24: Pandas for Data Engineering

In Data Engineer’s Lunch #24: Pandas for Data Engineering, we discussed using Pandas for performing Data Engineering tasks in Python. This topic is part of our ongoing series on Python ETL tools. The live recording of the Data Engineer’s Lunch, which includes a more in-depth discussion, is also embedded below in case you were not able to attend live. If you would like to attend a Data Engineer’s Lunch live, it is hosted every Monday at noon EST. Register here now!

Continue reading
Airflow and Cassandra

Apache Cassandra Lunch #48: Airflow and Cassandra

In Apache Cassandra Lunch #48: Airflow and Cassandra, we discussed using Airflow to manage interactions with Cassandra. Specifically this week we covered Airflow Operators, and how they could be used to interact with a Cassandra cluster on the level of the various language drivers. The live recording of Cassandra Lunch, which includes a more in-depth discussion and a demo, is embedded below in case you were not able to attend live. If you would like to attend Apache Cassandra Lunch live, it is hosted every Wednesday at 12 PM EST. Register here now!

Continue reading