Anant Corporation Blog: Our research, knowledge, thoughts, and recommendations about building and managing online business platforms.
In Data Engineer’s Lunch #16: Introduction to awk for Data Engineering, we introduce awk, a domain-specific language and text processor, and how we can use this tool for data engineering. The live recording of the Data Engineer’s Lunch, which includes a more in-depth discussion and live walkthrough, is also embedded below in case you were not able to attend live. If you would like to attend Data Engineer’s Lunch in person, it is hosted every Monday at 12 PM EST. Register here now!Continue reading
In case you missed it, the fourth installment of our weekly data engineering lunch was presented by guest speaker Will Angel. It covered the topic of using Airflow for data engineering. Airflow is a scheduling tool for managing data pipelines. The live recording of the Data Engineer’s Lunch, which includes a more in-depth discussion, is also embedded below in case you were not able to attend live. If you would like to attend a Data Engineer’s Lunch live, it is hosted every Monday at noon EST. Register here now!Continue reading
The first part of any machine learning project is to gather data. This sounds easy. You may think that this puts you in the perfect position to work with data you have in relational databases. In some circumstances that may be correct. However, most of the ways that we store data in databases for business platforms are sub-optimal for using machine learning. They require more work to gain the insights we want out of our data.Continue reading
This is the second part of our “Diving deep into Gartner’s Top 10 Data and Analytics Technology Trends for 2019” blog series. In this series, we’ll explore the top 10 Data & Analytics 2019 trends identified by Gartner. If you haven’t yet seen the first post on Augmented Analytics check it out here.Continue reading
Asynchronous Data is data that is not combined or aligned when it is sent or received. In this type of transmission, signals are sent between the computers and external systems or vice versa in an asynchronous manner. But why does this matter and how can people benefit from it?Continue reading
Subscribe to our monthly newsletter below and never miss the latest Cassandra and data engineering news!