Spark Scala Examples: Your baby steps to Big Data

baby walking on the beach

This post elaborates on Apache Spark transformation and action operations by providing a step by step walk through of Spark examples in Scala. Before you dive into these examples, make sure you know some of the basic Apache Spark Concepts. Below examples are in no particular sequence and is the first part of our five-part […]

SQL Find Duplicates Like a Pro: 3 Guaranteed Techniques

dogs representing duplicate records

Aah…. duplicates! They are everywhere! Look around you – multiple charger cables, headphones, pictures in your smartphone! But we are not here to talk about those duplicates. No, Sir! We are here to address the duplicates in sql, how to find them and possibly resolve them in your SQL code. In this SQL find duplicates […]

10 Redshift Create Table Examples to make you an expert

redshift create table

In this blog post, let us look at some Redshift Create Table Examples, 10 to be exact ! However, before we get started, what exactly is Redshift? Amazon Redshift is a cloud based data warehouse service by AWS. For those of us who have used PostgreSQL, you will be surprised to know that Redshift is […]

Spark SQL Date Functions – Complete list with examples

spark sql date functons

In this post we will address Spark SQL Date Functions, its syntax and what it does. There are 28 Spark SQL Date functions, meant to address string to date, date to timestamp, timestamp to date, date additions, subtractions and current date conversions. Spark SQL is the Apache Spark module for processing structured data. There are […]