Using Amazon RedShift with the AWS .NET API Part 10: RedShift in Big Data
April 16, 2015 Leave a comment
Introduction
In the previous post we discussed how to calculate the more complex parts of the aggregation script: the median and nth percentile if the URL response time.
This post will take up the Big Data thread where we left off at the end of the series on Amazon S3. We’ll also refer to what we built [at the end of the series on Elastic MapReduce]. That post took up how to run an aggregation job via the AWS .NET SDK on an available EMR cluster. Therefore the pre-requisite of following the code examples in this post is familiarity with what we discussed in those topics.
In this post our goal is to show an alternative to EMR. We’ll also see how to import the raw data source from S3 into RedShift.