Skip to content
Tech Stuff, BigData & more

Primary Navigation Menu

Menu
  • Home
  • BigData
    • Spark
    • AWS
  • Tech Staff
    • Scrappy
    • Solr
  • Project Management

Redshift Spectrum

Analyzing Redshift Spectrum ‘Patronus’

2017-11-19
By: Albert Franzi
On: 19th November 2017
In: AWS

Redshift Spectrum ‘Patronus’ This post aims to analyze how Redshift Spectrum works and how we can take advantage of using it. I will try to load data from S3 such as Sessions (Parquet) & Raw Data (JSON). First of all, we will follow the Getting started using spectrum guide. To use Redshift Spectrum, the cluster needs to be at version 1.0.1294 or later. We can validate that executing select version();

1
2
3
4
dwh_sch=# select version();
                                                         version
--------------------------------------------------------------------------------------------------------------------------
PostgreSQL 8.0.2 on i686-pc-linux-gnu, compiled by GCC gcc (GCC) 3.4.2 20041017 (Red Hat 3.4.2-6.fc3), Redshift 1.0.1499

To keep in mind – Redshift Pricing. Price Per Redshift Spectrum Query With Redshift Spectrum, you are billed at $5 per terabyte of data scanned, rounded up to the next megabyte, with a 10 megabyte minimum per query. For example, if you scan 10Read More →

Recent Posts

  • Analyzing Redshift Spectrum ‘Patronus’
  • Creating a Redshift Sandbox for our Analysts
  • Learning Session – AWS Athena
  • Checking your Redshift users
  • Tableau fed by Presto with S3 Parquets

Categories

  • AWS
  • BigData
  • MapReduce
  • Project Management
  • Solr
  • Spark
  • Tech Staff
  • Uncategorised

Meta

  • Log in
  • Entries RSS
  • Comments RSS
  • WordPress.org

© 2015 - 2019 Efimeres