Questions tagged [redshift]

Amazon Redshift is a petabyte-scale data warehousing service using existing business intelligence tools to analyze the data. Redshift is a column-oriented database based on ParAccel (MPP over PostgreSQL 8.0).

5 questions
4
votes
1 answer

To datawarehouse or not to data warehouse?

I was wondering if you will be as so kind to assist me with a quick question (will to be happy to explain more if you are willing to...). I am researching and setting up a system to do a machine learning job (training) to find correlations between…
3
votes
1 answer

Big Data - Data Warehouse Solutions?

I have a dozen of databases that stores different data, and each of them are 100TBs in size. All of the data is stored in AWS services such as RDS, Aurora and Dynamo. Many times I find myself need to perform "joins" across databases, for example a…
user1157751
  • 709
  • 1
  • 8
  • 22
1
vote
0 answers

Finding change maximum change in the value using Redshift

Following is the problem I want to solve. But I don't know how to implement it. I am using Redshift to store data. Following is the format of the data stored in Redshift. It is sales history for every product for all year by month. ProductId Year …
Pavan Kumar
  • 111
  • 1
1
vote
1 answer

Out of Memory Error when Selecting Data from Redshift Table

I am selecting data from Amazon Redshift Table with 500 millions rows. I have 64bit python installed. code import psycopg2 from sqlalchemy import create_engine import pandas as pd engine =…
TigSh
  • 243
  • 1
  • 5
  • 14
1
vote
1 answer

Using regex in redshift to find dollar values

I have a field in a Redshift table that has user-generated text. The field is where users can say how much they think something costs. Ideally it'd just be a decimal, but it's varchar. So users can type "I think this is worth \$25", or "I'd pay…
ScottieB
  • 333
  • 1
  • 2
  • 8