Amazon Redshift is a petabyte-scale data warehousing service using existing business intelligence tools to analyze the data. Redshift is a column-oriented database based on ParAccel (MPP over PostgreSQL 8.0).
Questions tagged [redshift]
5 questions
4
votes
1 answer
To datawarehouse or not to data warehouse?
I was wondering if you will be as so kind to assist me with a quick question (will to be happy to explain more if you are willing to...). I am researching and setting up a system to do a machine learning job (training) to find correlations between…
Yonathan Mizrachi
- 41
- 1
3
votes
1 answer
Big Data - Data Warehouse Solutions?
I have a dozen of databases that stores different data, and each of them are 100TBs in size. All of the data is stored in AWS services such as RDS, Aurora and Dynamo.
Many times I find myself need to perform "joins" across databases, for example a…
user1157751
- 709
- 1
- 8
- 22
1
vote
0 answers
Finding change maximum change in the value using Redshift
Following is the problem I want to solve. But I don't know how to implement it. I am using Redshift to store data.
Following is the format of the data stored in Redshift. It is sales history for every product for all year by month.
ProductId Year …
Pavan Kumar
- 111
- 1
1
vote
1 answer
Out of Memory Error when Selecting Data from Redshift Table
I am selecting data from Amazon Redshift Table with 500 millions rows. I have 64bit python installed.
code
import psycopg2
from sqlalchemy import create_engine
import pandas as pd
engine =…
TigSh
- 243
- 1
- 5
- 14
1
vote
1 answer
Using regex in redshift to find dollar values
I have a field in a Redshift table that has user-generated text. The field is where users can say how much they think something costs.
Ideally it'd just be a decimal, but it's varchar. So users can type
"I think this is worth \$25", or "I'd pay…
ScottieB
- 333
- 1
- 2
- 8