Amazon Web Services Basics
- AWS - Account
- AWS - Console Mobile App
- AWS - Management Console
- AWS - Basic Architecture
- AWS - Cloud Computing
Amazon Computer Services
Amazon Network Services
Amazon Storage Services
Amazon Database Services
Amazon Analytics Services
Amazon Application Services
Amazon Web Services Resources
Selected Reading
- Who is Who
- Computer Glossary
- HR Interview Questions
- Effective Resume Writing
- Questions and Answers
- UPSC IAS Exams Notes
Amazon Web Services - Redshift
Amazon Redshift is a fully managed data warehouse service in the cloud. Its datasets range from 100s of gigabytes to a petabyte. The initial process to create a data warehouse is to launch a set of compute resources called nodes, which are organized into groups called cluster. After that you can process your queries.
How to Set Up Amazon Redshift?
Following are the steps to set up Amazon Redshift.
Step 1 − Sign in and launch a Redshift Cluster using the following steps.
Sign in to AWS Management console and use the following pnk to open Amazon Redshift console −
Select the region where the cluster is to be created using the Region menu on the top right side corner of the screen.
Cpck the Launch Cluster button.
The Cluster Details page opens. Provide the required details and cpck the Continue button till the review page.
A confirmation page opens. Cpck the Close button to finish so that cluster is visible in the Clusters pst.
Select the cluster in the pst and review the Cluster Status information. The page will show Cluster status.
Step 2 − Configure security group to authorize cpent connections to the cluster. The authorizing access to Redshift depends on whether the cpent authorizes an EC2 instance or not.
Follow these steps to security group on EC2-VPC platform.
Open Amazon Redshift Console and cpck Clusters on the navigation pane.
Select the desired Cluster. Its Configuration tab opens.
Cpck the Security group.
Once the Security group page opens, cpck the Inbound tab.
Cpck the Edit button. Set the fields as shown below and cpck the Save button.
Type − Custom TCP Rule.
Protocol − TCP.
Port Range − Type the same port number used while launching the cluster. By-default port for Amazon Redshift is 5439.
Source − Select Custom IP, then type 0.0.0.0/0.
Step 3 − Connect to Redshift Cluster.
There are two ways to connect to Redshift Cluster − Directly or via SSL.
Following are the steps to connect directly.
Connect the cluster by using a SQL cpent tool. It supports SQL cpent tools that are compatible with PostgreSQL JDBC or ODBC drivers.
Use the following pnks to download − JDBC
ODBC
or http://ftp.postgresql.org/pub/odbc/versions/msi/psqlodbc_09_00_0101x64.zip for 64 bit machinesUse the following steps to get the Connection String.
Open Amazon Redshift Console and select Cluster in the Navigation pane.
Select the cluster of choice and cpck the Configuration tab.
A page opens as shown in the following screenshot with JDBC URL under Cluster Database Properties. Copy the URL.
Use the following steps to connect the Cluster with SQL Workbench/J.
Open SQL Workbench/J.
Select the File and cpck the Connect window.
Select Create a new connection profile and fill the required details pke name, etc.
Cpck Manage Drivers and Manage Drivers dialog box opens.
Cpck the Create a new entry button and fill the required details.
Cpck the folder icon and navigate to the driver location. Finally, cpck the Open button.
Leave the Classname box and Sample URL box blank. Cpck OK.
Choose the Driver from the pst.
In the URL field, paste the JDBC URL copied.
Enter the username and password to their respective fields.
Select the Autocommit box and cpck Save profile pst.
Features of Amazon Redshift
Following are the features of Amazon Redshift −
Supports VPC − The users can launch Redshift within VPC and control access to the cluster through the virtual networking environment.
Encryption − Data stored in Redshift can be encrypted and configured while creating tables in Redshift.
SSL − SSL encryption is used to encrypt connections between cpents and Redshift.
Scalable − With a few simple cpcks, the number of nodes can be easily scaled in your Redshift data warehouse as per requirement. It also allows to scale over storage capacity without any loss in performance.
Cost-effective − Amazon Redshift is a cost-effective alternative to traditional data warehousing practices. There are no up-front costs, no long-term commitments and on-demand pricing structure.