- It is a fast and powerful, fully managed, petabyte-scale data warehouse service in the cloud.
- Customers can start small for just $0.25 per hour with no commitments or upfront costs and scale to petabyte or more for $1000 per terabyte per year, less than 10th of most other data warehousing solutions.
- Redshift configuration:
- Single Node (160 GB)
- Multi-Node
- Leader Node (manages client connections and receives queries)
- Compute Node (store data and perform queries and computations). Up to 128 compute nodes.
- Columnar Data Storage:
- Instead of storing data in series of rows, Redshift organises the data by column. Unlike row-based systems, which are ideal for transaction processing, column-based systems are ideal for data warehousing and analytic, where queries often involve aggregates performed over large datasets. Since only the columns involved in the query are processed and columnar data is stored sequentially on the storage media, column-based systems require far fewer I/Os, greatly improving query performance.
- Advanced Compression
- Columnar data stores can be compressed much more than row-based data stores because similar data is stored sequentially on disk. Redshift employs multiple compression techniques and can often achieve significant compression relative to traditional relational datastores. In addition, Redshift doesn’t require indexes or materialized views and so use less space than traditional relational database systems. When loading data into an empty table, Redshift automatically samples your data and selects the most appropriate compression technique.
- Massively Parallel Processing
- Redshift automatically distributes data and query load across all nodes. Redshift makes it easy to add nodes to your warehouse and enables you to maintain fast query performance as your data warehouse grows.
- Pricing:
- Compute node hours
- Backup
- Data transfer (within a VPN, not outside)
- Redshift Security:
- Encrypted in transit using SSL
- Encrypted at rest using AES-256 encryption.
- By default, Redshift take care of Key Management.
- Manage your own keys through HSM
- AWS Key Management Service
- Availability:
- Currently only available in 1 AZ
- Can restore snapshots to new AZ’s in the event of an outage.
Subscribe to:
Post Comments (Atom)
0 comments:
Post a Comment