Parallel Database for OLTP and OLAP

Parallel Database for OLTP and OLAP

Just a
survey article on materials on parallel database products and
technologies for OLTP/OLAP applications. It mainly covers major
commercial/academic efforts on developing parallel dbms to solve the
ever growing large amount of relational data processing problem.

 

Part I - Parallel DBMSs

1.1 Parallel Database for OLAP (Shared-Nothing/MPP)

TeraData

- TeraData Home

- Teradata DBC/1012 Paper

- NCR Teradata VS Oracle Exadata (Teradata's perspective)

Vertica

- Vertica Home

- The original research project: C-Strore

Paraccel

- Paraccel Home

- MPP Based Architecture

- Columnar Based Storage

- Flash Based Storage

DataLlegro(now MS Madison)

- Design Choices in MPP Data Warehousing Lessons from DATAllegro V3

- Microsoft SQL Server Parallel Data Warehousing

Netezza

- Netezza Home

- Acquired by IBM

- Hadoop & Netezza: Synergy in Data Analytics (Part 1, Part 2) 

- Netezza Twinfin VS Oracle Exadata (eBook, Blog, Netezza's perspective)

GreenPlum:

- GreenPlum Home

- Combined: PostGreSQL/ZFS/MapReduce

- Acquired by EMC

Oracle ExaData:

- ExaData Home (Technical Overview, White Paper)
>> - OLTP & OLAP Hybrid Orientation

>> - 1 * RAC + N * Exadata Cells (Storage Node) + Infiniband Network

>> - Exadata Cell: Flash Cache + Disk Array + Data Filtering Logic (partial SQL execution)
- Oracle Exadata VS Netezza TwinFin (Oracle Engineer's perspective)

IBM DB2 Data Partitioning Feature (can work with both OLAP/OLTP)

- formerly known as DB2 Parallel Edition (An Shorter Overview)

- DB2 At a Glance - Data Partitioning Feature

- Simulating Massively Parallel Database Processing on Linux

AsterData:

- Supercharging Analytics with SQL-MapReduce

- Aster Data brings Applications inside an MPP Database 

Misc Articles:

- What's MPP?

- Comparison of Oracle to IBM DB2 UDB and NCR Teradata for Data Warehousing

- SMP or MPP for Data Warehouse

- Dividing the data Warehousing work among MPP Nodes

- SANs vs. DAS in MPP data Warehousing

- Three ways Oracle or Microsoft could go MPP

1.2 Parallel Database for OLTP (Shared-Disk/SMP)

Oracle Real Application Cluster

- Oracle RAC Concepts

- Oracle Parallel Database Server Concepts

- Oracle RAC Case Study on 16-Node Linux Cluster

IBM DB2 for z/OS (with Sysplex Technology)

- Share Disk and Share Nothing for IBM DB2

- What's DB2 Data Sharing?

IBM DB2 for LUW (with pureScale Technology)

- IBM DB2 pureScale: The Next Big Thing or a Solution Looking for a Problem?

- What is DB2 pureScale?

- DB2 pureScale Scalability (section 1, section 2)

Part II - Academic Readings

2.1 Overview

1). Parallel Database System: The Future of High Performance Database Processing

2). Survey of Architecture of Parallel Database System

3). The Case for Shared Nothing

4). Much Ado About Shared-Nothing

2.2 Research System

1). XPS: A High Performance Parallel Database Server

2). The Design of XPRS

3). Prototyping Buuba, H High Parallel Database System

4). The Gamma Database Machine Project

5). NonStop SQL, A Distributed, High-Performance, High-Availability Implementation of SQL

6). Parallel Query Processing in Shared Disk Database System

7). Architecture of SDC, the Super Database Computer

2.3 Commercial System

1). A Study of A Parallel Database Machine and Its Performance - The NCR/TERADATA DBC/1012

2). A Practical Implementation of the Database Machine - Teradata DBC/1012

3). DB2 Parallel Edition

4). Parallel SQL Execution in Oracle 10g

6). Shared Cache - The Future of Parallel Database

7). Cache Fusion: Extending Shared-Disk Clusters with Shared Caches