Database Optimization: When to Shard, Partition, or Index

As data volumes grow and application requirements become more demanding, the efficiency of database operations becomes critical to system performance. This guide explores three fundamental database optimization strategies: indexing, partitioning, and sharding. Each approach solves specific performance challenges and comes with its own tradeoffs.

Understanding when to apply each strategy is crucial for database architects and developers. This guide will help you navigate the complex decision-making process by examining:

The core concepts and principles behind each optimization strategy
Specific scenarios where each approach shines
Performance implications and trade-offs
Implementation considerations and best practices
Real-world examples and case studies

Key Insight

These strategies are not mutually exclusive – many high-performance systems combine indexing, partitioning, and sharding to achieve optimal results. The art lies in understanding when and how to apply each technique based on your specific requirements and constraints.

Understanding Data Access Patterns

Before diving into specific optimization strategies, it's essential to understand your application's data access patterns. These patterns determine which strategy will be most effective for your use case. Analyzing your workload is the first and most critical step in selecting the right optimization approach.

Read vs. Write Ratio

Read-Heavy

Query Complexity

Mixed

Data Growth

Exponential

Common Access Patterns

Access Pattern	Description	Recommended Optimization
Point Queries	Single-record lookup by specific key (e.g., find user by ID)	Indexing Sharding
Range Queries	Multiple records based on value ranges (e.g., orders between dates)	Indexing Partitioning
Full Table Scans	Processing all records in a table (e.g., batch operations)	Partitioning
Aggregations	Statistical operations like SUM, AVG, COUNT (e.g., reporting)	Partitioning Sharding
Write-Heavy Workloads	High volume of inserts/updates (e.g., logging, event streams)	Sharding
Mixed Workloads	Balanced read/write with various query types (e.g., OLTP systems)	Indexing + Partitioning

Analysis Tip

Use database monitoring tools to analyze your most frequent queries and their execution plans. Look for patterns in data access and bottlenecks in performance. Most database systems provide query analyzers that can help identify slow queries and suggest optimization strategies.

Key Performance Metrics

Before implementing any optimization strategy, establish baselines for the following key metrics to measure improvement and make data-driven decisions:

Query response time: How long it takes to execute a query and return results (measured in milliseconds or seconds)
Throughput: Number of transactions per second (TPS) or queries per second (QPS) the system can handle
Resource utilization: CPU, memory, disk I/O, and network usage percentages during peak loads
Scalability: How performance changes as data volume or user load increases (linear vs. non-linear scaling)
Availability: Percentage of time the system is operational (often measured in "nines" - 99.9%, 99.99%, etc.)
Latency percentiles: Response times at different percentiles (p50, p95, p99) to understand outliers
Write amplification: The ratio of data written to storage compared to the original data size (important for write-heavy workloads)

These metrics will help you determine which optimization strategy to prioritize and provide a basis for measuring improvements after implementation. Document baseline performance before making changes to accurately measure the impact of your optimizations.

Measurement Tip

When measuring performance, consider both average-case and worst-case scenarios. Applications often fail not because of average performance but because of unexpected spikes or edge cases. Monitor your 95th and 99th percentile response times in addition to averages.

Database Indexing

Indexing is the most fundamental database optimization technique. It creates auxiliary data structures that allow the database engine to locate rows more efficiently without scanning the entire table. Think of an index as the index in a book—it helps you find information without reading every page.

When to Use Indexing

Indexes are ideal for the following scenarios:

Frequent queries on specific columns (especially in WHERE clauses)
Tables with high cardinality (many unique values)
JOIN operations that need to be optimized
WHERE clauses with equality or range conditions
ORDER BY and GROUP BY operations
When query performance is more critical than write performance

Advantages

Dramatically improves query performance for specific operations (often 10-1000x faster)
No application changes required (transparent to application code)
Relatively simple to implement and maintain
Works well with existing queries without restructuring
Supports both equality and range-based queries
Can be created and dropped dynamically as workloads change

Disadvantages

Increases storage requirements (typically 10-20% per index)
Slows down write operations (INSERT, UPDATE, DELETE)
Requires ongoing maintenance as data changes
Not effective for low-cardinality columns (e.g., boolean fields)
Too many indexes can degrade overall performance
Index management adds operational complexity

Types of Indexes

Index Type	Description	Best For
B-Tree	Balanced tree structure that maintains sorted data (standard in most databases)	General-purpose indexing, range queries, point lookups
Hash	Uses hash function for key-based lookups	Exact match queries (equality), high-performance key-value lookups
Bitmap	Compressed index using bit arrays	Low-cardinality columns, multiple AND/OR conditions, data warehousing
Full-text	Specialized index for text search with tokenization and relevance ranking	Text search, document databases, content management systems
Spatial	Optimized for geometric data with spatial algorithms (R-tree, Quadtree)	Geographic queries, location-based services, GIS applications
Covering	Index that includes all columns needed by a query (no table lookup required)	High-performance queries where all data is in indexed columns

Implementation Tip

For composite indexes (multiple columns), place the most selective columns first to maximize efficiency. Consider the cardinalityCardinality refers to the number of unique values in a column. High cardinality means many distinct values (e.g., ID), while low cardinality means few unique values (e.g., gender). of columns and common query patterns. Monitor index usage with database tools and be prepared to drop unused indexes to reduce overhead.

-- Example index creation SQL statements

-- Simple index on a single column
CREATE INDEX idx_users_email ON users(email);

-- Composite index on multiple columns
CREATE INDEX idx_orders_customer_date ON orders(customer_id, order_date);

-- Covering index that includes additional columns
CREATE INDEX idx_products_category_price ON products(category_id, price) INCLUDE (name, stock);

-- Unique index that enforces data integrity
CREATE UNIQUE INDEX idx_employees_employee_id ON employees(employee_id);

Database Partitioning

Partitioning divides a large table or index into smaller, more manageable pieces called partitions, while maintaining a single logical view of the data. Each partition can be stored separately, potentially on different physical media. Partitioning helps manage large tables and improves query performance through partition pruningPartition pruning is when the database engine skips scanning partitions that are not relevant to a query, reducing I/O operations and improving performance..

When to Use Partitioning

Partitioning is particularly effective in these scenarios:

Very large tables that exceed practical management limits (typically hundreds of GB or TB)
Tables with time-based data where older records are accessed less frequently
Workloads that benefit from parallel query execution
When you need to improve maintenance operations (backups, index rebuilds)
Data archiving requirements where older partitions can be easily moved or purged
When queries frequently filter on the partition key column

Advantages

Improves query performance through partition pruning (only scanning relevant partitions)
Enables parallel query operations across partitions
Simplifies maintenance of large datasets through partition-level operations
Enhances backup and recovery operations (partition-level backups)
Facilitates data lifecycle management (archiving, purging)
Can improve write performance by distributing writes across partitions

Disadvantages

Adds complexity to database design and management
Can decrease performance for queries that span multiple partitions
Requires careful planning of partition keys
May require schema changes in existing applications
Potential for uneven data distribution ("skew") affecting performance
Joins across partitions may be less efficient

Partitioning Strategies

Strategy	Description	Example Use Case
Range Partitioning	Divides data based on a range of values (e.g., dates, IDs, numeric ranges)	Time-series data with monthly or yearly partitions
List Partitioning	Divides data based on discrete values or sets of values	Geographic regions, product categories, departments
Hash Partitioning	Distributes data evenly using a hash function on the partition key	Even distribution when no natural dividing key exists
Composite Partitioning	Combines multiple strategies (e.g., range + hash) with subpartitioning	Date range primary with customer ID subpartitioning
Key Partitioning	Similar to hash but uses database's internal hashing algorithm	When even distribution is needed but you want the DB to manage hashing
Column Partitioning	Stores different columns in different partitions (columnar storage)	Analytics workloads with selective column access patterns

Implementation Tip

Choose partition keys that align with your most common query patterns. For time-series data, consider a rolling window partitioning strategy where you maintain a fixed number of recent partitions and archive older ones. Test partition elimination by examining query execution plans to ensure your database is actually skipping irrelevant partitions.

-- Example partitioning SQL statements (PostgreSQL syntax)

-- Range partitioning by date
CREATE TABLE sales (
    id SERIAL,
    sale_date DATE NOT NULL,
    amount DECIMAL(10,2),
    customer_id INTEGER
) PARTITION BY RANGE (sale_date);

-- Create partitions for specific date ranges
CREATE TABLE sales_2023_q1 PARTITION OF sales
    FOR VALUES FROM ('2023-01-01') TO ('2023-04-01');
    
CREATE TABLE sales_2023_q2 PARTITION OF sales
    FOR VALUES FROM ('2023-04-01') TO ('2023-07-01');

-- List partitioning by region
CREATE TABLE customers (
    id SERIAL,
    name TEXT,
    region TEXT
) PARTITION BY LIST (region);

CREATE TABLE customers_na PARTITION OF customers
    FOR VALUES IN ('USA', 'Canada', 'Mexico');
    
CREATE TABLE customers_eu PARTITION OF customers
    FOR VALUES IN ('UK', 'France', 'Germany', 'Spain', 'Italy');

Database Sharding

Sharding is a database architecture pattern that distributes data across multiple independent database instances (shards). Unlike partitioning, which operates within a single database instance, sharding spreads data across separate physical database servers, enabling horizontal scaling of both storage and compute resources.

When to Use Sharding

Sharding is most appropriate in these scenarios:

When data volume exceeds the capacity of a single database server (typically TB scale)
High-throughput applications that need to scale writes horizontally
Multi-tenant applications where tenants can be isolated on different shards
Geographic distribution requirements for data locality and latency reduction
When you need to scale beyond the hardware limits of a single server
When high availability and fault isolation are critical requirements

Advantages

Near-linear horizontal scaling for both reads and writes
Improved performance through reduced index sizes and dataset per node
Better fault isolation (a failure affects only one shard)
Enables data locality for geographically distributed systems
Potential for reducing licensing costs by using smaller database instances
Allows for heterogeneous hardware optimization for different workloads

Disadvantages

Significantly increases application complexity
Challenges with transactions spanning multiple shards (distributed transactions)
Complexity in handling schema changes across shards
Potential for uneven data distribution (hot spots)
Joins across shards are complex and inefficient
Increased operational complexity and monitoring requirements

Sharding Strategies

Strategy	Description	Considerations
Hash-Based Sharding	Uses a hash function on the shard key to determine placement	Even distribution but difficult to add/remove shards without rebalancing
Range-Based Sharding	Divides data based on ranges of shard key values	Natural organization but potential for hot spots and imbalance
Directory-Based Sharding	Uses a lookup service to map keys to shards (shard map)	Flexible but adds additional lookup overhead and potential bottleneck
Geo-Sharding	Distributes data based on geographic location	Improves latency for users but adds complexity with data sync
Entity-Based Sharding	Distributes different entities to different shards	Simpler implementation but complex for cross-entity relationships
Function-Based Sharding	Uses custom function for complex shard assignment logic	Highly customizable but can be complex to maintain and understand

Implementation Tip

Consider using a sharding middleware layer or framework that abstracts the complexity of managing shards. Popular options include Vitess for MySQL, Citus for PostgreSQL, or database-as-a-service offerings with built-in sharding support. Design your data model to minimize cross-shard operations, and consider denormalizing data to keep related information together on the same shard.

// Example shard routing logic in pseudocode

function getShardForUser(userId) {
    // Hash-based sharding
    const shardId = hash(userId) % NUMBER_OF_SHARDS;
    return getShardConnectionById(shardId);
}

function executeQueryAcrossShards(query, combineFunction) {
    const results = [];
    
    // Execute query on each shard in parallel
    const promises = ALL_SHARDS.map(shard => {
        return executeOnShard(shard, query)
            .then(result => results.push(result));
    });
    
    // Wait for all queries to complete
    return Promise.all(promises)
        .then(() => combineFunction(results));
}

// Example usage
async function getUserById(userId) {
    const shard = getShardForUser(userId);
    return await shard.query("SELECT * FROM users WHERE id = ?", [userId]);
}

Decision Helper

Use this interactive tool to get personalized recommendations based on your specific database needs and constraints. Answer the questions below to determine which optimization strategy might work best for your situation.

1 What is your primary performance concern?

2 What is your approximate dataset size?

3 How complex are your implementation capabilities?

Recommendation

Please complete all steps to get a recommendation.

Decision Flowchart

Use this flowchart to guide your decision-making process when choosing between indexing, partitioning, and sharding:

Decision Making Tip

Use a phased approach when implementing database optimizations. Start with the simplest solution (typically indexing) and measure its impact before moving to more complex strategies. This iterative approach helps avoid unnecessary complexity while ensuring you only implement what's needed for your specific workload.

Comparison: Indexing vs. Partitioning vs. Sharding

Feature	Indexing	Partitioning	Sharding
Implementation Complexity	Low	Medium	High
Data Distribution	Single location	Single server, multiple segments	Multiple servers
Impact on Reads	Significant improvement	Moderate improvement	Scalable improvement
Impact on Writes	Slight degradation	Moderate improvement	Significant improvement
Query Complexity	Transparent	Mostly transparent	Can be complex
Application Changes	None	Minimal	Extensive
Data Size Scalability	Limited	Medium	Virtually unlimited
Transaction Support	Full	Full (with some limitations)	Limited across shards
Maintenance Overhead	Low	Medium	High
Fault Tolerance	Single point of failure	Single point of failure	High (failure isolation)
Hardware Requirements	Standard	Higher-end single server	Multiple servers
Cost	Low	Medium	High

Database Examples

Different database systems offer varying capabilities for indexing, partitioning, and sharding. Here's how some popular database systems implement these optimization strategies:

MySQL

Indexing: B-Tree, Hash, Full-Text, Spatial

Partitioning: Range, List, Hash, Key, Columns

Sharding: Application-level or via Vitess middleware

Best for: Both OLTP and OLAP workloads with built-in partitioning

PostgreSQL

Indexing: B-Tree, Hash, GiST, GIN, BRIN, SP-GiST

Partitioning: Range, List, Hash (declarative since v10)

Sharding: Via extensions like Citus or application-level

Best for: Complex queries and advanced indexing needs

MongoDB

Indexing: Single field, Compound, Multi-key, Text, Geospatial

Partitioning: Via hashed sharding (similar concept)

Sharding: Native sharding with built-in router (mongos)

Best for: Document data with horizontal scaling requirements

Apache Cassandra

Indexing: Primary indexes, Secondary indexes

Partitioning: Hash-based on partition key

Sharding: Native through consistent hashing and tokens

Best for: High-write workloads requiring linear scalability

SQL Server

Indexing: Clustered, Non-clustered, Columnstore, Spatial

Partitioning: Range, Function-based, Partition switching

Sharding: Via Elastic Database tools or Availability Groups

Best for: Enterprise workloads with comprehensive tools

Oracle Database

Indexing: B-tree, Bitmap, Function-based, Domain

Partitioning: Range, List, Hash, Composite, Interval

Sharding: Oracle Sharding (since 12c)

Best for: Large enterprise applications with advanced partitioning

Case Studies

E-commerce Product Catalog

An e-commerce platform with millions of products needed to optimize its product search functionality:

Challenge: Slow search response times (averaging 3-5 seconds) and category browsing performance issues as product catalog grew to over 10 million items
Solution: Combined indexing (for search attributes like name, description, SKU) with partitioning by product category (horizontal) and attribute type (vertical for text vs. numeric data)
Implementation: Created specialized full-text indexes for search terms while partitioning the catalog by main category (electronics, clothing, home goods, etc.)
Result: 10x faster search times (down to 300-500ms) and improved browse experience with better resource utilization

Social Media Platform

A rapidly growing social network needed to manage billions of user interactions:

Challenge: Scaling to handle exponential user growth from 1 million to 50 million users in 18 months with massive data volume increases
Solution: Implemented user-based sharding with consistent hashing for user data, activity feeds, and social connections
Implementation: Divided users across 128 logical shards distributed on 32 physical database servers, with a central shard router service
Result: Achieved linear scaling with user growth while maintaining sub-100ms response times for feed generation and social graph queries

Financial Transaction System

A payment processor dealing with historical transaction data:

Challenge: Fast access to recent transaction data (last 90 days) while maintaining seven years of history for compliance and reporting
Solution: Time-based partitioning with daily partitions for the most recent 30 days, monthly for the last year, and yearly for historical data
Implementation: Automated partition management with scripts for creating new partitions and archiving old ones to lower-cost storage
Result: Current transaction queries remained fast (under 50ms) while enabling efficient archiving and reducing storage costs by 60%

IoT Sensor Network

A smart city implementation with thousands of environmental sensors:

Challenge: Handling 100,000+ sensors reporting data every minute (over 140 million readings per day) with real-time analysis requirements
Solution: Combined sharding by sensor type and geographic location with time-based partitioning for historical data
Implementation: Used a time-series optimized database with automated partition rotation and tiered storage policies
Result: Achieved consistent write throughput of 50,000+ metrics per second with sub-second query times for real-time dashboards

Implementation Best Practices

Indexing Implementation

Start with analyzing query patterns using database profiling tools to identify the most common and slowest queries
Create indexes only for columns frequently used in WHERE, JOIN, or ORDER BY clauses
Consider the selectivity of columns when creating indexes (high-cardinality columns benefit more)
Use covering indexes for queries that access only indexed columns to avoid table lookups
Regularly monitor index usage and remove unused indexes to reduce overhead
Be cautious with indexing in write-heavy applications (consider async index updates if supported)
Test index performance with realistic data volumes and query patterns

Partitioning Implementation

Define a clear partitioning strategy based on data access patterns and business requirements
Test partition pruning for your common queries by examining query execution plans
Plan for partition maintenance operations (splitting, merging, archiving, purging)
Consider local indexes vs. global indexes based on query needs and database capabilities
Implement automated partition management for time-based strategies
Estimate partition sizes in advance to avoid overly large or small partitions
Plan for data growth and establish a retention policy for historical data

Sharding Implementation

Define shard keys that distribute data evenly and align with access patterns
Implement a robust shard routing layer or use an existing framework/middleware
Plan for cross-shard queries and transactions (avoid them when possible)
Consider implementing caching to reduce cross-shard operations
Develop a strategy for resharding as data grows without application downtime
Implement automated monitoring and alerting for shard health and balance
Consider data locality for geographically distributed applications
Test failure scenarios and recovery procedures

Implementation Checklist

Before implementing any optimization strategy, consider the following:

Benchmark current performance to establish a baseline
Test the solution in a non-production environment first
Create a rollback plan in case of issues
Document the implementation details for future reference
Plan for ongoing monitoring and maintenance
Consider both immediate and long-term data growth

Monitoring and Optimization

Regardless of the strategy chosen, continuous monitoring is essential to maintain optimal performance. Here are key aspects to monitor for each optimization technique:

Index Monitoring

Track index usage statistics to identify unused or rarely used indexes
Monitor index fragmentation levels and schedule maintenance accordingly
Analyze query execution plans to verify indexes are being used as expected
Check for missing indexes based on query performance and database advisor recommendations
Monitor disk space usage for indexes in relation to table size

Partition Monitoring

Verify partition elimination is working correctly for common queries
Monitor partition size distribution to identify imbalances
Track query performance across partition boundaries vs. within partitions
Monitor partition maintenance operations and their impact on performance
Check storage utilization across partitions and storage tiers

Shard Monitoring

Monitor data distribution and skew across shards
Track cross-shard queries and their performance impact
Monitor shard router/manager performance and availability
Check replication lag for replicated shards
Monitor connection pooling and resource utilization per shard
Track latency differences between shards

Monitoring Tip

Establish performance baselines before implementing any optimization strategy. This allows you to quantify improvements and make data-driven decisions about further optimizations. Set up automated alerting for performance degradation beyond established thresholds, and regularly review metrics to identify trends before they become problems.

Recommended Monitoring Tools

General Database Monitoring

Prometheus + Grafana
Datadog
New Relic
Dynatrace
SolarWinds Database Performance Monitor

Database-Specific Tools

MySQL: MySQL Enterprise Monitor, PMM
PostgreSQL: pgStatStatements, pg_stat_monitor
MongoDB: MongoDB Atlas monitoring
SQL Server: SQL Server Management Studio
Oracle: Enterprise Manager, AWR reports

Key Metrics to Monitor

Query performance (execution time, frequency)
Resource utilization (CPU, memory, I/O, network)
Cache hit ratios
Lock contention
Index usage statistics
Partition/shard balance

Maintenance Considerations

Index Maintenance

Schedule regular index rebuilds/reorganizations to address fragmentation (typically during off-peak hours)
Update statistics to ensure the query optimizer has accurate information for query planning
Consider online index operations to minimize downtime when rebuilding indexes
Periodically review and remove unused indexes based on usage statistics
Plan for index growth as table data increases
Test new indexes in non-production environments before deploying to production

Partition Maintenance

Implement procedures for adding/removing partitions as needed (automation is key)
Plan for data archiving and purging strategies for older partitions
Consider partition-level backup and restore procedures for faster recovery
Monitor partition sizes and rebalance if necessary
Optimize partition switching operations to minimize impact
Test partition maintenance procedures regularly

Shard Maintenance

Develop procedures for adding new shards as data grows
Implement monitoring for shard balance and performance
Plan for resharding operations to rebalance data distribution
Develop disaster recovery procedures for individual shards
Consider automated tools for shard management
Maintain consistent schema across all shards
Establish backup and restore procedures for each shard
Test shard failure and recovery scenarios regularly

Automation Tip

Automate routine maintenance tasks wherever possible. Use database-specific maintenance jobs, scripts, or infrastructure-as-code tools to ensure consistent and reliable maintenance operations. Document all maintenance procedures and schedule regular reviews to adapt to changing workloads and data patterns.

Frequently Asked Questions

Can I use indexing, partitioning, and sharding together?

Yes, these strategies are complementary and often used together. For example, you might shard your database by customer ID, partition each shard by date, and add indexes for specific query patterns within each partition. This multi-layered approach can provide optimal performance for complex workloads.

How do I choose the right columns for indexing?

Focus on columns that appear frequently in WHERE clauses, JOIN conditions, and ORDER BY statements. Prioritize columns with high cardinality (many unique values) and those used in highly selective queries. Use database monitoring tools to identify slow queries and missing indexes. For composite indexes, place the most selective columns first and consider the order of columns based on how they're used in queries.

When should I avoid partitioning?

Avoid partitioning when dealing with small tables (less than 10GB), when most of your queries access data across multiple partitions, when you have limited maintenance windows, or when your queries rarely filter on the potential partition key. Partitioning adds complexity and might not provide benefits for smaller datasets or workloads that don't align well with partitioning boundaries.

What are the biggest challenges with sharding?

The biggest challenges with sharding include:

Cross-shard transactions and maintaining ACID properties
Complexity in application design and data access patterns
Schema changes and consistency across shards
Resharding as data grows (rebalancing data)
Operational complexity and monitoring multiple instances
Join operations across different shards
Maintaining referential integrity across shards

Using specialized middleware or frameworks can help mitigate some of these challenges.

How many indexes should a table have?

There's no one-size-fits-all answer, but a good rule of thumb is to keep the number of indexes below 5-7 per table for typical OLTP workloads. Each index adds overhead to write operations and consumes storage. Focus on indexes that provide significant query performance improvements rather than creating indexes for every possible query pattern. Regularly monitor index usage and remove unused indexes.

How does cloud database scaling compare to traditional sharding?

Cloud databases often offer two scaling approaches:

Vertical scaling (scaling up): Increasing resources (CPU, memory) on a single instance without architectural changes
Horizontal scaling (scaling out): Adding more instances and distributing data across them

Many cloud database services handle sharding transparently, abstracting away the complexity. Services like Amazon Aurora, Azure Cosmos DB, and Google Cloud Spanner manage sharding behind the scenes, often providing simpler implementation with built-in tooling for monitoring and management. However, they may have limitations in terms of query flexibility compared to custom sharding solutions.

Summary: Choosing the Right Strategy

Selecting the right database optimization strategy depends on your specific workload, data volume, and performance requirements. Here's a summary to guide your decision-making:

Choose Indexing When

You need to optimize specific query patterns
Your dataset is small to medium-sized (<100GB)
You need a low-complexity solution
Read performance is more critical than writes
You can't modify your application code

Choose Partitioning When

Your tables are very large (100GB-1TB+)
You need efficient maintenance operations
Queries have clear filtering patterns (e.g., by date)
You need to manage data lifecycle (archive/purge)
A single database server can still handle your workload

Choose Sharding When

You've reached the limits of a single server
You need horizontal scale-out for reads and writes
Your data volume is multi-terabyte scale
You need geographic distribution
You can handle increased application complexity

Best Practices

Start with the simplest solution that meets your needs
Measure performance before and after changes
Combine strategies for complex workloads
Consider your team's operational capabilities
Plan for future growth from the beginning
Automate maintenance wherever possible

Remember that database optimization is an iterative process. Start with a solid design, measure performance, make incremental improvements, and adjust as your workload evolves. The right strategy today may need to be revisited as your application grows and requirements change.

On This Page

Introduction
Data Access Patterns
Performance Metrics
Database Indexing
Database Partitioning
Database Sharding
Decision Helper
Decision Flowchart
Comparison Chart
Database Examples
Case Studies
Implementation
Monitoring
Maintenance
FAQ
Summary