Mastering Data Distribution and Redundancy in Elasticsearch
Core Concepts of Sharding
Elasticsearch achieves horizontal scalability and fault tolerance through a dual-layer architecture built on shards and replicas. Understanding how data is partitioned and duplicated across cluster nodes is fundamental to deploying robust search infrastructures.
A shard functions as an independant Lucene instance that ...
Posted on Wed, 17 Jun 2026 16:35:01 +0000 by ShogunWarrior
Implementing Database Sharding and Read-Write Splitting with Mycat
Database Shardign Implementation
Mycat Installation and Configuration
Download Mycat from the official repository. For Linux environments, extract the package using:
tar -xvf Mycat-server-1.6.7.1-release-*.tar.gz
The extracted directory includes:
bin: executable scripts
conf: configuration files (schema.xml, server.xml, rule.xml)
lib: require ...
Posted on Thu, 04 Jun 2026 19:21:20 +0000 by visualed
Elasticsearch Distributed Architecture: Sharding, Routing, and Split-Brain Mitigation
Distributed Systems vs. ClusteringArchitecture TypeAnalogyPrimary BenefitClusterMultiple workers performing identical tasksSystem high availability and load distributionDistributedMultiple workers performing distinct specialized tasksCompute/storage scaling, decoupling, performance accelerationElasticsearch inherently abstracts the complexities ...
Posted on Tue, 02 Jun 2026 17:07:12 +0000 by newburcj
Avoiding Negative Hash Codes from Java's String.hashCode() in Sharding Scenarios
When sharding data by a non-integer primary key or a composite key (usually concatenated in to a string), developers often rely on Java's built-in String.hashCode() method to derive a hash value, which is then used for modulo-based sharding. While convenient, this approach can produce negative hash codes, leading to invalid shard numbers. Under ...
Posted on Tue, 19 May 2026 15:09:12 +0000 by brianbehrens
Analyzing MongoDB Count Inaccuracies in Sharded Clusters
When working with MongoDB sharded clusters, developers often notice that db.collection.count() returns enconsistent or incorrect results. While this is largely addressed in version 4.0 and later for queries with predicates, understanding the underlying causes is critical for maintaining data integrity in legacy systems (like MongoDB 3.6) and hi ...
Posted on Wed, 13 May 2026 23:44:57 +0000 by DavidT