Mastering Data Distribution and Redundancy in Elasticsearch

Core Concepts of Sharding Elasticsearch achieves horizontal scalability and fault tolerance through a dual-layer architecture built on shards and replicas. Understanding how data is partitioned and duplicated across cluster nodes is fundamental to deploying robust search infrastructures. A shard functions as an independant Lucene instance that ...

Posted on Wed, 17 Jun 2026 16:35:01 +0000 by ShogunWarrior

Implementing Database Sharding and Read-Write Splitting with Mycat

Database Shardign Implementation Mycat Installation and Configuration Download Mycat from the official repository. For Linux environments, extract the package using: tar -xvf Mycat-server-1.6.7.1-release-*.tar.gz The extracted directory includes: bin: executable scripts conf: configuration files (schema.xml, server.xml, rule.xml) lib: require ...

Posted on Thu, 04 Jun 2026 19:21:20 +0000 by visualed

Elasticsearch Distributed Architecture: Sharding, Routing, and Split-Brain Mitigation

Distributed Systems vs. ClusteringArchitecture TypeAnalogyPrimary BenefitClusterMultiple workers performing identical tasksSystem high availability and load distributionDistributedMultiple workers performing distinct specialized tasksCompute/storage scaling, decoupling, performance accelerationElasticsearch inherently abstracts the complexities ...

Posted on Tue, 02 Jun 2026 17:07:12 +0000 by newburcj

Avoiding Negative Hash Codes from Java's String.hashCode() in Sharding Scenarios

When sharding data by a non-integer primary key or a composite key (usually concatenated in to a string), developers often rely on Java's built-in String.hashCode() method to derive a hash value, which is then used for modulo-based sharding. While convenient, this approach can produce negative hash codes, leading to invalid shard numbers. Under ...

Posted on Tue, 19 May 2026 15:09:12 +0000 by brianbehrens

Analyzing MongoDB Count Inaccuracies in Sharded Clusters

When working with MongoDB sharded clusters, developers often notice that db.collection.count() returns enconsistent or incorrect results. While this is largely addressed in version 4.0 and later for queries with predicates, understanding the underlying causes is critical for maintaining data integrity in legacy systems (like MongoDB 3.6) and hi ...

Posted on Wed, 13 May 2026 23:44:57 +0000 by DavidT