Performance Comparison: GridGain 9.1.1 vs GridGain 8.9.20

Below is a performance comparison between GridGain versions 9.1.1 and 8.9.20.

About the Benchmark

These performance results were obtained using the YCSB (Yahoo! Cloud Serving Benchmark) — a widely adopted framework for measuring the throughput and latency of key-value and cloud-serving data stores.

YCSB provides:

Pre-defined core workloads that exercise different mixes of operations (read, update, insert, scan).
Flexible workload definition for modeling application-specific access patterns.
Client-side load generation, keeping the system under test free from load-generation overhead.
Detailed statistical reports that summarize throughput, latency, and distribution metrics.

For more information about YCSB, visit the YCSB Wiki.

Environment Specifications

In each test, the cluster consists of a given number of server nodes and the load is generated by client benchmark drivers running on separate hosts. Each server and client resides on its own machine to eliminate resource contention.

Benchmarks were executed under the following configurations:

Storage	Servers	Replicas
RAM+Disk	2	1
RAM+Disk	3	3
RAM+Disk	5	1
RAM+Disk	5	3
RAM Only	2	1
RAM Only	3	3
RAM Only	5	1
RAM Only	5	3

Each host has the following hardware configuration:

System	PowerEdge R650
CPU	32 x Intel(R) Xeon(R) Silver 4314 CPU @ 2.40GHz (2 Sockets)
RAM	256Gb
NVMe	Dell Ent NVMe CM7 U.2 MU 1.6TB
SSD	2 * 450Gb
LAN	Ethernet 10Gb
OS	Linux 6.8.12-4-pve
JDK	Java(TM) SE Runtime Environment (build 21.0.4+8-LTS-274)

The YCSB workload is configured as follows:

Client hosts	1
Threads	16
Client type	thick (GG 8), thin (GG 9)
Field length	100 bytes
Fields count	10
Entries count	15 millions
Batch size	1000 entries

Benchmark Results

Summary

Benchmark results reveal a significant jump in both read and write throughput with the tested configuration.

The performance lift in GridGain 9.1.1 stems from the sweeping architectural upgrades introduced in the 9.x line—most notably the revamped replication and transactional protocols, along with deep storage-level optimizations.

Storage	Servers	Replicas	Throughput, 9.1.1 vs 8.9.20
			PUT	GET
RAM+Disk	2	1	+298%	+143%
RAM+Disk	3	3	+324%	+89%
RAM+Disk	5	1	+434%	+43%
RAM+Disk	5	3	+438%	+51%
RAM Only	2	1	+348%	+145%
RAM Only	3	3	+356%	+119%
RAM Only	5	1	+441%	+50%
RAM Only	5	3	+439%	+67%

Results List

Below is a table of benchmark results for each tested configuration.

"Average latency" column represents the average latency for 1 batch operation in microseconds (less is better).

"Throughput" column represents the amount of batches processed per 1 second (more is better).

Product version	Servers	Replicas	Storage	Operation	Average latency (μs)	Throughput, 1000 ops/s
8.9.20	2	1	RAM+Disk	put	165501	92
9.1.1	2	1	RAM+Disk	put	35441	366
					+367%	+298%
8.9.20	3	3	RAM+Disk	put	262476	59
9.1.1	3	3	RAM+Disk	put	56053	250
					+368%	+324%
8.9.20	2	1	RAM only	put	128223	117
9.1.1	2	1	RAM only	put	23053	524
					+456%	+348%
8.9.20	3	3	RAM only	put	173196	88
9.1.1	3	3	RAM only	put	32180	401
					+438%	+356%
8.9.20	2	1	RAM+Disk	get	50218	315
9.1.1	2	1	RAM+Disk	get	20332	766
					+147%	+143%
8.9.20	3	3	RAM+Disk	get	35282	446
9.1.1	3	3	RAM+Disk	get	18344	844
					+92%	+89%
8.9.20	2	1	RAM only	get	52053	304
9.1.1	2	1	RAM only	get	20940	744
					+149%	+145%
8.9.20	3	3	RAM only	get	36597	431
9.1.1	3	3	RAM only	get	16344	946
					+124%	+119%
8.9.20	5	1	RAM+Disk	put	140340	108
9.1.1	5	1	RAM+Disk	put	19664	577
					+614%	+434%
8.9.20	5	3	RAM+Disk	put	214742	71
9.1.1	5	3	RAM+Disk	put	33908	382
					+533%	+438%
8.9.20	5	1	RAM only	put	130110	116
9.1.1	5	1	RAM only	put	17561	628
					+641%	+441%
8.9.20	5	3	RAM only	put	164566	92
9.1.1	5	3	RAM only	put	23973	496
					+586%	+439%
8.9.20	5	1	RAM+Disk	get	22919	684
9.1.1	5	1	RAM+Disk	get	15805	980
					+45%	+43%
8.9.20	5	3	RAM+Disk	get	25365	619
9.1.1	5	3	RAM+Disk	get	16668	932
					+52%	+51%
8.9.20	5	1	RAM only	get	23379	670
9.1.1	5	1	RAM only	get	15318	1005
					+53%	+50%
8.9.20	5	3	RAM only	get	25446	615
9.1.1	5	3	RAM only	get	15085	1024
					+69%	+67%

Details Per Configuration

RAM+Disk, 2 Servers, 1 Replica

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aipersist	put	165501.2	245247	317183	92
9.1.1	aipersist	put	35441.0	70783	112639	366
			+367%	+246%	+182%	+298%
8.9.20	aipersist	get	50218.2	52639	55071	315
9.1.1	aipersist	get	20331.6	33663	49887	766
			+147%	+56%	+10%	+143%

RAM+Disk, 5 Servers, 1 Replica

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aipersist	put	140339.8	170751.0	201983.0	108
9.1.1	aipersist	put	19664.1	32927.0	61375.0	577
			+614%	+419%	+229%	+434%
8.9.20	aipersist	get	22919.4	25759.0	28383.0	684
9.1.1	aipersist	get	15804.8	26575.0	34783.0	980
			+45%	-3%	-23%	+43%

RAM Only, 2 Servers, 1 Replica

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aimem	put	128223.0	167807	206591	117
9.1.1	aimem	put	23052.9	61759	91647	524
			+456%	+172%	+125%	+348%
8.9.20	aimem	get	52052.6	55007	58527	304
9.1.1	aimem	get	20940.0	39295	55711	744
			+149%	+40%	+5%	+145%

RAM Only, 5 Servers, 1 Replica

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aimem	put	130110.2	151679.0	165887.0	116
9.1.1	aimem	put	17561.3	35487.0	72063.0	628.0
			+641%	+327%	+130%	+441%
8.9.20	aimem	get	23378.6	25151.0	28655.0	670
9.1.1	aimem	get	15317.6	25839.0	43679.0	1005
			+53%	-3%	-52%	+50%

RAM+Disk, 3 Servers, 3 Replicas

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aipersist	put	262476.2	397567	541183	59
9.1.1	aipersist	put	56053.3	92671	147455	250
			+368%	+329%	+267%	+324%
8.9.20	aipersist	get	35281.6	37631	42207	446
9.1.1	aipersist	get	18344.4	32031	59487	844
			+92%	+17%	-41%	+89%

RAM+Disk, 5 Servers, 3 Replicas

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aipersist	put	214742.3	293631.0	447743.0	71
9.1.1	aipersist	put	33907.8	58783.0	76543.0	382
			+533%	+400%	+485%	+438%
8.9.20	aipersist	get	25364.9	28063.0	30799.0	619
9.1.1	aipersist	get	16668.2	29551.0	54687.0	932
			+52%	-5%	-78%	+51%

RAM Only, 3 Servers, 3 Replicas

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aimem	put	173196.0	224127.0	265471.0	88
9.1.1	aimem	put	32180.2	70655.0	119871.0	401
			+438%	+217%	+121%	+356%
8.9.20	aimem	get	36596.8	38879.0	43167.0	431
9.1.1	aimem	get	16343.6	25983.0	51295.0	946
			+124%	+50%	-19%	+119%

RAM Only, 5 Servers, 3 Replicas

Product	Storage engine	Operation	Latency (us)			Throughput, 1000 ops/s
Product	Storage engine	Operation	Average	95th %	99th %	Throughput, 1000 ops/s
8.9.20	aimem	put	164565.7	198655.0	222847.0	92
9.1.1	aimem	put	23973.2	60383.0	101183.0	496
			+586%	+229%	+120%	+439%
8.9.20	aimem	get	25446.4	27983.0	30383.0	615
9.1.1	aimem	get	15085.3	25311.0	36031.0	1024
			+69%	+11%	-19%	+67%