markpapadakis · August 29, 2015 14:09 · jasobrown · Nov 24, 2014 · markpapadakis · Nov 24, 2014
diff --git a/CloudDS Timings.txt b/CloudDS Timings.txt
 # CloudDS
 2048 requests in sequence - percentiles captured in an HDRHistogram
 All CloudDS caching options disabled.
 Cluster Size: 7
 Replication Factor: 3
 Row size in Columns: 10
 Row size/payload: 2.5KBs
 Column Family size in rows: 4MM
 Cluster configuration: half of those 7 nodes commodity HW with 8GB RAM, other blade-class nodes with 4GB of RAM. Gigabit link for interconnect. 
 Cluster in active/heavy use by many other production services(not idle).

 This is for wide columns - that is, each row can have 0+ columns, and when updating a row you
 can update/add any columns - no need RCU, which in turns means, 
 each GET requires (unless the optimizer deems unnecessary)
 fetching the columns content needed for all row updates from 
 files an inmemory maps in order to compile the final response.

 CloudDS supports an alternative storage engine (KV) where value is opaque data(blob),
 and those semantics requirements are not needed, thus it's enough to just locate 
 the most recent update among all recorded updates -- no need to merge.
 This should probably give a 10-15% perfomance improvement for this benchmark
 but this is for the wide-columns storage engine.




 GET:
 Consistency Level: QUORUM
 1% => 440
 5% => 463
 10% => 470
 20% => 480
 50% => 507
 75% => 533
 90% => 565
 95% => 589
 97% => 608
 99% => 668


 values in microseconds. So, at 99% percentile, we need, worse case scenario, 0.7 milliseconds.

 ---
 Same test, this time with Consistency Level: ONE

 1% => 202
 5% => 206
 10% => 209
 20% => 214
 50% => 238
 75% => 251
 90% => 270
 95% => 279
 97% => 295
 99% => 382

 # CloudDS - PUT
 Same setup, number of requests, cluster, etc 

 Consistency Level: QUORUM
 1% => 461
 5% => 482
 10% => 499
 20% => 518
 50% => 562
 75% => 608
 90% => 658
 95% => 695
 97% => 727
 99% => 781

 Consistency Level: ONE
 1% => 252
 5% => 263
 10% => 270
 20% => 279
 50% => 294
 75% => 305
 90% => 320
 95% => 330
 97% => 338
 99% => 382



 When optimizations planned are implemented, it should be at least 20-30% more efficient.


 # mySQL
 Same benchmark for a mySQL query on a table with of 1MM rows. (keys appropriately set)
 Query: SELECT * FROM table WHERE user = 'markpapadakis'

 1% => 774
 5% => 817
 10% => 830
 20% => 841
 50% => 874
 75% => 912
 90% => 959
 95% => 1002
 97% => 1043
 99% => 1158
	# CloudDS
	2048 requests in sequence - percentiles captured in an HDRHistogram
	All CloudDS caching options disabled.
	Cluster Size: 7
	Replication Factor: 3
	Row size in Columns: 10
	Row size/payload: 2.5KBs
	Column Family size in rows: 4MM
	Cluster configuration: half of those 7 nodes commodity HW with 8GB RAM, other blade-class nodes with 4GB of RAM. Gigabit link for interconnect.
	Cluster in active/heavy use by many other production services(not idle).

	This is for wide columns - that is, each row can have 0+ columns, and when updating a row you
	can update/add any columns - no need RCU, which in turns means,
	each GET requires (unless the optimizer deems unnecessary)
	fetching the columns content needed for all row updates from
	files an inmemory maps in order to compile the final response.

	CloudDS supports an alternative storage engine (KV) where value is opaque data(blob),
	and those semantics requirements are not needed, thus it's enough to just locate
	the most recent update among all recorded updates -- no need to merge.
	This should probably give a 10-15% perfomance improvement for this benchmark
	but this is for the wide-columns storage engine.




	GET:
	Consistency Level: QUORUM
	1% => 440
	5% => 463
	10% => 470
	20% => 480
	50% => 507
	75% => 533
	90% => 565
	95% => 589
	97% => 608
	99% => 668


	values in microseconds. So, at 99% percentile, we need, worse case scenario, 0.7 milliseconds.

	---
	Same test, this time with Consistency Level: ONE

	1% => 202
	5% => 206
	10% => 209
	20% => 214
	50% => 238
	75% => 251
	90% => 270
	95% => 279
	97% => 295
	99% => 382

	# CloudDS - PUT
	Same setup, number of requests, cluster, etc

	Consistency Level: QUORUM
	1% => 461
	5% => 482
	10% => 499
	20% => 518
	50% => 562
	75% => 608
	90% => 658
	95% => 695
	97% => 727
	99% => 781

	Consistency Level: ONE
	1% => 252
	5% => 263
	10% => 270
	20% => 279
	50% => 294
	75% => 305
	90% => 320
	95% => 330
	97% => 338
	99% => 382



	When optimizations planned are implemented, it should be at least 20-30% more efficient.


	# mySQL
	Same benchmark for a mySQL query on a table with of 1MM rows. (keys appropriately set)
	Query: SELECT * FROM table WHERE user = 'markpapadakis'

	1% => 774
	5% => 817
	10% => 830
	20% => 841
	50% => 874
	75% => 912
	90% => 959
	95% => 1002
	97% => 1043
	99% => 1158
No results found