KDB+ use case

SamC · April 3, 2010, 9:47am

Morning all,I’ve got a project that generates around a million rows of time seriesdata per day. The data is all received in order, and typicallycontains simple integers and/or floating point numbers. At present, Iuse MySQL for storage, with a “live” table that holds the realtimedata, and then a series of other tables that holds averaged dataaggregated over various time periods (e.g. hourly, daily, weekly,etc). MySQL is starting to creak under the load, and I’m keen to startlooking at alternatives (particularly as I anticipate that there’sgoing to be hundreds of millions of new rows per day in the next fewmonths).I don’t need to store data at full granularity indefinitely - I’d beperfectly happy with it on a sliding scale, so we store fullgranularity for a short period (e.g. 1 day), all the way back to dailyaverages when looking over 1+ year period. (Not dissimilar to what RRDdatabases provide from RRDTool)Does this sound like a reasonable use-case for KDB+? Would anyonerecommend any open source alternatives too? Or if I’m barking up thewrong tree entirely, plesae say so.Thanks,Sam

nathan_perrem1 · April 4, 2010, 3:07am

Hi Sam,

Yes this is definitely the type of use case which kdb+ is suitable for.

With reasonable hardware, that volume of data should not be a problem. Also, your proposal regarding aggregating the data into various different time buckets can be done pretty easily.

Typically the amount of data that you store historically will be limited by the amount of disk space you have. The amount of intra day data you have will be limited by the amount of physical memory you have.

?

Regards,

Nathan

Topic		Views
kdb vs sql Community Support kdb-and-q	5	October 20, 2015
Preferred KDB format for energy tick data? Community Support kdb-and-q	3	January 1, 2012
KDB+ - Personal Project Database Question(s) Community Support kdb-and-q	1	February 6, 2018
The Design and Implementation of Modern Column-Oriented Database Community Support kdb-and-q	1	February 11, 2017
Any limitation on the number of timeseries KDB can handle? Community Support kdb-and-q	1	August 9, 2019

KDB+ use case

Related topics