Groupbys and joins favor dbs that don't rely on segregated tables and especially when single file, such as embedded dbs. It's not really clear to me what duckdb actually is last time I checked and ran some tests by running it inside my host app. Is it based on rmbs concepts? Kind of a bit, olap yes, but also not quite. I got the impression it tries to be a bit of everything with focus on ease of use. It's certainly not optimized for time series. The tests described in the link look to me very biased to favor small footprint storage technology that is directly integrated along the host application. A completely different use case than the issues applications like clickhouse or timebase are targeting. And certainly not optimized for financial data or timeseries.
TimeBase is open source and so are all its client libraries. And so is clickhouse.
TimeBase is open source and so are all its client libraries. And so is clickhouse.
https://duckdb.org/2023/04/14/h2oai.html
You can question the objectivity but it's a standardised test.
DuckDB used to be much slower but it has changed in the last year or so. Comparing with the same test from 2021. TimeBase seems proprietary and for that reason I will avoid it like the plague. I might choose to go commercial with what I'm working on, so locking myself into a proprietary platform is a no-go.
I will only need to interface using Python, backends of the packages I use are in C, C++, Rust so there's not much of a speed issue.
Another tool to keep in mind is Ibis. They intend to make it so working with any backend db system is standardised. So whether it's DuckDB or ClickHouse, wouldn't mean changing any part of the code outside of spinning up the server / library imports.
Last edited: