Distinguishing between a technological lead and ineffective competition is also important. An example is database engine technology. Some proprietary databases are orders of magnitude more efficient/scalable than any open source comparable, which looks qualitative, but is widely recognized as a product of design quality rather than any technological lead. (see also: Google’s data infrastructure)
Seems untrue to me, and I’ve benchmarked dozens of databases for dozens of problems.
In the column-store space (optimized for aggregate analytics… distributed execution of aggrgated queries, quick filtering based on ordering and data compression) Clickhouse is the best there is in my experience… I made that point 4 years ago, but now you can find plenty of other benchmarks for it. It’s used by many large scale search engines and advertisers except google, and among others, by CERN.
In wide column storage space, and more broadly in the “heavy filtering, large amounts of data space” cassandra (, facebook) and now Scylla seem to lead. I’ve never had to put dozens of petabytes in a database, but the few people that do need this seem to agree.
In the transactional space I haven’t seen anyone bring a significant gain over postgres and mariadb yet.
Kv store and in memory caching you have aerospike, rocksdb and stuff that’s based on tikv more recently… All slightly different trade-offs, all open source. I’m not even aware of proprietary products here to be honest.
Those 4 combined cover most use cases a db has.
So, not saying I’m convinced I’m correct, but could you provide some examples to back up your claims ? Name some names, or, ideally, provide some uecases/domain where one could find benchmarks that demonstrate a proprietary database gas the upper hand.
Seems untrue to me, and I’ve benchmarked dozens of databases for dozens of problems.
In the column-store space (optimized for aggregate analytics… distributed execution of aggrgated queries, quick filtering based on ordering and data compression) Clickhouse is the best there is in my experience… I made that point 4 years ago, but now you can find plenty of other benchmarks for it. It’s used by many large scale search engines and advertisers except google, and among others, by CERN.
In wide column storage space, and more broadly in the “heavy filtering, large amounts of data space” cassandra (, facebook) and now Scylla seem to lead. I’ve never had to put dozens of petabytes in a database, but the few people that do need this seem to agree.
In the transactional space I haven’t seen anyone bring a significant gain over postgres and mariadb yet.
Kv store and in memory caching you have aerospike, rocksdb and stuff that’s based on tikv more recently… All slightly different trade-offs, all open source. I’m not even aware of proprietary products here to be honest.
Those 4 combined cover most use cases a db has.
So, not saying I’m convinced I’m correct, but could you provide some examples to back up your claims ? Name some names, or, ideally, provide some uecases/domain where one could find benchmarks that demonstrate a proprietary database gas the upper hand.