More

maxmcd · 2025-12-09T18:30:54 1765305054

"Reply in the tone of Wikipedia" has worked pretty well for me

maxmcd · 2025-12-08T20:20:30 1765225230

> > You can force an fsync after each messsage [sic] with always, this will slow down the throughput to a few hundred msg/s.

Is the performance warning in the NATS possible to improve on? Couldn't you still run fsync on an interval and queue up a certain number of writes to be flushed at once? I could imagine latency suffering, but batches throughput could be preserved to some extent?

scottlamb · 2025-12-08T20:29:51 1765225791

> Is the performance warning in the NATS possible to improve on? Couldn't you still run fsync on an interval and queue up a certain number of writes to be flushed at once? I could imagine latency suffering, but batches throughput could be preserved to some extent?

Yes, and you shouldn't even need a fixed interval. Just queue up any writes while an `fsync` is pending; then do all those in the next batch. This is the same approach you'd use for rounds of Paxos, particularly between availability zones or regions where latency is expected to be high. You wouldn't say "oh, I'll ack and then put it in the next round of Paxos", or "I'll wait until the next round in 2 seconds then ack"; you'd start the next batch as soon as the current one is done.

ADefenestrator · 2025-12-09T06:22:43 1765261363

Yes, this is a reasonably common strategy. It's how Cassandra's batch and group commit modes work, and Postgres has a similar option. Hopefully NATS will implement something similar eventually.

maxmcd · 2025-12-08T18:16:27 1765217787

Maybe this? https://forums.foundationdb.org/t/swift-or-c-20-coroutine-wh...

maxmcd · 2025-12-02T19:25:58 1764703558

This will block threads while waiting for other threads to write. That might work great for your threading model but I usually end up putting the writer in one thread and then other threads send writes to the writer thread.

busymom0 · 2025-12-02T19:32:32 1764703952

I do open 2 connections:

First one for writing with flags:

    SQLITE_OPEN_CREATE | SQLITE_OPEN_READWRITE | SQLITE_OPEN_FULLMUTEX

Second one for reading with flags:

    SQLITE_OPEN_READONLY | SQLITE_OPEN_FULLMUTEX

As you can note, I have SQLITE_OPEN_FULLMUTEX on both of them. Should I only have it for the writing one?

maxmcd · 2025-12-04T12:31:40 1764851500

Oh nice, yes I think your threads should be able to perform reads concurrently when the write lock is not held. Would make sure you are in WAL mode as well, since I think that will improve your concurrency.

maxmcd · 2025-11-21T20:09:44 1763755784

Just that row should be locked since it's: "for update skip locked".

I agree the concurrency limitation is kind of rough, but it's kind of elegant because you don't have to implement some kind of timeout/retry thing. You're certainly still exposed to the possibility of double-sending, so yes, probably much nicer to update the row to "processing" and re-process those rows on a timeout.

maxmcd · 2025-11-04T21:22:07 1762291327

I went here next: https://www.youtube.com/@CMUDatabaseGroup

maxmcd · 2025-10-24T13:42:58 1761313378

Are there any open sourced sharded query planners like this? Something that can aggregate queries across many duckdb/sqlite dbs?

hobofan · 2025-10-24T14:04:08 1761314648

Not directly DuckDB (though I think it might be able to be connected to that), but I think Apache Datafusion Ballista[0] would be a typical modern open source benchmark here.

[0]: https://datafusion.apache.org/ballista/contributors-guide/ar...

mritchie712 · 2025-10-24T14:30:24 1761316224

DeepSeek released smallpond

0 - https://github.com/deepseek-ai/smallpond

1 - https://www.definite.app/blog/smallpond (overview for data engineers, practical application)

maxmcd · 2025-09-28T23:32:47 1759102367

There are a few different styles: https://github.com/orgs/community/discussions/16925

jftuga · 2025-09-29T00:00:54 1759104054

Thanks for the link. These are nice additions.

maxmcd · 2025-09-11T17:06:21 1757610381

Do they mention transactions anywhere? Maybe it will be OLAP?

maxmcd · 2025-09-03T13:32:37 1756906357

I think this is the comparable view: https://www.energydashboard.co.uk/live