-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support checkpointing on interval only, not batch completion #30
Comments
Also interested in any feature that can improve costs of storage accounts. I'm surprised there is no way to use local filesystem or centralized cache/document storage like redis or mongo to save checkpoints... |
@mbrancato Thinking about it again. Can't we achieve this by setting very large |
I had other limitations on batch sizes. But batch sizes don’t let Logstash wait for the batch queue to fill up. If you are using Azure storage, be sure to use V1 storage accounts since the transaction costs are 90% less than V2. |
@mbrancato IC, Thanks! it does not look that MS planing to End Of Life V1, definitely gonna try it out. |
+1 for any feature that can reduce the storage costs associated with check-pointing. |
Also I think the checkpointing interval at the moment is not serving any purpose. Every event is checkpointed. I made the following change in my fork to fix it. |
Hello,
|
hello, |
Hi, |
Please add support for the checkpoint interval to be the only influence on writing a checkpoint to blob storage. Currently, checkpointing occurs when any batch completes as well. The problem is that some outputs are constrained by the batch size, and that might lead to smaller batches, causing significant write operations. I've seen this be very expensive even for smaller environments. Even then, most batches at the default size of 50 are going to cause large reads/writes for checkpointing.
That said, our use case would be fine with a purely time-based checkpoint, not batch based. By reducing checkpoint intervals to 30 seconds or so, a significant cost savings could be realized.
The text was updated successfully, but these errors were encountered: