I recently found out that my weaviate db is increasing in size everytime I do batch update (adding items and remove items), although the total number of items remains relatively the same. The issues seems to lie in commitlog.d folder, where it keeps increasing in size, with many .condensed files. How do I clean up old logs (that I know is not needed anymore), or reduce the size of the logs.
I tried to remove the commitlog.d folder altogether when restarting weaviate, It works fine, but I'm not sure about the consequences of that.
I also searched for ways to clean up weaviate logs, or about the .condensed files in weaviate docs, but no luck
You cannot remove the commit log files as this has the graph operations (edge changes) applied to HNSW.
Weaviate has a commit log combiner for HNSW that will merge commit logs together and remove duplicate information reducing the size.
What is the current disk utilization of your cluster and sizes of each commit log? Please feel free to directly contact me (on community Slack) as I'd need more info to debug further.