pythonparsingloggingdrain3

How to force drain3 to keep log event defining parameter as a static?


I am trying to parse log lines for log anomaly detection, but two log lines are too similar for the parser to keep them apart:

[Something] VM Started
[Something] VM Paused

it parses it to VM <*> grouping the event into the same cluster. I tried masking Started and Paused to force it to detect it but maybe haven't found the right masking option yet.

I am aware that in general the parser is doing a fine job at finding the variable content, but in this case I would like to keep it separated.

Last idea I have is to mask the entire line and replace it but I wonder whether a better way exists.


Solution

  • Those log sequences are pretty short and we would need to know your similarity threshold st configuration value . In the original paper the similarity is defined as follows.