-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
grok should have a replace functionality like we have for mutate #103
Comments
Grok is primarily for parsing, not modifying data. The mutate filter (since it does text replacement already), or a new filter, feels like a better place to implement this proposal. |
Otherwise, I am in favor of this feature. |
Thanks @jordansissel for supporting this issue. Since this issue interests you, I have 1 more point to make it more interesting. Currently, we do write 1 custom regex pattern on each line like below.../my_pattern_directory/my_pattern_file
CCN_MASTER [1-2]{16}
CCN_VISA [2-3]{15}
CCN_AMEX [3-4]{14}
CCN_MAESTRO [4-5]{13} Inorder to apply above patterns on a log message, we need to write filter like something as shown belowfilter
{
grok {
patterns_dir => ["/logstash/patterns"]
match => { "message" => "%{CCN_MASTER}" }
add_field => { "Infosec_Pattern_Found" => "CCN" }
}
grok {
patterns_dir => ["/logstash/patterns"]
match => { "message" => "%{CCN_VISA}" }
add_field => { "Infosec_Pattern_Found" => "CCN" }
}
} As we can see, the grok count will increases as we have more no of patterns. Also, the "Infosec_Pattern_Found" field getting added redundantly here. Proposed solutionInstead of identifying custom patterns individually, we can group them like below.../my_pattern_directory/my_pattern_file
CCN
{
MASTER [1-2]{16}
VISA [2-3]{15}
AMEX [3-4]{14}
MAESTRO [4-5]{13}
} And the corresponding filter looks something like below. filter
{
grok {
patterns_dir => ["/logstash/patterns"]
match => { "message" => "%{CCN}" }
add_field => { "Infosec_Pattern_Found" => "CCN" }
}
}
OR
filter
{
grok {
patterns_dir => ["/logstash/patterns"]
match => { "message" => "%{CCN.MASTER}" }
add_field => { "Infosec_Pattern_Found" => "CCN" }
}
} This way, we will achieve:
Please do consider this point as well if it seems feasible. Let me know if we can track this altogether in a different ticket. |
You can do this today:
|
Usecase
I have few custom regex patterns which looks for some sensitive information in the log messages like credit card number, social security number etc.
I have applied these patterns inside grok and matching each log message for regex's I wrote in a file inside patterns folder.
Log message which has a matching pattern would be added with a custom field named "Infosec_Pattern" with matching pattern values like "CCN, SSN" etc.
Logstash version 2.3.1
Below is the sample filter config
This works perfect. Now what I want is:
Replace a matched string with some value like "XXXXXXXX" in the message since the matching string contains sensitive information.
In order to do this, I need to make use of mutate where I have to again find the pattern in log message and replace it with desired value using gsub.
Below is the sample filter config (with mutate section)
Output after applying above sample config
Parsed log message without having mutate section looks like below:
Parsed log message having mutate section looks like below:
As we can clearly see, we need to match a pattern twice if I want to replace the matched string in the original message field.
I tried to use overwrite inside grok but that is not helping much as sensitive data can be present anywhere in the string. And also I would not be able to replace the data with some desired value like "XXXX" using overwrite.
Expectation
OR
Option 1 seems to be a best fit for this.
The text was updated successfully, but these errors were encountered: