fix: prevent invalid commands matches on 5 characters or less (932220 PL-2, 932230 PL-1, 932232 PL-3, 932235 PL-1, 932236 PL-2, 932237 PL-3, 932238 PL-3, 932239 PL-2, 932250 PL-1, 932260 PL-1) #3735

EsadCetiner · 2024-06-18T09:38:42Z

There have been many reports of false positives with the 932260 family of rules, most of them stem from invalid commands being matched (such as identity matching id). This pr solves most of these issues by preventing matches of invalid commands on 5 characters or less, I didn't prevent invalid matches for all commands to avoid the regex size exploding in size. I've modified the definition of @ from [\s<>&|)] to (?:[\s<>&|)]|$) and adding @ to all commands with 5 characters or less. This results in either matching a command without arguments, or an command with an argument without matching permutations of a commands (For example, sudo matching sudoaaaa).

Since this is a big change, I likely may have missed some commands that were intended to be matched (For example, ip matching iptables and ip6tables) but were shortened for performance or other reasons.

Some new attacks are detected with this PR, but some false negatives are introduced. The unix rules need a complete overhaul however this PR should be fine as an short to medium term fix.

Fixes false positives with english words:
identity
unique
time express

This should fix false positives with random data such as UUID, session cookies, tokens, and base64 data.

This might cause some new false positives because some commands are being matched with no arguments when they previously weren't, but this shouldn't be as bad as the false positives it fixes. I've excluded some commands to try and prevent the most common instances of these false positives (including some old ones, for example matching id=1).

closes: #3711
closes: #3631
closes: #3725
closes: #3812
closes: #3985

… PL-1, 932235 PL-1, 932236 PL-2, 932237 PL-3, 932239 PL-2, 932250 PL-1, 932260 PL-1)

theseion · 2024-06-19T18:56:35Z

I'll need a couple of days before I can look at this PR in detail.

fzipi · 2024-06-20T12:42:32Z

Since this is a big change, I likely may have missed some commands that were intended to be matched (For example, ip matching iptables and ip6tables) but were shortened for better performance.

Just as a note: ip is a command, not a shortened version.

fzipi · 2024-06-20T12:54:58Z

(?:[\s<>&|)]|$)

Maybe using a word boundary instead? Like (?:[\s<>&|)])?\b.

EsadCetiner · 2024-06-22T06:49:06Z

@fzipi

Maybe using a word boundary instead? Like (?:[\s<>&|)])?\b

I've further simplified this by just using a word boundry and getting rid of the optional non-capturing group. The effect is pretty much the same, detection might be slightly better, but the generated regexes have reduced in size dramatically with this change (especially 932236).

For some reason only using a word boundry for rule 932237 crashes Apache, I couldn't figure out the cause so I just added a workaround and comment.

theseion · 2024-06-29T18:30:34Z

Thank you very much @EsadCetiner for this PR. I think the idea of looking for word boundaries is great. What I don't like is that we now need to add @ everywhere. It was bad enough already.

I have an idea for a different approach that I think makes more sense in the long run. I think we should simplify the command line processor in crs-toolchain and remove the logic for @ and ~. Instead, a block processed with the command line processor should end with a \b. Consider the following block:

##!> cmdline unix
  gcc
  sudo
##!<

The result would look as follows (simplified):

(?:gcc|sudo)\b

This still leaves ~. I think we can take care of that by using \w+ instead. Consider the following:

##!> cmdline unix
  python\w+
##!<

This would result in the following (simplified):

(?:python(?:\w+))\b

This would match python2 and python3, for example, but not python . I would make these replacements in the list of commands directly, as there are only a couple of them anyway.

In conclusion:

we would get rid of @ and ~
we could remove the replacements for @ and ~ in the assembly files for rules that don't use the command line processor
we would fix most of the false positives that have been reported over recent weeks where a command is matched as a prefix
we would reduce the sizes of many regular expressions significantly

@EsadCetiner Please open a separate PR with the additions (I saw a couple svn related commands).

theseion · 2024-06-30T19:06:39Z

I realise your PR solves an issue we have now and will prevent more issues from being filed. I propose to go ahead with your PR and implement my proposal later on.

To complete your PR, @EsadCetiner, you'll have to also adapt toolchain.yaml.

EsadCetiner · 2024-07-01T16:26:01Z

@theseion

I have an idea for a different approach that I think makes more sense in the long run. I think we should simplify the command line processor in crs-toolchain and remove the logic for @ and ~. Instead, a block processed with the command line processor should end with a \b.

Agreed, I had a general idea of something like this but I wasn't sure on how to approach this.

I realise your PR solves an issue we have now and will prevent more issues from being filed. I propose to go ahead with your PR and implement my proposal later on.

4.5.0 is coming out in 3 weeks or so, I'll try and address all of your feedback before then. I'd rather get it out of the way now so it's not forgotten.

Please open a separate PR with the additions (I saw a couple svn related commands).

These commands are already detected in the current release, but they stopped being detected once I prevented the invalid match. Shouldn't those additions be part of this PR so those commands are still covered, therefore avoiding some regressions?

To complete your PR, @EsadCetiner, you'll have to also adapt toolchain.yaml.

I'm not entirely clear on what this does, I see what looks to be@ here, but I don't see ~.

  anti_evasion_suffix:
    # - <>: redirection, e.g., `cat<foo`
    # - ,: brace expansion, e.g., `""{nc,-p,777}`
    ## - &|: logical operators in headers, e.g., `a=nc&&$a -nlvp 555`
    unix: |
      [\s<>,&|)].*

I assume you want me to modify the unix evasion suffix to this, then remove all references to @.

    unix: |
-      [\s<>,&|)].*
+      \b

I think your saying here:

Instead, a block processed with the command line processor should end with a \b

This still leaves ~. I think we can take care of that by using \w+ instead.

To modify the affected regex assembly files to something like this, while keeping ~:

##!> assemble
  ##!> cmdline unix
    ##!> include-except unix-shell-upto3 unix-shell-fps-pl1 -- ~ \w+
##!<

I hope this wasn't too convoluted, is this the general idea of what you want me to do?

theseion · 2024-07-02T18:46:19Z

These commands are already detected in the current release, but they stopped being detected once I prevented the invalid match. Shouldn't those additions be part of this PR so those commands are still covered, therefore avoiding some regressions?

Yes.

I'm not entirely clear on what this does, I see what looks to be@ here, but I don't see ~.

~ is anti_evasion_no_space_suffix. Don't replace the wildcard match .*.

I assume you want me to modify the unix evasion suffix to this, then remove all references to @.

Modify, yes. We still need @, @ will tell the toolchain to add that suffix.

I think your saying here:

Instead, a block processed with the command line processor should end with a \b

This still leaves ~. I think we can take care of that by using \w+ instead.

To modify the affected regex assembly files to something like this, while keeping ~:

##!> assemble
##!> cmdline unix
##!> include-except unix-shell-upto3 unix-shell-fps-pl1 -- ~ \w+
##!<

You can ignore that part. That's my proposal for the change of the processor. It has nothing to do with your PR.

EsadCetiner · 2024-07-03T23:16:28Z

@theseion done, ready for review

EsadCetiner · 2024-07-09T22:19:16Z

I've gone back to my original solution of resolving this false positive without a word boundry (?:[\s<>&|)]|$), the word boundry was causing a fair few false positives. Added tests to make sure those false positives don't come up again.

theseion · 2025-01-12T16:02:34Z

Do you think the \b makes sense for ~? Could you give me an example where \b makes a difference?

EsadCetiner · 2025-01-12T23:49:56Z

@theseion

\b is needed to prevent matching permutations of commands beyond 5 characters when ~ is used, it acts similar to adding @ to a command. The quantifier is primarily there since I'm not sure how long the permutations of some commands can be, I'm just being conservative with causing too many false negatives. I can remove it if you think it's redundant.

I've checked over my work again this morning and I think this PR is ready to be merged if nobody else has objections.

theseion · 2025-01-13T19:12:53Z

Ok. I think it makes sense but I think 5 might be too conservative. For example, docker-compose would require 7. How about we increase it to 10? That should suffice for almost anything.

Please also update the anti_evasion_no_space_suffix definitions for both windows and unix to use the same restriction.

EsadCetiner · 2025-01-14T05:46:38Z

@theseion these look machine generated, is there another file I'm supposed to edit then compile the regex or should I just replace them completely? It's hard to read the regex and understand what it's trying to match.

theseion · 2025-01-14T07:09:44Z

No, I created them manually. But you're right. I'll handle it.

Match at most 10 consecutive characters

theseion · 2025-01-15T19:44:51Z

I've updated the expressions @EsadCetiner, please take a look. The expressions mainly consist of the anti_evasion and anti_evation_suffix patterns, with a few additions. Once you see that, they're not hard to understand.

EsadCetiner · 2025-01-16T06:37:27Z

@theseion I think I have a better understanding of the regex now, the extra comments helped clear up some confusion.

I've taken a look at your changes and found a regression with the anti_evasion_no_space_suffix for UNIX, it's not doing what it says in the tin (It's now matching python space when it shouldn't be).

I've pushed a fix, but the fix felt a little too easy, can you double check it?

theseion · 2025-01-16T20:06:59Z

Duh. You're right. \s mustn't be part of that group. That's exactly why we have the ~ suffix :)

theseion · 2025-01-16T20:07:32Z

LGTM

theseion · 2025-02-06T19:30:45Z

@EsadCetiner are you waiting for me to perform the merge?

EsadCetiner · 2025-02-06T23:12:33Z

@theseion no, I was waiting for somebody else to review it since it's a big change. I'll merge in a few days if nobody has any objections.

EsadCetiner added 6 commits June 15, 2024 10:45

fix: prevent invalid commands matches on 5 characters or less (932230…

f9be489

… PL-1, 932235 PL-1, 932236 PL-2, 932237 PL-3, 932239 PL-2, 932250 PL-1, 932260 PL-1)

fix: copy paste error

b870dc9

fix: invalid output in tests

1e56e28

test: enable tests to detect new attacks

f97a09c

Merge branch 'coreruleset:main' into fix-invalid-command-matches

cdd86d7

test: enable tests to detect new attacks

b583f22

EsadCetiner requested a review from theseion June 18, 2024 09:38

fix: correct description for 932250-4

ac55e54

EsadCetiner added 2 commits June 22, 2024 16:33

Merge branch 'main' into fix-invalid-command-matches

259cb1f

perf: use word boundry to prevent invalid matches

8e001e5

EsadCetiner added 2 commits June 22, 2024 21:50

test: add test for id command

b893999

fix: invalid output for tests

2355eb7

EsadCetiner added the ➕ False Positive label Jun 25, 2024

fzipi mentioned this pull request Jul 1, 2024

Monthly Chat Agenda July 2024 (2024‐07‐01 and 2024‐07‐15) #3728 #3753

Closed

EsadCetiner added 5 commits July 3, 2024 06:51

Merge branch 'main' into fix-invalid-command-matches

7101a8f

fix: add missing line break

2d54859

fix: invalid test format

40f9b4b

chore: update toolchain

1dbcd8f

test: enable tests for newly detected attacks

a07ffa4

fix: don't use word boundry to prevent invalid matches

98a8cf2

EsadCetiner added 2 commits January 12, 2025 13:52

chore: update unix-shell.data

9bf8e99

fix: typos

7038308

chore: update anti_evasion_no_space_suffixes

20ecdd8

Match at most 10 consecutive characters

theseion force-pushed the fix-invalid-command-matches branch from 256d481 to 20ecdd8 Compare January 15, 2025 19:43

EsadCetiner added 2 commits January 16, 2025 17:33

fix: regression with unix evasion suffix no space

8466d83

chore: update regex

e20192d

theseion approved these changes Jan 16, 2025

View reviewed changes

This was referenced Jan 25, 2025

CRS 3.0 Blocking cf_clearence cookie from cloudflare #3985

Closed

Shell false positives for rules 932260 and 932236 #3631

Closed

fzipi mentioned this pull request Feb 2, 2025

Monthly Chat Agenda February 2025 (2025-02-03) #3990

Closed

Merge branch 'main' into fix-invalid-command-matches

8cffb1a

EsadCetiner added this pull request to the merge queue Feb 10, 2025

Merged via the queue into coreruleset:main with commit 26bec41 Feb 10, 2025
6 checks passed

EsadCetiner deleted the fix-invalid-command-matches branch February 10, 2025 05:42

theseion mentioned this pull request Feb 10, 2025

Rule 932370 has false positive for "At" after newline #3953

Open

1 task

EsadCetiner mentioned this pull request Feb 15, 2025

False positives with 932235 PL1 Remote Command Execution: Unix Command Injection (command without evasion) #3932

Closed

fzipi added the release:fix label Mar 1, 2025

fzipi mentioned this pull request Mar 3, 2025

Monthly Chat Agenda March 2025 (2025-03-03) #4033

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: prevent invalid commands matches on 5 characters or less (932220 PL-2, 932230 PL-1, 932232 PL-3, 932235 PL-1, 932236 PL-2, 932237 PL-3, 932238 PL-3, 932239 PL-2, 932250 PL-1, 932260 PL-1) #3735

fix: prevent invalid commands matches on 5 characters or less (932220 PL-2, 932230 PL-1, 932232 PL-3, 932235 PL-1, 932236 PL-2, 932237 PL-3, 932238 PL-3, 932239 PL-2, 932250 PL-1, 932260 PL-1) #3735

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix: prevent invalid commands matches on 5 characters or less (932220 PL-2, 932230 PL-1, 932232 PL-3, 932235 PL-1, 932236 PL-2, 932237 PL-3, 932238 PL-3, 932239 PL-2, 932250 PL-1, 932260 PL-1) #3735

fix: prevent invalid commands matches on 5 characters or less (932220 PL-2, 932230 PL-1, 932232 PL-3, 932235 PL-1, 932236 PL-2, 932237 PL-3, 932238 PL-3, 932239 PL-2, 932250 PL-1, 932260 PL-1) #3735

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!