Closed
Description
Describe the bug
While reviewing #2333, noticed two things:
- the 920120 regexp is not "disassembled". We should disassemble it so it is easier to compare with the no-backtracking one.
- Looks like the regexp is missing the
á
meta. It should match using&[aeEiIoOuUyY]acute
. We should also review all these html metas so see if we are missing others, just in case.
Steps to reproduce
Read the data files for regexp 920120.
Expected behaviour
In PL1 html metas like á
should not match.
Actual behaviour
It is matching.
Additional context
Your Environment
- CRS version (e.g., v3.2.0): v3.4.0-dev