I want to detect positive and negative integers but not hyphenated numbers. For example:
From 2020-2022, the frequency of COVID-19 increased when the temperature fell between 10 and -10 degrees. (2)
I want to match 10
, -10
, and 2
, but not -2022
or -19
.*
\b-?\d+\b
is catching ALL the numerics because -
is a non-word character\B-\d+\b
will catch ONLY the negatives.(\B-|(?<!\w-)\b)\d+\b
should work, but my engine cannot use negative lookbehinds. sigh*I would rather not match the 2020
too but "integers not years" is beyond the scope of regular expressions.
You can first match what you don't want; then use a pipe; and then match what you want; and finally use the latter.
For example, you can use the following without SKIP FAIL
:
import re
s = """
From 2020-2022, the frequency of COVID-19 increased when the temperature fell between 10 and -10 degrees. (2) From 2020 - 2022,
the frequency of COVID-19 increased when the temperature fell between 10 and -10 degrees. (2)
"""
print(tuple(filter(None, re.findall(r'(?im)(?:\d+\s*|[a-z0-9]+)-\s*\d+|([-+]?\d+)', s))))
('10', '-10', '2', '10', '-10', '2')
Further details are elaborated on in this link.