c++regexboost-regex

Regex to detect negative numbers but not hyphenated numbers


I want to detect positive and negative integers but not hyphenated numbers. For example:

From 2020-2022, the frequency of COVID-19 increased when the temperature fell between 10 and -10 degrees. (2)

I want to match 10, -10, and 2, but not -2022 or -19.*

*I would rather not match the 2020 too but "integers not years" is beyond the scope of regular expressions.


Solution

  • You can first match what you don't want; then use a pipe; and then match what you want; and finally use the latter.

    For example, you can use the following without SKIP FAIL:

    import re
    
    s = """
    From 2020-2022, the frequency of COVID-19 increased when the temperature fell between 10 and -10 degrees. (2) From 2020 - 2022, 
    the frequency of COVID-19 increased when the temperature fell between 10 and -10 degrees. (2)
    
    """
    
    print(tuple(filter(None, re.findall(r'(?im)(?:\d+\s*|[a-z0-9]+)-\s*\d+|([-+]?\d+)', s))))
    
    
    ('10', '-10', '2', '10', '-10', '2')
    

    Further details are elaborated on in this link.