pythonregexlinuxfiledirectory

How do I search directories and find files that match regex?


I recently started getting into Python and I am having a hard time searching through directories and matching files based on a regex that I have created.

Basically I want it to scan through all the directories in another directory and find all the files that ends with .zip or .rar or .r01 and then run various commands based on what file it is.

import os, re

rootdir = "/mnt/externa/Torrents/completed"

for subdir, dirs, files in os.walk(rootdir):
    if re.search('(w?.zip)|(w?.rar)|(w?.r01)', files):
        print "match: " . files

Solution

  • import os
    import re
    
    rootdir = "/mnt/externa/Torrents/completed"
    regex = re.compile('(.*zip$)|(.*rar$)|(.*r01$)')
    
    for root, dirs, files in os.walk(rootdir):
      for file in files:
        if regex.match(file):
           print(file)
    

    The code below answers the question in the following comment:

    That worked really well, is there a way to do this if match is found on regex group 1 and do this if match is found on regex group 2 etc ? – nillenilsson

    import os
    import re
    
    regex = re.compile('(.*zip$)|(.*rar$)|(.*r01$)')
    rx = '(.*zip$)|(.*rar$)|(.*r01$)'
    
    for root, dirs, files in os.walk("../Documents"):
      for file in files:
        res = re.match(rx, file)
        if res:
          if res.group(1):
            print("ZIP",file)
          if res.group(2):
            print("RAR",file)
          if res.group(3):
            print("R01",file)
    

    It might be possible to do this in a nicer way, but this works.