pythonperformancereturngeneratoryield

Which is generally faster, a yield or an append?


I am currently in a personal learning project where I read in an XML database. I find myself writing functions that gather data and I'm not sure what would be a fast way to return them.

Which is generally faster:

  1. yields, or
  2. several append()s within the function then return the ensuing list?

I would be happy to know in what situations where yields would be faster than append()s or vice-versa.


Solution

  • yield has the huge advantage of being lazy and speed is usually not the best reason to use it. But if it works in your context, then there is no reason not to use it:

    # yield_vs_append.py
    data = range(1000)
    
    def yielding():
        def yielder():
            for d in data:
                yield d
        return list(yielder())
    
    def appending():
        lst = []
        for d in data:
            lst.append(d)
        return lst
    

    This is the result:

    python2.7 -m timeit -s "from yield_vs_append import yielding,appending" "yielding()"
    10000 loops, best of 3: 80.1 usec per loop
    
    python2.7 -m timeit -s "from yield_vs_append import yielding,appending" "appending()"
    10000 loops, best of 3: 130 usec per loop
    

    At least in this very simple test, yield is faster than append.