seleniumselenium-webdriverweb-scrapingwebdriverdiskcache

Setting Disk Cache size in Selenium, while webscraping multiple websites?


From the available information I understood that setting disk cache size in selenium will help in faster loading of the web pages, when we are doing the scraping or anything on single website. But my question is what good will it do if we set the disk cache size while dealing with multiple websites?

Or is it in fact bad to set disk cache size? When scraping multiple web pages i.e. in a way the websites can trace that we are scraping?


Solution

  • Disk Cache is a cache memory that is used to speed up the process of storing and accessing data from the host machine hard disk. It enables faster processing during reading/writing, issuing commands and other I/O process between the hard disk, the memory and computing components. A disk cache is also referred to as a disk buffer or cache buffer.


    Chromium disk cache

    The disk cache stores resources fetched from the web so that they can be accessed quickly at a latter time if needed. The main characteristics are:


    Conclusion

    To conclude, by default will be configured with the default value for the which users can configure as per their respective usecases.


    Changing Chrome Cache size on Windows 10

    There is only one method that can be used to set and limit Google Chrome’s cache size.

    Change-Chrome-Cache-size.png

    Google-Chrome-Properties-446x600.png

    Here 2147483648 is the size of the cache in bytes which is equal to 2 Gigabytes.