> The implementation of the scraper is entirely contained in a single GitHub Act...

simonw · on Aug 10, 2023

GitHub repos appear to have a "soft" size limit of about 1GB - I feel completely comfortable with free repos with up to that size of content.

Once you get above 5GB I believe GitHub Support may send you a quiet polite email asking you to reconsider!

https://docs.github.com/en/repositories/working-with-files/m... has some more information on limits - they suggest keeping individual files below 50MB (and definitely below 100MB).

matheusmoreira · on Aug 11, 2023

I occasionally scrape results from brazilian lotteries. Their official web sites have internal APIs which simply return JSON data. I simply download the JSON and commit it to the repository. Right now I have 5504 files totalling 22 MB. GitHub hasn't complained yet.