I am writing a function that will download an HTML post.
Having downloaded the string I will:
- strip off the html tags
- remove special encoding characters like n
- remove trailing white spaces
My question is: should I first download all the content and then proceed with the modifications or should I apply the modifications right after the string has been downloaded (and only then download the next string) ?
This question is mostly about architecture and good practices. In this particular case, speed is not an issue because I don’t have more than 1000 posts and this procedure will run once a month. The language is Python.
What is the best practice for this case? And what is the best practice in general ?