Recently I had some free time and I decided I wanted to automate some common tasks of mine. And let me tell you honestly, I hate having to do screen scraping. It’s an annoying, tedious task. Making regex for this, and for that, and then to find out my hours were wasted as that regex won’t work on another site.
That’s a thing of the past.
I’m super excited about this find. Maybe I’m the last to discover it, but it’s just too awesome to pass up.
The project is called: PHP Simple HTML DOM Parser.
Literally, this takes almost all of the frustration out of screen scraping. Here’s an example from a quick and dirty login and grab my stats for ESEA.
And that’s it.
Now take a look at that, and realize how much stuff I’m not forced to do.
I know this isn’t some great new invention, loading the source into a DOM object and parsing it, but man, this almost eliminates the need to think about screen scraping entirely.