Scraper of District Court Website
Our client needed a flexible real time scraper of cases from District Court website. The website has a captcha protection so beating this protection was one of our challenges, and we resolved it successfully. We’ve implemented a scraper with admin interface to manage scraper settings. All scraped cases are represented in a spreadsheet with sorting and filtering options. There is an option to select a date range to view cases for a specific period and export to CSV. This is a multi-threading scraper since our client wanted to monitor multiple districts at the same time, so scraper behaves as multiple users, combining data into a single database, to provide the highest performance and data availability. The scraper has multi-level failure protection to make sure that not a single piece of data would be lost.