I'm an expert on regular expressions, spiders/crawlers (web scraping / data mining) and databases (MySQL, MSSQL, PL/SQL, H2, SQLite) with more than 15 years of experience in collecting large amount data from the web (from tens of millions of pages per month). My experience include collecting data from large sites like LastFM, Goodreads, IMDB, Tripadvisor, Yelp, Booking, Google Maps, etc.
The output data can be in SQL (for import), CSV, TSV, JSON, XML or other data format if you give me a specification for the format.
Just give me an example with 2-3 records in your data format with urls where this data come from and I can start.