535480 Scraping multiple websites & Consolidating Data
Ni upoštevno
V teku
Objavljeno pred približno 13 leti
Ni upoštevno
Plačilo ob dostavi
A php script which scrapes Television Series information. Specially the Upcoming, new seasons, new episodes. To achieve a complete database, scrape the 4 websites below, and then unite the data for each series into one entry
The websites are:
[login to view URL]
fields: (title, year, imdb website, description, categories (drama, comedy, etc..), actors
[login to view URL]
fields: title, original run date (start and end if available), official website, imdb website, first 3 actors (might be better formatted on imdb site)
[login to view URL]
fields: title, imdb web site, dates, aired from date, last date, actors (if available)
[login to view URL]
fields: title, imdb website, description
Most importantly, consolidate the data, so that each series has only one entry. Take the title series as the primary key, and make sure each series has only one title.
Output is to be an sql table with title as primary key. Each TV series to have only one title.