imdb-scraper

A simple Node.js IMDb Title Scraper

Don't know how to use Node.js? Try my live demo API.

☁️ Installation

# Clone it
-> git clone https://github.com/baderproductions/imdb-scraper.git
-> npm install

# Use it
-> npm start
-> Go to http://localhost:6005/[IMDB title here]

FAQ

1. What response should I expect from the scraper?

Example from http://localhost:6005/tt0145487:

{
"link": "https://www.imdb.com/title/tt0145487",
"title": "Spider-Man (2002)",
"year": "2002",
"rating": "7.3",
"duration": "2h 1min",
"genre": "Action",
"poster": "https://m.media-amazon.com/images/M/MV5BZDEyN2NhMjgtMjdhNi00MmNlLWE5YTgtZGE4MzNjMTRlMGEwXkEyXkFqcGdeQXVyNDUyOTg3Njg@._V1_UX182_CR0,0,182,268_AL_.jpg",
"plot": "When bitten by a genetically modified spider, a nerdy, shy, and awkward high school student gains spider-like abilities that he eventually must use to fight evil as a superhero after tragedy befalls his family."
}

2. What if I want to scrap more fields?

In the index.js you have the function scrapeIt(), there you can add or remove fields as you wish. Inspect the IMDb website tags in order to do that.

 const scrapeIt = async (url) => {
   try {
     const { data } = await axios.get(url).catch((err) => {
       return res.status(500).json({
         error: err,
       });
     });
     const selector = cheerio.load(data);
     const link = url;
     const title = selector(".title_wrapper > h1").text().trim();
     const year = selector(".title_wrapper > h1 > span > a").text();
     const poster = selector(".poster > a > img").attr("src").toString();
     const rating = selector(".ratingValue > strong > span").text();
     const duration = selector(".title_wrapper > div > time").text().trim();
     const genre = selector(".title_wrapper > div > a").first().text();
     const plot = selector(".plot_summary  > .summary_text").text().trim();
     return { link, title, year, rating, duration, genre, poster, plot };
   } catch {
     console.error(
       `ERROR: An error occurred while trying to fetch the IMDb url: ${url}`
     );
   }
 };

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
public		public
README.md		README.md
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

imdb-scraper

☁️ Installation

FAQ

1. What response should I expect from the scraper?

2. What if I want to scrap more fields?

📜 License

About

Releases

Packages

Languages

ikevin127/imdb-scraper

Folders and files

Latest commit

History

Repository files navigation

imdb-scraper

☁️ Installation

FAQ

1. What response should I expect from the scraper?

2. What if I want to scrap more fields?

📜 License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages