Python Developer Needed for Daily News Scraping

₹1500-12500 INR

In Progress

Posted

17 days ago

₹1500-12500 INR

Paid on delivery

I'm seeking a skilled Python developer to help with web scraping from various news sites. The objective is to gather headlines and articles on a daily basis. Key Requirements: - Web scraping experience, particularly with news sites - Proficiency in Python - Ability to deliver data in JSON format - Understanding of ethical web scraping practices - Experience with error handling and retries in web scraping tasks - Ability to integrate and retrieve data from APIs if scraping fails - Proficiency in data cleaning and normalization - Knowledge of scheduling scraping jobs using tools like cron - Knowledge of using proxies to prevent IP blocking Please include examples of relevant past projects in your bid. Please scrape from major news sites like CNN, BBC, and Reuters and Tech news sites as well when sending bid please send your approach to this problem i have around 20-30 sites for which i want a cron to run in flash everyday scrap the main content of the article

Project ID: 38878616

About the project

13 proposals

Remote project

Active 17 days ago

Looking to make some money?

Email address

Benefits of bidding on Freelancer

Set your budget and timeframe

Get paid for your work

Outline your proposal

It's free to sign up and bid on jobs

Awarded to:

@sarbtech123

As a seasoned Python developer and web enthusiast with strong Data Mining skills, I'm confident that I possess the necessary expertise to tackle your daily news scraping project.

₹6,000 INR in 7 days

4.9

(34 reviews)

4.8

13 freelancers are bidding on average ₹7,885 INR for this job

@jinufeb14

With extensive experience in the field of data mining and processing, I am well-prepared to tackle your daily news scraping project. My proficiency with Python, including web scraping and data manipulation, ensures that I can deliver the scraped news content in your desired JSON format accurately and in a timely manner. Ethical practices in web scraping are important for the success of your project, and I have a solid understanding of these principles that ensures the project remains within legal boundaries. My years of experience has allowed me to effectively handle errors and retries within web scraping tasks, as well as integrate and retrieve data from APIs when traditional scraping may not be feasible. Given my extensive background with cloud technology and utilizing related tools for optimal performance and scalability, I can ensure the reliability and efficiency of your daily news scraping process. In addition, my knowledge on scheduling scraping jobs using cron provides a hassle-free automated data collection during your desired times. Proxies will be efficiently used to avoid any IP blocking issues during the process. In conclusion, my technical expertise combined with my commitment to delivering high-quality outputs makes me an ideal fit for this project. With me onboard, you can rest assured that your scraping tasks are performed consistently and reliably day in and day out regardless of challenges that may come forth.

₹25,000 INR in 7 days

5.0

(45 reviews)

5.9

@vijaychouhan

Hello !! With extensive experience in web scraping and Python development, I am confident in delivering a reliable, efficient, and ethical solution tailored to your needs. Proposed Approach: 1. Target Site Analysis: -> Analyze the structure of the 20–30 news sites (e.g., CNN, BBC, Reuters, and tech news sites) to understand their layouts, URLs, and data points to scrape. -> Use libraries like BeautifulSoup, Scrapy, or Selenium for scraping depending on site complexity. 2. Web Scraping Implementation: -> Develop custom scraping scripts to extract headlines and main article content from each site. -> Implement error handling and retry mechanisms to ensure robustness against connectivity issues or unexpected changes in website structure. 3. Ethical Scraping Practices: -> Respect the websites’ terms of service and implement rate-limiting and delays where necessary. -> Use proxies and user-agent rotation to prevent IP blocking and avoid detection. Thanks

₹7,000 INR in 7 days

4.4

(5 reviews)

4.6

@syedfaiq12

I have 4 years of web scraping experience in python i have scraped sites like instagram,zillow and in formats like json

₹7,000 INR in 7 days

5.0

(9 reviews)

3.1

@Khelifa90

Hello, i have a good experience scraping variety of websites and cleaning data with python, contact me to discuss more project details, thanks

₹7,000 INR in 7 days

5.0

(12 reviews)

2.9

@FatmaGamalShams

I am an experienced Python developer specializing in web scraping and automation, and I propose building a robust solution to scrape headlines and article content daily from 20–30 news sites like CNN, BBC, Reuters, and Tech news platforms. Using tools like Scrapy and BeautifulSoup for static sites, and Selenium or Playwright for JavaScript-heavy pages, I will ensure accurate and reliable data extraction. To prevent IP blocking, I will implement rotating proxies and user agents while handling failures with retries or API integration where available. The data will be cleaned, normalized, and delivered in JSON format, stored locally or in a database as needed. I will schedule the scraping jobs using cron to run automatically every day and optimize performance with multi-threading or asyncio for efficiency. My solution will include error handling, easy-to-update modular code, and a fallback mechanism to ensure consistency. I have successfully completed similar projects, including news aggregation scrapers and large-scale data extraction pipelines, and I am confident I can deliver a reliable and scalable solution within the desired timeframe.

₹9,000 INR in 7 days

5.0

(2 reviews)

2.0

@namitajain118

My name is Namita, and I am a Python developer with 6 years of expertise in designing and implementing web scraping solutions. I am excited to assist in building a system to gather daily headlines and articles from 20–30 news sites, including CNN, BBC, and Reuters, and deliver structured JSON data. Approach: Utilize Scrapy for large-scale sites, Selenium for dynamic pages, and Requests for lightweight tasks. Set up cron jobs for automated daily scraping. Apply proxies and user-agent rotation to avoid IP bans. Add fallback mechanisms using APIs for seamless operation. Ensure the data is thoroughly cleaned and standardized. Experience: Built scalable scrapers for diverse domains like news, e-commerce, and jobs. Skilled in error handling, scheduling, and anti-bot techniques. Timeline: Prototype within 5–7 days, full solution in 15 days. I can share sample data or a demo to showcase the approach. Let’s collaborate to make your project a success! Best regards, Namita Jain

₹3,000 INR in 7 days

5.0

(3 reviews)

2.2

@TaemAllah

Being an experienced Python developer, I believe I'm more than qualified for this project. With my proficient coding abilities, specifically in Python and Java, I’ve successfully dealt with numerous data processing and scraping projects in the past. My approach to web scraping is not just limited to extracting the data you need but also covers ethical practices, efficient error handling and retries, and integration with APIs as a backup plan if scraping fails. My knowledge on data cleaning and normalization is crucial when it comes to providing you with clean and structured JSON format data. This means that the information you receive from me will be easy to work with, saving you time and effort in organizing it later. Moreover, my mastery of using proxies will prevent any IP blocking issues that may arise during the web scraping process. Finally, I'm familiar with scheduling tasks using cron which can help automate your daily flash scraping requirement from major news sites. As I've extracted information from around 200 different websites already, scraping content from approximately 20-30 news sites won't just be a task for me but rather one of my specialties. Consider me your reliable Python freelancer who'll ensure your headlines and articles are scraped swiftly and accurately every single day.

₹1,500 INR in 1 day

5.0

(4 reviews)

1.4

@ShekharTyagi345

I am a Python developer with 8 years of experience in web scraping using Selenium, Scrapy, and Requests. I propose building a robust scraper for 20–30 news sites like CNN, BBC, and Reuters, delivering daily headlines and articles in JSON format. Approach: Use Scrapy for scalable sites, Selenium for dynamic content, and Requests for lightweight tasks. Schedule daily scraping with cron jobs. Integrate proxies and user-agent rotation to prevent IP blocking. Implement error handling with API fallback if scraping fails. Clean and normalize data for consistent output. Experience: Delivered scalable web scraping projects with automation and error handling. Expert in integrating proxies and scheduling scrapers for high-traffic sites. Timeline: Prototype in 5–7 days, full delivery in 15 days. I can provide a sample JSON output for review. Let’s discuss further to tailor the solution to your needs.

₹7,000 INR in 7 days

5.0

(4 reviews)

1.0

@sachinmajithia

With over two decades of experience in web development and strong proficiency in Python, I'm confident that I am the perfect match for your project needs. I have a deep understanding of web scraping, having used it extensively during my career and even implemented AI techniques such as machine learning, ensuring that I can handle the complexities of scraping from major news sites like CNN, BBC, Reuters, and Tech news sites. To further strengthen your faith in my abilities, let me tell you about some of my relevant past projects - from contributing to successful government initiatives to working with private organizations just like yours, I’ve consistently delivered high-quality results using web scraping. On top of my skills in data normalization, cleaning, and delivering data in JSON format, I’m capable of implementing error handling and retries for a smoother scraping experience. Also noteworthy is my knowledge of integrating and retrieving data from APIs when traditional scraping fails - this ensures that you'll receive accurate and timely news data every single day. I understand the importance of ethical practices in web scraping and have avoided any IP blocking through proper proxy usage. Finally, my expertise in scheduling jobs using tools like cron would help streamline the entire scraping process.

₹7,000 INR in 7 days

3.5

(3 reviews)

0.8

@ippililokesh72

I believe that with my expertise in Python development and web scraping, I can deliver a high-quality solution that meets your needs. I'm excited about the opportunity to collaborate on this project and look forward to discussing it further. Technical Vision: Programming Language: Python Libraries/Frameworks: BeautifulSoup, Requests, Scrapy (if needed) Database Technology: MS SQL or MongoDB Deployment: Cloud-based solution (AWS, Azure) for scalability.

₹7,000 INR in 7 days

0.0

(0 reviews)

0.0

@MAtHCodEiuhiu

During my school years, I not only systematically studied core courses such as statistics, data mining, and machine learning, but also actively participated in multiple data analysis projects. From data collection, cleaning, analysis to report writing, I am proficient in using Python including Pandas, NumPy, SciPy and other libraries, R language, and SQL for data processing and analysis, effectively extracting valuable information and providing scientific basis for decision-making. My carefulness and rigor ensure the accuracy and timeliness of every analysis.

₹7,000 INR in 7 days