Website data extraction perl jobs
Batch OCR Text Extraction Upload an entire comic chapter (10–100+ pages) in one go. Extract text from all pages (speech bubbles, captions, annotations) into a single structured text file. Supported input formats: PNG, JPG, WEBP, and PDF. Automated Translation Use extracted text to auto-translate into the desired language using AI APIs (e.g., OpenAI GPT, DeepSeek, Google Gemini). Allow users to download the translated text file in multiple formats (e.g., DOCX, TXT). Glossary and Learning Save all character names, locations, and specific terms into a glossary database, linked with the respective manga/comic series. Utilize the glossary to ensure consistent translations across chapters and series. AI integration to learn and improve translations over time based on user feedbac...
I'm looking for a freelancer who can extract customer data from Apollo for me. The requirements are as follows: - Data Count: I'm looking for a dataset that contains between 50,000 to 100,000 customer records. - Specific Details: The data should primarily include contact information. Other customer details such as purchase history or demographic information are not required. - Delivery Format: The contact information should be delivered in an Excel format. Ideal skills for this project include data mining, Excel proficiency, and experience using Apollo's data extraction tools. Please provide a realistic timeline for data extraction and any prior experience you have with similar projects. Hi, I’m looking to gath...
I'm looking for assistance with extracting text data from PDF documents. Ideal candidate should have: - Proficient data entry skills - Experience in data extraction from PDF - Attention to detail to ensure accuracy - Ability to meet deadlines
...establish a system that not only stores these invoices, but also processes and analyzes them. Key requirements include: - **Data Extraction**: The system should be capable of pulling out relevant information from the invoices. - **Automatic Categorization**: Invoices should be sorted into appropriate categories automatically. - **Validation Checks**: The system should perform checks to validate the information extracted from the invoices. Invoices come in various formats, including PDF, Image files (JPEG/PNG), and Excel/CSV. Ideal candidates for this project should have: - Extensive experience with AWS and invoice management systems. - Strong skills in data extraction and automatic categorization. - Ability to set up comprehensive validation checks. - Fa...
I am currently facing some issues with my Python script that's used for web scraping text data. The script needs debugging and optimization to ensure smooth extraction of data. Ideal Skills: - Python - Web Scraping - Debugging - Data Extraction Experience: - Prior experience with Python-based web scraping - Proven track record of fixing bugs in Python scripts - Familiarity with text data extraction Please provide examples of similar projects you have completed.
Objective: Develop a Python script to extract Solana contract addresses from screenshots. Input: Screenshots containing text overlays that include Solana contract addresses. Output: A list of extracted Solana contract addresses printed to the console. Requirements: Proficient in Python programming. Strong understanding of image processing techniques (e.g., OCR, image manipulation). Experience with extracting specific text from images (e.g., using libraries like OpenCV, Pytesseract). Familiarity with the Solana network and the format of Solana contract addresses. Exclusions: The script should not handle QR codes or photographs. The focus is solely on extracting addresses from screenshots with text overlays. Deliverables: A functional Python script that accurately extracts Solana contra...
I'm looking for a freelancer who can meticulously extract a large mixed data (numerical and text) table from a statistical website and transfer it into Excel without missing any details. The ideal candidate should ensure that the data is formatted in Excel in the exact same way as it appears on the website. Skills and Experience: - Proficient in data extraction and Excel - Attention to detail to ensure no information is missed - Ability to format data in Excel as per the client's request
I need a skille...need a skilled Python programmer who can help in extracting table data from PDFs and converting it into an Excel file. The tables in the PDFs have varying structures, so your ability to handle different table layouts is essential. Key requirements: - Extract tables from multiple PDFs - Convert the data into a well-structured Excel file with 4 different formats across 2 sheets - Ensure the data is accurately represented as mixed data (both numeric and text) Ideal Skills: - Proficiency in Python, with experience in data manipulation using Pandas DataFrame - Excellent attention to detail for accurate data extraction and conversion - Prior experience in handling PDFs and Excel files With your expertise, I hope to streamline...
Current System ● Backend: Utilizes the Spring Boot framework to manage application logic, APIs, and business processes. ● Frontend: ○ Mobile App: Developed using Flutter, ensuring cross-platform compatibility. ○ Web Interface: Built with Angular, providing a responsive and dynamic user experience. ● Database: PostgreSQL serves as the primary data storage system, handling all transactional data. ● Notifications: Firebase manages push notifications, keeping users informed about tender updates and other relevant activities. ● WhatsApp Integration: Twilio’s WhatsApp Business API facilitates communication and interactions via WhatsApp. ● Hosting: Deployed on DigitalOcean Cloud infrastructure, leveraging its scalability and reliability features. ---------------- Proposed Enh...
...for Image Cropping & OCR Data Extraction Description: We are looking for a Python developer to help automate the extraction of data from screenshots. The project involves two key tasks: Screenshot Cropping Script: Develop a script to crop screenshots and isolate the relevant data sections, focusing on the area below the column headers. The cropping areas may vary slightly between different screenshots. OCR Data Extraction and Excel Conversion: Use OCR tools (like Tesseract) to extract text from the cropped images. Convert the extracted data into a structured format, and save it into an Excel file, with each piece of data placed into corresponding columns. Requirements: Proficiency in Python (Pillow, OpenCV, pytesser...
I'm looking for a freelancer to help with a data entry task involving PDF files. The task primarily involves extracting text from PDFs and helping with editing and proofreading. Ideal candidates should have: - Excellent attention to detail - Strong editing and proofreading skills - Experience with data entry from PDFs - Proficiency in English
...cases, and client services. • N8N Automation: An orchestration tool for workflow automation across all platforms. • SMS & Calling Tool: A communication solution for appointment scheduling and follow-ups. • Court Record Scraper: A data extraction tool for real-time public eviction record collection. 2. Workflow Steps Step 1: Court Records Scraping and Lead Generation Objective: Automate lead generation by scraping public court eviction records. Process: 1. Use a scraper to extract eviction case details (e.g., property address, landlord, case status). 2. Enrich extracted data by appending missing contact information (emails, phone numbers) via APIs. 3. Automatically upload leads to Ligna CRM using N8N. Output: Verified, enriched leads ready for foll...
I'm seeking a modern logo for my company, "ONE" (Original Natural Extract). This company specializes in extracting natural products using CO2 extraction machines. The logo should emphasize the company name and integrate a unique graphic element. Key Requirements: - The logo should cater to a minimalist yet modern aesthetic. - It should highlight both the company name and a graphic element. - The overall design should convey a sense of innovation. Ideal Skills: - Graphic Design - Logo Design - Illustrator - Creative Conceptualization - Modern Design Aesthetics
...posted on the IGEM website. Key Tasks: - Process an Excel spreadsheet containing hundreds of project names. - Identify and extract the corresponding project team member' names (thousands) from a specific section of each project's page on the IGEM website. - Prepare a new excel spreadsheet with project name, team members and a like to their linkedin profile. Ideal skills for this project: - Experience with data processing and Excel. - Attention to detail to ensure accurate identification of project owners. - Ability to follow instructions and work independently. Link to the iGEM team pages (see for example year 2024): Deliver the extracted data in an Excel file. Please use manual extraction of team member names
I need assistance in extracting data from the website using the search term ONEU9092093. Requirements: - Get all available data from the topmost row of the search results - Paste the data into a Google Sheet, which I will provide access to Ideal Skills: - Web scraping - Proficiency in Google Sheets - Attention to detail Looking forward to your bids.
I am looking for a skilled data scraping professional with experience in extracting text content from websites. - Primary Source: The data needs to be scraped from a specific website. - Data Type: The focus is primarily on text content. - Frequency: The data needs to be scraped periodically. Ideal skills for the job would include proficiency in web scraping tools and programming languages such as Python, familiarity with data extraction from websites and understanding of web data mining. Previous experience with similar projects will be an advantage.
I'm looking for a freelancer to help with a data entry task involving PDF files. The task primarily involves extracting text from PDFs and helping with editing and proofreading. Ideal candidates should have: - Excellent attention to detail - Strong editing and proofreading skills - Experience with data entry from PDFs - Proficiency in English
I'm looking for an expert in data extraction and analysis. I need reviews and location data gathered from around 47,000 locations based on input data I provide (company name, geolocation, and address). The extracted data will be used for analysis and insights, so the successful freelancer will be highly skilled in data scraping (Python, data manipulation). They should be able to specify a reproducible method and tool for this task. The data I need includes: 1. Reviews: with details on date and reviews. 2. Additional data from the location: including opening times and popular times. The list of locations will be provided in a CSV/xlsx file. Please include in your proposal the method and tool you plan to use, along wi...
...for Image Cropping & OCR Data Extraction Description: We are looking for a Python developer to help automate the extraction of data from screenshots. The project involves two key tasks: Screenshot Cropping Script: Develop a script to crop screenshots and isolate the relevant data sections, focusing on the area below the column headers. The cropping areas may vary slightly between different screenshots. OCR Data Extraction and Excel Conversion: Use OCR tools (like Tesseract) to extract text from the cropped images. Convert the extracted data into a structured format, and save it into an Excel file, with each piece of data placed into corresponding columns. Requirements: Proficiency in Python (Pillow, OpenCV, pytesser...
I need assistance in extracting data from the website using the search term ONEU9092093. Requirements: - Get all available data from the topmost row of the search results - Paste the data into a Google Sheet, which I will provide access to Ideal Skills: - Web scraping - Proficiency in Google Sheets - Attention to detail Looking forward to your bids.
I'm in need of a Java programmer who can create a small program for me. The program's purpose is to convert a PDF file into an Excel file in a tabular format. Key Requirements: - The program...in need of a Java programmer who can create a small program for me. The program's purpose is to convert a PDF file into an Excel file in a tabular format. Key Requirements: - The program should be able to extract all data from the PDF and convert it into Excel. - The conversion should be straightforward, without any data processing or formatting applied to the Excel file. Ideal Skills: - Proficiency in Java - Experience with PDF to Excel conversion - Familiarity with data extraction from PDFs This is a simple project, but it requires attention to deta...
...ERP systems. The software will enable users to analyze technical drawings, extract relevant properties, remove logos, and save structured data for further use. **Software Objectives:** - **Logo Recognition and Removal:** Automatic detection and removal of logos using two technologies: template matching and deep learning (e.g., YOLO). If no logo is detected, the user can manually mark it. - **Property Recognition:** Automatic extraction of critical information such as material, surface finish, and tolerances. Extracted data will be highlighted in the drawing and presented in a structured format (e.g., JSON). Users can review and adjust the data manually. - **ERP Integration:** Create new articles in the ERP system **Acumatica X360** based ...
Data Analysis Project Using SQL, Excel, and Power BI Dashboard Description: I am looking for a skilled data analyst to assist with a project that involves data processing, analysis, and visualization using SQL, Excel, and Power BI. The project will require: Data Extraction & Cleaning: Extract and clean data from multiple sources using SQL. Ensure the data is accurate, complete, and properly formatted for analysis. Data Analysis in Excel: Perform statistical and trend analysis. Create pivot tables, charts, and perform calculations to uncover insights. Power BI Dashboard: Build an interactive and visually appealing Power BI dashboard. Include key performance indicators (KPIs), filters, and drill-down capabilities for detailed in...
I need a comprehensive list of Defence canteen dealer contact numbers from specific cities in Madhya Pradesh - namely Bhopal, Indore, Gwalior, and Bhind. The information needs to be meticulously compiled into an Excel sheet. Ideal Skills and Experience: - Proficient in internet research and data extraction - Excel spreadsheet skills - Ability to work independently and self-motivate - Previous experience in similar data collection tasks is a plus.
I require a skilled coder for a data extraction project focused on text data from various websites. Ideal Skills and Experience: - Proficiency in web scraping techniques and tools. - Strong coding skills, preferably in Python or similar languages. - Experience in data extraction from websites. - Attention to detail and ability to handle large volumes of data.
I'm looking for someone to help me extract text from a PDF document. Ideal Skills and Experience: - Proficient in data entry and text extraction - Experienced with PDF documents - Skilled in Microsoft Word - Detail-oriented to ensure all text is accurately transcribed
...BUDGET) For the website backend we have a short timeline, within 5 days to setup and the rest as per the project scope. Key Features: - The app should be Windows and Android compatible. - It must facilitate Single sign-on (SSO) for user authentication. - The order details management features should include automated data extraction and order history tracking, reporting, Sms alert messaging to Travellers and Whatsapp messaging regarding trip details. - Backend order management - Invoicing with auto-email, Billing, CRM - Downloading reports in excel, pdf in preferred template - Generate booking voucher with QR code - Trip Estimate feature for b2b partners with login, to see their reports, data - Live chat integration The app will fetch booking details da...
I'm in need of a tool that can take engine sound ramps (from idle to redline) and extract constant RPM sounds into looped samples. Key Requirements: - Input: RAW WAV engine sound ramps - Output: Separate files for each RPM, specifically every 500 RPM intervals Ideal Skills: - Experience with audio processing and sound en...redline) and extract constant RPM sounds into looped samples. Key Requirements: - Input: RAW WAV engine sound ramps - Output: Separate files for each RPM, specifically every 500 RPM intervals Ideal Skills: - Experience with audio processing and sound engineering - Proficiency in software development for audio tools - Familiarity with engine sound characteristics and requirements for RPM sound extraction. example:
I have a project requiring data extraction from a PDF file and input into an Excel spreadsheet. The data includes: - Text - Numbers - Tables The challenge is that each page of the PDF has a different layout. Your tasks will involve interpreting the varied formats and accurately transferring the information into a single sheet in Excel. Ideal Skills: - Proficient in data entry - Excellent attention to detail - Familiarity with Excel and PDF handling - Able to interpret varied data formats Experience: - Previous experience in data extraction and entry - Able to complete task in timely manner A big factor in winning this contest will be who gets me the data back by 5pm EST 1/19/2025 or sooner
Python and Visual Studio Code Data Processing Data Entry Web Scraping Data Mining The project involves creating a script to mine text data . Key Responsibilities: - Develop a text mining script that can efficiently extract and process data from various fields in the database. - The script should be able to handle large volumes of data and return results in a timely manner. Ideal Skills: - Proficiency in SQL and data mining techniques. - Experience with developing scripts for database text extraction. - Ability to work with large datasets.
I am in need of an experienced Mearn stack developer. Technical Requirements. * The software must be developed using React.js, Node.js, and TypeScript. Below are the key features and requirements:- File upload/ downloads and Folder Management Document Preview: * Automatic Data Extraction Search and Filters User Account Management Software Requirements. * Independence: The software must be developed fully in-house and should not rely on third-party services for core functionalities. * Responsiveness: The tool should be responsive and offer a seamless user experience across different devices and screen sizes. * UI/UX: A clean, intuitive, and user-friendly interface is essential for ease of use. The design should prioritize simplicity and efficiency in document management. Tech...
I'm looking for a Power Automate automation to help with document information extraction from PDF resumes. The automation should be able to extract: - Contact Details - Work Experience Ideal Skills: - Extensive experience with Power Automate - Proficiency in document processing automation - Familiarity with PDF file manipulation - Knowledge of data extraction techniques Please provide examples of similar projects you have completed.
...primarily phone numbers and LinkedIn profiles. - The data should be delivered in a Google Sheets format. Ideal Skills and Experience: - Previous experience in lead generation and data extraction. - Familiarity with the construction industry in Bangalore. - Proven track record of sourcing high-quality prospects. - Proficient in using Google Sheets for data organization. Requirement: 100 Profiles Industry: Construction Tools: LinkedIn, Lusha, ZoomInfo, LinkedIn Recruiter, Job Overview: We are looking for a skilled Data Extraction Specialist to source and extract contact details of Inside Sales Representatives, Telecallers, Inside Sales Managers, and Sales Executives from top construction companies in Bangalore. The extracted data will b...
...answer questions based on provided or fed data. Key Requirements: 1. Platform Compatibility: • Must run efficiently on Raspberry Pi 5 hardware. • Optimize for limited computing resources (e.g., CPU, RAM). 2. Model Features: • Ability to process audio input (speech-to-text) and visual input (camera feed for image recognition or text extraction). • Provide text-based answers derived from the fed data or context. 3. Performance: • Model should be lightweight with minimal latency. • High accuracy in understanding and generating responses based on context. 4. Input/Output Specifications: • Audio input: Use microphone or pre-recorded audio files. • Visual input: Accept image or video feed from the Pi camera. • Out...
I'm looking for a freelancer to help with a data entry task involving PDF files. The task primarily involves extracting text from PDFs and helping with editing and proofreading. Ideal candidates should have: - Excellent attention to detail - Strong editing and proofreading skills - Experience with data entry from PDFs - Proficiency in English
I need a skilled data entry operator to help extract data from text documents. Key Responsibilities: - Extracting data from various text documents sourced from web pages. Ideal Skills: - Proficient in data extraction techniques. - Experienced in working with text documents. - Familiar with navigating and extracting data from web pages. Please note, experience with web data extraction is highly preferred.
...seeking assistance to extract data from a conference website. The task involves gathering information on all attendees and their respective companies. Key Requirements: - Extract a list of attendees and their information (Name, Title, and Company) and export it into a CSV file. - Create a secondary CSV file that lists all the companies in attendance (one column: Company). The data needs to be extracted from a publicly accessible section of the conference website. The task needs to be completed before noon EST on January 21st. I will make a decision on whom to select by 9:00pm EST on January 19th. Ideal Candidate: - Previous experience with data extraction and web scraping is preferred. - Proficient in using tools like Python, Excel, or any ...
I hav...I need a freelancer with strong expertise in Python, data manipulation and AI to develop an AI model that will evaluate new loan requests and predict their approval status based on the training from this dataset. Key aspects of the project: - Data Preprocessing: The data will require normalization, handling of missing values, and feature extraction. - Model Development: Using Python, you'll build an AI model to train on the database. - Prediction: The model will assess new loan requests and indicate if they should be accepted or rejected. The ideal candidate will have extensive experience with AI model development, and will be proficient in Python, and data manipulation. Your understanding of data preprocessing and feature extra...
I need a simple BAQ coded in Epicor to report on total unreceived quantities. The report should include the following specific details: - Part number - Number of POs - Total unreceived qty - Site location In addition, the report should be able to filter by: - Date range - Supplier - Warehouse location Ideal Skills and Experience: - Proficient in coding BAQ in Epicor - Experienced in data extraction and report generation - Able to implement custom filters in reports
I'm seeking a professional with deep expertise in Puppeteer and the Google API. I am open to various tasks including web scraping, automation of tasks, and data extraction and analysis. Your role will involve employing Puppeteer for web automation and using the Google API for data management. Ideal Candidates Should Have: - Extensive experience with Puppeteer - Proficiency in using various Google APIs - Strong skills in web scraping and data analysis - Able to automate tasks efficiently - Good understanding of working with structured, text, and image data.
I'm looking for an expert in Make automation who can help me automate the process of extracting data from my one-page guest satisfaction surveys. These surveys are available as scanned pictures (jpg). Key Requirements: - Files will be placed in a Google Drive Folder and processed immediately, one by one - Reading and extracting data from the picture - Parsing different types of data fields which include text fields, number ratings, multiple choice options, and handwritten entries - Uploading the extracted data to a Google Sheet Experience with Make automation and data extraction is a must. The ideal freelancer for this job should have a keen eye for detail to ensure all data is accurately captured and transferred.
I'm looking for a skilled data extractor with LinkedIn experience. The task is to extract specific information from user profiles on LinkedIn. Key Requirements: - Extract user profiles data from LinkedIn, focusing primarily on contact information. - Deliver the extracted data in CSV format. Ideal Skills: - Proficient in data extraction and web scraping. - Familiar with LinkedIn's platform and its user profiles. - Experience in delivering data in CSV format.
I'm looking for someone to capture data from tables on website. In the "Pending Stock List 17Jan2025" excelsheet, I have listed down all the URLs from which data must be captured. The data must be captured in format as given in "Stock Data Format" excelsheet. Bonus if you deliver the script also that fetches the data. STEPS FOR UPDATING THE DATA: Go to one of the URL in the excelsheet: Pending Stock List Lets say, the URL is and the corresponding stock id is 918 Now on this URL, you will see data for 5 years Similarly, for other URLs on the same stock id, you will see data for 5 years You will create 1 row in the excelsheet Stock Data for each year for stock
I'm seeking assistance to extract email addresses from a dynamic website. I only have frontend access and this task is a one-time requirement. Ideal skills and experience for this job would include: - Proficiency in web scraping tools and techniques - Understanding of dynamic websites and their structures - Ability to extract data efficiently and accurately Please note, the task requires a thorough and meticulous approach to ensure all relevant email addresses are captured. website: (follow instructions by this video:
Please Sign Up or Login to see details.
Abstract: This research develops an anomaly detection system for sequential log data from distributed services by integrating sequential and contextual features. Sequential features capture the temporal dependencies between log entries, allowing the model to identify unusual patterns in the sequence of events that may precede an anomaly. Contextual features, such as word embeddings, are used to extract the semantic meaning from the content of the log messages, helping the model understand the context behind each log entry. This is crucial for detecting anomalies that may not be evident from temporal patterns alone, such as subtle variations in system behavior or unusual error descriptions. The inclusion of semantic meaning enables the detection of early-stage anomalies, which may ot...
I'm looking for someone to help me with manual data entry from web pages. This is a straightforward task, but it requires attention to detail and the ability to follow instructions carefully. Ideal skills and experience: - Proficient in manual data entry - Familiar with web data extraction - Detail-oriented - Able to follow instructions accurately
I'm looking for someone to help me with manual data entry from web pages. This is a straightforward task, but it requires attention to detail and the ability to follow instructions carefully. Ideal skills and experience: - Proficient in manual data entry - Familiar with web data extraction - Detail-oriented - Able to follow instructions accurately
I'm seeking a professional in web scraping to help me gather contact information for lead generation. The data will primarily be sourced from Company Websites and Industry Directories. - Specific Data: I need the following contact information: - Email addresses - Phone numbers - Website links Ideal candidates should have extensive experience in complex web scraping, with a proven track record of successfully generating leads. Proficiency in data extraction tools and software, along with excellent attention to detail, is crucial.