only for @prayogo803 Automated Extraction and Structuring of Recipes from PDF Cookbooks

Đã hoàn thành Đã đăng vào 4 tháng trước Thanh toán khi bàn giao
Đã hoàn thành Thanh toán khi bàn giao

I have a collection of approximately 50 PDF cookbooks, containing an estimated total of 13,000 recipes. Some of these cookbooks have already been split into individual recipe files, while others remain in their original format. I am seeking a freelancer to automate the process of separating the remaining recipes, using the most appropriate tools, potentially including DALL-E or other advanced OCR and text recognition software.

Scope of Work:

The selected freelancer will be responsible for the following tasks:

1. Text Recognition:

• Extract text from the recipes, even if they are embedded in images within the PDF files.

• Ensure the recognition process accurately captures all text, including any non-standard fonts or formats used in the cookbooks.

2. Recipe Identification:

• Detect and separate individual recipes, even if they span across multiple pages.

• Ensure that each recipe is fully captured, without splitting across pages unless the recipe itself does.

3. Data Conversion:

• Convert the extracted text from each recipe into a structured JSON format.

• The JSON should include fields such as Title, Title of the Book, Short Description, Ingredients, Cooking Process, Categories, and Tags.

• Categories and Tags will be provided by me.

4. Language Consistency:

• Ensure that all extracted recipes are in English. For any non-English recipes, a translation process may be necessary.

5. Database Creation:

• Input the JSON-formatted recipes into a database platform, such as Airtable, which I will provide access to.

• Each recipe entry in the database must include a unique identification number and all the relevant fields.

Deliverables:

• A fully populated Airtable database containing all the recipes, accurately categorized and tagged.

• JSON files for each recipe, stored in a systematic folder structure.

• A report detailing the process, including any challenges encountered and how they were addressed.

To ensure that we can be autonomous in future projects involving new books, it is essential that the deliverables include all the supporting files used throughout the process. This should encompass everything from initial materials to final versions, as well as any relevant documentation explaining the workflow and configurations used. These files will enable us to replicate and adapt the processes independently in the future, ensuring continuity and efficiency in our publishing projects.

Skills Required:

• Expertise in OCR technology and text extraction from PDFs, especially where text is embedded in images.

• Experience with tools like DALL-E or similar for image and text recognition.

• Strong knowledge of JSON formatting and database management.

• Familiarity with Airtable or similar database platforms.

• Fluency in English, with experience in translation if necessary.

Timeline:

Please provide an estimated timeline for completing this project, considering the volume of work involved.

Budget:

I am open to bids, but please provide a detailed breakdown of costs, including any software licenses or tools that may be required.

Application Requirements:

• Please provide examples of similar projects you have completed.

• A brief outline of the tools and methods you would use to accomplish this project.

• Your proposed timeline and budget.

Dall-E Data Collection JSON OCR Python

ID dự án: #38527493

Về dự án

36 đề xuất Dự án từ xa 4 tháng trước đang mở

Được trao cho:

prayogo803

⭐Good day⭐Thanks for your job posting because it well fits to my skills. If you want perfect result, please ping me asap. thanks, Prayogo

$250 USD trong 7 ngày
(6 Đánh Giá)
4.1

36 freelancer chào giá trung bình$530 cho công việc này

AwaisChaudhry

Hello Good evening , I hope you are doing great. Just finished reading the brief details of your job . I see you have been looking for a freelancer who has experience with JSON, Data Collection, OCR, Python and Dall-E Thêm

$750 USD trong 14 ngày
(19 Nhận xét)
7.4
ZohaibRoy

As a Python developer with over 5 years of experience, I can provide the expertise you need to automate the extraction and structuring of recipes from your collection of PDF cookbooks. I have extensive knowledge and pr Thêm

$250 USD trong 1 ngày
(58 Nhận xét)
6.8
AITSoft

Hello, Upon reading the job details I would say that all the required skills Dall-E, Python, Data Collection, OCR and JSON fall under my skills. I work on freelancer full time and I believe I can do this job if I get Thêm

$750 USD trong 13 ngày
(40 Nhận xét)
6.3
OutsourceMan

⭐⭐⭐⭐⭐OCR is one of the essential tools in our repertoire. We have extensive experience working with OCR technology, especially where text is embedded in images, using a variety of software and libraries. Additionally, Thêm

$500 USD trong 7 ngày
(7 Nhận xét)
5.8
FaizalShaik80

Hello Dear! Here is the best freelancer to automate the process of separating the recipes, using the most appropriate tools, potentially including DALL-E or other advanced OCR and text recognition software. I will shar Thêm

$1000 USD trong 11 ngày
(8 Nhận xét)
5.0
hotanloc

Hi... Nice to meet you.(OCR EXPERT) I am have full experiences in extraction numeric data from pdf or scanned image and convert this to csv file or txt file format using python automatically. In this project, we have t Thêm

$250 USD trong 3 ngày
(16 Nhận xét)
5.3
usmanhassan123

Hey Jose M C. , I just finished reading the job description and I see you are looking for someone experienced in JSON, Dall-E, Data Collection, OCR and Python. This is something I can do, Please review my profile to c Thêm

$250 USD trong 3 ngày
(9 Nhận xét)
4.6
islamamer6

Hi there, I understand the complexity and importance of accurately extracting and organizing the 13,000 recipes from your PDF cookbooks. The key challenge here lies in ensuring precise text recognition, even when reci Thêm

$2404 USD trong 7 ngày
(17 Nhận xét)
4.2
nikolas41

Hi there, How are you? I wanted to express my strong interest in your project, particularly given my extensive experience. I am confident in my ability to deliver the project to your exact specifications and to the hig Thêm

$1000 USD trong 10 ngày
(1 Nhận xét)
2.6
markor2

Hey Jose M C. I have over extensive experience in Data Collection, JSON, Dall-E, OCR and Python, so I'm super excited about the chance to work on your project-"only for @prayogo803 Automated Extraction and Structuring Thêm

$250 USD trong 7 ngày
(1 Nhận xét)
1.6
jmvdesignsohio

Hi I've read your description carefully and I am sure I can deliver the result you want. I would like to discuss your project in more detail via chat. Regards, Joseph

$500 USD trong 7 ngày
(0 Nhận xét)
0.0
jacobb1997

Dear Jose M C., I went through your project description and it seems like I am a great fit for this job. I am an expert full stack developer with 7+ years of experience in software development. With years of experie Thêm

$500 USD trong 7 ngày
(0 Nhận xét)
0.0
Intelliglance786

Hello @prayogo803, I understand that you require automation for extracting and structuring recipes from PDF cookbooks using tools like DALL-E and OCR technology. As an experienced Python developer with expertise in OC Thêm

$250 USD trong 6 ngày
(0 Nhận xét)
0.0
paul396

Hello. Expert here! I have rich experience in your project. ✔️Opportunities don't just happen. I create them.✔️ If u hire me, I will do my best and deliver perfect result. Thanks. Paul

$500 USD trong 7 ngày
(1 Nhận xét)
0.0
kevina6

Hi, I went through your project description and it seems like I am a great fit for this project. With a strong background in Python, I’m confident that I can deliver exactly what you’re looking for. The algorithms I Thêm

$500 USD trong 7 ngày
(0 Nhận xét)
0.0
sumbal26

Hi there, I can help automate the process of extracting and organizing recipes from your PDF cookbooks. I'll start by using tools like Tesseract for OCR to read and extract text from the PDFs, even from images or trick Thêm

$500 USD trong 7 ngày
(0 Nhận xét)
0.0
ehsan758

Hi there, I'm Suhaib, I hope this message finds you well. I noticed you're looking for an experienced OCR and Text Recognition Specialist—I'm ready to step in and assist. Your project involves the extraction, identi Thêm

$500 USD trong 3 ngày
(0 Nhận xét)
0.0
owenm28

Hello, dear. I read your requirements carefully and very interesting. I have rich experience about web development and can implement your requirements perfectly. Please contact me and let us discuss in detail. Best reg Thêm

$500 USD trong 7 ngày
(0 Nhận xét)
0.0
bobanp3

I'm no stranger to large-scale data extraction and I couldn't resist reaching out to your project. With over a decade of experience, I have honed my skills in Machine Learning, Deep Learning Algorithms, and Natural Lan Thêm

$500 USD trong 5 ngày
(0 Nhận xét)
0.0
enriquem29

I am writing to express my strong interest in your project. With my expertise in JSON, Dall-E, Data Collection, OCR and Python, I am sure I can deliver the best solutions and high-quality results for your needs. I g Thêm

$500 USD trong 4 ngày
(0 Nhận xét)
0.0