Our project involves the following top level processes.
1. Scrape a site with PDF Files. This will require some intelligent scraping and masking process either with Proxy or randomly. We need not to get blocked.
2. A. Take the PDF and extract Text. B. Use OCR to extract Image file Text and Digits that are masked in the PDF on purpose. The PDF has both Text and Images as attached.
3. Take the results and create our JSON file format and send to the endpoint on schedule.
This project requires you to start now.
20 freelancers are bidding on average $560 for this job
Hi I can scrape the PDFs and use OCR to extract the required data. Can work on a demo if you like Relevant Skills and Experience Scrape - OCR Proposed Milestones $888 USD - Full Can work right now
Hello! We will start the project right now as you [url removed, login to view] request you please check few samples before awarding the project. Stay tuned, I'm still working on this proposal.
Have a self built scraping app which works faster than other mainstream scrapers. I can start working right now. Jus inbox for more details. Stay tuned, I'm still working on this proposal.