[mdlug] vWorker Project: Extract Data from a PDF

Mark Thuemmel ldaphelp at thuemmel.com
Wed Sep 19 21:29:06 EDT 2012


On 09/19/2012 08:42 PM, gib at juno.com wrote:
> I saw the following in the http://vWorker.com site.  It made me think about the command we saw in our Mug.org meeting this month.  Remember?  It just seems to me someone could get a quick $850 by taking that command and a bit of grep magic to build this solution. Too bad they want "Visual Studio .net".  I would guess the command source code could be used as a template for the solution.
>
>   Extract Data from PDF's
> 	TITLE: Extract Data from a PDF Technology: Visual Studio .net Deadline: 3 weeks. Overview We need to download PDFs from a public source of data and extract key information from the PDFs. We will be able to provide specific text parameters to search for and require the sentences/statements that follow those text parameters to be extracted and written to a database table. Outputs We require the source code so that we can make minor edits ourselves. NOTE: We do NOT need a sophisticated front end as the code will be inherited by programmers able to make the ... (see project for full description)
> 	
>
>      	By: Jason Ed Co  (0 ratings). Viewed 1199 times since Sep 16, 2012 2:55:36 PM EDT
>      	Estimated size: $500 - $4,999.99. Payment model: Fixed-price. Sourcing type: Outsourcing.
>      	Phase: Bidding open. Max bid: $850.00 USD.
>      	

Unless the PDF is an "image" and does not actually contain anything but 
pixels.



More information about the mdlug mailing list