I have a HP Scanner / MFP which is bundled with DSS software. I can scan the file to my ftp server (Linux Server). It is in an XML file format. The XML files shows the PDF filename & meta data (CDATA) that was captured at the device. Many files will start to reside in the ftp folder the XML file along with PDF attachment. These files need to be parsed into MySQL database with the metadata populating the required fields as well as the attachment. The attachment field won't be a blob. The ftp folder need to be cleaned (deleted) after the insert into the Mysql has been parsed except the PDF file types.
This process need to be automated (perhaps cron job). Attached is a sample of the xml file.
NB. this is only a sample XML. The scanner is capable of doing OCR and that metadata could also be added into the XML.
I already have the web front end designed with searching etc.
Good day. Fancy meeting a fellow South African! I have got the skills required for this job, and I have done many parsing projects in PHP/MySQL before. This will be no challenge. See pm for details. Regards, Bennie Swart.