Create a webform that accepts 2 required values (e-mail and website URL), 2 optional values that must be entered together (username and password), and an 'execute' process button.
For example, the e-mail addr provided is: <john@[login to view URL]> and the url is:
<[login to view URL]>.
Username: Jeffery_Fac7s Password: H%bYq@)
All email and URL values would have basic value validation checking. Username and PW fields should accept special characters.
Upon entering both values, user clicks 'execute' button
Also create web API that can accept the above (4) values.
Store email address
Crawl website URL (<[login to view URL]>) with no depth limit within the domain ([login to view URL])
Must also be able to enter pw protected areas with supplied username/pw credentials prompted by textbox or within url (<http://username:password@[login to view URL]>)
Convert HTML, images, css, script (php/xml) into PDF. In other words, generate a 'snapshot' of what a browser would display into a pdf.
Combine all these pdf pages into a single pdf document
Name file <[login to view URL]>-<mm-dd-yyyy>-<24hr:min:sec>.pdf
Upload (ftp) pdf document onto supplied web server.
If work order entered through webform:
Generate retrieval URL
Send retrieval URL to stored email address (<john@[login to view URL]> ) originally provided in step 1 with unique transaction number in subject line and body.
If work order entered through API:
Return PDF payload back over open http connection.
In case of timeout, fall back to email delivery described above.
Support:
We can provide server support, Windows or Linux (Centos) OS Platform, software tools. Linux implementation preferred.
## Deliverables
Suggestions:
You may want to first create entire url list of website.
-There is no limit to depth of links, but any links that don't match primary domain ([login to view URL]) will be omitted.
With url list, recursively open each url and then execute the conversion.
There are some websites that may help:
URL List: <[login to view URL]>
HTML to PDF conversion:
<[login to view URL]>
<[login to view URL]>
Adobe Acrobat 9+ also provides a mechanism to convert websites into PDF files.