WE NEED A SERVER SIDE SITES GRABBING MODULE WHICH SHOULD BE ABLE TO DO AUTOMATICALLY THE FOLLOWING OPERATIONS:
1. GRABBING HTML CODE AND PICTURES (the way teleport does it) FROM URLS THAT ADMIN CAN SPECIFY in the backend - IT IS SITE GRABBING.
2. STORING THIS CODES IN THE SERVER as we will use them on other sites.
3. PARCING HTML CODE IN ORDER TO FIND SIMILARITIES AND GROUP THE SAME PAGES OF THE SITE BY VERSIONS AND DELETEING UNNECESSARY PARTS OF CODE SUCH AS PHONE NUMBERS AND REPLACING EMAIL ADDRESSES.
4. Scan links to specified domain in order to determine best version.
5. ADD HTML AND JAVA SCRIPT CODES IN THE HEADER – RIGHT SIDE – END OF THE BODY TEXT AND FOOTER OF EACH PAGE ACCOURDING TO OUR INSTRUCTION. GENERATE .HTACCESS FILE ACCOURDING TO OUR INSTRUCTION.
6. Upload main version and old ones to the domain hosting account via ftp - .
7. CREATE ADMIN INTERFACE IN ORDER FOR THE ADMIN TO BE ABLE TO HELP MODULE OR CORRECT – TO SEE GRABBING PROGRESS, TO START REGRABBING IN CASE OF A FAILURE .
WE HAVE detailed instruction for each task.
Did you do before something similar or as complex as this ? can you show me the links.
WE NEED TO talk to you on the phone .