I need an application/program of any type to run on windows or on the web that will convert jobs from my website to a xml feed with the below specifications, a example is available at: [login to view URL]
The xml feed needs to have the tags:
<source> (will encase everything else)
<publisher> (website/company name)
<publisherurl> (website url)
<lastBuildDate> (job posting date)
<job> (encasing the job information)
everything after this till mentioned is within the job tags and must be within cdata sections ie. <title><![CDATA[Sales Executive]]></title>
<title> (job title)
<date> (job date)
<referencenumber> (job id number)
<url> (job url)
<city> (job city)
<state> (job state)
<country> (job country)
<description> (job description please truncate to roughly 450 characters)
<salary> (job salary)
<education> (job degree)
end of job tag, new jobs will continue from the next job tag immediately afterward.
The application will need to find tags in a website copy the information within the tags out and recreate a xml feed with the information.
It would be preferable (I would like the below but they are not absolutely necessary) to:
1. Be able to specify a directory on a website then give it the tags to find and replace then have it scrape the entire site for them and convert them to the specifications above.
2. Have it be scheduled to do this automatically.
3. Have it automatically upload the xml file by FTP or post method to the specified website and directory.
4. Have it be able to do all of this very quickly or using multiple connections while not using a lot of processor/memory.
I would like to have the C++ if that is okay with you.
I can provide more details and the website address upon request.