I need to do a very lightweight screen scrape, parse, find a url, then scrape that page for a stock number.
See page1 and page2 in the samples for reference:
Regexp should parse out the links preceeded by a "nopicture" image, and follow the link to find it's stock number (easily parsed on page2). It would then skip past the link with a named picture, to find the next "nopicture".
The end result, should be a script that lists all of the product names, in their respective categories, whether or not they have a photo and their corresponding stock#s. It should also have a querystring toggle to only show the items needing photos. I also expect it to be flexible with the surrounding html content, as it may change slightly from time to time. I'd prefer well-documented variables be used for static content, so it can be tweaked in the future if need be.
## Deliverables
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
4) Developer guarantees the product and all components to be free of viruses, "spyware" or any other type of remote communication, "calling home", timebomb or other malicious data modification. Any type or remote communication to any other server or service must be authorized by the buyer.
## Platform
Windows 2000/2003, IIS, ASP.NET, coded in C#. I expect clean and documented code. For example:
See the "linksExpression" for a good example of what I consider well-docuemented regexp: <[login to view URL]>