There are many free data on the net, but nobody collect and use them. I am starting a project to collect various data on the net. Using regular expression provided by .NET, I think it is quite simple to collect web data. I have put related works on http://www.derivativepower.com and http://www.jumboguide.com