Web Wiz - Green Windows Web Hosting

  New Posts New Posts RSS Feed - Web Developer needed
  FAQ FAQ  Forum Search   Events   Register Register  Login Login

Web Developer needed

 Post Reply Post Reply Page  <123>
Author
NPRI View Drop Down
Newbie
Newbie
Avatar

Joined: 17 November 2004
Location: United Kingdom
Status: Offline
Points: 13
Post Options Post Options   Thanks (0) Thanks(0)   Quote NPRI Quote  Post ReplyReply Direct Link To This Post Posted: 11 January 2005 at 6:55pm
Thanks for that. I have looked into the Google option but have decided that that isnt the way to go as there are way to many commercial sites on there. The whole reason for setting up the paranormal search engine was so that the visitors will get 100% relevent search results.
 
I have been checking out my servers and apparently I can have Mysql installed.
 
I know that this is a long shot but as my good ol mum always says "If you dont ask you dont get"
Back to Top
Gullanian View Drop Down
Senior Member
Senior Member
Avatar

Joined: 04 January 2002
Location: England
Status: Offline
Points: 4373
Post Options Post Options   Thanks (0) Thanks(0)   Quote Gullanian Quote  Post ReplyReply Direct Link To This Post Posted: 11 January 2005 at 8:33pm
You might find a developer to help with small things for free, but as I understand writting a web crawler that parses out all relevant information is pretty difficult and resource consuming.
Back to Top
dpyers View Drop Down
Senior Member
Senior Member


Joined: 12 May 2003
Status: Offline
Points: 3937
Post Options Post Options   Thanks (0) Thanks(0)   Quote dpyers Quote  Post ReplyReply Direct Link To This Post Posted: 11 January 2005 at 11:58pm
Actually, writing a web crawler isn't a big thing. I've done them in perl, c++ and java. The perl one was a dog though as they really need to be multi-threaded and that wasn't an option at the time. IIRC, I wrote the java one as part of a java 101 type exercise in threading.
 
You could probably do a .net one if you can handle the web server timeout issue. My java crawler ran through an app server, not a web server and the pearl and c++ ones hit outbound tcp ports directly, didn't go through a web or an app server.
 
The issue isn't so much one of crawling pages, but what do you do with the data on those pages. What words and phrases do you extract? How do you id content instead of something like java script. Do you ignore meta tags? Comments? Finally, how do you rationalize the data into a db that allows you the ability to scan it fast with some degree of relevance to the search term.

Lead me not into temptation... I know the short cut, follow me.
Back to Top
Phat View Drop Down
Senior Member
Senior Member


Joined: 23 February 2003
Status: Offline
Points: 386
Post Options Post Options   Thanks (0) Thanks(0)   Quote Phat Quote  Post ReplyReply Direct Link To This Post Posted: 12 January 2005 at 5:37am
In other words just use google.
Back to Top
Gullanian View Drop Down
Senior Member
Senior Member
Avatar

Joined: 04 January 2002
Location: England
Status: Offline
Points: 4373
Post Options Post Options   Thanks (0) Thanks(0)   Quote Gullanian Quote  Post ReplyReply Direct Link To This Post Posted: 12 January 2005 at 8:55am
I think you missunderstood me Wink I said that writting one that parses information is difficult Smile
Back to Top
dpyers View Drop Down
Senior Member
Senior Member


Joined: 12 May 2003
Status: Offline
Points: 3937
Post Options Post Options   Thanks (0) Thanks(0)   Quote dpyers Quote  Post ReplyReply Direct Link To This Post Posted: 12 January 2005 at 9:18am
Embarrassed

Lead me not into temptation... I know the short cut, follow me.
Back to Top
Bluefrog View Drop Down
Senior Member
Senior Member


Joined: 23 October 2002
Location: Korea, South
Status: Offline
Points: 1701
Post Options Post Options   Thanks (0) Thanks(0)   Quote Bluefrog Quote  Post ReplyReply Direct Link To This Post Posted: 12 January 2005 at 10:58am
Originally posted by Gullanian Gullanian wrote:

I think you missunderstood me Wink I said that writting one that parses information is difficult Smile


Not difficult if you are really lazy and only return crap results. Rocket Science if you return relevant results...

3 expensive technologies: Search, Compression, Encryption. Those core things are all very difficult and not within the scope of a single developer. They require large teams to come up with something good, or a LOT of time.

Back to Top
NPRI View Drop Down
Newbie
Newbie
Avatar

Joined: 17 November 2004
Location: United Kingdom
Status: Offline
Points: 13
Post Options Post Options   Thanks (0) Thanks(0)   Quote NPRI Quote  Post ReplyReply Direct Link To This Post Posted: 12 January 2005 at 11:52am
Hi guys,
             Thanks for all your input on this. I guess I had better qualify this a bit more. The sider is needed to index meta tags and links on the spidered sites. The ranking system does not have to be too far fetched, my idea was to have it rank pages by keywords in the meta tags and by the ammount of internal and external links, however if this is too much then ranking just by the keywords in the meta tags will do just fine.
 
All the best,
Neal
Back to Top
 Post Reply Post Reply Page  <123>

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 12.08
Copyright ©2001-2026 Web Wiz Ltd.


Become a Fan on Facebook Follow us on X Connect with us on LinkedIn Web Wiz Blogs
About Web Wiz | Contact Web Wiz | Terms & Conditions | Cookies | Privacy Notice

Web Wiz is the trading name of Web Wiz Ltd. Company registration No. 05977755. Registered in England and Wales.
Registered office: Web Wiz Ltd, Unit 18, The Glenmore Centre, Fancy Road, Poole, Dorset, BH12 4FB, UK.

Prices exclude VAT at 20% unless otherwise stated. VAT No. GB988999105 - $, € prices shown as a guideline only.

Copyright ©2001-2026 Web Wiz Ltd. All rights reserved.