| Author |
Topic Search Topic Options
|
Gullanian
Senior Member
Joined: 04 January 2002
Location: England
Status: Offline
Points: 4373
|
Post Options
Thanks(0)
Quote Reply
Topic: Google query Posted: 30 October 2004 at 9:49pm |
|
I don't understand how google is allowed to:
- Have text in search results taken directly from a website
- Cache pages
- Cache images
Surely this is breaking the law? I have copyrighted my material
and Google (or any other search engine) is without permission holding
it on their servers? Not that I mind, this is more of a query.
|
 |
dpyers
Senior Member
Joined: 12 May 2003
Status: Offline
Points: 3937
|
Post Options
Thanks(0)
Quote Reply
Posted: 31 October 2004 at 1:02am |
Search engines, including Google have addressed this issue some time ago which is why they follow orders from robots.txt files. You also have the means to use 301 and 302 return codes to redirect old links to new content, and can use 403's, 404's, etc. to block access.
Caching pages is really no different from what my browser, or perhaps your isp does. At least google eventually expires them. Try checking a site you had a few years ago from the wayback machine at http://www.archive.org/web/web.php if you want the sins of your youth to haunt you.
|
Lead me not into temptation... I know the short cut, follow me.
|
 |
Gullanian
Senior Member
Joined: 04 January 2002
Location: England
Status: Offline
Points: 4373
|
Post Options
Thanks(0)
Quote Reply
Posted: 31 October 2004 at 7:49am |
|
Hmm ok, still seems a little grey to me though. Have there been any law suits over it or anything?
|
 |
dpyers
Senior Member
Joined: 12 May 2003
Status: Offline
Points: 3937
|
Post Options
Thanks(0)
Quote Reply
Posted: 31 October 2004 at 9:42am |
Wouldn't be surprised if there's been lawsuits, but who do you sue? There's been some sort of indexed search capability since at least 1986. Some cache pages some don't. Even I've written bots to search for things.
Don't forget, that the stuff that's cached by search engines, browsers, and isp's like NetZero is publically available and contains full text - including copyright notices. If it's not publically available, it's not cached.
|
Lead me not into temptation... I know the short cut, follow me.
|
 |
Gullanian
Senior Member
Joined: 04 January 2002
Location: England
Status: Offline
Points: 4373
|
Post Options
Thanks(0)
Quote Reply
Posted: 31 October 2004 at 11:54am |
|
It just sounds to me like someones written a book, and someone says I'm
going to take your book and republish it unless you have another page
in it telling me specificaly not to.
Correct me if I'm wrong, but here in the UK whatever I write is
automaticaly covered by copyright laws, and no-one can take any of it
without my permission.
|
 |
dpyers
Senior Member
Joined: 12 May 2003
Status: Offline
Points: 3937
|
Post Options
Thanks(0)
Quote Reply
Posted: 31 October 2004 at 4:46pm |
No one can take it and claim ownership or publish without your permission. Google is less like a book, and more like a library index.
As far as cached pages go, they are publically accessible. The only fidfference between them in by browsers cache and them in googles cache is how long they stay there.
If you have information that should not be accessible to the general public, you can either put it on a page that is not named after your server's default page, and have no links to it from any other web page - which brings up the question of why do you want it on the web. Or you can put it into a password protected area.
You don't have to let google into your site. A two line robots.txt file will block it.
|
Lead me not into temptation... I know the short cut, follow me.
|
 |
Gullanian
Senior Member
Joined: 04 January 2002
Location: England
Status: Offline
Points: 4373
|
Post Options
Thanks(0)
Quote Reply
Posted: 31 October 2004 at 6:00pm |
|
Thanks for the information, it's not something I'm trying to solve just curiosity.
|
 |
Gullanian
Senior Member
Joined: 04 January 2002
Location: England
Status: Offline
Points: 4373
|
Post Options
Thanks(0)
Quote Reply
Posted: 31 October 2004 at 6:08pm |
|
|
 |