Does Google Index Content in "The Cloud" (Amazon S3, etc)

May 14, 2008 - 7:56 am 1 by

Cloud computing is becoming more and more popular amongst webmasters and site owners. In short, companies like Amazon, RackSpace, Google and others are offering hosting services where you upload your content (html, images, videos, pdfs, etc.) to a web server, that web server then replicates that content onto other web servers - so if you think about it, your content is not just on one server, with limited resources and bandwidth, but on dozens (or more) of servers with virtually unlimited bandwidth and resources.

Duplicate content issue? Nope. There is only one URL for that content (unless you generate multiple URLs for the same content yourself) but Amazon S3, for example, doesn't create a duplicate content issue.

One webmaster at WebmasterWorld is complaining that Google Image search doesn't seem to be indexing the images he has hosted over at Amazon S3. But honestly, I think it is just a timing issue for him.

If you conduct a site command on site:s3.amazonaws.com, the location of the S3 content, you will find hundreds of thousands of results returned. If you conduct the same site command search at Google Image search, you find many images from S3 included in the Google Image Search index.

So, it does appear Google is indexing content in the cloud. Specifically from Amazon S3. Does something have to happen on the Amazon side for Google to index your content? I personally cannot find any hints to Amazon blocking any content from search engines on the technical docs or the FAQs. So maybe it is just a timing thing?

Forum discussion at WebmasterWorld.

 

Popular Categories

The Pulse of the search community

Search Video Recaps

 
- YouTube
Video Details More Videos Subscribe to Videos

Most Recent Articles

Search Forum Recap

Daily Search Forum Recap: February 21, 2025

Feb 21, 2025 - 10:00 am
Search Video Recaps

Search News Buzz Video Recap: Google Ranking Volatility, In-Content Learning, Google AI With Ads, Local & More

Feb 21, 2025 - 8:01 am
Google Ads

Google Response Search Ads (RSAs) Second Headline In Sitelinks & More

Feb 21, 2025 - 7:51 am
Google

Google Hotel Results Tests Book With Official Site Box

Feb 21, 2025 - 7:41 am
Bing Search

Bing Copilot AI Answers Tabbed Carousel Card

Feb 21, 2025 - 7:31 am
Google Ads

Google Ads To Stop Placing Your Ads On Parked Domains By Default

Feb 21, 2025 - 7:21 am
Previous Story: In 2008, Is The NoArchive Tag a Red Flag in SEO?