Scott on Writing

Musings on technical writing...

Google Base - Search Crap Easier!

The 'blogosphere' is abuzz with Google's latest pre-pre-Alpha release, Google Base (the first pre- is in there because the product isn't even released yet, save for http://base.google.com, which is up sometimes, down othertimes, and doesn't really do anything other than generate a fevered pitch among bloggers). Google Base. From a good ArsTechnica article on this pre-pre-Alpha service, “ Google Base is Google's database into which you can add all types of content. We'll host your content and make it searchable online for free.” In a nutshell, purportedly this service will allow Sally Housecoat and Joe Meatball to upload and add their own content to Google Base, which is sort of going to be this online database.

To me, it appears as if the aim of Google Base is to be that big ol' database in the sky. “Have some data? Go ahead and stick up in that cloud up there.  Need to search that data? Here are the web service APIs, knock yourself out.” Users will be able to tag their uploaded information with labels, geographic information, and other categories, and the results, one would imagine, would be integrated into other Google services, such as Google Maps, Froogle, Google Local, and so on. (For example, one could say, “Show me all garage sales going on in my zip code in the next two weeks.”)

Some are calling it the craigslist and eBay killer; some see this as the potential end of classified ads in newspapers.

I am not so excited about this product for a couple of reasons. First, it's pre-pre-Alpha... If this is at all like the Google Reader released a month ago or so, then even when Google Base is officially released, it'll still need a lot of work. Second, and more importantly, they're allowing users to add to this database. If they just let any ol' person add any old thing, the quality of Google Base will quickly approach zero. Look at USENET - except in moderated or very specific forums visited only by a small sect of people, a large percentage of the stuff posted there is crap. Google Base will need some sort of moderation or community involvement that will keep this data pure. And how many people are going to keep using Google Base when they do a search for garage sales in their area and show up only to find that they moved it to next weekend, but forgot to update Google Base?

Let's just say I'm pretty pessimistic when it comes to any service that basically trusts the general public to add to their catalog, and I'd hope Google would know this better than anyone else... a-hem. Let's call this the Scott Mitchell theorem: The quality of any piece of information is inversely proportional to how many people contribute to it.”

posted on Wednesday, October 26, 2005 2:41 PM

Feedback

# re: Google Base - Search Crap Easier! 10/27/2005 9:08 AM Juan Pablo

"The quality of any piece of information is inversely proportional to how many people contribute to it.”

Doesn't Wikipedia refute that theorem ?

# re: Google Base - Search Crap Easier! 10/27/2005 9:13 AM Scott Mitchell

Juan, the quality of Wikipedia is subjective, IMO. Yes, there are some great entries that are very informative and correct, but there are some that are pretty weak.

Here's an article discussing some of the quality issues in Wikipedia.
http://www.theregister.co.uk/2005/10/18/wikipedia_quality_problem/

Let me ask you this - if you were in a contest where you could win a million dollars for answering one question correctly, and you could have EITHER a copy of a published encyclopedia or access to the Wikipedia site to use a reference, which one would you choose?

# re: Google Base - Search Crap Easier! 10/27/2005 9:04 PM Jake

Craigslist is moderated by each community and it's a great site. Granted, you do find a lot of crap on Craigslist but there's some great stuff too. If Google allows the community to moderate Google Base then I think it may work.

# re: Google Base - Search Crap Easier! 12/9/2005 8:58 AM Justin

That's like saying "the internet is useless because it isn't moderated." Yet, Google search finds relevant results.

# re: Google Base - Search Crap Easier! 12/9/2005 9:01 AM Scott Mitchell

I think you make my point, Justin, seeing as 99% of the content on the Internet is crap. That is, for every informational, interesting, factual site, there's 99 sites like this - http://www.angelfire.com/super/badwebs/

:-p

Title:  
Name:  
Url:
Protected by Clearscreen.SharpHIPEnter the code you see:
Comments   

Add To Your Reader

My Links

Archives

Post Categories

 

I am a Microsoft MVP for ASP.NET.
I am an ASPInsider.
<May 2008>
SMTWTFS
27282930123
45678910
11121314151617
18192021222324
25262728293031
1234567

Comment Stats

DayTotal% of Total
Sunday 1866.8%
Monday 37913.9%
Tuesday 45316.7%
Wednesday 50418.5%
Thursday 53519.7%
Friday 49418.2%
Saturday 1666.1%
Total 2717100.0%

Hour1Total% of Total
12:00 AM 652.4%
1:00 AM 682.5%
2:00 AM 622.3%
3:00 AM 742.7%
4:00 AM 572.1%
5:00 AM 1033.8%
6:00 AM 1084.0%
7:00 AM 1585.8%
8:00 AM 1716.3%
9:00 AM 1475.4%
10:00 AM 1716.3%
11:00 AM 1816.7%
12:00 PM 1886.9%
1:00 PM 1696.2%
2:00 PM 1605.9%
3:00 PM 1324.9%
4:00 PM 1073.9%
5:00 PM 923.4%
6:00 PM 913.3%
7:00 PM 963.5%
8:00 PM 833.1%
9:00 PM 782.9%
10:00 PM 792.9%
11:00 PM 772.8%
Total 2717100.0%

Comments by Blog Entry Date/Time

Day Entry MadeAvg.Total
Sunday 5.54144
Monday 5.22339
Tuesday 4.28419
Wednesday 7.67637
Thursday 6.90607
Friday 5.48411
Saturday 5.33160
Total 5.842717

Hour1 Entry MadeAvg.Total
12:00 AM 5.0035
1:00 AM 1.002
5:00 AM 0.000
7:00 AM 7.0035
8:00 AM 5.35107
9:00 AM 6.32278
10:00 AM 6.47246
11:00 AM 4.41181
12:00 PM 6.88330
1:00 PM 3.00111
2:00 PM 5.41222
3:00 PM 8.64285
4:00 PM 4.0589
5:00 PM 5.92154
6:00 PM 4.52113
7:00 PM 9.67174
8:00 PM 9.80147
9:00 PM 5.05111
10:00 PM 5.4265
11:00 PM 4.5732
Total 5.842717

Learn More About Comment Stats
1 - All times GMT -8...


Blog Stats

Favorite Web Sites

My Books

My MSDN Articles