Scott on Writing

Musings on technical writing...

Google Base - Search Crap Easier!

The 'blogosphere' is abuzz with Google's latest pre-pre-Alpha release, Google Base (the first pre- is in there because the product isn't even released yet, save for http://base.google.com, which is up sometimes, down othertimes, and doesn't really do anything other than generate a fevered pitch among bloggers). Google Base. From a good ArsTechnica article on this pre-pre-Alpha service, “ Google Base is Google's database into which you can add all types of content. We'll host your content and make it searchable online for free.” In a nutshell, purportedly this service will allow Sally Housecoat and Joe Meatball to upload and add their own content to Google Base, which is sort of going to be this online database.

To me, it appears as if the aim of Google Base is to be that big ol' database in the sky. “Have some data? Go ahead and stick up in that cloud up there.  Need to search that data? Here are the web service APIs, knock yourself out.” Users will be able to tag their uploaded information with labels, geographic information, and other categories, and the results, one would imagine, would be integrated into other Google services, such as Google Maps, Froogle, Google Local, and so on. (For example, one could say, “Show me all garage sales going on in my zip code in the next two weeks.”)

Some are calling it the craigslist and eBay killer; some see this as the potential end of classified ads in newspapers.

I am not so excited about this product for a couple of reasons. First, it's pre-pre-Alpha... If this is at all like the Google Reader released a month ago or so, then even when Google Base is officially released, it'll still need a lot of work. Second, and more importantly, they're allowing users to add to this database. If they just let any ol' person add any old thing, the quality of Google Base will quickly approach zero. Look at USENET - except in moderated or very specific forums visited only by a small sect of people, a large percentage of the stuff posted there is crap. Google Base will need some sort of moderation or community involvement that will keep this data pure. And how many people are going to keep using Google Base when they do a search for garage sales in their area and show up only to find that they moved it to next weekend, but forgot to update Google Base?

Let's just say I'm pretty pessimistic when it comes to any service that basically trusts the general public to add to their catalog, and I'd hope Google would know this better than anyone else... a-hem. Let's call this the Scott Mitchell theorem: The quality of any piece of information is inversely proportional to how many people contribute to it.”

posted on Wednesday, October 26, 2005 2:41 PM

Feedback

# re: Google Base - Search Crap Easier! 10/27/2005 9:08 AM Juan Pablo

"The quality of any piece of information is inversely proportional to how many people contribute to it.”

Doesn't Wikipedia refute that theorem ?

# re: Google Base - Search Crap Easier! 10/27/2005 9:13 AM Scott Mitchell

Juan, the quality of Wikipedia is subjective, IMO. Yes, there are some great entries that are very informative and correct, but there are some that are pretty weak.

Here's an article discussing some of the quality issues in Wikipedia.
http://www.theregister.co.uk/2005/10/18/wikipedia_quality_problem/

Let me ask you this - if you were in a contest where you could win a million dollars for answering one question correctly, and you could have EITHER a copy of a published encyclopedia or access to the Wikipedia site to use a reference, which one would you choose?

# re: Google Base - Search Crap Easier! 10/27/2005 9:04 PM Jake

Craigslist is moderated by each community and it's a great site. Granted, you do find a lot of crap on Craigslist but there's some great stuff too. If Google allows the community to moderate Google Base then I think it may work.

# re: Google Base - Search Crap Easier! 12/9/2005 8:58 AM Justin

That's like saying "the internet is useless because it isn't moderated." Yet, Google search finds relevant results.

# re: Google Base - Search Crap Easier! 12/9/2005 9:01 AM Scott Mitchell

I think you make my point, Justin, seeing as 99% of the content on the Internet is crap. That is, for every informational, interesting, factual site, there's 99 sites like this - http://www.angelfire.com/super/badwebs/

:-p

Title:  
Name:  
Url:
Protected by Clearscreen.SharpHIPEnter the code you see:
Comments   

My Links

Ads Via DevMavens

Archives

Post Categories

 

I am a Microsoft MVP for ASP.NET.
I am an ASPInsider.
<March 2010>
SMTWTFS
28123456
78910111213
14151617181920
21222324252627
28293031123
45678910

Comment Stats

DayTotal% of Total
Sunday 2056.8%
Monday 42514.1%
Tuesday 51917.2%
Wednesday 55618.4%
Thursday 58019.2%
Friday 54718.1%
Saturday 1886.2%
Total 3020100.0%

Hour1Total% of Total
12:00 AM 782.6%
1:00 AM 812.7%
2:00 AM 682.3%
3:00 AM 822.7%
4:00 AM 692.3%
5:00 AM 1264.2%
6:00 AM 1193.9%
7:00 AM 1816.0%
8:00 AM 1926.4%
9:00 AM 1585.2%
10:00 AM 1886.2%
11:00 AM 1936.4%
12:00 PM 2016.7%
1:00 PM 1846.1%
2:00 PM 1695.6%
3:00 PM 1354.5%
4:00 PM 1153.8%
5:00 PM 1073.5%
6:00 PM 1013.3%
7:00 PM 1073.5%
8:00 PM 923.0%
9:00 PM 882.9%
10:00 PM 913.0%
11:00 PM 953.1%
Total 3020100.0%

Comments by Blog Entry Date/Time

Day Entry MadeAvg.Total
Sunday 5.00160
Monday 4.80384
Tuesday 4.04477
Wednesday 7.39680
Thursday 6.26676
Friday 5.07466
Saturday 4.78177
Total 5.403020

Hour1 Entry MadeAvg.Total
12:00 AM 5.2937
1:00 AM 1.002
5:00 AM 0.000
7:00 AM 3.8550
8:00 AM 3.72134
9:00 AM 6.06297
10:00 AM 5.63276
11:00 AM 4.22194
12:00 PM 6.16351
1:00 PM 3.09133
2:00 PM 4.89230
3:00 PM 7.67322
4:00 PM 4.00108
5:00 PM 6.07170
6:00 PM 4.64116
7:00 PM 8.95188
8:00 PM 8.63164
9:00 PM 5.00115
10:00 PM 6.31101
11:00 PM 4.5732
Total 5.403020

Learn More About Comment Stats
1 - All times GMT -8...


Blog Stats

Favorite Web Sites

My Books

My MSDN Articles