Building a search engine

Interesting article at Kuro5hin: bq. Without a doubt this has been one of the most absurd and strangest projects I have started so far. Not long ago the idea that I could build a search engine capable of indexing the Internet as a whole seemed so far away. Now it is becoming a reality. Without further ado I wish to announce the early release of mozdex.com an Open Search Engine. bq. Mozdex.com was dreamed up from the belief that searching should be more of a science and a factual process rather then a proprietary and secretive process. Through the beauty of open source and the hard work of the Nutch team we have been able to use Nutch build a beta test index of nearly 50 million pages. bq. What we want to do is provide a search system where you can see how the algorithm ranks pages. The ability to see incoming anchors and references to the pages gives more insight into the results. We feel that by working with an open API and Algorithm that the mass of great minds on the Internet can work together to come up with an algorithm that doesn�t lend itself so much to being cheated by �spammy� sites. The premise being that a well thought out algorithm can understand the basic tricks of the trade and more quickly react to new hacks & cheats used to "spam" indexes. This is a severe case of re-inventing the wheel but it is an interesting one. Google does publish details on its page-rank system but not the source. I don't know that this project would have the financial resources that Google does to maintain the server farm and the bandwidth but it will be interesting to see. Another search engine I like is here Kartoo

October 2022

Sun Mon Tue Wed Thu Fri Sat
            1
2 3 4 5 6 7 8
9 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30 31          

Environment and Climate
AccuWeather
Cliff Mass Weather Blog
Climate Depot
Ice Age Now
ICECAP
Jennifer Marohasy
Solar Cycle 24
Space Weather
Watts Up With That?


Science and Medicine
Junk Science
Life in the Fast Lane
Luboš Motl
Medgadget
Next Big Future
PhysOrg.com


Geek Stuff
Ars Technica
Boing Boing
Don Lancaster's Guru's Lair
Evil Mad Scientist Laboratories
FAIL Blog
Hack a Day
Kevin Kelly - Cool Tools
Neatorama
Slashdot: News for nerds
The Register
The Daily WTF


Comics
Achewood
The Argyle Sweater
Chip Bok
Broadside Cartoons
Day by Day
Dilbert
Medium Large
Michael Ramirez
Prickly City
Tundra
User Friendly
Vexarr
What The Duck
Wondermark
xkcd


NO WAI! WTF?¿?¿
Awkward Family Photos
Cake Wrecks
Not Always Right
Sober in a Nightclub
You Drive What?


Business and Economics
The Austrian Economists
Carpe Diem
Coyote Blog


Photography and Art
Digital Photography Review
DIYPhotography
James Gurney
Joe McNally's Blog
PetaPixel
photo.net
Shorpy
Strobist
The Online Photographer


Blogrolling
A Western Heart
AMCGLTD.COM
American Digest
The AnarchAngel
Anti-Idiotarian Rottweiler
Babalu Blog
Belmont Club
Bayou Renaissance Man
Classical Values
Cobb
Cold Fury
David Limbaugh
Defense Technology
Doug Ross @ Journal
Grouchy Old Cripple
Instapundit
iowahawk
Irons in the Fire
James Lileks
Lowering the Bar
Maggie's Farm
Marginal Revolution
Michael J. Totten
Mostly Cajun
Neanderpundit
neo-neocon
Power Line
ProfessorBainbridge.com
Questions and Observations
Rachel Lucas
Roger L. Simon
Samizdata.net
Sense of Events
Sound Politics
The Strata-Sphere
The Smallest Minority
The Volokh Conspiracy
Tim Blair
Velociworld
Weasel Zippers
WILLisms.com
Wizbang


Gone but not Forgotten...
A Coyote at the Dog Show
Bad Eagle
Steven DenBeste
democrats give conservatives indigestion
Allah
BigPictureSmallOffice
Cox and Forkum
The Diplomad
Priorities & Frivolities
Gut Rumbles
Mean Mr. Mustard 2.0
MegaPundit
Masamune
Neptunus Lex
Other Side of Kim
Publicola
Ramblings' Journal
Sgt. Stryker
shining full plate and a good broadsword
A Physicist's Perspective
The Daily Demarche
Wayne's Online Newsletter

About this Entry

This page contains a single entry by DaveH published on May 3, 2004 10:22 AM.

Guide to Buying HDTV was the previous entry in this blog.

Sliding down the slope of Dhimmitude in Spain is the next entry in this blog.

Find recent content on the main index or look in the archives to find all content.

Monthly Archives

Pages

OpenID accepted here Learn more about OpenID
Powered by Movable Type 5.2.9