Sphider: A Free PHP Search Engine For Your Website

March 7th, 2008 · No Comments

Sphider - A Free PHP Search EngineI’ve just installed Sphider, a free PHP search engine, on this very website. I thought that Wordpress’ search function lacked a lot of features like indexing content outside the blog itself, advanced search capabilities, etc.

I used to use phpDig a lot but it’s been a long time since an update has been made available. While it’s (or used to be) a very good search engine, it was kind of tricky to get it back on track if it got messed up.

The installation is pretty easy so I won’t go into details about this here. I got this thing working within 10 minutes and I gotta say it can crawl a site really fast. It’s even better if you use the command line rather than your browser to initiate the crawling process.

Sphider also has the capability of indexing Acrobat Reader files (.pdf) as well as Microsoft Office documents (.doc) if you have the right converters installed. To convert PDF documents to text on Linux, you will need the pdftotext utility bundled with Xpdf.

Unfortunately I found out that the catdoc project, the MS Word to text converter, has been dead for quite a while and that even though you can still find it on the web, it won’t be able to convert documents created with Office XP and 2003. I know that there’s a command line utility that comes with OpenOffice but since I don’t have X installed, I couldn’t try. And I don’t plan to install OpenOffice on my server just for that purpose.

If you find a good text converter for MS Word documents, let us know!

0 responses so far ↓

There are no comments yet...Kick things off by filling out the form below.

Leave a Comment




Posted in Articles · PHP · Tutorials | No Comments

Dedicated Servers
 
VPS
Website Hosting
 

Recent Comments

Recent Webmasters

Hosting Type :
Monthly Price :
Storage :
Transfer :
Sort By :
Search