- SharePoint 2010 - 5 Hot Features to Look Forward To
- Selecting a CMS: How to Build a Short List
- Alterian Drops Immediacy, Morello Web CMS Brands
- Installing SharePoint 2010 on Windows 7
- How SharePoint 2010’s Metadata Services Increase Usability
- Gartner’s Top 5 BPM Predictions for 2010 And Beyond
- Open Text Reports Good Q2, Vignette Contributes to YoY Spike
Lucene Finds its Way to the Top
The Apache Software Foundation (ASF) has recently reclassified the Lucene search engine project from a Jakarta sub-project to a top-level ASF effort.
Lucene is a full text search engine that provides an API and a set of libraries enabling powerful search functionality to be included in all types of Java applications. Doug Cutting is the project's primary developer.
Lucene is offered as a developer toolkit, and requires a certain amount of Java development to implement or integrate a functional search solution.
As an example, for web search, a developer would need to write their own web site spider that populated the Lucene index with Lucene documents.
On the retrieval side, the developer would then need to provide a form handler and query parser that called into the Lucene API for search hits and formatted the results for web presentation.
Given this, its best to think of Lucene as a developer resource and not as a ready to run search engine.
There are several ports of Lucene to other languages. Of note are DotLucene (C# .NET) and Plucene (PERL).
Plucene is currently used by Technorati, is embedded in the Eclipse IDE, and is part of www.furl.com's tools.
2 Reader Comments
Leave a Response
From our Job Board View all jobs
|
Jobs RSS feed
| Post a job right now
- Web Dev Badass at InterWorks
- Front-end Engineer at isocket
- Platform Architect at MyWire
- IT Business Development Manager / Sales Executive at ISIS Papyrus
- Product Support Engineer at Digitech Systems
- UI Designer at Mochi Media
- SharePoint Developer at Metalogix
- Database Kernel Architect / Technical Lead at Quantivo
Featured Events View all events
|
Events RSS feed
| Add your event
- Feb 17, 2010 – Webinar: 4 Essential Strategies for Advancing Your Website's Business Impact
- Feb 26, 2010 – Intelligent Content 2010
- Apr 21, 2010 – Drupalcon San Francisco 2010
- May 5, 2010 – CMS Expo 2010 (Evanston)
- Oct 7, 2010 – HartmanEVENT 2010 - Social Media & Mobile Usability

Get the Newsletter
Email It
Stumble It
Add RSS
Processing...


It's Lucene, not PLucene that's used by Technorati. The developer does not have to provide a query parser, Lucene has a good default query parser.
Thanks for the correction.
You are right that there is a query parser as part of Lucene and Plucene. However, there are some very common query sytax expressions that will cause problems with that parser. I strong doubt one would put the "stock" parser into production.
I my experience, its a much more common practice to implement an intermediary parser that handles more syntax cases and one that is often tuned to what the given audience needs/expects.
-Brice