图书标签: lucene 搜索引擎 信息检索 java IR Lucene 自然语言处理 计算机科学
发表于2025-02-22
Lucene in Action, Second Edition pdf epub mobi txt 电子书 下载 2025
HIGHLIGHT New edition of top-selling book on the new version of Lucene--the core open-source technology behind most full-text search and "Intelligent Web" applications. DESCRIPTION When Lucene first hit the scene five years ago, it was nothing short of amazing. By using this open-source, highly scalable, super-fast search engine, developers could integrate search into applications quickly and efficiently. A lot has changed since then--search has grown from a "nice-to-have" feature into an indispensable part of most enterprise applications. Lucene now powers search in diverse companies including Akamai, Netflix, LinkedIn, Technorati, HotJobs, Epiphany, FedEx, Mayo Clinic, MIT, New Scientist Magazine, and many others. Some things remain the same, though. Lucene still delivers high-performance search features in a disarmingly easy-to-use API. Due to its vibrant and diverse open-source community of developers and users, Lucene is relentlessly improving, with evolutions to APIs, significant new features such as payloads, and a huge increase (as much as 8x) in indexing speed with Lucene 2.3. And with clear writing, reusable examples, and unmatched advice on best practices, Lucene in Action, Second Edition is still the definitive guide to developing with Lucene. KEY POINTS * Completely revised and updated to current Lucene 2.3 APIs. * Practical coverage, like how to index MS Word, PDF, HTML, and XML. * Full introduction to Intelligent Web topics like smart searching, sorting, and filtering.
MICHAEL MCCANDLESS has been building search engines for over a decade. In 1999,with three other people, he founded iPhrase Technologies, a startup providing usercentric enterprise search engine software, written in Python and C++. After IBM acquired iPhrase in 2005, Michael became involved in Lucene and started contributing patches, becoming a committer in 2006 and PMC member in 2008. Michael received his B.S., M.S and Ph.D. from MIT, and now lives in Lexington, MA along with his wonderful wife, Jane, and four delightful kids, Mia, Kyra, Joel and Kyle. Michael’s blog is at http://chbits.blogspot.com.
ERIK HATCHER codes, writes, and speaks on technical topics that he finds fun and challenging. He has written software for a number of diverse industries using many different technologies and languages. Erik coauthored Java Development with Ant (Manning,2002) with Steve Loughran, a book that has received industry acclaim. Since the release of Erik’s first book, he has spoken at numerous venues including the No Fluff, Just Stuff symposium circuit, JavaOne, O’Reilly’s Open Source Convention, JavaZone, devoxx, user groups, and even sometimes webinars. As an Apache Software Foundation member, he is an active contributor and committer on several Apache projects including Lucene and Solr. Erik proudly presents his favorite technologies passionately, recently notables are Solr, Solritas, Flare, Blacklight, and solr-ruby—preferring to dabble at the intersection of user experiences and Solr. Erik cofounded Lucid Imagination, where he helps carry the torch for open-source search goodness. Erik keeps fit and serene in central Virginia.
OTIS GOSPODNETIC ′ has been a Lucene developer since before Lucene became Apache Lucene. He is the co-founder of Sematext, a company that focuses on providing services and products around search (focusing on Lucene, Solr, and Nutch) and analytics (think BigData, Hadoop, etc.). Otis has given talks about Lucene and Solr over the years and some of his previous technical publications include articles about Lucene, published by O’Reilly Network and IBM developerWorks. Years ago, Otis also wrote To Choose and Be Chosen: Pursuing Education in America, a guidebook for foreigners wishing to study in the United States; it’s based on his own experience. Otis currently lives in New York City where he runs the NY Search & Discovery Meetup.
很好的书,现在还在看
评分附录B关于Lucene索引格式的说明非常棒
评分Lucene make search easy.
评分拿出来重新细读。
评分Manning出版的XXX In Action系列的书翻译的都不是很好,读书不如静下心来看源码,有时候甚至源码要比翻译的文字清晰的多。
不错的一本书,对Lucene,或者说,Search中的一些关键点都有详细的讲述。 看完后再去看源代码,可以做到事半功倍。
评分我们team一直用lucene,不过把lucene用的跟关系表似的 汗一个 搜索引擎三大块,索引查找和打分 这本书索引讲的不够深入,其实lucene索引的内部的数据结构还是很经典的 打分写的太浅,应该找个例子更深入一些 查找部分我个人认为是写的可以的, 可作为入门书,一定要记得学习下...
评分书写得挺好,全面介绍了Lucene这个非常流行的java全文搜索引擎的框架。 英文不难,条理清晰,读起来挺有味道。 遗憾的是示例的API过时了。例如 现在Lucene3.0 中的 Field的创建方式与本书中所说的相差很大;IndexWriter的构造函数也有变化。 相信还有其他deprecated 的地方...
评分很久以前见百度的人用过这个,感觉是一本圣书。但是,初次看的时候,很失望。 书中就是对lucene的几个基本接口作了介绍,举了一些例子。但是对实现的细节没有做说明。 要彻底认识lucene还得从阅读源代码入手,结合lucene in action中介绍的API, 沿着数据处理流...
评分做Lucene也只有这本书能参考了,没啥选择。还不错,全面,重要的细节也讲了,做Lucene必备参考书。
Lucene in Action, Second Edition pdf epub mobi txt 电子书 下载 2025