Taming Text pdf epub mobi txt 電子書下載2025

簡體網頁||繁體網頁

☆☆☆☆☆

出版者:Manning Publications

作者:Grant S. Ingersoll

出品人:

頁數:320

译者:

出版時間:2013-1-24

價格:USD 44.99

裝幀:Paperback

isbn號碼:9781933988382

叢書系列:

圖書標籤:

文本分析
TextMining
計算機
IR
計算機科學
數據挖掘
NLP
Programming
文本處理
自然語言處理
信息提取
文本挖掘
機器學習
Python
數據科學
文本分析
NLP
文本分類

下載連結在頁面底部

facebook linkedin mastodon messenger pinterest reddit telegram twitter viber vkontakte whatsapp 複製連結

想要找書就要到小美書屋

book.quotespace.org

立刻按 ctrl+D收藏本頁

你會得到大驚喜!!

具體描述

It is no secret that the world is drowning in text and data. This causes real problems for everyday users who need to make sense of all the information available, and software engineers who want to make their text-based applications more useful and user-friendly. Whether you're building a search engine for a corporate website, automatically organizing email, or extracting important nuggets of information from the news, dealing with unstructured text can be a daunting task.

Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are bulit.

著者簡介

Grant Ingersoll is an independent consultant developing search and natural language processing tools. Prior to being a consultant, he was a Senior Software Engineer at the Center for Natural Language Processing at Syracuse University with 11 years of hands-on experience developing Java applications, many of which have been spent working on text processing applications. At the Center and, previously, at MNIS-TextWise, Grant worked on a number of text processing applications involving information retrieval, question answering, clustering, summarization, and categorization. Grant is a committer, as well as a speaker and trainer, on the Apache Lucene Java project and a co-founder of the Apache Mahout machine-learning project. He holds a master's degree in computer science from Syracuse University and a bachelor's degree in mathematics and computer science from Amherst College.

Thomas Morton writes software and performs research in the area of text processing and machine learning. He has been the primary developer and maintainer of the OpenNLP text processing project and Maximum Entropy machine learning project for the last 5 years. He received his doctorate in Computer Science from the University of Pennsylvania in 2005, and has worked in several industry positions applying text processing and machine learning to enterprise class development efforts. Currently he works as a software architect for Comcast Interactive Media in Philadelphia.

圖書目錄

讀後感

評分☆☆☆☆☆

还是那句话，有英文版的就绝不要读中文版的，特别是对于技术书籍。翻译的低级错误真是太多了。我就读了中文版不到一章就发现好多坑。吐槽开始：中文版77、81页：3.6.1 数量判定 3.6.2 判断数量这他么玩文字游戏呢！换个位置就好了？！对应的英文版是3.6.1 Judging qualit...

評分☆☆☆☆☆

偏重实践的书，理论部分略有欠缺。最重要的是：只讨论了Java。现在NLP应该Python是主流。 ---------------------------------- ---------------------------------- ---------------------------------- ---------------------------------- ---------------------------------...

評分☆☆☆☆☆