挖掘社交網絡

挖掘社交網絡 pdf epub mobi txt 電子書 下載2025

出版者:東南大學齣版社
作者:Matthew A·Russell
出品人:
頁數:332
译者:
出版時間:2011-5
價格:78.00元
裝幀:
isbn號碼:9787564126865
叢書系列:
圖書標籤:
  • 數據挖掘
  • 社會網絡
  • O'Reilly
  • mining
  • 計算機
  • 互聯網
  • sns
  • 計算機及軟件
  • 社交網絡分析
  • 數據挖掘
  • 網絡科學
  • 社會網絡
  • 圖論
  • Python
  • 機器學習
  • 信息傳播
  • 社區發現
  • 影響力分析
想要找書就要到 小美書屋
立刻按 ctrl+D收藏本頁
你會得到大驚喜!!

具體描述

《挖掘社交網絡(影印版)》,本書簡潔而且具有操作性的書將為你展示如何迴答這些甚至更多的問題,你將學到如何組閤社交網絡數據、分析技術,如何通過可視化幫助你找到你一直在社交世界中的內容。

著者簡介

馬修·羅塞爾(Matthew A.Russell),Digital Reasoning Systems公司的技術副總裁和Zaffra公司的負責人,是熱愛數據挖掘、開源和Web應用技術的計算機科學傢。他也是《Dojo: The Dofinitive Guide》(O'Reilly齣版社)的作者。在LinkedIn上聯係他或在Twitter上關注@ptwobrussell,可隨時關注他的最新動態。

圖書目錄

preface
1. introduction: hacking on twitter data
installing python development tools
collecting and manipulating twitter data
tinkering with twitter's apl
frequency analysis and lexical diversity
visualizing tweet graphs
synthesis: visualizing retweets with protovis
closing remarks
2. microformats: semantic markup and common sense collide
xfn and friends
exploring social connections with xfn
a breadth-first crawl of xfn data
geocoordinates: a common thread for just about anything
wikipedia articles + google maps = road trip?
slicing and dicing recipes (for the health of it)
collecting restaurant reviews
summary
3. mailboxes: oldies but goodies
.mbox: the quick and dirty on unix mailboxes
mbox + couchdb = relaxed email analysis
bulk loading documents into couchdb
sensible sorting
map/reduce-inspired frequency analysis
sorting documents by value
couchdb-lucene: full-text indexing and more
threading together conversations
look who's talking
visualizing mail "events" with simile timeline
analyzing your own mail data
the graph your (gmail) inbox chrome extension
closing remarks
4. twitter: friends, followers, and setwise operations
restful and oauth-cladded apis
no, you can't have my password
a lean, mean data-collecting machine
a very brief refactor interlude
redis: a data structures server
elementary set operations
souping up the machine with basic friend/follower metrics
calculating similarity by computing common friends and followers
measuring influence
constructing friendship graphs
clique detection and analysis
the infochimps "strong links" apl
interactive 3d graph visualization
summary
5. twitter: the tweet, the whole tweet, and nothing but the tweet
pen: sword∷ tweet: machine gun (?!?)
analyzing tweets (one entity at a time)
tapping (tim's) tweets
who does tim retweet most often?
what's tim's influence?
how many of tim's tweets contain hashtags?
juxtaposing latent social networks (or #justinbieber versus #teaparty)
what entities co-occur most often with #justinbieber and #teaparty
tweets?
on average, do #justinbieber or #teaparty tweets have more
hashtags?
which gets retweeted more often: #justinbieber or #teaparty?
how much overlap exists between the entities of #teaparty and
#justinbieber tweets?
visualizing tons of tweets
visualizing tweets with tricked-out tag clouds
visualizing community structures in twitter search results
closing remarks
6. linkedln: clustering your professional network for fun (and profit?)
motivation for clustering
clustering contacts by job title
standardizing and counting job titles
common similarity metrics for clustering
a greedy approach to clustering
hierarchical and k-means clustering
fetching extended profile information
geographically clustering your network
mapping your professional network with google earth
mapping your professional network with dorling cartograms
closing remarks
?. 6oogle buzz: tf-idf, cosine similarity, and collocations
buzz = twitter + blogs (???)
data hacking with nltk
text mining fundamentals
a whiz-bang introduction to tf-idf
querying buzz data with tf-idf
finding similar documents
the theory behind vector space models and cosine similarity
clustering posts with cosine similarity
visualizing similarity with graph visualizations
buzzing on bigrams
how the collocation sausage is made: contingency tables and scoring
functions
tapping into your gmail
accessing gmail with oauth
fetching and parsing email messages
before you go off and try to build a search engine.
closing remarks
8. blogs et al.: natural language processing (and beyond)
nlp: a pareto-like introduction
syntax and semantics
a brief thought exercise
a typical nlp pipeline with nltk
sentence detection in blogs with nltk
summarizing documents
analysis of luhn's summarization algorithm
entity-centric analysis: a deeper understanding of the data
quality of analytics
closing remarks
9. facebook: the all-in-one wonder
tapping into your social network data
from zero to access token in under 10 minutes
facebook's query apis
visualizing facebook data
visualizing your entire social network
visualizing mutual friendships within groups
where have my friends all gone? (a data-driven game)
visualizing wall data as a (rotating) tag cloud
closing remarks
10. the semantic web: a cocktail discussion
an evolutionary revolution?
man cannot live on facts alone
open-world versus closed-world assumptions
inferencing about an open world with fuxi
hope
index
· · · · · · (收起)

讀後感

評分

如果你希望从这本书里边学到任何软件使用方法以外的东西,我觉得你会失望的。 因为从第七章开始才讲算法,还将得各种悲剧。直接看wikipedia都能理解得更快。 之前的章节都是各个社交网络API的介绍和工具使用介绍,还算行吧。 里边提到的工具目录里边基本都有,直接上官方站...  

評分

虽然使用的语言是python,而且分析的网站都是国内被禁的网站,但是读完这本书后,感到很受启发,其实如果你懂了这本书中的内容,分析其他社交网站也会得心应手,比如说像国内的sina微博,人家提供的API也很有价值啊,你读完这本书,收获会很大。  

評分

Facebook、Twitter和LinkedIn产生了大量宝贵的社交数据,但是你怎样才能找出谁通过社交媒介正在进行联系?他们在讨论些什么?或者他们在哪儿?这本简洁而且具有可操作性的书将揭示如何回答这些问题甚至更多的问题。你将学到如何组合社交网络数据、分析技术,如何通过可视化帮助你...  

評分

原本是想学些数据分析的算法和思想,但是拿到这本书之后挺失望。看到第四章,全在讲如何使用twitter等社交网站的api。 只能当拓展知识面看看,了解下书里面讲到的开源工具。 另外,书的价格还不算便宜。  

評分

yes, damn beaver -,-# 社交网站的DM需要用直推来隐藏看似复杂却又简单,做起来简单却确实不是随便谁都能做好的工作。 UPLOAD YOUR SOUL TO THE ULTIMATE INTERNET!哈哈哈哈!

用戶評價

评分

做挖掘的入門好書,如果你想挖掘其它的,啓發也不小。

评分

做挖掘的入門好書,如果你想挖掘其它的,啓發也不小。

评分

做挖掘的入門好書,如果你想挖掘其它的,啓發也不小。

评分

做挖掘的入門好書,如果你想挖掘其它的,啓發也不小。

评分

做挖掘的入門好書,如果你想挖掘其它的,啓發也不小。

本站所有內容均為互聯網搜索引擎提供的公開搜索信息,本站不存儲任何數據與內容,任何內容與數據均與本站無關,如有需要請聯繫相關搜索引擎包括但不限於百度google,bing,sogou

© 2025 book.quotespace.org All Rights Reserved. 小美書屋 版权所有