[go: up one dir, main page]

Skip to content

Utilities & scripts to collect and find insight from social network data and users.

Notifications You must be signed in to change notification settings

IHJpc2V1cCAK/socint

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Social Intelligence

Utilities & scripts to collect and find insight from social network data.

Store a local cache of data from social media sites in a Postgres database for analysis. Report on interesting trends, user patterns, etc..

Quick Example

Lets see what a user's been doing lately:

(env)~/reddit$ ./user_stats spez
INFO:main:obtaining user history: spez
INFO:main:querying user schedule
INFO:main:querying user_comments
INFO:main:generate schedule graph
INFO:main:obtaining user history

/u/spez history has 966 posts
 announcements         475 (49.2%)
 reddit.com             76 ( 7.9%)
 IAmA                   52 ( 5.4%)
 cscareerquestions      47 ( 4.9%)
 programming            34 ( 3.5%)
 modnews                34 ( 3.5%)
 technology             26 ( 2.7%)
 [...snip...]

spez posting schedule

Get Started

  1. Clone the repo
  2. Get a Postgres database running
  3. Dive into the platform-specific directories (e.g. Reddit, or Twitter)
  4. Follow platform-specific instructions found in that README
  5. Run some scripts under ./collect ... then check out what's available under ./report

Where are we going with this?

The project is static. Reddit changed their search functionality so cloudsearch no longer works. This leaves ./reddit/collect/subreddit_comments.py broken. The rest of it works as far as I know so you can still stream and get a redditors history. TODO: explore pushshift.io which looks like a better source for historical data from Reddit.

Releases

No releases published

Packages

No packages published

Languages