//
Twitterlyzer

Twitterlyzer

Github repository: https://github.com/plotti/twitterlyzer

Description

Twitterlyzer is an application that helps researchers to collect Twitter data. It is build using  Rails 2.3.7 and most of the API calls are done through delayed_jobs in order to parallalyze the requests. The Twitter API is wrapped using grackle and twitter gems

  • Persons are organized in projects that contain a set of people.

Persons

  • collect one person
  • collect multiple persons based on a csv import
  • collect the egonetwork of a given person
  • calculate statistics of the people collected (friends, follower distributions, origin etc..)

Connections between persons

  • Connections between persons are stored not in the DB but on the HD in a PStore
  • THis allows to extremly fast access millions of edges, instead of going through DB tables

Tweets

  • collect the tweets of a person
  • collect the tweets of all persons
  • collects tweets based on a csv list
  • collect all retweets of all collected tweets
  • export all tweets into a csv
  • calculate statistics on the tweets (links used, keywords, timeline)

Networks

  • export the networks of the collected persons in a project the formats UCINET, Gephi
  • export the retweet networks of persons
  • export the @ networks between persons
  • export the person stats
  • export the Twitter links of persons

Scraping

  • Using scrapers under tasks directory you can scrape the following websites Murack.com, Google, Twellow, Wefollow

Sentiment

  • It can compute sentiment for german tweets using a bayesian classifier  (see tasks directory)

Screenshots

Discussion

No comments yet.

Leave a comment