Published by

Nicole Radziwill

Image Credit: Doug Buckley at http://hyperactive.to

So yesterday, I set up an #AmazonGiveaway for my new R book at https://giveaway.amazon.com/p/ea32d421d8d7672d — but I had my 10 year old input the number that will determine every nth person who gets the printed copy delivered to them, so that I’d be surprised too when it happened. Well I got surprised today, because nothing’s happened yet… he must have set the number pretty high. I’m a little impatient, so I decided that today I’d like to randomly sample my most recent 100 Twitter followers and send 3 eBooks to whoever comes up.

Turns out, it was pretty simple once I found the right documentation. This was the first time I’ve successfully accessed information from Twitter within R; when I tried other times, the documentation I encountered was problematic and the authentication never worked. But I finally converged on excellent documentation which helped to solve my problem at http://geoffjentry.hexdump.org/twitteR.pdf.

First, I went to https://apps.twitter.com/app/new and set up an application called “random-new-followers”. I think the choice of name is totally arbitrary… Twitter just wants a way to track who’s using their API and how it’s being used. I gave this form my name and web site URL as well. Next, I went to the “Keys and Access Tokens” tab. I had to click the “Create Access Token” button at the bottom. Once I did, I had 4 different access keys to work with, each of which looks like an unwieldy string of numbers and letters and has one of the following four names:

Consumer Key (API Key)
Consumer Secret (API Secret)
Access Token
Access Token Secret

Then I went into my R console. First, I wanted to make sure I had the most recent copy of the httr and twitteR packages. After using .libPaths() to find out where my R packages are installed, I went into those directories and blew away the old folders containing httr and twitteR. Then I went into R and re-installed the two packages:

> install.packages("httr")
> library(httr)
> install.packages("twitteR")
> library(twitteR)

At this point, authenticating and getting access to the Twitter data was pretty straightforward:

setup_twitter_oauth("Consumer Key", "Consumer Secret",access_token="Access Token", access_secret="Access Token Secret")

(When YOU use the above code, be sure to plug in the long, unwieldy number-and-letter combinations you saw on your “Keys and Access Tokens” tab. They are unique to you… and this is what unlocks twitteR so you can get real data!) Next, you can specify the user whose information you’d like to obtain, and using getFollowers() you can find out information about their followers:

Notice that I only wanted to randomly sample from my most recent 100 followers, hence the [1:100]. The names that pop up in quotation marks are my random sample of followers. (If you wanted to sample from all your Twitter followers, just leave out the [1:100].) After I got my three random followers, I checked out their Twitter pages. The bad news? I think they are all bots 😦

(So then I filtered out anyone who had “811” or “#AmazonGiveaway” in their recent Tweets.)

There is a lot more information that’s obtained when you try getUser() from the twitteR package. If you’ve gotten this far, check out all of the interesting information (and methods) that are contained in the results by typing this:

> str(me)

Rate this:

7 responses to “Randomly Sample Twitter Followers in R”

Nicole Radziwill

April 27, 2015

Having a heck of a time with WordPress on this post… it keeps violating my code and turning it into nonsense.

Reply
corynissen

April 27, 2015

Highlight.js is great for this. There’s a wordpress plugin… https://wordpress.org/plugins/wp-highlightjs/

Reply
1. Nicole Radziwill
  
  April 27, 2015
  
  My installation is hosted, so I’m not in control of the plugins 😦 But thank you! We do have a couple blogs that we administer ourselves and this will be really helpful.
  
  Reply
corynissen

April 27, 2015

Highlight.js is good for this. There’s a wordpress plugin… https://wordpress.org/plugins/wp-highlightjs/

Reply
Nicole Radziwill

April 28, 2015

WordPress violated my code AGAIN — unexpectedly and about 12 hours after updating the post — so I just turned the problematic portion into image. Take that, WordPress!

Reply
Bryan

April 30, 2015

How did you account for sampling your 100 “most recent” followers as opposed to just the first 100 followers returned in the vector? Does the Twitter API sort followers in a chronological manner? I’m not asking to point anything out in your approach, I’m asking because this could be useful to me in the future.

Thanks!

Reply
1. Nicole Radziwill
  
  April 30, 2015
  
  So I did a really unsophisticated browse through my last 200 new followers, and found that *pretty much* — the response that you get from getFollowers() is organized according to time, in reverse. The end of the data structure definitely corresponded to my oldest and first followers. Didn’t find any documentation definitively confirming the order, but from my extremely unscientific sample of 1, it worked and met my needs. Post back here if you find a definitive answer… this would be useful to me too!
  
  Reply

I’m Nicole

Since 2008, I’ve been sharing insights and expertise on Digital Transformation & Data Science for Performance Excellence here. As a CxO, I’ve helped orgs build empowered teams, robust programs, and elegant strategies bridging data, analytics, and artificial intelligence (AI)/machine learning (ML)… while building models in R and Python on the side. In 2025, I help leaders drive Quality-Driven Data & AI Strategies and navigate the complex market of data/AI vendors & professional services. Need help sifting through it all? Reach out to inquire – check out my new book that reveal the one thing EVERY organization has been neglecting – Data, Strategy, Culture & Power.

More About Me or HIRE ME OR MY PEOPLE

Let’s connect

Get Notifications

Stay updated with our latest ideas, books, and courses, or follow me on LinkedIn.

Quality and Innovation

Randomly Sample Twitter Followers in R

Rate this:

7 responses to “Randomly Sample Twitter Followers in R”

Leave a reply to Nicole Radziwill Cancel reply

I’m Nicole

Let’s connect

Get Notifications

Recent posts

Sausage Program

Psychological Forces in Data Management

How Data Loses Value Over Time

Process in Service of Value

Looking Ahead to 2025

The Scariest Part of Corporate Halloween

Rate this:

Share this:

7 responses to “Randomly Sample Twitter Followers in R”

Leave a reply to Nicole Radziwill Cancel reply

I’m Nicole

Let’s connect

Get Notifications

Recent posts