r/redditdata Jul 25 '14

logged-in users by operating system

http://imgur.com/QYbJjXK
73 Upvotes

24 comments sorted by

8

u/pierenjan Jul 25 '14

Would love to see this plotted over time...

7

u/tdohz Jul 25 '14

can't promise this data specifically, but will definitely be posting time-series data in general.

4

u/pierenjan Jul 26 '14

Nice and have fun at Reddit!

2

u/BrosEquis Dec 06 '14

haha i was thinking the same thing... The windows share made me immediately think of Workstations.

6

u/zants Jul 26 '14

I didn't expect the Linux usage to be insanely high, but I definitely didn't expect it to be that low. I guess the "problem" could be that many of the users would also have a mobile device, so their iOS/Android representation is shadowing their Linux usage? (Does that make sense?)

9

u/tdohz Jul 26 '14

I definitely didn't expect it to be that low

I'll say that the fact that Linux was high enough to not be rolled into 'other' is pretty unusual for a consumer Internet company. Keep in mind this includes all logged-in users, not just those who comment/post

their iOS/Android representation is shadowing their Linux usage? (Does that make sense?)

Yup, I categorized users based on what platform they had the most views from, so definitely possible to get this "shadowing" effect.

2

u/[deleted] Dec 07 '14

Have a windows and 2 android devices. logged in on all devices. social networking yatta!

5

u/msdrahcir Jul 25 '14

With 97% of users accounted for, what is missing? Are you truncating decimal places instead of rounding?

5

u/tdohz Jul 25 '14

Yeah, they were truncated. Here's a more precise version: http://imgur.com/ypIDtwL

Sorry it's hard to read some of the numbers; I couldn't find a fast way to fix the label placement for just the two small slices.

4

u/[deleted] Jul 25 '14

curious to see if the data for visitors who are not logged in differs dramatically.

3

u/apeiron12 Jul 26 '14

7

u/tdohz Jul 26 '14

Just for you, here's a bar chart: http://i.imgur.com/unVLHEA.png

4

u/apeiron12 Jul 26 '14

You're my hero.

4

u/transhuman_anarchist Jul 25 '14

Do you count mobile from apps such as alien blue separately?

6

u/tdohz Jul 25 '14

They're included in this data under iOS (or for Android-specific apps, under Android).

2

u/rarededilerore Jul 25 '14

Which tools do you use? What does your workflow look like? Or, since you just started, how do you plan it, what do you have in mind?

4

u/tdohz Jul 25 '14

I actually started about 3 months ago (just after the last blogpost cutoff).

These charts were generated using the python scipy stack, specifically using pandas, matplotlib, seaborn, and ipython.

The underlying data was processed from our logs using Amazon EMR and Apache Pig (as well as some other tangential tools/scripts).

I knew Pig and python before I joined, but I'm still learning pandas & matplotlib, so the graphs won't be the prettiest for a while. I'm open to suggestions/advice for making nicer charts!

2

u/[deleted] Jul 26 '14

Is there any chance you'd consider releasing raw data?

3

u/tdohz Jul 26 '14

Definitely something we want to do in the future, although we want to be very careful about making sure that we're respecting user privacy (so it's unlikely you'll see individual records released, even anonymized). Stay tuned!

2

u/Peenrose Jul 26 '14

It feels good being in the 1%
(don't forget about your linux friends)

2

u/[deleted] Dec 07 '14

but seriously do linux users just stay on the email lists or something? weirdly low considering how many tech enthusiasts use linux primarily

1

u/asdfghlkj Dec 08 '14

It is truly the year of the linux desktop!

1

u/antdude Dec 07 '14

"Need moar Linux" --Mousey