Dan Gray
Author Archive

Dan Gray

Dan has worked in online media and marketing across a variety of industries, from data science to interactive entertainment, building online communities around technology interests and innovative startups.

Alibaba
Uncategorized

Today we feature Alibaba‘s use of big data. The company filed for an IPO on Tuesday, and is expected to raise more than Facebook’s $16 billion in 2012. Alibaba dominates the e-commerce market in China. It can be considered a mix of Amazon, eBay and Paypal, adapted for the Chinese

Read More
Big Data Universe
infographics

The infographic leverages data from Wikibon’s Big Data report. It encapsulates the Big Data universe in single image, depicting the revenue breakdown derived from Big Data. These range from Hadoop pure-plays to legacy vendors to NoSQL, as well as companies less associated with the Big Data space. This infographic is

Read More
DRM and China's Knockoff Economy
Uncategorized

The United States gave the world the iPhone, China gave it the HiPhone. China has created knockoffs of almost every imaginable product, including the White House and entire European villages. Estimates of the cost of Chinese piracy to the US economy was estimated at $50 billion in 2011 alone. Another

Read More
Hadoop is free, but what's the catch?
Uncategorized

Hadoop is designed to make storing and processing large amounts of data affordable. It does this in two ways: Through hardware, by allowing use of normal processing chips, and Through software, with the open-source Apache licence Open-source software essentially means Hadoop is free, or more specifically no licence fee needs

Read More
The Simpsons' paradox
Data Science

Simpson’s paradox happens when behaviour at the group level is different from the behaviour of its subgroups. For example, consider a mobile app with 10,000 Android users and 5,000 on iOS. If we make the assumption that all users spend equally, then it may make more sense to prioritize development

Read More
Big Data Helping Us Run Faster
Uncategorized

Previously we had a look at how data-centric approaches were used in baseball and cricket to identify undervalued players. Today we review studies on absolute limits on how fast humans and animals can run. While human runners continue to set new speed records, racehorses appear to have reached a plateau.

Read More
Twitter Credibility Problem
Uncategorized

Twitter has a credibility problem. Fake information and photos proliferate the platform, especially after a natural disaster. For example, Hurricane Sandy photos depicted flooding of the whole East Coast and the Statue of Liberty being felled by a tsunami. For every problem, there’s a startup. Tweetcred appends a ‘credibility ranking’,

Read More
US middle class no longer the world richest
infographics

Last week, we had a quick look at Thomas Piketty’s Capital in the Twenty-First Century, and data-driven research that highlights rising inequality globally. The following infographic featured today further shows how the American middle class is no longer the most affluent in the world. Economic figures, such as GDP per

Read More
Big Data takes over the Humanities
Machine Learning

Google Translate is an excellent example of the foray of machine learning into languages. Can we extend this notion into the study of the humanities in general? Enter the digital humanities. There has been a lot of enthusiasm on how technology will open up new avenues of research, as well

Read More
big data be all end all
Uncategorized

This week, we will look at the three best articles that speak to the power of big data. There is a lot of literature on the topic. We find these articles best encapsulate how we can manipulate data and extract meaningful insights like never before. Next week, we will present

Read More