Category Archives: Big Data

How Ancestry.com Manages Generations Of Big Data

How Ancestry.com Manages Generations Of Big Data

Jeff Bertolucci Over the past year, the genealogy site’s repository of family historical data has more than doubled in size. Here’s how Ancestry managed its growth. Businesses often use — or overuse — the term “big data” to describe all sorts of data-related products and services, but the buzzword […]

Yelp Graph: Business Clustering Based on Check-In Data

Yelp Graph: Business Clustering Based on Check-In Data

Recently, Yelp made available a sample dataset from the greater Phoenix metropolitan area including around 11,000 businesses, 8,000 check-in sets, 43,000 users and 230,000 user reviews. With the help of this data, data scientists can execute real-life experiments with various data mining/machine learning algorithms. In our case, we are […]

How Is Hadoop Like Teenage Sex? [Infographic]

How Is Hadoop Like Teenage Sex? [Infographic]

Hortonworks_Hadoop_Summit_Infographic.jpg How is Hadoop like teenage sex? It’s an old riddle whose answer is changing quickly. If you don’t already know what it is, read on. And if you do, read on anyway (and check out the Infographic) because we have some marvelous visual statistics to share. How Is […]

Are VCs Getting Duped by Hadoop?

Are VCs Getting Duped by Hadoop?

Robert Mullins Always keeping an eye on the horizon While Hadoop is touted as the next big thing, the next, NEXT big thing is something called the “semantic Web,” according to Charles Silver, CEO of Algebraix, which has developed what Silver calls the first commercially available, high-performance platform for […]

Splunk Enterprise & Hunk for Hadoop at Cisco Labs

Splunk Enterprise & Hunk for Hadoop at Cisco Labs

At the end of October, Splunk announced the release of new product called Hunk: Splunk Analytics for Hadoop . Once you get over the awesome name, you realize how much of a game-changer it is to give individuals across the organization the ability to interactively explore, analyze and visualize […]

Designing Machine Learning Frameworks: Flexibility and Sound Design

Designing Machine Learning Frameworks: Flexibility and Sound Design

As a response to my last post , people mentioned mlbase to me as a potential candidate for bringing scalability and machine learning closer together. I took a closer look and wasn’t really that impressed. Don’t get me wrong, this is not a bad project, but it is still […]

The ‘Deutsche Bahn’ (German Railway Corp.) is always late!!!! Or is it? And if, why?

The ‘Deutsche Bahn’ (German Railway Corp.) is always late!!!! Or is it? And if, why?

(This article was first published on Rcrastinate , and kindly contributed to R-bloggers) The biggest German railway company, the ‘Deutsche Bahn’, is subject of frequent emotional discussions about being late all the time. A big German newspaper, the Süddeutsche Zeitung built the so-called ‘train monitor’ (Zugmonitor). The data is […]

Introducing CrimeMap – A Web App Powered by ShinyApps!

Introducing CrimeMap – A Web App Powered by ShinyApps!

A few months ago I did a mini project using open crime data and R to create crime visualisations . At that time, I was already thinking about a web app using Shiny but I couldn’t justify the time to develop the app and then set up a server […]

Five ways to handle Big Data in R

Five ways to handle Big Data in R

Five strategies to tackle big data with R Big data was one of the biggest topics on this year’s useR conference in Albacete and it is definitely one of today’s hottest buzzwords. But what defines “Big Data”? And on the practical side: How can big data be tackled in […]

Hedge Funds Pick Nuggets From Online Social Conversations

Hedge Funds Pick Nuggets From Online Social Conversations

Kishore Jethanandani Hedge funds don’t dismiss the gaggle on social media sites as mere white noise but another source of data to gain alpha. Machine learning and natural language processing algorithms aid in sifting through the chaff of social media data and zero down on the valuable gems. “Social […]