Tag Archives: mining

Big Data Gets a Little More Manageable

Big Data Gets a Little More Manageable

Processing advancements are getting a lot of attention for their use with Big Data. Big Data isn’t just Hadoop and in-memory anymore. Big data technologies and tools have grown significantly…

Big data's biggest impact is not on Silicon Valley

Big data’s biggest impact is not on Silicon Valley

As interesting as big data can be within Silicon Valley’s elite companies, it’s having a bigger impact on Middle America. Most Hadoop jobs can be found in Silicon Valley, where…

“Little Data” Matters, Too

“Little Data” Matters, Too

Back when I was working at the advertising agency JWT, one of our clients—a U.S. Marine Corps colonel—said something that has stuck with me ever since. “Look,” he said, “if I’m on a battlefield trying to defend a hill and I get a piece of intelligence, even if I’m […]

Yelp Graph: Business Clustering Based on Check-In Data

Yelp Graph: Business Clustering Based on Check-In Data

Recently, Yelp made available a sample dataset from the greater Phoenix metropolitan area including around 11,000 businesses, 8,000 check-in sets, 43,000 users and 230,000 user reviews. With the help of this data, data scientists can execute real-life experiments with various data mining/machine learning algorithms. In our case, we are […]

Five ways to handle Big Data in R

Five ways to handle Big Data in R

Five strategies to tackle big data with R Big data was one of the biggest topics on this year’s useR conference in Albacete and it is definitely one of today’s hottest buzzwords. But what defines “Big Data”? And on the practical side: How can big data be tackled in […]

Full Metal Hadoop - Christian Prokopp | Big Data Republic

Full Metal Hadoop – Christian Prokopp | Big Data Republic

Initially do-it-yourself distributions like Cloudera, MapR, and Hortonworks made up a great part of the market. In recent years, following the success of Amazon Elastic MapReduce (EMR), Hadoop/data services like Qubole have become popular. Qubole in particular has highlighted advantages over EMR. (See my From Zero to Big Data […]

Why Canada still needs census in age of data mining

Why Canada still needs census in age of data mining

Could data mining ever replace the National Household Survey? It’s a question prompted by this week’s stalled release of data  from the 2011 National Household Survey, over what Statistics Canada admitted were “serious errors.” The National Household Survey (NHS) replaced Canada’s longer-form mandatory census in 2011. Those errors, along with rising survey […]

Marketing: The Most Profitable Place for Big Data Analytics

Marketing: The Most Profitable Place for Big Data Analytics

Customer experience, digital Marketing: The Most Profitable Place for Big Data Analytics There’s currently a big talent war being fought between old school advertising agencies and big tech companies like Google, Facebook and Twitter. As April Dembosky outlined in her recent Financial Times article , both sides are snatching […]

Text Mining the Complete Works of William Shakespeare

Text Mining the Complete Works of William Shakespeare

I am starting a new project that will require some serious text mining. So, in the interests of bringing myself up to speed on the tm package, I thought I would apply it to the Complete Works of William Shakespeare and just see what falls out. The first order […]

Go Back to the Future With the Big Data Big Warehouse

Go Back to the Future With the Big Data Big Warehouse

Jeff Morris Although we don’t build data warehouses, or even data malls (Remember them?), along with the star schema model and metadata layer, we’re still seeking the same kinds of answers, and finding them just as elusive. What has changed is that we have eliminated the data-modeling step — […]