Tag Archives: data

MapR Moves to Secure Hadoop

MapR Moves to Secure Hadoop

Slide Show Big Data: Not Just for Big Business Anymore When it comes to anything relating to Big Data, concerns about security are never far away. After all, concentrating massive amounts of data in one place can make for a very tempting target for hackers. At the Strata Conference […]

Main Big Data Technologies: NoSQL

Main Big Data Technologies: NoSQL

cap-theorem Share this content: When considering the technologies required to approach the problem of Big Data, it’s only natural to consider the database management system first. Most of the most widely used databases are already optimized to store and handle large data volumes. For some years now, systems based […]

Stinger Initiative Brings SQL Users to Hadoop Via Apache Hive

Stinger Initiative Brings SQL Users to Hadoop Via Apache Hive

Slide Show Eight Ways to Put Hadoop to Work in Any IT Department Hadoop is big, but there’s no doubt that the game changer will be marrying SQL— the primary language used by business analysts for ad hoc analysis—with Hadoop. If you don’t want the information in Hadoop to […]

Creating a .NET-based Visual Monitoring System for Hadoop | .NET Zone

Creating a .NET-based Visual Monitoring System for Hadoop | .NET Zone

Summary Generic Hadoop doesn’t provide any out-of-the-box visual monitoring systems that report on the status of all the nodes in a Hadoop cluster. This JNBridge Lab demonstrates how to create a .NET-based monitoring application that utilizes an existing Microsoft Windows product to provide a snapshot of the entire Hadoop […]

Using Amazon’s Elastic MapReduce to Compute Recommendations with Apache Mahout 0.8

Using Amazon’s Elastic MapReduce to Compute Recommendations with Apache Mahout 0.8

Apache Mahout is a “scalable machine learning library” which, among others, contains implementations of various single-node and distributed recommendation algorithms. In my last blog post, I described how to implement an on-line recommender system processing data on a single node. What if the data is too large to fit […]

Big Data: Just Because You Can Measure It, Doesn't Mean You Should

Big Data: Just Because You Can Measure It, Doesn’t Mean You Should

Customer Experience, Big Data: Just Because You Can Measure It, Doesn’t Mean You Should Big data is hotter than ever, but are we losing sight of our goals by getting wrapped up in the tool? Big data is in every conversation, blog post and business strategy — and rightfully […]

Image Search with Splunk and Hunk

Image Search with Splunk and Hunk

One of the sexy new features Hunk brings to the Splunk 6 smorgasbord, is preprocessing data. Since Hunk is built on top of Hadoop’s MapReduce framework, we can utilize it’s preprocessing framework. Basically, now you can take any data, write a piece of code that turns it into text, […]

First Look: Facebook’s Oregon Cold Storage Facility

First Look: Facebook’s Oregon Cold Storage Facility

October 16th, 2013 By: Jordan Novet fb-hdds-470 Tweet PRINEVILLE, Ore. – Last Thursday, Facebook began migrating data – primarily pictures – to its newly constructed cold storage facility, within walking distance from its two huge data halls containing thousands of servers. On Tuesday, it opened the facility to reporters, […]

A first step towards R from spreadsheets

A first step towards R from spreadsheets

Move your data analysis to a computing environment specifically designed for it. Why R and not spreadsheets? Here are three reasons: Spreadsheets are easily overwhelmed.  Very complex things can be done in spreadsheets — it is just that complex spreadsheets are inefficient and dangerous . Graphics should be considered vital […]

Introducing: The Splunk App for Okta

Introducing: The Splunk App for Okta

I alluded to this last week in my post about Okta-ing Splunk –we’re now Splunking Okta ! Today, the Splunk App for Okta went live on Splunk Apps and we’ve already gained value from looking at how our Splunkers are logging into apps. Earlier this week, I was sitting […]