Category Archives: Big Data

Alpine Data Analytics App Works Directly Against Hadoop

Alpine Data Analytics App Works Directly Against Hadoop

Slide Show Big Data: Not Just for Big Business Anymore The popular perception is that anything involving petabytes of data requires a lot of IT people and at least one data scientist to analyze. In reality, however, analytics applications are scaling to the point where analysts can now analyze […]

UPS Nets Huge Fuel Savings With Analytics

UPS Nets Huge Fuel Savings With Analytics

5 Big Wishes For Big Data Deployments (click image for larger view and for slideshow) Constructive dissatisfaction. That’s what UPS calls its ongoing quest for process improvement that brought about ORION, an On-Road Integrated Optimization and Navigation system that will save the shipper 1.5 million gallons of fuel in […]

Handling Big Data Backup & Recovery

Handling Big Data Backup & Recovery

John Edwards The fact that big data systems and applications must be supported by a fast and powerful recovery strategy is undeniable. There is clearly a growing need for more efficient ways to move massive data volumes over a WAN and to manage backup restoration from a holistic, enterprise-wide […]

Spatial Clustering With Equal Sizes

Spatial Clustering With Equal Sizes

Cluster Map This is a problem I have encountered many times where the goal is to take a sample of spatial locations and apply constraints to the algorithm.  In addition to providing a pre-determined number of K clusters a fixed size of elements needs to be held constant within […]

MapR Moves to Secure Hadoop

MapR Moves to Secure Hadoop

Slide Show Big Data: Not Just for Big Business Anymore When it comes to anything relating to Big Data, concerns about security are never far away. After all, concentrating massive amounts of data in one place can make for a very tempting target for hackers. At the Strata Conference […]

Main Big Data Technologies: NoSQL

Main Big Data Technologies: NoSQL

cap-theorem Share this content: When considering the technologies required to approach the problem of Big Data, it’s only natural to consider the database management system first. Most of the most widely used databases are already optimized to store and handle large data volumes. For some years now, systems based […]

Stinger Initiative Brings SQL Users to Hadoop Via Apache Hive

Stinger Initiative Brings SQL Users to Hadoop Via Apache Hive

Slide Show Eight Ways to Put Hadoop to Work in Any IT Department Hadoop is big, but there’s no doubt that the game changer will be marrying SQL— the primary language used by business analysts for ad hoc analysis—with Hadoop. If you don’t want the information in Hadoop to […]

Creating a .NET-based Visual Monitoring System for Hadoop | .NET Zone

Creating a .NET-based Visual Monitoring System for Hadoop | .NET Zone

Summary Generic Hadoop doesn’t provide any out-of-the-box visual monitoring systems that report on the status of all the nodes in a Hadoop cluster. This JNBridge Lab demonstrates how to create a .NET-based monitoring application that utilizes an existing Microsoft Windows product to provide a snapshot of the entire Hadoop […]

Using Amazon’s Elastic MapReduce to Compute Recommendations with Apache Mahout 0.8

Using Amazon’s Elastic MapReduce to Compute Recommendations with Apache Mahout 0.8

Apache Mahout is a “scalable machine learning library” which, among others, contains implementations of various single-node and distributed recommendation algorithms. In my last blog post, I described how to implement an on-line recommender system processing data on a single node. What if the data is too large to fit […]

Big Data: Just Because You Can Measure It, Doesn't Mean You Should

Big Data: Just Because You Can Measure It, Doesn’t Mean You Should

Customer Experience, Big Data: Just Because You Can Measure It, Doesn’t Mean You Should Big data is hotter than ever, but are we losing sight of our goals by getting wrapped up in the tool? Big data is in every conversation, blog post and business strategy — and rightfully […]