Splunk Digs Into the Year of the Big Data Application

Splunk Digs Into the Year of the Big Data Application

Thursday Jan 2nd 2014 by Mike Vizard This year will be the one in which businesses discover the applications that turn all that data into something of business value. Slide Show Big Data: Not Just for Big Business Anymore If 2013 was the year that most organizations discovered what […]

CMWire's Top 10 Hits of 2013: Big Data

CMWire’s Top 10 Hits of 2013: Big Data

Yes, Big Data was a Big Buzzword in 2013. The technology and business press — and even mainstream media — got a piece of the action, churning out article after article about what Big Data means to you. And that’s part of the problem. Big Data means lots of […]

Using Amazon’s Elastic MapReduce to Compute Recommendations with Apache Mahout 0.8

Using Amazon’s Elastic MapReduce to Compute Recommendations with Apache Mahout 0.8

Apache Mahout is a “scalable machine learning library” which, among others, contains implementations of various single-node and distributed recommendation algorithms. In my last blog post, I described how to implement an on-line recommender system processing data on a single node. What if the data is too large to fit […]

Response Time Percentiles for Multi-server Applications

Response Time Percentiles for Multi-server Applications

In a previous post , I applied my rules-of-thumb for response time (RT) percentiles (or more accurately, residence time in queueing theory parlance), viz., 80th percentile: $R_{80}$, 90th percentile: $R_{90}$ and 95th percentile: $R_{95}$ to a cellphone application and found that the performance measurements were not completely consistent. Since […]

Make Big Data Portable: the Basics

Make Big Data Portable: the Basics

Soam Acharya If you’re reading this, then you probably know that we’re very much pro Hadoop-as-a-Service. Obviously, many organizations we speak to have concerns about the logistics of transporting all their data. While at first glance this process can appear intimidating, it’s actually a lot easier than many suspect, […]

Big Datas Dark Underbelly

Big Datas Dark Underbelly

Doug Miles According to the new AIIM survey report, " Big Data and Content Analytics: measuring the ROI ," while big data analysis is recognized as a core organizational competence, 60% of organizations admit that their current BI (business intelligence) reporting capability is "inadequate" — with an even larger […]

Logging, Processing and Monitoring Data using Talend, ElasticSearch, Logstash and Kibana

Logging, Processing and Monitoring Data using Talend, ElasticSearch, Logstash and Kibana

Your mission-critical projects need complex event processing, realtime management and monitoring. Talend 5.4 (released in December 2013, https://www.talend.com ) offers a great new feature: Talend Event Logging. It allows logging, processing and monitoring of all technical events and business data. In this article, I will focus on how to […]

Cloudera's Enterprise Data Hub Rises to the Call of Amazon's AWS

Cloudera’s Enterprise Data Hub Rises to the Call of Amazon’s AWS

Room with Clouds Someone joked at Strata and Hadoop World earlier this year that Cloudera was ahead of its time when it chose its name. “You should have called it On-premise era,” said the would-be comedian, referring to the fact that Cloudera and most other enterprise-grade Hadoop distros live […]

Hadoop gets native R programming for big data analysis

Hadoop gets native R programming for big data analysis

Sensing a growing interest in big data-style analysis, software provider Revolution Analytics has updated its flagship package of R statistical functions so it can be run with the Hadoop data processing platform. Revolution R Enterprise 7 (RRE 7), to be made available on Monday , also features the ability […]

Intel Goes Graph with Hadoop Distro

Intel Goes Graph with Hadoop Distro

Language Flags HPCwire Japan Omnibond Xyratex Brocade Fusion-io Data Direct Networks Revolution Analytics Scale MP Karmasphere ScaleOut December 17, 2013 Alex Woodie Intel will be targeting big retail operations with a new graph database that it unveiled today as part of its Intel Distribution for Apache Hadoop version 3 […]