Tag Archives: big

Splice Machine: When traditional RDBMS hits the big data performance wall

Splice Machine: When traditional RDBMS hits the big data performance wall

Legacy relational databases can’t cope with big data demands, so Splice Machine built a Hadoop-based alternative for running operational, real-time applications. San Francisco-based startup Splice Machine markets itself as the…

EnterpriseDB Says JSON Toolkit Nixes NoSQL Drawbacks

EnterpriseDB Says JSON Toolkit Nixes NoSQL Drawbacks

NoSQL databases start out easy, but you’ll later struggle with data logic, says EnterpriseDB. JSON toolkit promises best of NoSQL and RDBMS. EnterpriseDB on Tuesday introduced a free developer kit…

Data scientist: Your mileage may vary

Data scientist: Your mileage may vary

For those madly scrambling to hire data scientists, make sure you’re hiring the right kind. Getting it wrong can be very expensive. Finding a good data scientist just got even…

This analysis of modern history is a prime example of why big data really matters

This analysis of modern history is a prime example of why big data really matters

Summary: A researcher has analyzed the past 35 years of world history in order to find periods most similar to present-day Ukraine and Arab-Spring-era Egypt. It’s a great example of…

Ethernet Interfaces Transform Object Storage

Ethernet Interfaces Transform Object Storage

Commentary Jim O’Reilly New direct Ethernet interfaces for object-oriented storage will change the rules of storage, allowing for large performance gains. The idea of direct Ethernet drive interfaces dates back to at least 2001, although very little interest was generated among the very conservative storage clientele. A product platform […]

What the Internet of Things Can Learn from IBM's Smarter Cities Initiative

What the Internet of Things Can Learn from IBM’s Smarter Cities Initiative

Infographic Smarter Cities. Turning Big Data into Insight.jpg If you take a step back and look at the bigger picture, it’s clear that many of the concepts driving the Internet of Things (IoT) have been around for some time now. It took a while for the penny to drop, but […]

Wanna Race? Cloudera Says Impala is Faster than Hive and Proprietary RDMS

Wanna Race? Cloudera Says Impala is Faster than Hive and Proprietary RDMS

race cars.jpg Cloudera made a big splash at O’Reilly Strata + Hadoop World 2013  in New York City last October when it announced its Enterprise Data Hub strategy. It wants it to be the place where companies park all of their data, regardless of its format, and from which […]

Pivotal Hadoop Distribution and HAWQ Realtime Query Engine

Pivotal Hadoop Distribution and HAWQ Realtime Query Engine

PivotalHD_ArchitectDiagram Introduction SQL on Hadoop and the support for interactive, ad-hoc queries in Hadoop is in increasing demand and all the vendors are providing their answer to these requirements. In the open source world Cloudera’s Impala, Apache Drill (backed by MapR), Hortonworks’s Stinger initiatives are competing in this market, […]

Spark: Low Latency, Massively Parallel Processing Framework

While Hadoop fits well in most batch processing workloads, and is the primary choice of big data processing today, it is not optimized for other types of workloads due to its following limitation:  For a more detail elaboration of the Hadoop limitation , refer to my previous post . […]

Semantic Web Business: Going Nowhere Slowly

Semantic Web Business: Going Nowhere Slowly

Seth Grimes The semantic web vision persists, but the tools and processes don’t stand up to today’s data chaos. I’ve been a semantic web skeptic for years. SemWeb is a narrowly purposed replica of a subset of the World Wide Web. It’s useful for information enrichment in certain domains, […]