The number of enterprise-level deployments of Hadoop MapReduce is rising quickly, driven by a need to understand and potentially adopt this new business analytics platform for business applications. We note that pilot Hadoop projects are underway within many of the … Continued
Introduction: The demand for Big Data analytics processes by enterprise executives and business group leaders is growing at a rapid pace. However, the challenge to enterprise IT from the Big Data analytics perspective lies in capturing data from multiple new … Continued
In this whitepaper John Webster covers enhancements to Hadoop that can be realized by Veritas Cluster File Systems (CFS) users and discusses Symantec’s new solution for Enterprise Hadoop, that proposes to replace HDFS. Introduction: Hadoop MapReduce is now commonly used … Continued
In this Technical Insight, John Webster explores data movement in context to data analytics.
In this whitepaper, John Webster discusses storage data growth and the various ways that IT management can develop new and more sustainable practices and processes to adequately manage that data growth. Introduction: The cost of acquiring, managing and maintaining all … Continued
In this completed 4 part series, John Webster provides tips on managing big data in an organization. First article covers how big data analytics differs from traditional data warehousing and introduce the distributed compute cluster as a foundation to big … Continued
John Webster’s presentation from SNW spring 2012 in Dallas, Texas. Description: Big Data Analytics differs from traditional data warehousing in that it encompasses diverse data formats including unstructured data. A number of platforms have emerged including Hadoop and NoSQL that … Continued
Apache Hadoop has gained considerable attention from the enterprise IT community as a data analytics alternative to traditional BI systems and data warehousing. And while this is not the only alternative currently available, it has become highly visible. However, with … Continued
We’re seeing dramatic growth in the use of Big Data database architectures (Hadoop MapReduce for example). While these are best known in the context of web-based applications and development activities, they are no longer confined to the web. Cloudera, EMC … Continued
Now that the cloud computing bandwagon is out of gas, vendors have jumped on the next one to roll down the pike: Big Data. And as with previous hype cycles, Big Data is now a source of confusion for users … Continued
In a summary of articles from CNet/Data-Driven, John Webster highlights the future of storage supporting “Big Data Storage” and “Big Data Analytics”
Now that cloud computing bandwagon is out of gas, vendors have jumped on the next one to roll down the pike: Big Data. And as with previous hype cycles, Big Data is now a source of confusion for users as … Continued
The practitioners of Big Data Analytics processes are generally hostile to shared storage. They prefer direct-attached storage (DAS) in its various forms from solid state disk (SSD) to high capacity SATA disk buried inside parallel processing nodes. The perception of … Continued
John Webster’s presentation from SNW 2011 on Big Data Analytics and the impact on Storage
Discussion on Big Data Analytics and EMC’s Greenplum announcement
On July 6, 2010 EMC announced it will acquire GreenPlum, a data warehousing and business analytics software firm in an all-cash transaction. This document summarizes the details and discusses the market implications.